Approximately one in seven couples has difficulty conceiving. Infertility may be due to a single cause or a combination of factors (e.g., genetic factors, diseases, or environmental factors) that prevents a pregnancy from occurring or continuing.
When a couple seeks medical assistance for reproductive difficulties, the couple is advised to undergo a number of diagnostic procedures to ascertain potential causes. Often the procedures can be highly invasive, costly, and time consuming. Moreover, traditional diagnostic procedures may not be informative of the ultimate chance of achieving live birth. The uncertainty surrounding prognosis and treatment decisions is a significant challenge for fertility specialists.
A number of other factors add to the complexity around diagnosing and treating infertility. For example, obesity has been associated in some studies with reduced success in achieving pregnancy. Understanding the differential impact of obesity on the success or failure of different types of fertility treatments, in different individuals would be helpful in guiding couples and women experiencing fertility difficulties to better outcomes.
The invention provides methods and systems for assessing a patient's ability of achieving ongoing pregnancy and for informing course of treatment. The invention provides methods for generating a likelihood of achieving pregnancy and for selecting appropriate treatment to promote successful pregnancy by accounting for the body mass index (BMI) of the patient as an element of increasing the likelihood of a successful pregnancy. The invention recognizes that
BMI plays an important role in a patient's ability to achieve ongoing pregnancy and impacts the selection and success of fertility treatment protocols, as determined through several large-scale studies detailed below in the description of the invention.
Methods of the invention involve the prospective likelihood of success of a selected fertility/infertility treatment based upon a number of factors indicative of the potential for achieving ongoing pregnancy with respect to a specific fertility treatment and then informing the patient of a recommended fertility treatment. Factors to be considered in the clinical algorithm include BMI, age, weight, basal antral follicle count (BAFC), current medications, basal follicle stimulating hormone (FSH) levels, progesterone levels, estrogen levels, AMH levels, total gonadotropin used, infertility diagnosis, parity, number of oocytes retrieved, embryo stage at transfer, number of usable embryos, number of embryos transferred, and clinic. In one aspect, methods involve the steps of conducting one or more tests on the patient to determine a body mass index, obtaining one or more clinical characteristics from the patient, comparing the results obtained from the obtaining and conducting steps to a reference set of data obtained from a female reference population for which results of fertility treatments are known, and informing the patient of the potential for achieving ongoing pregnancy with respect to a specific fertility treatment, wherein the potential is determined based on the comparing step. At least one of the clinical characteristics includes the patient's age, clinical diagnoses, weight, BAFC, current medications, basal FSH, estrogen levels, progesterone levels, AMH, total gonadotropin used, infertility diagnosis, parity, infertility diagnosis, total gonadotropin used, number of oocytes retrieved, embryo stage at transfer, number of usable embryos, number of embryos transferred, and/or clinic.
In one aspect, the fertility treatment is intrauterine insemination (IUI). The fertility treatment can also include the use of one or more ovulation induction agents. These ovulation induction agents include gonadotropins (sometimes referred to as “mini-stim”), such as FSH, luteinizing hormone (LH), and human chorionic gonadotropin (hCG); and oral ovulation induction agents. Exemplary oral ovulation induction agents include, but are not limited to: clomiphene citrate; aromatase inhibitors, such as letrozole and anastrozole; insulin sensitizing drugs, such as metformin, rosiglitazone, and pioglitazone; bromocriptine; cabergoline; gonadotropin-releasing hormone (GnRH); and GnRH analogs, such as leuprolide acetate, nafarelin acetate, goserelin acetate, ganirelix, and cetrorelix acetate (the former three being agonists and the latter two being antagonists); and combinations thereof. Whether gonadotropins alone or gonadotropins with oral ovulation induction agents are recommended, is dependent, in part, upon the patient's BMI. In another embodiment, the fertility treatment is an assisted reproductive technology (ART), such as in vitro fertilization (IVF) or intracytoplasmic sperm injection (ICSI).
In addition to clinical data, genetic data can also be included in generating a probability of achieving ongoing pregnancy. Genetic data, such as mutations in fertility-related genes and gene expression profiles, can be obtained from the patient and used in the generation of the probability for achieving ongoing pregnancy. In one aspect, the genetic data is compared to data from the reference population, which includes both clinical and genetic data, in order to provide the probability of achieving ongoing success.
In another embodiment, a method for treating a patient suspected of having impaired fertility is provided. The method involves the steps of conducting one or more tests on the patient to determine BMI of the patient; administering a gonadotropin along if the patient's BMI is near or above a threshold BMI, or a gonadotropin and optionally an oral ovulation induction agent if the patient's BMI is below the threshold BMI; and subjecting the patient to a fertility treatment. In one aspect, the fertility treatment is IUI. In another aspect, the gonadotropin includes one or more of LH, FSH, and/or hCG. In yet another aspect, the oral ovulation induction agent can be letrozole, clomiphene citrate, bromocriptine, metformin, or cabergoline. The threshold limit can be 20, 25, or 30.
The invention relates to methods and systems for assessing likelihood (i.e., probability) of achieving pregnancy and for therapeutic intervention to achieve pregnancy. The invention provides methods for generating a likelihood of achieving pregnancy and guiding treatment therefrom using clinical data and specifically accounting for the body mass index (BMI) of the patient. In accordance with the present invention, the generation of a likelihood of achieving ongoing pregnancy for a certain fertility treatment in an individual takes into consideration clinical data, and optionally genetic data. The methods involve the determination of one or more correlations between clinical characteristics and known pregnancy and infertility-related outcomes from a reference set of data to provide a model representative of a cumulative probability of ongoing pregnancy for a certain fertility treatment. The methods can further involve the determination of one or more correlations between genetic characteristics and known pregnancy and infertility-related outcomes from the reference set of data to adjust the model. The model can then be applied to the patient's data to generate the likelihood of achieving ongoing pregnancy in the subject for a certain fertility treatment. Based on the generated likelihood, a treatment protocol can be recommended. As discussed in more detail below, the treatment protocol may depend, in part, on the BMI of the patient.
Assessment and analysis of the potential for achieving ongoing pregnancy and live birth incorporates the use of clinical fertility-associated information, or data, such as phenotypic and/or environmental characteristics. Exemplary clinical information is provided in Table 1 below.
In one embodiment, the assessment of a patient's probability of achieving an ongoing pregnancy incorporates clinical data such as age, BAFC, medication type, sperm motility, clinical diagnoses, BMI, hormone levels, and previous fertility treatments (including the use of ovulation induction agents).
Clinical information can be obtained by any means known in the art. In many cases this information can be obtained from a questionnaire completed by the subject that contains questions regarding certain clinical data, such as age. Additional information can be obtained from a questionnaire completed by the subject's partner and blood relatives. The questionnaire includes questions regarding the subject's clinical traits, such as his or her age, smoking habits, or frequency of alcohol consumption.
Information can also be obtained from the medical history of the subject, as well as the medical history of blood relatives and other family members, such as any clinical diagnoses, prior fertility treatments and current medications. Additional information can be obtained from the medical history and family medical history of the subject's partner. Medical history information can be obtained through analysis of electronic medical records, paper medical records, a series of questions about medical history included in the questionnaire, and a combination thereof.
In other embodiments, an assay specific to a phenotypic trait or an environmental exposure of interest is used. Such assays are known to those of skill in the art, and may be used with methods of the invention. For example, hormones, such as FSH and LH, may be detected from a urine or blood test. Venners et al. (Hum. Reprod. 21(9): 2272-2280, 2006) reports assays for detecting estrogen and progesterone in urine and blood samples. Venners also reports assays for detecting the chemicals used in fertility treatments.
Illicit drug use may be detected from a tissue or body fluid, such as hair, urine, sweat, or blood, and there are numerous commercially available assays (LabCorp) for conducting such tests. Standard drug tests look for ten different classes of drugs, and the test is commercially known as a “10-panel urine screen”. The 10-panel urine screen consists of the following: 1. Amphetamines (including Methamphetamine) 2. Barbiturates 3. Benzodiazepines 4.
Cannabinoids (THC) 5. Cocaine 6. Methadone 7. Methaqualone 8. Opiates (Codeine, Morphine, Heroin, Oxycodone, Vicodin, etc.) 9. Phencyclidine (PCP) 10. Propoxyphene. Use of alcohol can also be detected by such tests.
Numerous assays can be used to test a patient's exposure to plastics (e.g., Bisphenol A (BPA)). BPA is most commonly found as a component of polycarbonates (about 74% of total
BPA produced) and in the production of epoxy resins (about 20%). As well as being found in a myriad of products including plastic food and beverage contains (including baby and water bottles), BPA is also commonly found in various household appliances, electronics, sports safety equipment, adhesives, cash register receipts, medical devices, eyeglass lenses, water supply pipes, and many other products. Assays for testing blood, sweat, or urine for presence of BPA are described, for example, in Genuis et al. (Journal of Environmental and Public Health, Volume 2012, Article ID 185731, 10 pages, 2012).
A subject's BMI can be determined by first obtaining the subject's weight and height and then comparing to or inputting that information into a physical or computer-based table or chart. BMI is a value derived from the mass and height of an individual that is used to quantify the amount of tissue mass (including muscle, fat, and bone) in an individual, such that the individual can be categorized as underweight, normal weight, overweight, or obese. The commonly accepted ranges can be found in Table 2 below.
Antral follicle count (AFC) can be determined through the use of ultrasound, preferably a vaginal ultrasound. Antral follicles are small follicles within the ovaries that present during a later stage of folliculogenesis. BAFC are often used as a biomarker for ovarian reserve.
In one aspect of the invention, the assessment of the patient's probability of achieving ongoing pregnancy and subsequent determination of a treatment protocol includes the use of genetic data from both the patient and a reference population. This genetic data is utilized to provide more accurate prognoses that can inform downstream diagnostic tests and treatments that may benefit the subject.
Genetic data for use with methods of the invention include any biomarkers that are associated with reproduction, infertility/ability to achieving ongoing pregnancy. Exemplary biomarkers include genes (e.g. any region of DNA encoding a functional product), genetic regions (e.g. regions including genes and intergenic regions with a particular focus on regions conserved throughout evolution in placental mammals), and gene products (e.g., RNA and protein). In certain embodiments, the biomarker is an infertility-associated gene or genetic region. A reproductive health-associated genetic region is any DNA sequence in which variation is associated with a change in reproductive health. Examples of changes in reproductive health include, but are not limited to, the following: a homozygous mutation of an infertility-associated gene leads to a complete loss of fertility; a homozygous mutation of an infertility-associated gene is incompletely penetrant and leads to reduction in fertility that varies from individual to individual; a heterozygous mutation is completely recessive, having no effect on fertility; and the infertility-associated gene is X-linked, such that a potential defect in fertility depends on whether a non-functional allele of the gene is located on an inactive X chromosome (Barr body) or on an expressed X chromosome.
In particular embodiments, the assessed infertility-associated genetic region is a maternal effect gene. Maternal effects genes are genes that have been found to encode key structures and functions in mammalian oocytes (Yurttas et al., Reproduction 139:809-823, 2010). Maternal effect genes are described, for example in, Christians et al. (Mol Cell Biol 17:778-88, 1997); Christians et al., Nature 407:693-694, 2000); Xiao et al. (EMBO J 18:5943-5952, 1999); Tong et al. (Endocrinology 145:1427-1434, 2004); Tong et al. (Nat Genet 26:267-268, 2000); Tong et al. (Endocrinology, 140:3720-3726, 1999); Tong et al. (Hum Reprod 17:903-911, 2002); Ohsugi et al. (Development 135:259-269, 2008); Borowczyk et al. (Proc Natl Acad Sci U S A., 2009); and
Wu (Hum Reprod 24:415-424, 2009). Maternal effects genes are also described in U.S. 12/889,304. The content of each of these is incorporated by reference herein in its entirety.
In particular embodiments, the reproductive health-associated genetic region is one or more genes (including exons, introns, and 10 kb of DNA flanking either side of said gene) selected from the genes shown in Table 3 below. In Table 3, OMIM reference numbers are provided when available.
The genes listed in Table 3 can be involved in different aspects of reproduction/fertility related processes. Furthermore, additional genes beyond those maternal effect genes listed in Table 3 can also affect fertility. Genes affecting fertility can be involved with a number of male- and female-specific processes, or functional biological classifications, such as those shown in
Gonadogenesis encompasses the processes regulating the development of the ovaries and testes, and involves, but is not limited to, primordial germ cell specification and proliferation. The neuroendocrine axis encompasses for example the physiological pathways and structures regulating the production and activity of hormones in a number of different tissues in the human body, including the brain and gonads. Folliculogenesis encompasses the physiological mechanisms regulating the development of primordial follicles to cystic follicles in the ovary. Oogenesis encompasses the physiological mechanisms regulating the development of primordial oocytes to mature meiosis-II stage oocytes ready to be fertilized, hence those that are specific to female reproductive biology. Oocyte-embryo transition encompasses the physiological mechanisms regulating the development of the early embryo and includes mechanisms related to egg quality, such as oocyte cytoplasmic lattice formation, and paternal effect mechanisms. Placentation (Embryonic) encompasses the embryo-specific physiological mechanisms regulating implantation and the development of the placenta. Placentation (Uterine) encompasses the uterus-specific physiological mechanisms regulating embryo implantation and the development of the placenta. Post-implantation development encompasses the physiological mechanisms regulating post-implantation embryo development, particularly those whose disruption might lead to abnormal development or pregnancy loss in humans. Adiposity encompasses the physiological mechanisms regulating adipose tissue and body weight, which are known to play an important, indirect role in mammalian fecundity and infertility. Reproductive anatomy encompasses any phenotype relating to anatomical changes that could impact reproduction, fecundity or fertility. Immune response encompasses phenotypes that are specific to aspects of immune response mechanisms, which are known to play an important role in mammalian reproduction and fertility.
Spermatogenesis encompasses the processes involved in the production or development of mature spermatozoa, hence those that are specific to male reproductive biology. Maturation encompasses processes that enable spermatozoa to fertilize eggs, hence those that are specific to male reproductive biology. Capacitation encompasses processes specific to functional capacitation of spermatozoa in the vaginal canal and uterus. Fertilization encompasses processes relating to the union of a human egg and sperm. Mitosis encompasses processes involving changes to the cell division process such that it does not end with two daughter cells that have the same chromosomal complement as the parent cell. Such changes to the mitotic process may affect for example fertility-related cell proliferation or tissue maintenance. Meiosis encompasses processes regulating meiosis such that it results in four daughter cells each with exactly half the chromosome complement of the parent cell, for example during gametogenesis. Spermiogenesis encompasses processes regulating the morphological differentiation of haploid cells into sperm.
Mutations in genes associated with these various processes result in fertility difficulties for males and/or females containing these mutations.
Genetic data can be obtained, for example, by conducting an assay on a sample from a male or female that detects either a variant in a reproductive health-associated genetic region or abnormal (over or under) expression of a reproductive health-associated genetic region. The presence of certain variants in those genetic regions or abnormal expression levels of those genetic regions is indicative of fertility outcomes, i.e., whether ongoing pregnancy or live birth is achievable. Exemplary variants include, but are not limited to, a single nucleotide polymorphism, a single nucleotide variant, a deletion, an insertion, an inversion, a genetic rearrangement, a copy number variation, chromosomal microdeletion, genetic mosaicism, karyotype abnormality, or a combination thereof.
A sample may include a human tissue or bodily fluid and may be collected in any clinically acceptable manner. A tissue is a mass of connected cells and/or extracellular matrix material, e.g. skin tissue, hair, nails, nasal passage tissue, CNS tissue, neural tissue, eye tissue, liver tissue, kidney tissue, placental tissue, mammary gland tissue, gastrointestinal tissue, musculoskeletal tissue, genitourinary tissue, bone marrow, and the like, derived from, for example, a human or other mammal and includes the connecting material and the liquid material in association with the cells and/or tissues. A body fluid is a liquid material derived from, for example, a human or other mammal. Such body fluids include, but are not limited to, mucous, blood, plasma, serum, serum derivatives, bile, blood, maternal blood, phlegm, saliva, sputum, sweat, amniotic fluid, menstrual fluid, mammary fluid, follicular fluid of the ovary, fallopian tube fluid, peritoneal fluid, urine, semen, and cerebrospinal fluid (CSF), such as lumbar or ventricular CSF. A sample may also be a fine needle aspirate or biopsied tissue, e.g. an endometrial aspirate, breast tissue biopsy, and the like. A sample also may be media containing cells or biological material. A sample may also be a blood clot, for example, a blood clot that has been obtained from whole blood after the serum has been removed. In certain embodiments, the sample may include reproductive cells or tissues, such as gametic cells, gonadal tissue, fertilized embryos, and placenta. In certain embodiments, the sample is blood, saliva, or semen collected from the subject.
Genetic information from the sample can be obtained by nucleic acid extraction from the sample. Methods for extracting nucleic acid from a sample are known in the art. See for example, Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281, 1982, the contents of which are incorporated by reference herein in their entirety. In certain embodiments, a sample is collected from a subject followed by enrichment for genes or gene fragments of interest, for example by hybridization to a nucleotide array including fertility-related genetic regions or genetic fragments of interest. The sample may be enriched for genetic regions of interest (e.g., reproductive health-associated genetic regions) using methods known in the art, such as hybrid capture. See for examples, Lapidus (U.S. Pat. No. 7,666,593), the content of which is incorporated by reference herein in its entirety.
In particular embodiments, the assay is conducted on fertility-related genes or genetic regions containing the gene or a part thereof, such as those genes found in Table 3. Detailed descriptions of conventional methods, such as those employed to make and use nucleic acid arrays, amplification primers, hybridization probes, and the like can be found in standard laboratory manuals such as: Genome Analysis: A Laboratory Manual Series (Vols. I-IV), Cold Spring Harbor Laboratory Press; PCR Primer: A Laboratory Manual, Cold Spring Harbor Laboratory Press; and Sambrook, J et al., (2001) Molecular Cloning: A Laboratory Manual, 2nd ed. (Vols. 1-3), Cold Spring Harbor Laboratory Press. Custom nucleic acid arrays are commercially available from, e.g., Affymetrix (Santa Clara, Calif.), Applied Biosystems (Foster City, Calif.), and Agilent Technologies (Santa Clara, Calif.).
Methods of detecting variations (e.g., mutations) are known in the art. In certain embodiments, a known single nucleotide polymorphism (SNP) at a particular position can be detected by single base extension for a primer that binds to the sample DNA adjacent to that position. See for example Shuber et al. (U.S. Pat. No. 6,566,101), the content of which is incorporated by reference herein in its entirety. In other embodiments, a hybridization probe might be employed that overlaps the SNP of interest and selectively hybridizes to sample nucleic acids containing a particular nucleotide at that position. See for example Shuber et al. (U.S. Pat. Nos. 6,214,558 and 6,300,077), the content of which is incorporated by reference herein in its entirety.
In particular embodiments, nucleic acids are sequenced in order to detect variants in the nucleic acid compared to wild-type and/or non-mutated forms of the sequence. The nucleic acid can include a plurality of nucleic acids derived from a plurality of genetic elements. Methods of detecting sequence variants are known in the art, and sequence variants can be detected by any sequencing method known in the art.
DNA sequencing techniques include classic dideoxy sequencing reactions (Sanger method) using labeled terminators or primers and gel separation in slab or capillary, sequencing by synthesis using reversibly terminated labeled nucleotides, pyrosequencing, allele specific hybridization to a library of labeled oligonucleotide probes, sequencing by synthesis using allele specific hybridization to a library of labeled clones that is followed by ligation, real time monitoring of the incorporation of labeled nucleotides during a polymerization step, polony sequencing, and SOLiD sequencing. Sequencing of separated molecules has more recently been demonstrated by sequential or single extension reactions using polymerases or ligases as well as by single or sequential differential hybridizations with libraries of probes. Exemplary sequencing methods include but are not limited to the following: sequencing by chain termination and gel separation, as described by Sanger et al., Proc Natl. Acad. Sci. USA, 74(12): 5463 67 (1977); chemical degradation of nucleic acid fragments. See, Maxam et al., Proc. Natl. Acad. Sci., 74: 560 564 (1977); sequencing by hybridization. See, e.g., Harris et al., (U.S. patent application number 2009/0156412); Helicos True Single Molecule Sequencing (tSMS). See Harris T. D. et al. (2008) Science 320:106-109; see also, e.g., Lapidus et al. (U.S. Pat. No. 7,169,560), Lapidus et al. (U.S. patent application number 2009/0191565), Quake et al. (U.S. Pat. No. 6,818,395), Harris (U.S. Pat. No. 7,282,337), Quake et al. (U.S. patent application number 2002/0164629), and Braslaysky, et al., PNAS (USA), 100: 3960-3964 (2003); 454 sequencing (Roche) (Margulies, M et al. 2005, Nature, 437, 376-380); SOLiD technology (Applied Biosystems); Ion Torrent sequencing (U.S. patent application numbers 2009/0026082, 2009/0127589, 2010/0035252, 2010/0137143, 2010/0188073, 2010/0197507, 2010/0282617, 2010/0300559), 2010/0300895, 2010/0301398, and 2010/0304982); single molecule, real-time (SMRT) technology of Pacific Biosciences; nanopore sequencing (Soni G V and Meller A. (2007) Clin Chem 53: 1996-2001); chemical-sensitive field effect transistor (chemFET) arrays (See e.g., U.S. Patent Application Publication No. 2009/0026082); and use of an electron microscope (Moudrianakis E. N. and Beer M. Proc Natl Acad Sci USA. 1965 March; 53:564-71), the content of each of which is incorporated by reference herein in its entirety.
Yet another example of a sequencing technology that can be used in the methods of the provided invention is next-generation sequencing, such as Illumina sequencing, using Illumina HiSeq sequencers. Illumina sequencing is based on the amplification of DNA on a solid surface using fold-back PCR and anchored primers. Genomic DNA is fragmented, and adapters are added to the 5′ and 3′ ends of the fragments. DNA fragments that are attached to the surface of flow cell channels are extended and bridge amplified. The fragments become double stranded, and the double stranded molecules are denatured. Multiple cycles of the solid-phase amplification followed by denaturation can create several million clusters of approximately 1,000 copies of single-stranded DNA molecules of the same template in each channel of the flow cell. Primers, DNA polymerase and four fluorophore-labeled, reversibly terminating nucleotides are used to perform sequential sequencing. After nucleotide incorporation, a laser is used to excite the fluorophores, and an image is captured and the identity of the first base is recorded. The 3′ terminators and fluorophores from each incorporated base are removed and the incorporation, detection and identification steps are repeated.
In certain aspects, the invention provides a microarray including a plurality of oligonucleotides attached to a substrate at discrete addressable positions, in which at least one of the oligonucleotides hybridizes to a portion of a gene suspected of affecting fertility in a man or woman. Methods of constructing microarrays are known in the art. See for example Yeatman et al. (U.S. patent application number 2006/0195269), the content of which is hereby incorporated by reference in its entirety.
If the nucleic acid from the sample is degraded or only a minimal amount of nucleic acid can be obtained from the sample, PCR can be performed on the nucleic acid in order to obtain a sufficient amount of nucleic acid for sequencing (See e.g., Mullis et al. U.S. Pat. No. 4,683,195, the contents of which are incorporated by reference herein in its entirety).
Sequencing by any of the methods described above and known in the art produces sequence reads. Sequence reads can be analyzed to call variants by any number of methods known in the art. Variant calling can include aligning sequence reads to a reference (e.g. hg18) and reporting single nucleotide polymorphism (SNP)/single nucleotide variant alleles. An example of methods for analyzing sequence reads and calling variants includes standard Genome Analysis Toolkit (GATK) methods. See The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data, Genome Res 20(9):1297-1303, the contents of each of which are incorporated by reference. GATK is a software package for analysis of high-throughput sequencing data capable of identifying variants, including SNPs. SNP/SNV alleles can be reported in a format such as a Sequence Alignment Map (SAM) or a Variant Call Format (VCF) file. Some background may be found in Li & Durbin, 2009, Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics 25:1754-60 and McKenna et al., 2010. Variant calling produces results (“variant calls”) that may be stored as a sequence alignment map (SAM) or binary alignment map (BAM) file—comprising an alignment string (the SAM format is described, e.g., in Li, et al., The Sequence Alignment/Map format and SAMtools, Bioinformatics, 2009, 25(16):2078-9). Additionally or alternatively, output from the variant calling may be provided in a variant call format (VCF) file, e.g., in report. A typical VCF file will include a header section and a data section. The header contains an arbitrary number of meta-information lines, each starting with characters ‘##’, and a TAB delimited field definition line starting with a single ‘#’ character. The field definition line names eight mandatory columns and the body section contains lines of data populating the columns defined by the field definition line. The VCF format is described in Danecek et al., 2011, The variant call format and VCFtools, Bioinformatics 27(15):2156-2158. Further discussion may be found in U.S. Pub. 2013/0073214; U.S. Pub. 2013/0345066; U.S. Pub. 2013/0311106; U.S. Pub. 2013/0059740; U.S. Pub. 2012/0157322; U.S. Pub. 2015/0057946 and U.S. Pub. 2015/0056613, each incorporated by reference.
Furthermore, methods of the invention include conducting an assay on a sample from a subject that detects an abnormal (over or under) expression of a reproductive health-associated gene (e.g. a differentially or abnormally expressed gene). A differentially or abnormally expressed gene refers to a gene whose expression is activated to a higher or lower level in a subject suffering from a disorder, such as infertility, relative to its expression in a normal or control subject. The terms also include genes whose expression is activated to a higher or lower level at different stages of the same disorder. It is also understood that a differentially expressed gene may be either activated or inhibited at the nucleic acid level or protein level, or may be subject to alternative splicing to result in a different polypeptide product. Such differences may be evidenced by a change in mRNA levels, surface expression, secretion or other partitioning of a polypeptide, for example.
Differential gene expression may include a comparison of expression between two or more genes or their gene products, or a comparison of the ratios of the expression between two or more genes or their gene products, or even a comparison of two differently processed products of the same gene, which differ between normal subjects and subjects suffering from a disorder, such as infertility, or between various stages of the same disorder. Differential expression includes both quantitative, as well as qualitative, differences in the temporal or cellular expression pattern in a gene or its expression products. Differential gene expression (increases and decreases in expression) is based upon percent or fold changes over expression in normal cells. Increases may be of 1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, or 200% relative to expression levels in normal cells. Alternatively, fold increases may be of 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, or 10 fold over expression levels in normal cells. Decreases may be of 1, 5, 10, 20, 30, 40, 50, 55, 60, 65, 70, 75, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 99 or 100% relative to expression levels in normal cells.
Methods of detecting levels of gene products (e.g., RNA or protein) are known in the art. Commonly used methods known in the art for the quantification of mRNA expression in a sample include northern blotting and in situ hybridization (Parker & Barnes, Methods in Molecular Biology 106:247 283 (1999); RNAse protection assays (Hod, Biotechniques 13:852 854 (1992); and PCR-based methods, such as reverse transcription polymerase chain reaction (RT-PCR) (Weis et al., Trends in Genetics 8:263 264 (1992); the contents of all of which are incorporated by reference herein in their entirety. Alternatively, antibodies may be employed that can recognize specific duplexes, including RNA duplexes, DNA-RNA hybrid duplexes, or DNA-protein duplexes. Other methods known in the art for measuring gene expression (e.g., RNA or protein amounts) are shown in Yeatman et al. (U.S. patent application number 2006/0195269), the content of which is hereby incorporated by reference in its entirety.
In certain embodiments, reverse transcriptase PCR (RT-PCR) is used to measure gene expression. RT-PCR is a quantitative method that can be used to compare mRNA levels in different sample populations to characterize patterns of gene expression, to discriminate between closely related mRNAs, and to analyze RNA structure. Various methods are well known in the art. See, e.g., Ausubel et al., Current Protocols of Molecular Biology, John Wiley and Sons (1997); Rupp and Locker, Lab Invest. 56:A67 (1987), and De Andres et al., BioTechniques 18:42044 (1995); Held et al., Genome Research 6:986 994 (1996), the contents of which are incorporated by reference herein in their entirety.
Further PCR-based techniques include, for example, differential display (Liang and Pardee, Science 257:967 971 (1992)); amplified fragment length polymorphism (iAFLP) (Kawamoto et al., Genome Res. 12:1305 1312 (1999)); BeadArray™ technology (Illumina, San Diego, Calif.; Oliphant et al., Discovery of Markers for Disease (Supplement to Biotechniques), June 2002; Ferguson et al., Analytical Chemistry 72:5618 (2000)); BeadsArray for Detection of Gene Expression (BADGE), using the commercially available Luminex100 LabMAP system and multiple color-coded microspheres (Luminex Corp., Austin, Tex.) in a rapid assay for gene expression (Yang et al., Genome Res. 11:1888 1898 (2001)); and high coverage expression profiling (HiCEP) analysis (Fukumura et al., Nucl. Acids. Res. 31(16) e94 (2003)). The contents of each of which are incorporated by reference herein in their entirety.
In another embodiment, a MassARRAY-based gene expression profiling method is used to measure gene expression. For further details see, e.g. Ding and Cantor, Proc. Natl. Acad. Sci. USA 100:3059 3064 (2003), incorporated herein by reference.
In certain embodiments, differential gene expression can also be identified, or confirmed using a microarray technique. In this method, polynucleotide sequences of interest (including cDNAs and oligonucleotides) are plated, or arrayed, on a microchip substrate. The arrayed sequences are then hybridized with specific DNA probes from cells or tissues of interest. Methods for making microarrays and determining gene product expression (e.g., RNA or protein) are shown in Yeatman et al. (U.S. patent application number 2006/0195269); see also Schena et al., Proc. Natl. Acad. Sci. USA 93(2):106 149 (1996), the content of each of which is incorporated by reference herein in their entirety. Microarray analysis can be performed by commercially available equipment, following manufacturer's protocols, such as by using the Affymetrix GenChip technology, or Incyte's microarray technology.
In another aspect, protein levels can be determined by constructing an antibody microarray in which binding sites comprise immobilized, preferably monoclonal, antibodies specific to a plurality of protein species encoded by the cell genome. Methods for making monoclonal antibodies are well known (see, e.g., Harlow and Lane, 1988, ANTIBODIES: A LABORATORY MANUAL, Cold Spring Harbor, N.Y., which is incorporated in its entirety for all purposes).
In yet another aspect, levels of transcripts of marker genes in a number of tissue specimens may be characterized using a “tissue array” (Kononen et al., Nat. Med 4(7):844-7 (1998)). In other embodiments, Serial Analysis of Gene Expression (SAGE) is used to measure gene expression. Serial analysis of gene expression (SAGE) is a method that allows the simultaneous and quantitative analysis of a large number of gene transcripts, without the need of providing an individual hybridization probe for each transcript. For more details see, e.g. Velculescu et al., Science 270:484 487 (1995); and Velculescu et al., Cell 88:243 51 (1997, the contents of each of which are incorporated by reference herein in their entirety).
In other embodiments, Massively Parallel Signature Sequencing (MPSS) is used to measure gene expression. For more details see, e.g. Brenner et al., Nature Biotechnology 18:630 634 (2000).
Immunohistochemistry methods are also suitable for detecting the expression levels of the gene products of the present invention. In these methods, antibodies (monoclonal or polyclonal) or antisera, such as polyclonal antisera, specific for each marker are used to detect expression. Immunohistochemistry protocols and kits are well known in the art and are commercially available.
In certain embodiments, a proteomics approach is used to measure gene expression. Proteomics typically includes the following steps: (1) separation of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of the individual proteins recovered from the gel, e.g. my mass spectrometry or N-terminal sequencing, and (3) analysis of the data using bioinformatics. Proteomics methods are valuable supplements to other methods of gene expression profiling, and can be used, alone or in combination with other methods, to detect the products of the prognostic markers of the present invention.
In some embodiments, mass spectrometry (MS) analysis can be used alone or in combination with other methods (e.g., immunoassays or RNA measuring assays) to determine the presence and/or quantity of the one or more biomarkers disclosed herein in a biological sample. In some embodiments, the MS analysis includes matrix-assisted laser desorption/ionization (MALDI) time-of-flight (TOF) MS analysis, such as for example direct-spot MALDI-TOF or liquid chromatography MALDI-TOF mass spectrometry analysis. In some embodiments, the MS analysis comprises electrospray ionization (ESI) MS, such as for example liquid chromatography (LC) ESI-MS. Mass analysis can be accomplished using commercially-available spectrometers. Methods for utilizing MS analysis, including MALDI-TOF MS and ESI-MS, to detect the presence and quantity of biomarker peptides in biological samples are known in the art. See, for example, U.S. Pat. Nos. 6,925,389; 6,989,100; and 6,890,763, each of which is incorporated by reference herein in their entirety.
The present invention provides methods for generating a predicted probability of achieving ongoing pregnancy in an individual with respect to a specific fertility treatment and informing course of treatment therefrom, wherein the method specifically incorporates the effect of BMI on the predicted probability. Methods for generating a likelihood of achieving ongoing pregnancy generally involve the determination of one or more correlations between clinical characteristics and known pregnancy and infertility-related outcomes from a reference set of data to provide a model representative of a probability of ongoing pregnancy. In certain aspects, the methods further involve the determination of one or more correlations between genetic characteristics and known pregnancy and infertility-related outcomes from the reference set of data to adjust the model. The model can then be applied to the input data to generate the likelihood of achieving ongoing pregnancy in the subject, which will in turn, inform the course of treatment for the subject.
Clinical characteristics obtained from the reference population include, but are not limited to, any or all of the characteristics described above in the “Clinical Characteristics” section. Exemplary characteristics include BMI, fertility treatment history, age, BAFC, sperm motility, clinical diagnoses, and medication type. With respect to fertility treatment history, the reference set of data includes information as to what fertility treatments, including any ovulation induction agents, were used. Exemplary fertility treatments include, but are not limited to, ART, non-ART fertility treatments (RE), and fertility preservation technologies (egg, embryo, or ovarian preservation). Exemplary assisted reproductive technologies include, without limitation, IVF, zygote intrafallopian transfer (ZIFT), gametic intrafallopian transfer (GIFT), or ICSI paired with one of the methods above. Exemplary non-ART fertility treatments include ovulation induction protocols with or without IUI with sperm. Exemplary ovulation induction agents include gonadotropins such as LH, FSH, human menopausal gonadotropin (hMG), and hCG; and oral ovulation induction agents. Exemplary oral ovulation induction agents include, but are not limited to: clomiphene citrate; aromatase inhibitors, such as letrozole and anastrozole; insulin sensitizing drugs, such as metformin, rosiglitazone, and pioglitazone; bromocriptine; cabergoline; GnRH; and GnRH analogs, such as leuprolide acetate, nafarelin acetate, goserelin acetate, ganirelix, and cetrorelix acetate (the former three being agonists and the latter two being antagonists); and combinations thereof. Preferably, the clinical characteristics are to include BMI and previous fertility treatments, wherein at least some of the reference population has undergone non-ART treatments, such as IUI, with and without ovulation induction agents.
The clinical characteristics obtained from the reference population can then be passed through an association analysis in order to determine whether and to what extent the characteristics obtained from the subjects in the reference population are associated with the cumulative odds of achieving ongoing pregnancy.
In one embodiment, the methods also incorporate genetic characteristics from the reference population and their impact on the cumulative probability of achieving ongoing pregnancy. First, variants within genes and genetic regions, including those described above, are identified. In a preferred embodiment, whole genome sequencing is conducted on DNA extracted from whole blood samples using the Illumina HiSeq platform. As described above, variants can be called using standard Genome Analysis Toolkit (GATK) methods.
Once the variants are called, a customized pipeline is used to identify deleterious variants among the genetic signatures of patients. Deleterious variants can be determined using, for example, the SnpEff and Variant Effect Predictor (www.ensembl.org) engines. SnpEff is capable of rapidly categorizing the effects of SNPs and other variants in whole genome sequences. See, Cingolani et al., A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3; Landes Bioscience, 6:2, 1-13; April/May/June 2012, incorporated herein by reference. Variants predicted to have a high impact or be ‘moderate missense variants’ (moderate is defined by SnpEff as causing an amino acid change) using programs such as SnpEff are then selected.
Upon identification of these high and moderate impact variants, the variants are then passed through a scoring system based on various annotation tools. One of ordinary skill in the art would understand that both molecular and computational approaches are available for annotating variants (e.g. by comparing to a known database, through the use of ANOVA technology, through the use of multivariant analysis). Exemplary annotation tools include the Database for Annotation, Visualization and Integrated Discover (DAVID). Nature Protocols 2009; 4(1):44; and Nucleic Acids Res. 2009; 37(1):1, incorporated herein by reference.
Variants that were considered deleterious by at least two annotation tools can then be passed through to the association analysis, along with the clinical characteristics to determine whether the genetic variant signatures obtained from the subjects are associated with their cumulative odds of ongoing pregnancy.
The association analysis involves the use of any one of a number of models to calculate cumulative odds of ongoing pregnancy for the reference population, such as a cohort of patients. As noted above, the model incorporates and adjusts for clinical information, such as the clinical characteristics listed in Table 1, obtained from the subjects, including BMI, age, BAFC, medication type, sperm motility, clinical diagnoses, and prior fertility treatments. In one aspect, the model can be weighted towards more recent data.
Suitable analysis methods include, without limitation, logistic regression, ordinal logistic regression, linear or quadratic discriminant analysis, clustering, principal component analysis, nearest neighbor classifier analysis, and discrete time-proportional hazards models.
Logistic regression analysis may be used to generate an odds ratio and relative risk for each characteristic. Method of logistic regression are described, for example in, Ruczinski (Journal of Computational and Graphical Statistics 12:475-512, 2003); Agresti (An Introduction to Categorical Data Analysis, John Wiley & Sons, Inc., 1996, New York, Chapter 8); and Yeatman et al. (U.S. patent application number 2006/0195269), the content of each of which is hereby incorporated by reference in its entirety.
Some embodiments of the present invention provide generalizations of the logistic regression model that handle multicategory (polychotomous) responses. Such embodiments can be used to discriminate an organism into one or more prognosis groups (e.g., good prognosis, poor prognosis). Such regression models use multicategory logit models that simultaneously refer to all pairs of categories, and describe the odds of response in one category instead of another. Once the model specifies logits for a certain (J-1) pairs of categories, the rest are redundant. See, for example, Agresti, An Introduction to Categorical Data Analysis, John Wiley & Sons, Inc., 1996, New York, Chapter 8, which is hereby incorporated by reference.
Regularization techniques may be used in certain embodiments of the invention in order to prevent over-fitting to training data and to identify the most important features to include in predictive models. Examples of regularization techniques include, without limitation, least absolute shrinkage and selection operator (lasso), ridge regression, elastic net, or certain specifications of hierarchical Bayesian models. Regularization can be used with any known algorithm which optimizes an objective function including, without limitation, linear regression, logistic regression, or artificial neural networks. See, for example, Chapter 16 of Hastie, 2001, The Elements of Statistical Learning, Springer, New York, hereby incorporated by reference.
Lasso regularization results in penalized parameter estimates which are smaller in magnitude than non-penalized estimates. Lasso regression can be used as a variable selection technique by driving parameter estimates exactly to 0, suggesting that these features do not have predictive ability in the model. Only features with parameter estimates larger than 0 after lasso regularization are then taken as features in predictive models.
Linear discriminant analysis (LDA) attempts to classify a subject into one of two categories based on certain object properties. In other words, LDA tests whether object attributes measured in an experiment predict categorization of the objects. LDA typically requires continuous independent variables and a dichotomous categorical dependent variable. In one embodiment, the selected fertility-associated phenotypic traits serve as the requisite continuous independent variables. The prognosis group classification of each of the members of the training population serves as the dichotomous categorical dependent variable. For more information on linear discriminant analysis, see Duda, Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc; and Hastie, 2001, The Elements of Statistical Learning, Springer, New York; Venables & Ripley, 1997, Modern Applied Statistics with s-plus, Springer, New York, incorporated herein by reference.
Quadratic discriminant analysis (QDA) takes the same input parameters and returns the same results as LDA. QDA uses quadratic equations, rather than linear equations, to produce results. LDA and QDA are interchangeable, and which to use is a matter of preference and/or availability of software to support the analysis. Logistic regression takes the same input parameters and returns the same results as LDA and QDA.
In some embodiments of the present invention, decision trees are used to classify patients using expression data for a selected set of molecular markers of the invention. Decision tree algorithms belong to the class of supervised learning algorithms. The aim of a decision tree is to induce a classifier (a tree) from real-world example data. This tree can be used to classify unseen examples which have not been used to derive the decision tree. In general there are a number of different decision tree algorithms, many of which are described in Duda, Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc. Decision tree algorithms often require consideration of feature processing, impurity measure, stopping criterion, and pruning. Specific decision tree algorithms include, but are not limited to classification and regression trees (CART), multivariate decision trees, ID3, and C4.5.
In some embodiments, the fertility-associated characteristics are used to cluster a training set. Additional information and examples are described in Duda and Hart, Pattern Classification and Scene Analysis, 1973, John Wiley & Sons, Inc., New York; Kaufman and Rousseeuw, 1990, Finding Groups in Data: An Introduction to Cluster Analysis, Wiley, New York, N.Y.; Duda, Pattern Classification, Second Edition, 2001, John Wiley & Sons, Inc; and Hastie, 2001, The Elements of Statistical Learning, Springer, New York; Everitt, 1993, Cluster analysis (3d ed.), Wiley, New York, N.Y.; and Backer, 1995, Computer-Assisted Reasoning in Cluster Analysis, Prentice Hall, Upper Saddle River, N.J. Particular exemplary clustering techniques that can be used in the present invention include, but are not limited to, hierarchical clustering (agglomerative clustering using nearest-neighbor algorithm, farthest-neighbor algorithm, the average linkage algorithm, the centroid algorithm, or the sum-of-squares algorithm), k-means clustering, fuzzy k-means clustering algorithm, and Jarvis-Patrick clustering.
Other algorithms for analyzing associations are known. For example, the stochastic gradient boosting is used to generate multiple additive regression tree (MART) models to predict a range of outcome probabilities. A different approach called the generalized linear model, expresses the outcome as a weighted sum of functions of the predictor variables. The weights are calculated based on least squares or Bayesian methods to minimize the prediction error on the training set. A predictor's weight reveals the effect of changing that predictor, while holding the others constant, on the outcome. In cases where one or more predictors are highly correlated, in a phenomenon known as collinearity, the relative values of their weights are less meaningful; steps must be taken to remove that collinearity, such as by excluding the nearly redundant variables from the model. Thus, when properly interpreted, the weights express the relative importance of the predictors. Less general formulations of the generalized linear model include linear regression, multiple regression, and multifactor logistic regression models, and are highly used in the medical community as clinical predictors.
In a preferred embodiment, a discrete time-proportional odds model, such as the Cox proportional hazards model, is used to determine the cumulative probability of ongoing pregnancy in a group of subjects. See e.g., Cox, David R (1972). “Regression Models and Life-Tables”. Journal of the Royal Statistical Society, Series B. 34 (2): 187-220, incorporated herein by reference. Proportional hazards models relate the time that passes before some event occurs to one or more covariates that may be associated with that quantity of time, wherein the unique effect of a unit increase in a covariate is multiplicative with respect to the hazard rate (e.g., odds of achieving ongoing pregnancy/live birth).
To further enhance the predictive power of the analysis, as noted above, genetic information from the subjects can be incorporated. One method for determining the effect that genetic information has on the cumulative odds of ongoing pregnancy includes the sequence kernel association testing (SKAT) method, which is a gene set level methodology for testing if SNP-sets (gene sets) are associated with phenotypes (continuous or discrete) of interest. See Wu M C, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-Variant Association Testing for Sequencing Data with the Sequence Kernel Association Test. American Journal of Human Genetics. 2011;89(1):82-93. doi:10.1016/j.ajhg.2011.05.029, incorporated herein by reference. For additional description of the incorporation of genetic factors into a model for assessing the probability of achieving ongoing pregnancy and specifically regarding the use of SKAT in adjusting the model, see U.S. Provisional Application No. 62/408,632, filed Oct. 14, 2016, incorporated herein by reference. Furthermore, burden testing can be used to enhance the results of the SKAT analysis given that SKAT only provides a P-value for evidence of an association between the SNP-set and phenotype of interest. Adjustment of models using SKAT-type analysis, allows one to see whether there is statistical evidence that genomic information, at the category level (e.g. functional biological classification level), provides additional information beyond known clinical metrics that is sufficient to significantly affect the model, and therefore be associated with the odds of achieving ongoing pregnancy.
Once the model has been developed based on a reference set of data which includes BMI and for which results of fertility treatments with or without ovulation induction agents are known, the model can be applied to data obtained from a patient suspected of having impaired fertility in order to predict the potential for achieving ongoing pregnancy with respect to a specific fertility treatment. The data obtained from the patient will include a determination of the patient's BMI. In this way, the patient can receive the predicted probabilities for achieving ongoing pregnancy using any number of fertility treatments, such as intrauterine insemination (IUI), in addition to whether one or more ovulation induction agents will affect the probability of achieving ongoing pregnancy. This information will inform the course of treatment for the individual.
Methods for Recommending Treatment and/or Treating a Patient
In one embodiment, methods of treatment or targeting treatment upon assessment of the patient's potential for achieving ongoing pregnancy are provided. As provided above, a patient's probability for achieving ongoing pregnancy can be determined with respect to a specific fertility treatment. Thus, the recommended treatment protocol will depend, in part, on the probability generated in accordance with the description above.
Exemplary fertility treatments include, but are not limited to, assisted reproductive technologies (ART), non-ART fertility treatments (RE), and fertility preservation technologies (egg, embryo, or ovarian preservation). Exemplary assisted reproductive technologies include, without limitation, in vitro fertilization (IVF), zygote intrafallopian transfer (ZIFT), gametic intrafallopian transfer (GIFT), or intracytoplasmic sperm injection (ICSI) paired with one of the methods above.
In IVF, eggs are removed from the female subject, fertilized outside the body, and implanted inside the uterus of the female subject. ZIFT is similar to IVF in that eggs are removed and fertilization of the eggs occurs outside the body. In ZIFT, however, the eggs are implanted in the Fallopian tube rather than the uterus. GIFT involves transferring eggs and sperm into the female subject's Fallopian tube. Accordingly, fertilization occurs inside the woman's body. In ICSI, a single sperm is injected into a mature egg that has removed from the body. The embryo is then transferred to the uterus or Fallopian tube. In RE, hormone stimulation is used to improve the woman's fertility. Exemplary fertility preservation treatments include egg freezing in which eggs are removed, vitrified or otherwise frozen, and then stored indefinitely. Preservation can similarly be achieved through cryo-preservation of embryos generated through IVF and cryo-preservation of ovarian tissue, including slices of the ovarian cortex. Preservation could also involve removal of the ovary from the pelvic region and subcutaneous implantation in an ectopic location such as under the skin the in periphery of the body (i.e. arm).
Exemplary non-ART fertility treatments include ovulation induction protocols with or without IUI with sperm. Exemplary ovulation induction agents include gonadotropins such as LH, FSH, hMG, and hCG; and oral ovulation induction agents. Exemplary oral ovulation induction agents include, but are not limited to: clomiphene citrate; aromatase inhibitors, such as letrozole and anastrozole; insulin sensitizing drugs, such as metformin, rosiglitazone, and pioglitazone; bromocriptine; cabergoline; GnRH; and GnRH analogs, such as leuprolide acetate, nafarelin acetate, goserelin acetate, ganirelix, and cetrorelix acetate (the former three being agonists and the latter two being antagonists); and combinations thereof.
In one embodiment, the fertility treatment protocol involves the use of ovulation induction agents, the effect of which on the probability of achieving ongoing pregnancy is dependent, in part, on the patient's BMI. A patient's BMI can influence the effectiveness of certain fertility treatments. For instance, the use of gonadotropins alone as ovulation induction agents prior to intrauterine insemination (IUI), regardless of BMI, are associated with a higher change of achieving ongoing pregnancy. However, the use of gonadotropins alone is associated with an increased risk for multiples. Additionally, the divergence in success rate increases as BMI increases, with the use of gonadotropins alone being associated with a higher chance of achieving ongoing pregnancy than the use of gonadotropins with oral ovulation induction agents. Thus, the treatment protocol for a given patient will vary with respect to the patient's BMI and other competing factors, such as the risk for multiples.
In one embodiment, the BMI of a patient and its effect on the chances for achieving ongoing pregnancy with respect to certain fertility treatment protocols, including those that involve ovulation induction agents, is built into the assessment of a patient's probability for achieving ongoing pregnancy, such that the probability of achieving ongoing pregnancy in a patient can be provided for both an ovulation induction protocol involving the use of one or more gonadotropins only and an ovulation induction protocol involving the use of one or more gonadotropins in addition to the use of one or more oral ovulation induction agents. In this way, depending on which protocol provides the highest probability of achieving ongoing pregnancy and to what extent the probabilities diverge, a treatment protocol can be recommended. For example, for those individuals having a BMI near or under a threshold BMI, for which BMI does not play as much of a role in affecting a patient's probability of achieving ongoing pregnancy, the weighting of factors such as the risk for higher multiples will play a role in determining whether to add an oral ovulation induction agent to the ovulation induction protocol. Whereas, for those individuals having a higher BMI, such as a BMI near or above a threshold BMI which impacts the patient's probability of achieving ongoing pregnancy, the ovulation induction protocol will only include the use of gonadotropins.
In another embodiment, methods of the invention involve the determination of a patient's BMI and recommending/administering a certain ovulation induction protocol depending on the BMI. The methods can involve recommending and/or administering a gonadotropin alone if the patient's BMI is near or above a threshold limit. If the patient's BMI is near or below the threshold limit, a gonadotropin and optionally an oral ovulation induction agent is recommended and/or administered, with recommendation of the oral ovulation induction agent based on a weighting of other factors, such as the risk for multiples. The threshold BMI limit can be 18.5, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, or higher. In one embodiment, the threshold limit is 20. In another embodiment, the threshold limit is 25. In yet another embodiment, the threshold limit is 30.
In one embodiment, the treatment involves the use of IUI with an ovulation induction protocol, wherein the protocol involves the use of one or more gonadotropins, and optionally the use of one or more oral ovulation induction agents depending, in part, on the BMI of the patient.
Aspects of the invention described herein can be performed using any type of computing device, such as a computer, that includes a processor, e.g., a central processing unit, or any combination of computing devices where each device performs at least part of the process or method. In some embodiments, systems and methods described herein may be performed with a handheld device, e.g., a smart tablet, or a smart phone, or a specialty device produced for the system.
Methods of the invention can be performed using software, hardware, firmware, hardwiring, or combinations of any of these. Features implementing functions can also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations (e.g., imaging apparatus in one room and host workstation in another, or in separate buildings, for example, with wireless or wired connections).
Processors suitable for the execution of computer program include, by way of example, both general and special purpose microprocessors, and any one or more processor of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. Information carriers suitable for embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, (e.g., EPROM, EEPROM, solid state drive (SSD), and flash memory devices); magnetic disks, (e.g., internal hard disks or removable disks); magneto-optical disks; and optical disks (e.g., CD and DVD disks). The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
To provide for interaction with a user, the subject matter described herein can be implemented on a computer having an I/O device, e.g., a CRT, LCD, LED, or projection device for displaying information to the user and an input or output device such as a keyboard and a pointing device, (e.g., a mouse or a trackball), by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well. For example, feedback provided to the user can be any form of sensory feedback, (e.g., visual feedback, auditory feedback, or tactile feedback), and input from the user can be received in any form, including acoustic, speech, or tactile input.
The subject matter described herein can be implemented in a computing system that includes a back-end component (e.g., a data server), a middleware component (e.g., an application server), or a front-end component (e.g., a client computer having a graphical user interface or a web browser through which a user can interact with an implementation of the subject matter described herein), or any combination of such back-end, middleware, and front-end components. The components of the system can be interconnected through network by any form or medium of digital data communication, e.g., a communication network. For example, the reference set of data may be stored at a remote location and the computer communicates across a network to access the reference set to compare data derived from the female subject to the reference set. In other embodiments, however, the reference set is stored locally within the computer and the computer accesses the reference set within the CPU to compare subject data to the reference set. Examples of communication networks include cell network (e.g., 3G or 4G), a local area network (LAN), and a wide area network (WAN), e.g., the Internet.
The subject matter described herein can be implemented as one or more computer program products, such as one or more computer programs tangibly embodied in an information carrier (e.g., in a non-transitory computer-readable medium) for execution by, or to control the operation of, data processing apparatus (e.g., a programmable processor, a computer, or multiple computers). A computer program (also known as a program, software, software application, app, macro, or code) can be written in any form of programming language, including compiled or interpreted languages (e.g., C, C++, Perl), and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. Systems and methods of the invention can include instructions written in any suitable programming language known in the art, including, without limitation, C, C++, Perl, Java, ActiveX, HTMLS, Visual Basic, or JavaScript.
A computer program does not necessarily correspond to a file. A program can be stored in a file or a portion of file that holds other programs or data, in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.
A file can be a digital file, for example, stored on a hard drive, SSD, CD, or other tangible, non-transitory medium. A file can be sent from one device to another over a network (e.g., as packets being sent from a server to a client, for example, through a Network Interface Card, modem, wireless card, or similar).
Writing a file according to the invention involves transforming a tangible, non-transitory computer-readable medium, for example, by adding, removing, or rearranging particles (e.g., with a net charge or dipole moment into patterns of magnetization by read/write heads), the patterns then representing new collocations of information about objective physical phenomena desired by, and useful to, the user. In some embodiments, writing involves a physical transformation of material in tangible, non-transitory computer readable media (e.g., with certain optical properties so that optical read/write devices can then read the new and useful collocation of information, e.g., burning a CD-ROM). In some embodiments, writing a file includes transforming a physical flash memory apparatus such as NAND flash memory device and storing information by transforming physical elements in an array of memory cells made from floating-gate transistors. Methods of writing a file are well-known in the art and, for example, can be invoked manually or automatically by a program or by a save command from software or a write command from a programming language.
Suitable computing devices typically include mass memory, at least one graphical user interface, at least one display device, and typically include communication between devices. The mass memory illustrates a type of computer-readable media, namely computer storage media. Computer storage media may include volatile, nonvolatile, removable, and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. Examples of computer storage media include RAM, ROM, EEPROM, flash memory, or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, radiofrequency identification tags or chips, or any other medium which can be used to store the desired information and which can be accessed by a computing device.
As one skilled in the art would recognize as necessary or best-suited for performance of the methods of the invention, a computer system or machines of the invention include one or more processors (e.g., a central processing unit (CPU) a graphics processing unit (GPU) or both), a main memory and a static memory, which communicate with each other via a bus.
In an exemplary embodiment shown in
Memory according to the invention can include a machine-readable medium on which is stored one or more sets of instructions (e.g., software) embodying any one or more of the methodologies or functions described herein. The software may also reside, completely or at least partially, within the main memory and/or within the processor during execution thereof by the computer system, the main memory and the processor also constituting machine-readable media. The software may further be transmitted or received over a network via the network interface device.
Other embodiments are within the scope and spirit of the invention. For example, due to the nature of software, functions described above can be implemented using software, hardware, firmware, hardwiring, or combinations of any of these. Features implementing functions can also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations.
In this example, a multi-center retrospective analysis was performed and the effect of BMI on fertility treatment outcomes was examined.
Between 2009 to 2015, 249,436 treatment cycles were analyzed from 113,809 patients undergoing timed intercourse (TI) with oral fertility medications (“orals”), IUI with/without orals and/or gonadotropin stimulation (Gnd), and IVF (excluding canceled cycles) were studied. Cox proportional hazards models with time-dependent covariates to model number of treatment cycles to ongoing pregnancy was utilized. Then regression was used to compare BAFC and
AMH across BMIs. We adjusted our models for age, BAFC, medication type, sperm motility, clinic, and diagnoses.
It was observed that women <18.5 BMI had reduced BAFC (P=0.02), but similar AMH levels to women with normal BMI (P=0.12). These results suggest that BMI was correlated with the probability of achieving ongoing pregnancy in a protocol-dependent fashion (
These results suggest that normal BMI do not translate to optimal BMI for reproductive outcomes and success with fertility treatments.
In this example, the influence of obesity and ovulation induction agents, specifically the effectiveness of ovulation induction with gonadotropins alone and with gonadotropins plus oral ovulation induction agents in relation to BMI was investigated.
We analyzed 67,662 treatment cycles from 33,867 patients undergoing IUI ovulation induction (OI) with gonadotropins alone or with oral agents and gonadotropin from 2009 to 2016.
In this study, discrete-time proportional-odds models with time-dependent covariates to model the odds of achieving ongoing pregnancy during a given cycle were used. Interaction terms between BMI and stimulation type were included in the models to test the hypothesis that the effect of BMI was modified by stimulation type. BMI was treated continuously and categorized according to NIH guidelines (<18.5, 18.5-25, 25-30, 30-35, >35). Models were adjusted for age, BAFC, medication type, sperm motility, clinic, and diagnoses and were weighted towards more recent data.
Overall, OI with only gonadotropin agents had significantly higher odds of achieving ongoing pregnancy compared to OI with oral agents and gonadotropins (odds ratio (OR)=1.16, p<0.05). However, we found that the impact of BMI on success rate differed significantly with the type of IUI treatment protocol. As shown in
In sum, oral OI agents when used in conjunction with gonadotropins appear to be less effective for obese women. Although it had been previously demonstrated that oral agents alone result in lower rates of pregnancy with timed intercourse and IUI in obese women, the detrimental effect of increased BMI was not observed with gonadotropin alone. Surprisingly, we demonstrate that OI with only gonadotropin agents is associated with higher success rates than OI with oral agents and gonadotropins. This difference is most pronounced in higher BMI patients. Additionally, the study demonstrates that gonadotropin OI alone is related to a higher probability of conceiving multiples than oral and gonadotropin independent of BMI. These data suggest that the best IUI protocol for a given patient varies with respect to their BMI. As such, a personalized medicine approach including BMI should be taken into account when selecting an IUI treatment protocol.
In this example, we performed a multi-center retrospective analysis and examined the effect of BMI on IVF cycle outcomes (cycle cancellation, oocytes retrieved, usable embryos, ongoing pregnancy) while controlling for confounding factors.
We performed a retrospective review of 51,198 women who initiated their first autologous IVF or ICSI cycles at 13 fertility treatment centers in the United States between 2009-2015. We excluded frozen transfers of supernumerary embryos, cycles using preimplantation genetic screening (PGS), and cycles that were missing BMI data. We used BMI categories defined by World Health Organization (WHO): <18.50 kg/m2 (underweight), 18.50-24.99 kg/m2 (normal), 25.00-29.99 kg/m2 (overweight), 30.00-34.99 kg/m2 (obese class I), 35.00-39.99 kg/m2 (obese class II), and ≥40.00 kg/m2 (obese class III). Obese class II and class III were combined in our study because of limited sample size in obese class III group. The least absolute shrinkage and selection operator (LASSO) regression was used to select confounders that were statistically significantly (p<0.05) associated with each outcome measure. Logistic or Poisson regression was used to calculate the multivariate adjusted odds ratio (aOR; for cycle cancellation and ongoing clinical pregnancy) or adjusted incidence rate ratio (aIRR; for number of oocytes retrieved and number of usable embryos), respectively, with their 95% confidence interval (CI), with normal BMI as the reference category. Sensitivity analyses for number of usable embryos and ongoing clinical pregnancy were performed among cycles that transferred only blastocyst stage embryos.
We found BMI above the normal range was associated with worse IVF outcomes, including higher odds of cycle cancellation (cancelation of stimulation cycles that do not produce an adequate number of follicles to go through egg retrieval procedure), lower oocyte and embryo counts, and lower odds of an ongoing clinical pregnancy.
For cycle cancellation, factors that were controlled for in the analysis included: female age, gravidity, BAFC, basal FSH and E2 levels, AMH, total gonadotropin used, infertility diagnosis, and clinic. After adjusting for these confounders, odds of cycle cancellation were comparable between underweight and normal weight patients (p=0.23). For patients with BMI≥18.5 kg/m2, odds of cycle cancellation increased with rising BMI (overweight vs. normal weight aOR 1.17, 95% CI 1.08-1.26, p<0.001; obese class I vs. normal weight aOR 1.28, 95% CI 1.15-1.41, p<0.001), with obese class II/III patients having the highest aOR for cycle cancellation when compared with normal weight patients (aOR 1.50, 95% CI 1.33-1.68, p<0.001) (Table 5).
For number of oocytes retrieved, factors that were controlled for in the analysis included: female age, parity, BAFC, basal FSH, LH, and E2 levels, AMH, total gonadotropin used, infertility diagnosis, and clinic. After adjusting for these confounders, among 45,950 patients that reached oocytes retrieval, obese patients had fewer oocytes retrieved compared to normal weight patients (Table 5). When comparing number of oocytes retrieved between obese class I and normal weight patients, aIRR was 0.98 (95% CI 0.98-0.99, p<0.001), while aIRR comparing obese class II/III with normal weight, was 0.93 (95% CI 0.92-0.94, p<0.001). Underweight and overweight patients had similar number of oocytes retrieved compared to normal weight patients (p=0.11 and p=0.19 respectively).
For number of usable embryos, factors that were controlled for in the analysis included: female age, BAFC, basal FSH, LH, and E2 levels, AMH, gravidity, parity, ICSI, number of oocytes retrieved, embryo stage at the end of culture, total gonadotropin used, infertility diagnosis, and clinic. After adjusting for these confounders, among 45,767 patients with one or more oocytes retrieved, underweight, overweight, and obese patients had fewer usable embryos that were transferred or cryopreserved (underweight vs. normal weight aIRR 0.95, 95% CI 0.92-0.97, p<0.001; overweight vs. normal weight aIRR 0.98, 95% CI 0.97-0.99, p=0.006; obese class I vs. normal weight aIRR 0.97, 95% CI 0.96-0.99, p<0.001) (Table 5). Obese class II/III had the largest impact on number of usable embryos (aIRR=0.95, 95% CI 0.93-0.97, p<0.001). When only blastocyst cycles were included, the difference in number of usable blastocysts was not statistically significant between overweight vs. normal weight patients, whereas the underweight and obese patients had fewer usable blastocysts than normal weight patients (Table 6).
For ongoing clinical pregnancy, factors that were controlled for in the analysis included: female age, basal E2 levels, parity, infertility diagnosis, total gonadotropin used, number of oocytes retrieved, embryo stage at transfer, number of usable embryos, number of embryos transferred, and clinic. After adjusting for these confounders, among 39,055 patients with fresh embryo transfer, obese patients had lower odds of ongoing clinical pregnancy compared to normal weight patients (Table 5). When odds of ongoing clinical pregnancy were compared between obese class I and normal weight patients, aOR was 0.89 (95% CI 0.83-0.95, p<0.001), while aOR comparing obese class II/III and normal weight patients was 0.86 (95% CI 0.79-0.93, p<0.001). Underweight and overweight patients did not have significantly lower odds of ongoing clinical pregnancy compared to normal weight patients (p=0.21 and p=0.09 respectively). When only blastocyst transfers were included, results remained consistent, with obese patients having lower odds of ongoing clinical pregnancy (Table 6).
In this example, a multi-center retrospective analysis and examined the effect of BMI on IVF cycle outcomes (cycle cancellation, oocytes retrieved, usable embryos, ongoing pregnancy) for 8 patient subpopulations with different primary infertility diagnosis, while controlling for confounding factors was performed.
A retrospective review of 51,198 women who initiated their first autologous IVF or ICSI cycles at 13 fertility treatment centers in the United States between 2009-2015 was performed. Frozen transfers of supernumerary embryos, cycles using PGS, and cycles that were missing BMI data were excluded. BMI categories defined by WHO: <18.50 kg/m2 (underweight), 18.50-24.99 kg/m2 (normal), 25.00-29.99 kg/m2 (overweight), 30.00-34.99 kg/m2 (obese class I), 35.00-39.99 kg/m2(obese class II), and ≥40.00 kg/m2 (obese class III) were used. Obese class II and class III were combined in our study because of limited sample size in obese class III group. The least absolute shrinkage and selection operator (LASSO) regression was used to selected confounders that were statistically significantly (p<0.05) associated with each outcome measure. For 8 patient subpopulations with different primary infertility diagnoses reported by the clinics (diminished ovarian reserve (DOR), endometriosis, idiopathic, male factor, ovulatory dysfunction, PCOS, tubal factor, and uterine factor), logistic or Poisson regression was used to calculate the multivariate aOR for cycle cancellation and ongoing clinical pregnancy or aIRR for number of oocytes retrieved and number of usable embryos), respectively, with their 95% CI, with normal BMI as the reference category.
We examined the effect of BMI on IVF cycle outcomes for patient subpopulations with different primary infertility diagnoses, and found that the effect of BMI on IVF cycle outcomes was more pronounced for PCOS, ovulatory dysfunction, male factor, and uterine factor patients (Tables 7-10).
In cycles with primary diagnosis listed as “PCOS”, odds of cycle cancellation were higher in obese patients compared to normal weight patients (obese class I vs. normal weight: aOR 2.59, 95% CI 1.40-4.78, p=0.002; obese class II/III vs. normal weight: aOR 2.56, 95% CI 1.35-4.84, p=0.004) (
In cycles with primary diagnosis listed as “Ovulatory dysfunction”, odds of cycle cancellation were higher in obese class II/III patients compared to normal weight patients (aOR 1.58, 95% CI 1.10-2.26, p=0.01) (
In cycles with primary diagnosis listed as “Male factor”, odds of cycle cancellation were higher in obese patients compared to normal weight patients (obese class I vs. normal weight: aOR 1.57, 95% CI 1.22-2.02, p<0.001; obese class II/III vs. normal weight: aOR 1.80, 95% CI 1.34-2.42, p<0.001) (
In cycles with primary diagnosis listed as “Uterine factor”, odds of cycle cancellation were higher in overweight and obese patients compared to normal weight patients (overweight vs. normal weight: aOR 1.33, 95% CI 1.09-1.61, p=0.004; obese class I vs. normal weight: aOR 1.35, 95% CI 1.03-1.77, p=0.03; obese class II/III vs. normal weight: aOR 1.90, 95% CI 1.37-2.63, p<0.001) (
References and citations to other documents, such as patents, patent applications, patent publications, journals, books, papers, web contents, have been made throughout this disclosure. All such documents are hereby incorporated herein by reference in their entirety for all purposes.
The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore
This application claims the benefit of and priority to U.S. Provisional Application Ser. No. 62/460,415, filed Feb. 17, 2017, the entire contents of which are incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/017402 | 2/8/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62460415 | Feb 2017 | US |