OPTIMIZED ULTRA-LOW VOLUME LIQUID BIOPSY METHODS, SYSTEMS, AND DEVICES

Information

  • Patent Application
  • 20220162591
  • Publication Number
    20220162591
  • Date Filed
    March 25, 2020
    4 years ago
  • Date Published
    May 26, 2022
    2 years ago
Abstract
Provided herein are devices, systems, kits and methods for obtaining genetic information from cell-free fetal nucleic acids in ultra-low amounts of biological samples. Due to the convenience of obtaining ultra-low amounts of samples, devices, systems, kits and methods can be at least partially employed at a point of need.
Description
BACKGROUND

Genetic testing is a means for obtaining information about a subject's DNA and/or expression of that DNA. Genetic tests are continually being developed to obtain biological information about a subject. This biological information has many uses, including determining a health status of an individual, diagnosing an individual with an infection or disease, determining a suitable treatment for the individual, solving a crime and identifying paternity. Currently, genetic testing is mainly performed in clinics and laboratories by trained personnel with expensive and bulky equipment that requires technical training and expertise to use. It typically takes days to weeks, from the time a biological sample is obtained from a patient, to provide the patient with results of a genetic test.


As an example, many people who become aware of a pregnancy are eager to know the sex (hereby referred to as gender throughout this application) of the baby as soon as possible. There are tests that allow for obtaining gender information from DNA in maternal blood. Blood obtained from the mother must be analyzed with sophisticated equipment by a highly-trained technician. If the blood is obtained at a site distant from the laboratory where DNA analysis is performed, the sample must be stored, shipped, and analyzed in a timely fashion, or otherwise risk sample degradation.


Cell-free nucleic acids originate from various tissue types and are released into the circulation of an individual. The pool of cell-free nucleic acids in circulation often represents the genetic makeup of contributing tissue types. In the case of a healthy individual, it can be a very homogenous pool without much variation. However, when a tissue contains a noticeably different genome, a more heterogeneous cell-free nucleic acid pool can be observed. Common examples of subjects having tissues with noticeably different genomes include, but are not limited to: (a) cancer patients, where the tumor DNA contains mutated sites (b) transplant patients, where the transplanted organ releases donor DNA into the pool of cell-free DNA and (c) pregnant women, where the placenta contributes cell-free DNA that is largely representative of the fetal DNA. In some instances, a genome may be noticeably different due to epigenetic modifications. DNA from different tissues, organs and cell types has been shown to have distinct epigenetic patterns. Thus, it may be possible to detect cell-free nucleic acids from tissues, organs, and cells including, but not limited to, brain, liver, adipose, pancreas, endothelium, and immune cells. In addition, when a tissue or cell type of an individual is affected by a disease or infection, there may be more cell-free DNA from that tissue or cell-type circulating in that individual.


SUMMARY

Disclosed herein are devices, systems, kits and methods for analyzing components (e.g., nucleic acids, proteins) of a biological sample, including a sample from an animal (human or non-human). In general, devices, systems, kits and methods disclosed herein are capable of providing genetic information from an ultra-low volume of a sample by taking advantage of cell-free DNA fragmentation. For brevity, this may be referred to as “ultra-low volume liquid biopsy.” Prior to the instant disclosure, it was not expected that one could obtain reliable and useful genetic information from ultra-low volumes of samples because it was not believed that ultra-low volumes would provide a sufficient amount of cell-free nucleic acids from a particular tissue of interest (e.g., brain, liver, placenta, tumor) to be detectable or informative. Moreover, with an abundance of background signal from other cell-free nucleic acids, particularly those from blood cells, and the variation of that background from subject to subject, reproducibility and reliable comparisons between test subjects and control subject seemed nearly impossible. Prior to the instant disclosure it was also not expected that the relative amount and size distribution profile of DNA extracted from ultra-low volumes of sample can differ significantly from what has been previously described.


In contrast to cellular DNA, cell-free DNA is fragmented. In order to analyze cell-free DNA from ultra-low volumes of sample, methods, devices, systems and kits disclosed herein utilize cell-free DNA fragments from repetitive regions (e.g., regions with a common sequence) and/or multiple regions as statistically independent markers. Methods, devices, systems and kits disclosed herein are possible because cell-free DNA fragments from repetitive regions (e.g., regions of the genome containing multiple copies of the same or similar sequence), or many regions collectively, are present at a higher effective concentration in a sample than non-fragmented DNA sequences would be. Thus, sample volumes that contain a number of analytes sufficient to obtain useful genetic information are lower than previously thought. Advantageously, fragments from repetitive regions may be amplified with a single pair of primers or detected with a single probe. Alternatively or additionally, multiple detection regions that do not share similar sequences may be detected in small volumes, e.g., by tagging and amplifying them with a universal primer or amplifying with multiple primer pairs (e.g. in a multiplexed format).


Due to their ability to obtain useful genetic information from ultra-low volumes of biological samples, the devices, systems, kits and methods offer the advantages of being (1) minimally invasive, (2) applicable in home with little or no technical training (e.g., do not require complex equipment); and (3) informative at early stages of a condition (e.g., pregnancy, infection). These advantages reduce or negate the requirement for a laboratory or technician, thereby improving patient accessibility, compliance, and monitoring. This ultimately leads to improved health outcomes at lower cost to the healthcare system.


Analysis of cell-free circulating nucleic acids is met with a number of technical challenges. For instance, amplification of circulating nucleic acids in blood may be inhibited by some of the components in whole blood (e.g., hemoglobin). One of the ways the instant methods, systems and devices solve this technical challenge, is by obtaining plasma (containing cell-free nucleic acids) from capillary blood in a manner that avoids damage to the sample or contamination of the sample, either from components in whole blood or surrounding tissue (for e.g., transdermal puncture of the skin causing DNA in skin to contaminate sample of whole blood obtained).


Analysis of cell-free circulating nucleic acids in small sample volumes is particularly challenging. Despite past attempts to accomplish this goal, and for the reasons described herein, the field has remained skeptical that useful and accurate genetic information can be obtained from cell-free DNA in small sample volumes that can be collected at a point of need (e.g., capillary blood from a finger prick). The methods, systems and devices disclosed herein not only overcome these technical challenges, but can be used at a point of need, something that was practically inconceivable given the state of the art.


For example, past attempts to analyze circulating cell-free tumor DNA in five or more milliliters of blood were only informative when the sample had a relatively high tumor burden (e.g., 15-20%) and a fraction of the genome altered (FGA) of 15%, or more, which make the alterations easier to detect. However, in a majority of patients, the tumor burden is much lower. Thus, past attempts exclude a significant portion of the patient population. The methods, systems, devices and kits disclosed herein enable detection of cell free nucleic acids from a tumor in a sample with a lower tumor burden (e.g., below 15%).


In further attempts, analyzing smaller amounts of biological sample were unsuccessful due to white blood cell contamination or nucleic acid damage. Disclosed herein, in some embodiments, is an example of the implications of DNA damage or contamination in the context of measuring the fetal component of cell-free nucleic acids in a sample obtained from the mother. As shown herein (see Example 19), DNA damage and/or contamination at the transdermal puncture site (e.g. finger prick) results in a presence of nucleic acids of fragment lengths in the sample, which in some cases, is mistakenly assumed to be cell-free nucleic acid fragments. In fact, as shown herein, the overrepresentation of shorter fragment lengths was derived from DNA from the surrounding skin, and DNA damage caused by the lancet, caused by the transdermal puncture to obtain the sample. DNA damage and contamination described herein, for the first time, impose a major challenge when a small amount of sample is collected. For example, in a 20 microliter blood draw, the fetal fraction would drop from 10% to about 5% in accordance with these findings. The methods, systems, devices, and kits disclosed herein provide solutions to the contamination and DNA damage introduced by a transdermal puncture, including: (1) discarding the first drop of blood and obtaining for analysis a subsequent drop of blood; (2) capture methods that select against longer DNA fragments; (3) electrophoretic methods; (4) selection of library products by size; (5) or using bioinformatic methods to account/remove or differentially analyze based on size information.


In addition, obtaining five or more milliliters of blood from an individual requires a laboratory technician, which increases the cost of the genetic analysis and inconvenience to the patient (e.g., inconvenience caused by the time, discomfort, and expense of the genetic analysis). The present methods, devices, and systems are configured to provide useful and accurate genetic information by analyzing a biological sample, such as capillary blood, in amounts much lower than five milliliters that can be collected at a point of need (e.g., capillary blood from a finger prick).


Even if past attempts analyzed smaller sample volumes, they produced artificial results. For example, some past attempts of analyzing smaller amounts of sample dilute genomic DNA from cell lines and shear the genomic DNA to produce and detect cell free (cfDNA) surrogates. Down-sampling or dilutions of cell line DNA/sheared DNA and in silico methods produce artificial results because they are not reflective of size and length distributions and bin information in individual samples with low input number of molecules. In another example, past attempts to analyze smaller amounts of a biological sample produce artificial results because they rely on detecting predetermined mutations, which can also be referred to as “known events.” The instant disclosure presents methods, systems and devices for obtaining plasma (containing cell-free nucleic acids) from a small amount of capillary blood (e.g., finger prick) in a manner that provides accurate and non-predetermined genetic information from non-surrogate cfDNA.


The past attempts described herein would have to use a combination of low pass/low coverage whole genome sequencing in an initial detection step, and thereafter perform additional analysis in further detail to perform a genetic analysis accurately. Low pass/low coverage whole genome sequences is not optimal for detecting unknown events with high sensitivity, and likely will require more detailed follow up assays. Use of multiple assays to provide a genetic analysis is costly, inefficient, and is not a fungible solution at the point of need. By contrast, the present methods, devices, and systems solve the above problems by providing a way to obtain an accurate genetic analysis from ultra-low sample volumes by using multiple fragments of cell-free DNA that collectively are present at a high concentration that is detectable even in small samples.


Devices, systems, kits and methods disclosed herein are summarized as follows.


Aspects disclosed herein provide methods comprising: (a) obtaining a biological sample from a subject, wherein the biological sample comprises a cell-free deoxyribonucleic acid (cfDNA), and wherein the biological sample has a volume of at most 120 microliters (μl) when it is obtained from the subject; (b) tagging at least a portion of the cfDNA to produce tagged cfDNA by: (i) generating ligation-competent cfDNA by one or more steps comprising: (1) generating a blunt end of the cfDNA, wherein a 5′ overhang or a 3′ recessed end is removed using one or more polymerases and one or more exonucleases; (2) dephosphorylating the blunt end of the cfDNA; (3) contacting the cfDNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cfDNA; or (4) repairing or remove DNA damage in the cfDNA using a ligase; and (ii) ligating the ligation competent cfDNA to adaptor oligonucleotides by contacting the ligation competent cfDNA to the adaptor oligonucleotides in the presence of the ligase and one or more of a crowding reagent and a small molecule enhancer; and (c) optionally, amplifying the tagged cfDNA; and (d) sequencing at least a portion of the tagged cfDNA. In some embodiments, the volume is at most 100 microliters, when it is obtained from the subject. In some embodiments, the volume is at most 55 microliters when it is obtained from the subject. In some embodiments, the volume is at most 50 microliters when it is obtained from the subject. In some embodiments, the volume is at most 40 microliters when it is obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when it is obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cfDNA molecules. In some embodiments, the biological sample contains about 104 to about 109 cfDNA molecules. In some embodiments, the biological sample contains about 104 to about 107 cfDNA molecules. In some embodiments, the cfDNA in the biological sample is about 10 genome equivalents. In some embodiments, the cfDNA in the biological sample is at most 10 genome equivalents. In some embodiments, the cfDNA in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the biological sample was obtained from the subject by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise detecting a normal representation, an overrepresentation or an underrepresentation of at least one target sequence in the at least a portion of the tagged cfDNA. In some embodiments, the subject is pregnant with a fetus. In some embodiments, a component of the cfDNA is a fetal cfDNA component from the fetus. In some embodiments, methods comprise analyzing genotype information from an individual and the fetal cfDNA component to determine whether the individual paternally contributed to the fetus by identifying a genotypic match between the fetal cfDNA component and the genotype information. In some embodiments, methods further comprise amplifying in (c), and wherein generating the ligation-competent cfDNA comprises: (a) generating the blunt end of the cfDNA, wherein a 5′ overhang or a 3′ recessed end is removed using one or more polymerases and one or more exonucleases; (b) dephosphorylating the blunt end of the cfDNA; (c) contacting the cfDNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cfDNA; and (d) repairing or remove DNA damage in the cfDNA using the ligase. In some embodiments, the cfDNA is selected from a tumor, transplanted tissue or organ, or one or more pathogens, in the subject. In some embodiments, the one or more pathogens comprises a bacterium or component thereof. In some embodiments, the one or more pathogens comprises a virus or a component thereof. In some embodiments, the one or more pathogens comprises a fungus or a component thereof. In some embodiments, methods comprise amplifying by massively multiplexed amplification. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is massively multiplexed polymerase chain reaction (mmPCR). In some embodiments, methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, methods further comprise contacting the biological sample with a white blood cell stabilizer after obtaining the biological sample from the subject. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the tagging produces a library of tagged cfDNA with an efficiency of at least 0.5, when the library is prepared by: (a) performing end-repair, 5′ phosphorylation and A-tailing with incubation at 20 degrees Celsius for 30 minutes followed by 65 degrees Celsius for 30 minutes; (b) ligating the cfDNA to adaptor oligonucleotides with incubation at 20 degrees Celsius for 15 minutes; (c) cleaving a ligated adaptor loop from the adaptor oligonucleotides with incubation at 37 degrees Celsius for 15 minutes, to produce ligation-competent cfDNA; (d) amplifying the ligation-competent cfDNA by: (i) denaturing the ligation-competent cfDNA at 98 degrees Celsius for 1 minute, followed by 13 cycles at 98 degrees Celsius for 10 seconds; (ii) annealing the denatured ligation-competent cfDNA to one or more complementary primers from (i) at 65 degrees Celsius for 75 seconds; and (iii) extending the ligation-competent cfDNA at 65 degrees Celsius for 5 minutes, to produce an amplified library of ligation-competent cfDNA; and (iv) purifying the amplified library of ligation-competent cfDNA using SPRI beads. In some embodiments, the tagging produces the library of tagged cfDNA with an efficiency of at least 0.55, 0.60, 0.65, 0.70, 0.75, 0.80, 0.85, 0.90, 0.95, 0.96, 0.97, 0.98, 0.99, or 1.00.


Aspects disclosed herein provide methods comprising: (a) obtaining a biological sample from a pregnant subject with a fetus, wherein the biological sample comprises a cell-free deoxyribonucleic acid (cfDNA), and wherein the biological sample has a volume that is not greater than about 120 microliters when obtained from the subject; (b) contacting at least one cfDNA in the biological sample with an amplification reagent and a polynucleotide primer that anneals to a sequence corresponding to a sequence of interest to produce an amplification product; and (c) detecting a presence or an absence of the amplification product. In some embodiments, methods further comprise annealing a oligonucleotide probe with a detectable label to the at least one cfDNA. In some embodiments, methods further comprise detecting epigenetic modification of the cfDNA. In some embodiments, the epigenetic modification comprises methylation at a genetic locus of the cfDNA. In some embodiments, detecting a presence of the amplification product indicates a gender of the fetus. In some embodiments, a component of the cfDNA is from the fetus. In some embodiments, methods comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the volume is at most 100 microliters, when it is obtained from the subject. In some embodiments, the volume is at most 55 microliters when it is obtained from the subject. In some embodiments, the volume is at most 50 microliters when it is obtained from the subject. In some embodiments, the volume is at most 40 microliters when it is obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when it is obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cfDNA molecules. In some embodiments, the biological sample contains about 104 to about 109 cfDNA molecules. In some embodiments, the biological sample contains about 104 to about 107 cfDNA molecules. In some embodiments, the cfDNA in the biological sample is about 10 genome equivalents. In some embodiments, the cfDNA in the biological sample is at most 10 genome equivalents. In some embodiments, the cfDNA in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the biological sample was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis.


Aspects disclosed herein provide methods of increasing a relative amount of a target nucleic acid in a biological sample obtained from a subject comprising: (a) inducing a transdermal puncture at a site of the subject to produce a first fraction and a second fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting the second fraction of the biological sample, thereby reducing or eliminating contamination or nucleic acid damage of the biological sample, wherein the first fraction comprises a lower fraction of a target nucleic acid, as compared to a fraction of the target nucleic acid in the second fraction. In some embodiments, methods further comprise cleaning the site before inducing the transdermal puncture, thereby removing or reducing unwanted contaminant. In some embodiments, the unwanted contaminant comprises DNA from the transdermal puncture site. In some embodiments, the nucleic acid damage comprises damage to non-apoptotic DNA in the biological sample. In some embodiments, the biological sample is capillary blood. In some embodiments, methods further comprise detecting the target nucleic acid in the second fraction of the biological sample using an assay selected from massively multiplexed polymerase chain reaction (mmPCR) or nucleic acid sequencing. In some embodiments, the first fraction or the second fraction, or a combination thereof, has a volume of at most 300 microliters when obtained from the subject. In some embodiments, the volume is at most 100 microliters, when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the target nucleic acid is a circulating cell-free nucleic acid molecule. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules.


Aspects disclosed herein provide devices comprising: (a) a sample collector for obtaining from a subject a biological sample comprising a volume of at most 120 microliters, wherein the biological sample comprises a target cell-free DNA (cfDNA); (b) a sample purifier for removing a cell from the biological sample to produce a cell-depleted sample; and (c) a nucleic acid detector configured to detect the target cfDNA in the cell-depleted sample. In some embodiments, devices further comprise a nucleic acid ligator comprising: (a) an ligation formulation for producing ligation-competent target cfDNA, the ligation formulation comprising one or more of: (i) one or more exonucleases adapted to generate a blunt end of the target cfDNA and remove a 5′ overhang or a 3′ recessed end of the blunt end of the target cfDNA; (ii) a blunt end cfDNA dephosphorylating agent; (iii) a crowding reagent; (iv) a DNA damage repair agent; or (v) a DNA ligase; and (b) one or more adaptor oligonucleotides ligated to the ligation-competent target cfDNA. In some embodiments, devices further comprise a white blood cell stabilizer. In some embodiments, the nucleic acid detector is a massively multiplexed PCR device (mmPCR). In some embodiments, the ligation formulation comprises: (a) the one or more exonucleases adapted to generate a blunt end of the target cfDNA and remove a 5′ overhang or a 3′ recessed end of the blunt end of the target cfDNA; (i) the blunt end cfDNA dephosphorylating agent; (ii) the DNA damage repair agent; and (iii) the DNA ligase. In some embodiments, the nucleic acid detector comprises a nucleic acid sequencer or lateral flow strip. In some embodiments, the nucleic acid sequencer comprises a signal detector. In some embodiments, the sample purifier comprises a filter, and wherein the filter has a pore size of about 0.05 microns to about 2 microns. In some embodiments, the filter is a vertical filter. In some embodiments, the sample purifier comprises a binding moiety selected from an antibody, antigen binding antibody fragment, a ligand, a receptor, a peptide, a small molecule, and a combination thereof. In some embodiments, the sample collector is configured to lyse intercellular junctions of an epidermis of the subject to obtain the biological sample. In some embodiments, the volume is at most 100 microliters, when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the target nucleic acid is a circulating cell-free nucleic acid molecule. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules.


Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining a biological sample from a subject; (b) optionally tagging at least a portion of the cell-free nucleic acids to produce a library of optionally tagged cell-free nucleic acids; (c) optionally amplifying the optionally tagged cell-free nucleic acids; (d) sequencing at least a portion of the optionally tagged cell-free nucleic acids; and (e) detecting a normal representation, an overrepresentation or an underrepresentation of at least one target sequence in the at least a portion of the optionally tagged cell-free nucleic acids. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the blood comprises capillary blood. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample obtained from the subject was collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was not collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, the tagging of (c) comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the library in (c) is produced with an efficiency of at least 0.5. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a tumor. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the target cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the biological sample comprises a cell type or tissue type in which fetal cell-free nucleic acids are low, as compared to peripheral blood. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, the biological sample has a volume that is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules.


Aspects disclosed herein, in some embodiments, are methods prenatal paternity testing methods comprising: (a) obtaining a biological sample from a subject pregnant with a fetus; (b) optionally tagging at least a portion of the cell-free nucleic acids to produce a library of optionally tagged cell-free nucleic acids; (c) optionally amplifying the optionally tagged cell-free nucleic acids; (d) sequencing at least a portion of the optionally tagged cell-free nucleic acids; (e) receiving paternal genotype information from an individual suspected to be a paternal father of the fetus; and (0 comparing the paternal genotype information with a fetal component of the cell-free nucleic acids to determine whether there is a genotypic match between the fetal component and paternal genotype. In some embodiments, the biological sample comprises cell-free nucleic acids. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the blood comprises capillary blood. In some embodiments, the capillary blood comprises not more than 40 microliters of blood. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample obtained from the subject was collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was not collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the tagging of (c) comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the library in (c) is produced with an efficiency of at least 0.5. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a tumor. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the target cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the biological sample comprises a cell type or tissue type in which fetal cell-free nucleic acids are low, as compared to peripheral blood. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, the biological sample has a volume that is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules.


Aspects disclosed herein, in some embodiments, are methods of analyzing a biological sample obtained from a subject, the method comprising: (a) obtaining a biological sample from a subject; (b) optionally, tagging at least a portion of the cell-free nucleic acids to produce a library of tagged cell-free nucleic acids; (c) amplifying the optionally tagged cell-free nucleic acids by massively multiplexed amplification assay; (d) optionally, pooling the amplified optionally tagged cell-free nucleic acids; (e) sequencing at least a portion of the amplified optionally tagged cell-free nucleic acids; and (f) detecting a normal representation, an overrepresentation or an underrepresentation of at least one target sequence in the at least a portion of the optionally tagged cell-free nucleic acids. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the blood comprises capillary blood. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample obtained from the subject was collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was not collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the tagging of (c) comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the library in (c) is produced with an efficiency of at least 0.5. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a tumor. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the target cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the biological sample comprises a cell type or tissue type in which fetal cell-free nucleic acids are low, as compared to peripheral blood. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, the biological sample has a volume that is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules.


Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining about 1-100 microliters (μl) of a biological sample from a subject comprising deoxyribose nucleic acid (DNA); and (b) detecting an epigenetic modification of the DNA. In some embodiments, the epigenetic modification comprises DNA methylation at a genetic locus, a histone methylation, histone, ubiquitination, histone acetylation, histone phosphorylation, micro RNA (miRNA). In some embodiments, the DNA methylation comprises CpG methylation or CpH methylation. In some embodiments, the genetic locus comprises a promoter or regulatory element of a gene. In some embodiments, the genetic locus comprises a variable long terminal repeat (LTR). In some embodiments, the genetic locus comprises a cell-free DNA or fragment thereof. In some embodiments, the genetic locus comprises a single nucleotide polymorphism (SNP). In some embodiments, histone acetylation is indicated by a presence or level of histone deacetylases. In some embodiments, the histone modification is at a histone selected from the group consisting of histone 2A (H2A), histone 2B (H2B, histone 3 (H3), and histone 4 (H4). In some embodiments, the histone methylation is methylation of H3 lysine 4 (H3K4me2). In some embodiments, the histone acetylation is deacetylation at H4. In some embodiments, the miRNA are selected from the group consisting of miR-21; miR-126, mi-R142; mi-R146a, mi-R12a, mi-R181a, miR-29c, miR-29a, miR-291), miR-101, miRNA-155, and miR-148a. In some embodiments, biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the biological sample obtained from the subject was collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was not collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, the biological sample has a volume that is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules.


Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining a biological sample from a subject, and wherein the biological sample contains up to about 109 cell-free nucleic acid molecules; (b) sequencing at least a portion of the cell-free nucleic acid molecules to produce sequencing reads; (c) measuring at least a portion of sequencing reads corresponding to at least one chromosomal region; and (d) detecting a normal representation, an overrepresentation or an underrepresentation of the at least one chromosomal region. In some embodiments, the methods further comprising tagging the at least a portion of the cell-free nucleic acid molecules. In some embodiments, the tagging comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the biological sample is a biological sample having a volume of less than about 500 microliters (μl). In some embodiments, the biological sample is a biological sample having a volume of about 1 μL to about 100 μl. In some embodiments, the biological sample is a biological sample having a volume of about 5 μL to about 80 μl. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the biological sample has a volume that is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules. In some embodiments, the methods further comprise separating the plasma or serum from a blood sample. In some embodiments, separating comprises filtering the blood sample to remove cells, cell fragments, microvesicles, or a combination thereof, from the blood sample to produce the plasma sample. In some embodiments, obtaining the blood sample comprises pricking a finger. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some embodiments, the subject is a pregnant subject and the cell-free nucleic acid molecules comprise cell-free fetal nucleic acid molecules. In some embodiments, the cell-free nucleic acids comprise nucleic acids from a tumor in a tissue. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the target cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the biological sample comprises a cell type or tissue type in which fetal cell-free nucleic acids are low, as compared to peripheral blood. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject.


Aspects disclosed herein, in some embodiments, are prenatal paternity testing methods comprising: (a) obtaining a biological sample from a subject pregnant with a fetus, In some embodiments, the biological sample contains up to about 109 cell-free nucleic acid molecules; (b) sequencing at least a portion of the cell-free nucleic acid molecules to produce sequencing reads; (c) measuring at least a portion of sequencing reads corresponding to at least one chromosomal region; (d) receiving paternal genotype information from an individual suspected to be a paternal father of the fetus; and (e) comparing the paternal genotype information with a fetal component of the cell-free nucleic acids to determine whether there is a genotypic match between the fetal component and paternal genotype. In some embodiments, the methods further comprise amplifying the cell-free nucleic acids. In some embodiments, the methods further comprise tagging at least a portion of the cell-free nucleic acids to produce a library of tagged cell-free nucleic acids. In some embodiments, the methods further comprise amplifying the tagged cell-free nucleic acids. In some embodiments, the tagging comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the biological sample has a volume that is at most 500 microliters when obtained from the subject. In some embodiments, the volume is at most 300 microliters when obtained from the subject. In some embodiments, the volume is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 100 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the cell-free nucleic acid molecules are cell-free DNA molecules. In some embodiments, the methods further comprise separating the plasma or serum from a blood sample. In some embodiments, separating comprises filtering the blood sample to remove cells, cell fragments, microvesicles, or a combination thereof, from the blood sample to produce the plasma sample. In some embodiments, obtaining the blood sample comprises pricking a finger. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the subject is a pregnant subject and the cell-free nucleic acid molecules comprise cell-free fetal nucleic acid molecules. In some embodiments, the cell-free nucleic acids comprise nucleic acids from a tumor in a tissue. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the target cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the biological sample comprises a cell type or tissue type in which fetal cell-free nucleic acids are low, as compared to peripheral blood. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject.


Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining a biological sample from a subject; (b) amplifying the cell-free nucleic acids; (c) optionally tagging at least a portion of the cell-free nucleic acids to produce a library of tagged cell-free nucleic acids; (d) amplifying the optionally tagged cell-free nucleic acids by a massively multiplexed amplification assay; (e) optionally, pooling the amplified optionally tagged cell-free nucleic acids; (f) sequencing at least a portion of the amplified optionally tagged cell-free nucleic acid molecules to produce sequencing reads; (g) measuring at least a portion of sequencing reads corresponding to at least one chromosomal region; and (h) detecting a normal representation, an overrepresentation or an underrepresentation of the at least one chromosomal region. In some embodiments, the tagging comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the methods further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the biological sample has a volume that is at most 300 microliters when obtained from the subject. In some embodiments, the volume is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 100 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the methods further comprise separating the plasma or serum from a blood sample. In some embodiments, separating comprises filtering the blood sample to remove cells, cell fragments, microvesicles, or a combination thereof, from the blood sample to produce the plasma sample. In some embodiments, obtaining the blood sample comprises pricking a finger. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the subject is a pregnant subject and the cell-free nucleic acid molecules comprise cell-free fetal nucleic acid molecules. In some embodiments, the cell-free nucleic acids comprise nucleic acids from a tumor in a tissue. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the target cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the target cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the biological sample comprises a cell type or tissue type in which fetal cell-free nucleic acids are low, as compared to peripheral blood. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject.


Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining a biological sample from a subject; (b) isolating fetal trophoblast from the biological sample using an antibody specific to a fetal trophoblast cell-surface antigen; (c) lysing the fetal trophoblast a nucleus in the fetal trophoblast; (e) extracting fetal genomic DNA (gDNA) from the lysed fetal trophoblast; (f) contacting the fetal gDNA with an amplification reagent and an oligonucleotide primer that anneals to a sequence corresponding to a sequence of interest in order to produce an amplification product; and (g) detecting the (i) presence or absence of the amplification product, or (ii) a normal representation, an overrepresentation or an underrepresentation of at least one target sequence in the at least a portion of the fetal gDNA. In some embodiments, the presence or absence indicates a health status of the fetus. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the biological sample obtained from the subject was collected by the subject with a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject without a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, contacting comprises performing isothermal amplification. In some embodiments, contacting occurs at room temperature. In some embodiments, the method comprises incorporating a tag into the amplification product as the amplifying occurs, and further comprising, detecting the presence of the amplification product comprises detecting the tag. In some embodiments, the tag does not comprise a nucleotide. In some embodiments, detecting the amplification product comprises contacting the amplification product with a binding moiety that is capable of interacting with the tag. In some embodiments, methods further comprise contacting the amplification product with the binding moiety on a lateral flow device. In some embodiments, the steps (a) through (c) are performed in less than fifteen minutes. In some embodiments, the method is performed by the subject. In some embodiments, the method is performed by an individual without receiving technical training for performing the method. In some embodiments, obtaining, contacting, and detecting is performed with a single handheld device. In some embodiments, the health status is selected from the presence and the absence of a pregnancy. In some embodiments, the health status is selected from the presence and the absence of a neurological disorder, a metabolic disorder, a cancer, an autoimmune disorder, an allergic reaction, and an infection. In some embodiments, the health status is a response to a drug or a therapy. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, the biological sample has a volume that is at most 300 microliters when obtained from the subject. In some embodiments, the volume is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 100 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents.


Aspects disclosed herein provide methods comprising: (a) obtaining a biological sample from a subject pregnant with a fetus; (b) contacting at least one cell-free nucleic acid in the biological sample with an amplification reagent and an oligonucleotide primer that anneals to a sequence corresponding to a sex chromosome; and (c) detecting the presence or absence of an amplification product. In some embodiments, the presence or absence indicates a gender of the fetus. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample is a blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva, In some embodiments, the biological sample has a volume that is at most 300 microliters when obtained from the subject. In some embodiments, the volume is at most 100 microliters when obtained from the subject. In some embodiments, the volume is at most 55 microliters when obtained from the subject. In some embodiments, the volume is at most 50 microliters when obtained from the subject. In some embodiments, the volume is at most 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 40 microliters when obtained from the subject. In some embodiments, the volume is at between about 10 microliters and about 100 microliters when obtained from the subject. In some embodiments, the biological sample obtained from the subject is capillary blood. In some embodiments, the biological sample is not a plasma sample from blood. In some embodiments, the biological sample comprises circulating cell-free nucleic acids. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the cell-free nucleic acids in the biological sample is about 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is at most 10 genome equivalents. In some embodiments, the cell-free nucleic acids in the biological sample is between 5-6, 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 13-14, or 14-15 genome equivalents. In some embodiments, the biological sample obtained from the subject was collected by the subject with a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject without a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject.


Aspects disclosed herein, in some embodiments are methods comprising: (a) obtaining a biological sample from a subject, wherein the volume of the sample is not greater than about 300 microliters, and wherein the biological sample comprises a fetal trophoblast; (b) isolating the fetal trophoblast from the biological sample using a monoclonal antibody specific to a fetal trophoblast cell-surface antigen; (c) lysing the fetal trophoblast; (d), optionally, purifying the fetal trophoblast nucleus; € optionally, lysing the fetal trophoblast nucleus; (f) extracting fetal genomic DNA (gDNA) from the lysed fetal trophoblast; (g) contacting the fetal gDNA with an amplification reagent and an oligonucleotide primer that anneals to a sequence corresponding to a sequence of interest in order to produce an amplification product; and (h) detecting the presence or absence of the amplification product. In some embodiments, the presence or absence indicates a gender of the fetus. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample is a blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the volume of the blood sample is not greater than 120 μl. In some embodiments, the biological sample is a plasma sample from blood. In some embodiments, the volume of the plasma sample is not greater than 50 μl. In some embodiments, the volume of the plasma sample is between about 10 μl and about 40 μl. In some embodiments, the biological sample obtained from the subject was collected by the subject with a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject without a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining a biological sample from a subject, wherein the biological sample comprises cell-free nucleic acids; (b) generating a library of fragmented nucleic acid molecules by bringing the cell-free nucleic acids into contact with an endonuclease, thereby fragmenting at least one of the cell-free nucleic acids; (c) optionally amplifying the tagged cell-free nucleic acids; and (d) sequencing at least a portion of the tagged cell-free nucleic acids to detect a sequence of interest. In some embodiments, the endonuclease is a Cas enzyme. In some embodiments, the Cas enzyme is Cas9, Cas12, Cascade and Cas13, or one or more subtypes or orthologue thereof. In some embodiments, the endonuclease is guided to at the cell-free nucleic acids by a guide strand that is complementary to at least one of the cell-free nucleic acids. In some embodiments, the method further comprises fragmenting, by the endonuclease, the cell-free nucleic acids. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample is a blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the volume of the blood sample is not greater than 300 μl. In some embodiments, the volume of the blood sample is not greater than 120 μl. In some embodiments, the biological sample is a plasma sample from blood. In some embodiments, the volume of the plasma sample is not greater than 50 μl. In some embodiments, the volume of the plasma sample is between about 10 μl and about 40 μl. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free DNA. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 5 to about 100 copies of the sequence of interest. In some embodiments, the biological sample obtained from the subject was collected by the subject with a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject without a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, the DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, detecting the sequence of interest comprises detecting (i) a presence or an absence of the sequence of interest, or (ii) a normal representation, an overrepresentation or an underrepresentation of the sequence of interest. In some embodiments, detecting comprises massively multiplex amplification. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the subject is a pregnant subject and the cell-free nucleic acid molecules comprise cell-free fetal nucleic acid molecules. In some embodiments, the cell-free nucleic acids comprise nucleic acids from a tumor in a tissue. In some embodiments, the cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof.


Aspects disclosed herein, in some embodiments, are methods comprising: (a) obtaining a biological sample from a subject, wherein the biological sample comprises cell-free nucleic acids; (b) generating a library of fragmented nucleic acid molecules by bringing the cell-free nucleic acids into contact with a transposase enzyme, thereby fragmenting at least one of the cell-free nucleic acids and tagging the fragmented cell-free nucleic acids with a synthetic tag; (c) optionally amplifying the tagged cell-free nucleic acids; and (d) sequencing at least a portion of the tagged cell-free nucleic acids. In some embodiments, the transposase fragments the cell-free nucleic acids using a cut-and-paste mechanism. In some embodiments, the transposase is Tn5 transposase. In some embodiments, the tag is about 9 base pairs in length. In some embodiments, the method further comprises fragmenting, by the transposase, the cell-free nucleic acids. In some embodiments, the method further comprises tagging, by the transposase, the fragmented cell-free nucleic acids. In some embodiments, the methods further comprise contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject. In some embodiments, the biological sample is a blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the volume of the blood sample is not greater than 300 μl. In some embodiments, the volume of the blood sample is not greater than 120 μl. In some embodiments, the biological sample is a plasma sample from blood. In some embodiments, the volume of the plasma sample is not greater than 50 μl. In some embodiments, the volume of the plasma sample is between about 10 μl and about 40 μl. In some embodiments, the biological sample contains about 25 picograms (pg) to about 250 pg of total circulating cell-free DNA. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 5 to about 100 copies of the sequence of interest. In some embodiments, the biological sample obtained from the subject was collected by the subject with a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject without a finger prick. In some embodiments, the biological sample obtained from the subject was collected by the subject using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, methods further comprise cleaning a surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, the DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, methods do not consist of performing a phlebotomy, or deriving the biological sample from venous blood of the subject. In some embodiments, detecting the sequence of interest comprises detecting (i) a presence or an absence of the sequence of interest, or (ii) a normal representation, an overrepresentation or an underrepresentation of the sequence of interest. In some embodiments, detecting comprises massively multiplex amplification. In some embodiments, the massively multiplex amplification assay is isothermal amplification. In some embodiments, the massively multiplex amplification assay is polymerase chain reaction (mmPCR). In some embodiments, the subject is a pregnant subject and the cell-free nucleic acid molecules comprise cell-free fetal nucleic acid molecules. In some embodiments, the cell-free nucleic acids comprise nucleic acids from a tumor in a tissue. In some embodiments, the cell-free nucleic acids are cell-free nucleic acids from a fetus. In some embodiments, the cell-free nucleic acids are cell-free nucleic acids from a transplanted tissue or organ. In some embodiments, the cell-free nucleic acids are genomic nucleic acids from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof.


Aspects disclosed herein, in some embodiments, are systems comprising: (a) a sample collector configured to collect a biological sample of a subject; (b) a sample processor that is configured to isolate a sample component from the biological sample; (c) a nucleic acid detector that is configured to detect nucleic acids in the biological sample or the sample component; and (d) a nucleic acid information output. In some embodiments, the systems further comprise a white blood cell stabilizer. In some embodiments, the sample collector comprises a transdermal puncture device. In some embodiments, the transdermal puncture device comprises at least one of a needle, a lancet, a microneedle, a vacuum, and a microneedle array. In some embodiments, the sample collector comprises a device that is configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the sample component is selected from a cell, a carbohydrate, a phospholipid, a protein, a nucleic acid, and a microvesicle. In some embodiments, the sample component is a blood cell. In some embodiments, the sample component does not comprise a cell-free nucleic acid. In some embodiments, the sample component comprises a cell-free nucleic acid. In some embodiments, the cell-free nucleic acids are from a tumor. In some embodiments, the cell-free nucleic acids are from a fetus. In some embodiments, the cell-free nucleic acids are from a transplanted tissue or organ. In some embodiments, the cell-free nucleic acids are from one or more pathogens. In some embodiments, the cell-free nucleic acids are from a cell type or a tissue type with low abundance of cell-free nucleic acids, as compared to peripheral blood. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the sample component comprises one or more single nucleotide polymorphisms (SNPs), one or more indels, or a combination thereof. In some embodiments, the nucleic acid detector is configured to perform a genotyping assay. In some embodiments, the genotype assay comprises quantitative real-time polymerase chain reaction (qPCR), a genotype array, or automated sequencing. In some embodiments, the qPCR comprises multiplexed polymerase chain reaction (mmPCR), In some embodiments, the sample component is plasma or serum. In some embodiments, the sample purifier is configured to isolate plasma from less than 1 milliliter of blood. In some embodiments, the sample purifier is configured to isolate plasma from less than 250 μl of blood. In some embodiments, the volume of the biological sample is not greater than 50 μl. In some embodiments, the volume of the biological sample is between about 10 μl and about 40 μl. In some embodiments, the biological sample contains about 25 pg to about 250 pg of total circulating cell-free DNA. In some embodiments, the sample contains about 5 to about 100 copies of a sequence of interest in the biological sample or the sample component. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the nucleic acid detector comprises a nucleic acid sequencer. In some embodiments, systems further comprise at least one nucleic acid amplification reagent and at least one crowding agent. In some embodiments, the systems further comprise at least a first tag for producing a library of cell-free nucleic acids from the biological sample, and at least one amplification reagent. In some embodiments, the at least one nucleic acid amplification reagent comprises a primer, a polymerase, and a combination thereof. In some embodiments, the nucleic acid detector is configured to generate a library of tagged nucleic acids by: (a) generating ligation competent nucleic acids by one or more steps comprising: (i) generating a blunt end of the nucleic acids, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the nucleic acids; (iii) contacting the nucleic acids with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the nucleic acids; or (iv) repairing or remove damaged nucleic acids in the nucleic acids using a ligase; and ligating the ligation competent nucleic acids to adaptor oligonucleotides by contacting the ligation competent nucleic acids to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the system is further configured to pool two or more biological samples, each sample obtained from a different subject. In some embodiments, the nucleic acid detector is further configured to count the tags to detect a representation of the nucleic acids of interest in the sample. In some embodiments, the nucleic acid sequence output is selected from a wireless communication device, a wired communication device, a cable port, and an electronic display. In some embodiments, all components of the system are present in a single location. In some embodiments, all components of the system are housed in a single device. In some embodiments, the sample collector is located at a first location and at least one of the sample purifier and nucleic acid detector are second location. In some embodiments, the sample collector and at least one of the sample purifier and nucleic acid detector are at the same location. In some embodiments, the sample purifier comprises a filter. In some embodiments, the filter has a pore size of about 0.05 microns to about 2 microns. In some embodiments, system further comprise a transport or storage compartment for transporting or storing at least a portion of the biological sample. In some embodiments, the transport or storage compartment comprises an absorption pad, a fluid container, a sample preservative, or a combination thereof. In some embodiments, the systems further comprise a nucleic acid amplifier configured to the amplify nucleic acids from the sample component or the biological sample, and In some embodiments, the nucleic acid detector is further configured to detect amplified nucleic acids in the biological sample or the sample component. In some embodiments, the nucleic acid amplifier is a polymerase chain reaction (PCR) device. In some embodiments, the PCR device is a massively multiplexed PCR device (mmPCR). In some embodiments, the biological sample is not derived from venous blood of the subject.


Aspects disclosed herein, in some embodiments, are systems comprising: (a) a sample collector configured to collect about 1-100 microliter (μl) a biological sample of a subject; (b) a sample processor that is configured to isolate a sample component from the biological sample; (c) a detector that is configured to detect an epigenetic modification in the biological sample or the sample component; and (d) an information output. In some embodiments, the epigenetic modification comprises DNA methylation at a genetic locus, a histone methylation, histone, ubiquitination, histone acetylation, histone phosphorylation, micro RNA (miRNA). In some embodiments, the DNA methylation comprises CpG methylation or CpH methylation. In some embodiments, the genetic locus comprises a promoter or regulatory element of a gene. In some embodiments, the genetic locus comprises a variable long terminal repeat (LTR). In some embodiments, the genetic locus comprises a cell-free DNA or fragment thereof. In some embodiments, the genetic locus comprises a single nucleotide polymorphism (SNP). In some embodiments, histone acetylation is indicated by a presence or level of histone deacetylases. In some embodiments, the histone modification is at a histone selected from the group consisting of histone 2A (H2A), histone 2B (H2B, histone 3 (H3), and histone 4 (H4). In some embodiments, the histone methylation is methylation of H3 lysine 4 (H3K4me2). In some embodiments, the histone acetylation is deacetylation at H4. In some embodiments, the miRNA are selected from the group consisting of miR-21, miR-126, mi-R142, mi-R146a, mi-R12a, mi-R181a, miR-29c, miR-29a, miR-29b), miR-101, miRNA-155, and miR-148a. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the blood comprises capillary blood. In some embodiments, the capillary blood comprises not more than 40 microliters of blood. In some embodiments, the biological sample obtained from the subject was collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was not collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the sample collector is configured to permit the biological sample obtained from the subject to collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, the sample collector is configured to clean surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the systems further comprise a white blood cell stabilizer. In some embodiments, the biological sample is not derived from venous blood of the subject.


Aspects disclosed herein, in some embodiments, are devices comprising: (a) a sample collector for obtaining a biological sample from a subject in need thereof; (b) a sample purifier for removing a cell from the biological sample to produce a cell-depleted sample; and (c) a nucleic acid detector configured to detect a plurality of cell-free DNA fragments in the cell-depleted sample. In some embodiments, the devices further comprise a white blood cell stabilizer. In some embodiments, the sample collector is configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the sample collector is configured to collect a sample from a transdermal puncture. In some embodiments, a first sequence is present on a first cell-free DNA fragment of the plurality of cell-free DNA fragments and a second sequence is present on a second cell-free DNA fragment of the plurality of cell-free DNA fragments, and In some embodiments, the first sequence is at least 80% identical to the second sequence. In some embodiments, at least one of the first sequence and the second sequence is repeated at least twice in a genome of a subject. In some embodiments, the first sequence and the second sequence are each at least 10 nucleotides in length. In some embodiments, the first sequence is on a first chromosome and the second sequence is on a second chromosome. In some embodiments, the first sequence and the second sequence are on the same chromosome but separated by at least 1 nucleotide. In some embodiments, the first sequence and the second sequence are in functional linkage. In some embodiments, the nucleic acid detector comprises at least one of a detection reagent. In some embodiments, the at least one detection reagent comprises an oligonucleotide probe capable of detecting the at least one cell-free DNA fragment of the plurality. In some embodiments, the at least one detection reagent is capable of detecting a fetal epigenetic signature. In some embodiments, the fetal epigenetic signature comprises nucleic acid methylation. In some embodiments, the methylation is allele-specific. In some embodiments, the devices further comprise a nucleic acid amplifier configured to the amplify nucleic acids from the sample component or the biological sample, and In some embodiments, the nucleic acid detector is further configured to detect amplified nucleic acids in the biological sample or the sample component. In some embodiments, the nucleic acid amplifier is an isothermal polymerase chain reaction (PCR) device. In some embodiments, the isothermal PCR device is a massively multiplexed PCR device (mmPCR). In some embodiments, the devices further comprise a genotype analyzer configured to compare the plurality of cell-free DNA fragments detected with a known genotype. In some embodiments, the plurality of cell-free DNA fragments comprise a fetal component, and the known genotype is a paternal genotype. In some embodiments, the nucleic acid amplifier comprises at least one nucleic acid amplification reagent and a single pair of primers to amplify the first sequence and the second sequence. In some embodiments, the nucleic acid detector comprises a nucleic acid sequencer. In some embodiments, the nucleic acid sequence comprises a signal detector. In some embodiments, the nucleic acid detector is a lateral flow strip. In some embodiments, the cell-free DNA comprise one or more single nucleotide polymorphisms (SNPs), insertion or deletion (indel), or a combination thereof. In some embodiments, the cell-free DNA is from a tumor. In some embodiments, the cell-free DNA is from a fetus. In some embodiments, the cell-free DNA is from a transplanted tissue or organ. In some embodiments, the cell-free nucleic acids are from a cell type or a tissue type with low abundance of cell-free nucleic acids, as compared to peripheral blood. In some embodiments, the cell-free DNA is from one or more pathogens. In some embodiments, the pathogen comprises a bacterium or component thereof. In some embodiments, the pathogen comprises a virus or a component thereof. In some embodiments, the pathogen comprises a fungus or a component thereof. In some embodiments, the sample purifier comprises a filter, and In some embodiments, the filter has a pore size of about 0.05 microns to about 2 microns. In some embodiments, the filter is a vertical filter. In some embodiments, the sample purifier comprises a binding moiety selected from an antibody, antigen binding antibody fragment, a ligand, a receptor, a peptide, a small molecule, and a combination thereof. In some embodiments, the binding moiety is capable of binding an extracellular vesicle. In some embodiments, the nucleic acid detector is configured to generate a library of tagged cell-free DNA fragments by: (a) generating ligation competent cell-free DNA fragments by one or more steps comprising: (i) generating a blunt end of the cell-free DNA fragments, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA fragments; (iii) contacting the cell-free DNA fragments with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA fragments; or (iv) repairing or remove DNA damage in the cell-free DNA fragments using a ligase; and (v) ligating the ligation competent cell-free DNA fragments to adaptor oligonucleotides by contacting the ligation competent cell-free DNA fragments to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof. In some embodiments, the device is further configured to pool two or more biological samples, each sample obtained from a different subject. In some embodiments, the nucleic acid detector is further configured to count the tags to detect a representation of the nucleic acids of interest in the sample. In some embodiments, the devices further comprise a nucleic acid sequence output comprising a wireless communication device, a wired communication device, a cable port, or an electronic display. In some embodiments, the device is contained in a single housing. In some embodiments, the device operates at room temperature. In some embodiments, the device is capable of detecting the plurality of biomarkers in the cell-depleted sample within about five minutes to about twenty minutes of receiving the biological fluid. In some embodiments, the devices further comprise a communication connection. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the blood comprises capillary blood. In some embodiments, the sample purifier is configured to isolate plasma from less than 250 μl of blood. In some embodiments, the volume of the biological sample is not greater than 50 μl. In some embodiments, the volume of the biological sample is between about 10 μl and about 40 μl. In some embodiments, the biological sample contains about 25 pg to about 250 pg of total circulating cell-free DNA. In some embodiments, the biological sample contains about 5 to about 100 copies of a sequence of interest in the biological sample or the sample component. In some embodiments, the biological sample contains about 104 to about 109 cell-free nucleic acid molecules. In some embodiments, the biological sample contains about 104 to about 107 cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 300 pg of cell-free nucleic acid molecules. In some embodiments, the biological sample contains less than 3 ng of cell-free nucleic acid molecules. In some embodiments, the sample collector is configured to permit the biological sample obtained from the subject to collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, the sample collector is configured to clean surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. Aspects disclosed herein, in some embodiments, are devices comprising: (a) a sample collector configured to collect about 1-100 microliter (μl) a biological sample of a subject; (b) a sample processor that is configured to isolate a sample component from the biological sample; (c) a detector that is configured to detect an epigenetic modification in the biological sample or the sample component; and (d) an information output. In some embodiments, the sample collector is configured to collect a sample from a transdermal puncture. In some embodiments, the epigenetic modification comprises DNA methylation at a genetic locus, a histone methylation, histone, ubiquitination, histone acetylation, histone phosphorylation, micro RNA (miRNA). In some embodiments, the DNA methylation comprises CpG methylation or CpH methylation. In some embodiments, the genetic locus comprises a promoter or regulatory element of a gene. In some embodiments, the genetic locus comprises a variable long terminal repeat (LTR). In some embodiments, the genetic locus comprises a cell-free DNA or fragment thereof. In some embodiments, the genetic locus comprises a single nucleotide polymorphism (SNP). In some embodiments, histone acetylation is indicated by a presence or level of histone deacetylases. In some embodiments, the histone modification is at a histone selected from the group consisting of histone 2A (H2A), histone 2B (H2B, histone 3 (H3), and histone 4 (H4). In some embodiments, the histone methylation is methylation of H3 lysine 4 (H3K4me2). In some embodiments, the histone acetylation is deacetylation at H4. In some embodiments, the miRNA are selected from the group consisting of miR-21, miR-126, mi-R142, mi-R146a, mi-R12a, mi-R181a, miR-29c, miR-29a, miR-29b, miR-101, miRNA-155, and miR-148a. In some embodiments, the biological sample comprises blood, plasma, serum, urine, interstitial fluid, vaginal cells, vaginal fluid, cervical cells, buccal cells, or saliva. In some embodiments, the blood comprises capillary blood. In some embodiments, the capillary blood comprises not more than 40 microliters of blood. In some embodiments, the devices further comprise pooling two or more biological samples, each sample obtained from a different subject. In some embodiments, the biological sample obtained from the subject was collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was not collected by transdermal puncture. In some embodiments, the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject. In some embodiments, the biological sample obtained from the subject was collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the fluid. In some embodiments, the devices further comprise a white blood cell stabilizer In some embodiments, the sample collector is configured to permit the biological sample obtained from the subject to collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, the sample collector is configured to clean surface of a transdermal puncture site (e.g., skin) prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some instances, the transdermal puncture site is skin of a finger. In some embodiments, the biological sample is not derived from venous blood of the subject.


Aspects disclosed herein comprise methods of increasing a relative amount of a target nucleic acid in a biological sample obtained forma subject comprising: (a) inducing a transdermal puncture at a site to produce a first fraction and a second fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting the second fraction of the biological sample, thereby reducing or eliminating contamination or nucleic acid damage of the biological sample, wherein the first fraction comprises a lower fraction of a target nucleic acid, as compared to a fraction of the target nucleic acid in the second fraction. In some embodiments, the methods further comprising cleaning the site before inducing the transdermal puncture, thereby removing or reducing unwanted contaminant. In some embodiments, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the unwanted contaminant comprises DNA from cells or tissue surrounding the transdermal puncture site. In some instances, DNA is damaged. In some instances, the DNA is not damaged. In some embodiments, the transdermal puncture site is skin of a finger. In some embodiments, the contamination comprises nucleic acids from tissue surrounding the site. In some embodiments, the nucleic acid damage comprises damage to non-apoptotic DNA in the biological sample. In some embodiments, the biological sample is not derived from venous blood of the subject.


Other objects, features and advantages of the present disclosure will become apparent to those skilled in the art from the following detailed description. It is to be understood, however, that the detailed description and specific examples, while indicating some embodiments of the present disclosure are given by way of illustration and not limitation. Many changes and modifications within the scope of the present disclosure may be made without departing from the spirit thereof, and the disclosure includes all such modifications. Moreover aspects of one embodiment may be utilized in other, different embodiments.


INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.





BRIEF DESCRIPTION OF THE DRAWINGS

The novel features of the methods, devices, systems and kits disclosed herein are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present devices, systems and kits disclosed herein will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the devices, systems and kits disclosed herein are utilized, and the accompanying drawings of which:



FIG. 1 shows optional workflows for methods disclosed herein.



FIG. 2 shows a diagram of how components of the methods and systems disclosed herein can be distributed amongst various locations (physical and/or electronic), or focused primarily in one physical location.



FIG. 3 shows results of trisomy detection from ultra-low sample amounts generated by low coverage whole genome sequencing-by-synthesis. Depicted are the Z scores for the representation of chromosome 21 from reference and test samples. The dotted line represents a Z score of 3. A test sample showing a Z score equal or higher than 3 means that the sample contains a higher representation of chromosome 21 and is considered trisomic for chromosome 21. If the sample came from a pregnant women, the extra amount of chromosome 21 detected is contributed by the fetus and therefore it is concluded the fetus is trisomic for chromosome 21.



FIG. 4A shows a process overview for devices that are connected to remote systems and individuals. FIG. 4B shows an exemplary interface for devices that are connected to remote systems and individuals.



FIG. 5 shows a mobile device and how a mobile application is configured to connect with, communicate with, and receive genetic information and other information from the devices, systems and kits disclosed herein. FIG. 5A shows various functions that the mobile application provides. FIG. 5B shows a step-by-step walkthrough to guide a user through use of the devices, systems and kits disclosed herein. FIG. 5C shows interface elements allowing a user to start a test, view and share test results, and interact with others. FIG. 5D shows an interface for monitoring the status of a test. FIG. 5E shows how results can be shared.



FIG. 6 shows typical amounts of cfDNA fragments expected in different process steps of low-coverage whole genome sequencing using 8-10 ml of venous blood as a starting amount.



FIG. 7 shows the importance of increasing sequencing library efficiency to significantly improve sensitivity for applications using ultra-low cfDNA input amounts.



FIG. 8A-C shows electropherograms of sequencing libraries generated from decreasing amounts of cell-free (cfDNA) input. The input amount of cell-free DNA varied from 20 genome equivalents (20 GE) in FIG. 8A down to 1 genome equivalent (1 GE) in FIG. 8C. While the overall yield in library decreases, the amount adaptor dimers do not increase significantly and there is still sufficient amount and quality of library available for successful sequence analysis.



FIG. 9 shows detection of low fraction Y-chromosome (2.5% or greater) using low coverage Whole Genome Sequencing-by-Synthesis with ultra-low amounts of cfDNA (10 genome equivalents) isolated from venous blood.



FIG. 10 shows detection of low fraction Y-chromosome (2.5% or greater) using low coverage Whole Genome Sequencing-by-Synthesis with ultra-low input amounts of cfDNA isolated from capillary blood/plasma mixtures of female and male DNA.



FIG. 11 shows a cfDNA fragment size distribution comparison between cfDNA from capillary blood and venous blood based on paired end sequencing data.



FIG. 12 shows the detection of fetal chromosomal aneuploidy using low coverage Whole Genome Sequencing-by-Synthesis with ultra-low input amounts of cfDNA derived from blood/plasma of pregnant women. Ultra-low input amounts of cfDNA from non-trisomic reference samples were used to determine the median and median absolute deviation of the chromosome 21 representation. Test samples were ultra-low amounts of cfDNA (10 GE) from pregnant women carrying either a normal fetus (no trisomy) or a fetus with a chromosome 21 trisomy.



FIG. 13 shows the detection of fetal chromosomal aneuploidy using low coverage Whole Genome Sequencing-by-Synthesis with ultra-low input amounts of cfDNA derived from blood/plasma of pregnant women. Analysis was performed without a reference sample set using an sample internal method of determining trisomy 21 status. Test samples were ultra-low amounts of cfDNA (10 GE) from pregnant women carrying either a normal fetus (no trisomy) or a fetus with a chromosome 21 trisomy.



FIGS. 14A-14C show that a standard library preparation and sequencing method results in a lower representation of fetal cell-free DNA, as compared to a low-input optimized protocol, when ten (10) genomic equivalents are tested. FIG. 14A and FIG. 14B show the relationship between median bin count and median absolute deviation (MAD) per bin for the standard versus optimized protocol data sets. FIG. 14C. shows a matrix that allows to correlate sequence reads and genome equivalents for different library preparation efficiencies. FIG. 14D shows optimized protocol data points in yellow, standard protocol points in blue. Library preparation and sequencing with the standard protocol yields fewer effective sampled Genome Equivalents in sequencing, as compared to the optimized protocol of the present disclosure (median for Life=1.355, median for ILMN=6.065). FIG. 14E shows that the standard protocol data showed good specificity (0 false positives, 100% specificity) but poor sensitivity (2 false negatives, 50% sensitivity). FIG. 14F shows that the data derived from the standard protocol library preparation and sequencing is noisy and does not allow for an easy delineation of samples carrying a male versus female fetus. FIG. 14G shows that a combined fetal fraction measurement for all samples correlated well with the observed effect introduced by chr21 using the standard protocol (left) and the optimized protocol (right)). FIG. 14H shows that higher effective copy numbers resulted from the optimized protocol as compared to the standard protocol causing even wrong results on fetal sex for the standard protocol. FIG. 14I provides an explanation for the poor sensitivity (2 false negatives) of the standard protocol, with the red line simulating a 50% sensitivity using an estimated PCR efficiency of 90%, a library efficiency of only 5% and 36M sequence reads, in line with the actual data plotted from the 4 samples analyzed with the standard protocol (FIG. 14I).



FIG. 15 shows capillary blood based circulating cell-free DNA sequencing data retrieved from a patient with advanced cancer prior to and after treatment.



FIG. 16 shows expected results detecting circulating cell-free DNA from K. pneumoniae in patient capillary blood obtained from a hospitalized patient with a bloodstream infection (BSI).



FIG. 17A-17B shows the size distribution profile of cfDNA from venous blood (FIG. 17A) and capillary blood (FIG. 17B), as determined by sequencing of cfDNA fragments.



FIG. 18A shows a comparison of venous and capillary blood cfDNA fragment distribution.



FIG. 18B: Comparison of DNA size distribution between venous blood and surface bound DNA. FIG. 18C shows DNA fragment size distribution between cfDNA from venous blood and capillary blood with damaged cells. FIG. 18D shows DNA fragment size distribution between cfDNA from venous blood and capillary blood with damaged cells, zoom in on fragment size below 500 bp.



FIG. 19 shows a comparison of “wiped” and “non-wiped” capillary blood collection samples for differences in DNA fragment size distributions.



FIG. 20. shows Z-scores of control euploidy samples or control trisomy samples that are classified as euploidy or trisomy with 100% accuracy using the methods described herein. A Z-score of 3.5 and higher were classified as trisomic and samples with a Z-score of less than 3.5 were classified as euploid.





CERTAIN TERMINOLOGIES

The following descriptions are provided to aid the understanding of the methods, systems and kits disclosed herein. The following descriptions of terms used herein are not intended to be limiting definitions of these terms. These terms are further described and exemplified throughout the present application.


In general, the terms “cell-free polynucleotide,” “cell-free nucleic acid,” used interchangeably herein, refer to polynucleotides and nucleic acids that can be isolated from a sample without extracting the polynucleotide or nucleic acid from a cell. A cell-free nucleic acid may comprise DNA. A cell-free nucleic acid may comprise RNA. A cell-free nucleic acid is a nucleic acid that is not contained within a cell membrane, i.e., it is not encapsulated in a cellular compartment. In some embodiments, a cell-free nucleic acid is a nucleic acid that is not bounded by a cell membrane and is circulating or present in blood or other fluid. In some embodiments, the cell-free nucleic acid is cell-free before and/or upon collection of the biological sample containing it, and is not released from the cell as a result of sample manipulation by man, intentional or otherwise, including manipulation upon or after collection of the sample. In some instances, cell-free nucleic acids are produced in a cell and released from the cell by physiological means, including, e.g., apoptosis, and non-apoptotic cell death, necrosis, autophagy, spontaneous release (e.g., of a DNA/RNA-lipoprotein complex), secretion, and/or mitotic catastrophe. In some embodiments, a cell-free nucleic acid comprises a nucleic acid that is released from a cell by a biological mechanism, (e.g., apoptosis, cell secretion, vesicular release). In further or additional embodiments, a cell-free nucleic acid is not a nucleic acid that has been extracted from a cell by human manipulation of the cell or sample processing (e.g., cell membrane disruption, lysis, vortex, shearing, etc.).


In some instances, the cell-free nucleic acid is a cell-free fetal nucleic acid. In general, the term, “cell-free fetal nucleic acid,” as used herein, refers to a cell-free nucleic acid, as described herein, wherein the cell-free nucleic acid is from a cell that comprises fetal DNA. In pregnant women, the cell-free DNA originating from the placenta can contribute a noticeable portion of the total amount of cell-free DNA. Placental DNA is often a good surrogate for the fetal DNA, because in most cases it is highly similar to the DNA of the fetus. Applications like chorionic villus sampling have exploited this fact to establish diagnostic application. Often, a large portion of cell-free fetal nucleic acids are found in maternal biological samples as a result of placental tissue being regularly shed during the pregnant subject's pregnancy. Often, many of the cells in the placental tissue shed are cells that contain fetal DNA. Cells shed from the placenta release fetal nucleic acids. Thus, in some instances, cell-free fetal nucleic acids disclosed herein are nucleic acids release from a placental cell.


As used herein, the term “cellular nucleic acid” refers to a polynucleotide that is contained in a cell or released from a cell due to manipulation of the biological sample. Non-limiting examples of manipulation of the biological sample include centrifuging, vortexing, shearing, mixing, lysing, and adding a reagent (e.g., detergent, buffer, salt, enzyme) to the biological sample that is not present in the biological sample when it is obtained. In some instances, the cellular nucleic acid is a nucleic acid that has been released from a cell due to disruption or lysis of the cell by a machine, human or robot. In some instances, cellular nucleic acids (nucleic acids contained by cells) are intentionally or unintentionally released from cells by devices and methods disclosed herein. However, these are not considered “cell-free nucleic acids,” as the term is used herein. In some instances, devices, systems, kits and methods disclosed herein provide for analyzing cell-free nucleic acids in biological samples, and in the process analyze cellular nucleic acids as well.


As used herein, the term “biomarker” generally refers to any marker of a subject's biology or condition. A biomarker may be an indicator or result of a disease or condition. A biomarker may be an indicator of health. A biomarker may be an indicator of a genetic abnormality or inherited condition. A biomarker may be a circulating biomarker (e.g., found in a biological fluid such as blood). A biomarker may be a tissue biomarker (e.g., found in a solid organ such as liver or bone marrow). Non-limiting examples of biomarkers include nucleic acids, epigenetic modifications, proteins, peptides, antibodies, antibody fragments, lipids, fatty acids, sterols, polysaccharides, carbohydrates, viral particles, microbial particles. In some cases, biomarkers may even include whole cells or cell fragments.


As used herein, the term, “tag” generally refers to a molecule that can be used to identify, detect or isolate a nucleic acid of interest. The term, “tag,” may be used interchangeably with other terms, such as “label,” “adapter,” “oligo,” and “barcode,” unless specified otherwise. Note, however, that the term, “adapter,” can be used to ligate two ends of a nucleic acid or multiple nucleic acids without acting as a tag.


As used herein, the term “genetic information” generally refers to one or more nucleic acid sequences. In some instances, genetic information may be a single nucleotide or amino acid. For example, genetic information could be the presence (or absence) of a single nucleotide polymorphism. Unless specified otherwise, the term “genetic information” may also refer to epigenetic modification patterns, gene expression data, and protein expression data. In some instances, the presence, absence or quantity of a biomarker provides genetic information. For instance, cholesterol levels may be indicative of a genetic form of hypercholesterolemia. Thus, genetic information should not be limited to nucleic acid sequences.


As used herein, the term, “genetic mutation,” generally refers to an alteration of a nucleotide sequence of a genome. A genetic mutation is different from natural variation or allelic differences. The genetic mutation may be found in less than 10% of the subject's species. The genetic mutation may be found in less than 5% of the subject's species. The genetic mutation may be found in less than 1% of the subject's species. A genetic mutation in a subject may cause a disease or a condition in the subject. The genetic mutation may result in a frameshift of a protein-coding sequence. The genetic mutation may result in a deletion of at least a portion of a protein-coding sequence. The genetic mutation may result in a loss of a stop codon in a protein-coding sequence. The genetic mutation may result in a premature stop codon in a protein-coding sequence. The genetic mutation may result in a sequence that encodes a misfolded protein. The genetic mutation may result in a sequence that encodes a dysfunctional protein or non-functional protein (e.g., loss of binding or enzymatic activity). The genetic mutation may result in a sequence that encodes an overactive protein (e.g., increased binding or enzymatic activity). The genetic mutation my affect a single nucleotide (e.g., a single nucleotide variation or single nucleotide polymorphism). The genetic mutation my affect multiple nucleotides (e.g., frameshift, translocation).


As used herein, the term “specific to,” refers to a sequence or biomarker that is found only in, on or at the thing that the sequence or biomarker is specific to. For example, if a sequence is specific to a Y chromosome that means that it is only found on the Y chromosome and not on another chromosome.


As used herein, the terms, “normal individual” and “normal subject” refer to a subject that does not have a condition or disease of interest. For example, if the method or device being described is being used to detect a type of cancer, a normal subject does not have that type of cancer. The normal subject may not have cancer at all. In some instances, the normal subject is not diagnosed with any disease or condition. In some instances, the normal subject does not have a known genetic mutation. In some instances, the normal subject does not have a genetic mutation that results in a detectable phenotype that would distinguish the subject from a normal subject that does not have a known genetic mutation. In some instances, the normal subject is not infected by a pathogen. In some instances, the normal subject is infected by a pathogen, but has no known genetic mutation.


As used herein, the term “genomic equivalent” generally refers to the amount of DNA necessary to be present in a purified sample to guarantee that all genes will be present.


As used herein, the term “tissue-specific,” or the phrase, “specific to a tissue,” generally refers to a polynucleotide that is predominantly expressed in a specific tissue. Often, methods, systems and kits disclosed herein utilize cell-free, tissue-specific polynucleotides. Cell-free, tissue-specific polynucleotides described herein are polynucleotides expressed at levels that can be quantified in a biological fluid upon damage or disease of the tissue or organ in which they are expressed. In some cases, the presence of cell-free tissue-specific polynucleotides disclosed herein in a biological fluid is due to release of cell-free tissue-specific polynucleotides upon damage or disease of the tissue or organ, and not due to a change in expression of the cell-free tissue-specific polynucleotides. Elevated levels of cell-free tissue-specific polynucleotides disclosed herein may be indicative of damage to the corresponding tissue or organ. In some instances, cell-free polynucleotides disclosed herein are expressed/produced in several tissues, but at tissue-specific levels in at least one of those tissues. In these cases, the absolute or relative quantity of the cell-free tissue-specific polynucleotide is indicative of damage to, or disease of, a specific tissue or organ, or collection of tissues or organs. Alternatively or additionally, tissue-specific polynucleotides are nucleic acids with tissue-specific modifications. Tissue-specific polynucleotides may comprise RNA. Tissue-specific polynucleotides may comprise DNA. By way of non-limiting example, tissue-specific polynucleotides or markers disclosed herein include DNA molecules (e.g., a portion of a gene or non-coding region) with tissue-specific methylation patterns. In other words, the polynucleotides and markers may be expressed similarly in many tissues, or even ubiquitously throughout a subject, but the modifications are tissue-specific. Generally, tissue-specific polynucleotides or levels thereof disclosed herein are not specific to a disease. Generally, tissue-specific polynucleotides disclosed herein do not encode a protein implicated in a disease mechanism.


In some instances, a tissue-specific polynucleotide is present in a greater amount in a tissue of interest than it is present in blood of the subject. In some instances, the RNA is present in a greater amount in a tissue of interest than it is present in a blood cell. In some instances, the tissue-specific polynucleotide is not expressed by a blood cell. In some instances, the presence of the tissue-specific polynucleotide is at least two fold greater in the tissue than in the blood. In some instances, the presence of the tissue-specific polynucleotide is at least five fold greater in the tissue than in the blood. In some instances, the presence of the tissue-specific polynucleotide is at least ten fold greater in the tissue than in the blood.


In some instances, the presence of the tissue-specific polynucleotide is at least three fold higher in the tissue than any other tissue of the subject. In some instances, the presence of the tissue-specific polynucleotide is at least five fold higher in the tissue than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least ten fold higher in the tissue than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least three fold higher in no more than two tissues than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least five fold higher in no more than two tissues than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least ten fold higher in no more than two tissues than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least three fold higher in no more than three tissues than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least five fold higher in no more than three tissues than any other tissue. In some instances, the presence of the tissue-specific polynucleotide is at least ten fold higher in no more than three tissues than any other tissue.


In some instances, the tissue-specific polynucleotide is specific to a target cell type. In some instances, the presence of the tissue-specific polynucleotide is at least three fold higher in the target cell type than a non-target cell type. In some instances, the presence of the tissue-specific polynucleotide is at least five fold higher in the target cell type than the non-target other cell type. In some instances, the presence of the tissue-specific polynucleotide is at least ten fold higher in the target cell type than the non-target cell type. In some instances, the presence of the tissue-specific polynucleotide is at least three fold higher in no more than two target cell types than non-target cell types. In some instances, the presence of the tissue-specific polynucleotide is at least five fold higher in no more than two target cell types than non-target cell types. In some instances, the RNA is expressed at least ten fold higher in no more than two target cell types than non-target cell types. In some instances, the presence of the tissue-specific polynucleotide is at least three fold higher in no more than three target cell types than non-target cell types. In some instances, the presence of the tissue-specific polynucleotide is at least five fold higher in no more than three target cell types than non-target cell types. In some instances, the presence of the tissue-specific polynucleotide is at least ten fold higher in no more than three target cell types than non-target cell types.


As used herein, the terms, “isolate,” “purify,” “remove,” “capture,” and “separate,” may all be used interchangeably unless specified otherwise.


As used herein, the terms, “clinic,” “clinical setting,” “laboratory” or “laboratory setting” refer to a hospital, a clinic, a pharmacy, a research institution, a pathology laboratory, a or other commercial business setting where trained personnel are employed to process and/or analyze biological and/or environmental samples. These terms are contrasted with point of care, a remote location, a home, a school, and otherwise non-business, non-institutional setting.


As used herein, the term ‘about’ a number refers to that number plus or minus 10% of that number. The term ‘about’ when used in the context of a range refers to that range minus 10% of its lowest value and plus 10% of its greatest value.


As used herein, the singular forms “a”, “an” and “the” include plural references unless the context clearly dictates otherwise. For example, the term “a sample” includes a plurality of samples, including mixtures thereof.


The term, “accuracy,” should be given its broadest definition in light of the specification. However, the term “accuracy” may be used to refer to a statistical measure of how well a binary classification test correctly identifies or excludes a condition. As used herein, the term “accuracy” may also refer to the proportion of true results (both true positives and true negatives) among all samples examined. As used herein, the term “accuracy” may encompass “Rand accuracy” or accuracy as determined by the “Rand index.”


As used herein, the terms “homologous,” “homology,” or “percent homology” describe sequence similarity of a first amino acid sequence or a nucleic acid sequence relative to a second amino acid sequence or a nucleic acid sequence. In some instances, homology can be determined using the formula described by Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87: 2264-2268, 1990, modified as in Proc. Natl. Acad. Sci. USA 90:5873-5877, 1993). Such a formula is incorporated into the basic local alignment search tool (BLAST) programs of Altschul et al. (J. Mol. Biol. 215: 403-410, 1990). Percent homology of sequences can be determined using the most recent version of BLAST, as of the filing date of this application. In some cases, 2 or more sequences may be homologous if they share at least 20%, 25%, 30%. 35%, 40%, 45% 50%, 55%, 60% identity, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or higher identity when compared and aligned for maximum correspondence over a comparison window, or designated region as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. In some cases, 2 or more sequences may be homologous if they share at most 20%, 25%, 30%. 35%, 40%, 45% 50%, 55%, 60% identity, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or higher identity. Preferably, the % identity or homology exists over a region that is at least 16 amino acids or nucleotides in length or in some cases over a region that is about 50 to about 100 amino acids or nucleotides in length. In some cases, the % identity or homology exists over a region that is about 100 to about 1000 amino acids or nucleotides in length. In some cases, 2 or more sequences may be homologous and share at least 20% identity over at least 100 amino acids in a sequence. For sequence comparison, generally one sequence acts as a reference sequence, to which test sequences may be compared. When using a sequence comparison algorithm, test and reference sequences may be entered into a computer, subsequent coordinates may be designated, if necessary, and sequence algorithm program parameters may be designated. Any suitable algorithm may be used, including but not limited to Smith-Waterman alignment algorithm, Viterbi, Bayesians, Hidden Markov and the like. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm may then be used to calculate the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters. Any suitable algorithm may be used, whereby a percent identity is calculated. Some programs for example, calculate percent identity as the number of aligned positions that identical residues, divided by the total number of aligned positions. A “comparison window”, as used herein, includes reference to a segment of any one of the number of contiguous or non-contiguous positions which may range from 10 to 600 positions. In some cases the comparison window may comprise at least 10, 20, 50, 100, 200, 300, 400, 500, or 600 positions. In some cases the comparison window may comprise at most 10, 20, 50, 100, 200, 300, 400, 500, or 600 positions. In some cases the comparison window may comprise at least 50 to 200 positions, or at least 100 to at least 150 positions in which a sequence may be compared to a reference sequence of the same number of contiguous or non-contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by manual alignment and visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et al, eds. 1995 supplement)), In some cases, a comparison window may comprise any subset of the total alignment, either contiguous positions in primary sequence, adjacent positions in tertiary space but discontinuous in the primary sequence, or any other subset of 1 up to all residues in the alignment.


As used herein, the terms overrepresentation and underrepresentation generally refer to the difference between a sample and a control representation of target nucleic acids. A representation may be significantly less than a control representation and therefore underrepresented. A representation may be significantly more than a control representation and therefore underrepresented. The significance may be statistical. Methods of determining statistical significance are well known in the art. By way of non-limiting example, statistical significance performed using standard two tailed t-test (*: p<0.05; **p<0.01).


As used herein, the term “cloud” refers to shared or sharable storage of electronic data. The cloud may be used for archiving electronic data, sharing electronic data, and analyzing electronic data.


Throughout the application, there is recitation of the phrases “nucleic acid corresponding to a chromosome,” and “sequence corresponding to a chromosome.” As used herein, these phrases are intended to convey that the “nucleic acid corresponding to the chromosome” is represented by a nucleic acid sequence that is identical or homologous to a sequence found in that chromosome. The term “homologous” is described in the foregoing description.


A “single nucleotide polymorphism (SNP),” as used herein, refers to a single nucleotide that may differ between the genomes of two members of the same species. The usage of the term should not imply any limit on the frequency with which each variant occurs. In some instances, the SNP is mono-, bi-, tri- or tetra-allelic.


The term, “indel,” as used herein refers to an insertion or a deletion of a nucleobase that may differ between the genomes of two members of the same species. In some instances, the indel is mono-, bi-, tri- or tetra-allelic. In some instances, the insertion comprises one nucleobase, two nucleobases, three nucleobases, four nucleobases, five nucleobases, or more.


Throughout the application, there is recitation of chromosome positions. These position numbers are in reference to Genome Build hg38 (UCSC) and GRCh38 (NCBI). A genome build may also be referred to in the art as a reference genome or reference assembly. It may be derived from multiple subjects. It is understood that there are multiple reference assemblies available and more reference assemblies may be produced over time. However, one skilled in the art would be able to determine the relative positions provided herein in another genome build or reference genome.


DETAILED DESCRIPTION

Genetic testing is traditionally performed in a laboratory or clinical setting. However, in many instances where genetic testing would be useful, access to a laboratory or clinic is unavailable or impractical. High complexity testing such as analysis of circulating tumor DNA or fetal DNA testing is still rare because of limited access to such tests (e.g., requirement for a phlebotomy, timing, appointments required, distance to a clinic/laboratory) and cost of such tests (e.g., costs of performing a phlebotomy, processing milliliters of samples, sample tubes and reagents, shipping, particularly cold shipping). Thus, genetic tests that are operable at a point of need (e.g., locations remote from laboratories and clinics) are desirable. Genetic tests for operation at a point of need (e.g., home, school, farm) are preferably cost effective and simple for an untrained individual to perform. Genetic tests at point of need preferably require only small amounts of a biological sample. Traditionally, genetic testing requires a venous blood draw (phlebotomy) to obtain milliliters of blood containing enough DNA to be analyzed. However, a phlebotomy is not practical at a point of need. Ideally, a genetic test would only require amounts of blood achieved through the retrieval of capillary blood, e.g., via a transdermal puncture device or other means. This means point of need devices and methods for genetic testing need to be designed to function with ultra-low inputs of sample and a lower abundance of target molecules that are intended to be detected.


Genetic testing has also traditionally required transdermal puncture. However, transdermal puncture, regardless of sample volume, may cause inconvenience, discomfort, and in some instances, pain, to the subject. Accordingly, genetic testing devices that obviate the need for transdermal puncture are desirable.


In addition to accommodating ultra-low inputs of sample, it is desirable to have a genetic test that is capable of analyzing circulating cell-free nucleic acids (DNA and RNA), e.g., circulating cell-free fetal DNA, circulating tumor DNA, circulating DNA from a transplanted donor organ, and circulating DNA released from a specific tissue as part of a health related issue, disease progression or treatment response. However, analysis of circulating cell-free nucleic acids is challenging due to their short half-life and therefore low abundance. In addition, circulating cell-free nucleic acids in blood can be diluted by DNA released from white blood cells if care is not taken with the sample to avoid white blood cell lysis. White blood cell DNA creates background noise during detection of circulating cell-free nucleic acids, decreasing assay sensitivity and specificity.


Devices, systems, kits and methods disclosed herein overcome these challenges by combining gentle and efficient processing of small sample volumes (e.g., less than 1 ml) with a unique target region selection and assay design that takes advantage of the highly fragmented nature of circulating cell-free DNA (cfDNA). For example, devices, systems, kits and methods disclosed herein may provide reliable genetic information from a single finger prick. In some embodiments, the reliable genetic information is obtained from a single finger prick after the initial perfusion of capillary blood is discarded. In other embodiments, the devices, systems, kits and methods provide reliable genetic information without a need for transdermal puncture, for e.g., lysing the tight junctions of the skin such that fluid containing reliable genetic information may be extracted from the skin without a need for transdermal puncture.


Devices, systems, kits and methods disclosed herein provide for analysis of multiple target regions along a target gene that are spaced far enough apart that the target regions are likely going to be physically separate when the target gene is fragmented in circulation. Thus, while the above described limits of statistical sampling exist for individual long DNA fragments that are traditionally analyzed in genetic testing, the sampling statistics change favorably for cfDNA fragments. While there may be an aggregate of only 1 genome equivalent present in a capillary blood sample, there are many individual cfDNA fragments. Consequently, sensitive amplification can be achieved from ultra-low input amounts.


As an example, if twenty target regions are present along a genomic region and they are spaced far enough apart that they can be independently analyzed and detected when the region is fragmented, the input volume required to have at least 1 target region in 99% of all samples changes from 140 microliters (μl) to 25 μl, significantly increasing sensitivity. In some instances, the target regions contain identical sequences or similar sequences. These target regions may be referred to as copies.


In other instances, target regions may not share similar sequences, but share another characteristic such as a similar epigenetic status. For example, the target regions may have different sequences but they are all hyper-methylated. Regardless of the basis for the similarity between the regions, they are spaced appropriately to leverage the fragmentation pattern of circulating cell-free DNA which produces many circulating cfDNA fragments of which at least one can be detected in a small volume. By way of non-limiting example, selected target regions that are distant enough from each other to be on separate cfDNA fragments and are all hyper-methylated when a subject has cancer can be detected with bisulfate sequencing. In a small sample volume (e.g., a finger prick of blood), the likelihood that all of these fragments are present (which is equivalent to non-fragmented DNA) is low, but the likelihood that at least one fragment is present is high, and the cancer can be detected.


In yet other instances, the target regions may not contain similar sequences and may not contain similar epigenetic status. In this case, detection may require multiple primer sets or library preparation followed by amplification with universal primers to detect several distinct target regions. By way of non-limiting example, the detection of a fetal RHD gene in an RHD negative pregnant mother could be achieved from a finger prick amount of blood by using multiple sets of primers to detect multiple different exons of the RHD gene in cell-free fetal DNA fragments. Sensitivity can be increased by choosing primers that amplify regions that are physically distant in the RHD gene and therefore likely to be present on different cell-free DNA fragments. Detecting a fetal RHD gene in an RHD negative pregnant mother is important to prevent hemolytic disease of a newborn by the mother having antibodies against the child's blood. RHD testing is currently performed today from full blood draws (eight milliliters of blood) to achieve the appropriate reliable results. This volume is believed to be necessary to achieve reliable results because it is based on the likelihood that the entire RHD gene will be present in the sample. Based on this assumption, the likelihood of getting the whole RHD gene in a finger prick amount of blood is low and would easily lead to false negative results.


Regardless of how target regions are chosen, these regions are present in the sample as individual biomarkers when amplification or detection is performed on cell-free fragmented DNA. The concentration of the fragments containing the target region is greater than the corresponding non-fragmented DNA or a fragment that cannot be assayed as a group. Thus, there will be more signal from the target region than one would get from non-fragmented DNA or from assaying for one copy of the target region. One will be much more likely to detect a target region present in an ultra-low volume of sample than a non-target region that is not repeated or does not share some commonality with another region. By assaying multiple target regions in multiple DNA fragments, assay sensitivity is increased relative to traditional testing. Blood is a reliable source of cell-free nucleic acids. The methods disclosed herein for analyzing cell-free nucleic acids from blood involve isolating the plasma or serum fraction containing the cell-free nucleic acids. Devices, systems, kits and methods disclosed herein allow for gentle processing of a blood sample at a point of need. Devices, systems, kits and methods disclosed herein further allow, in some embodiments, obtaining a blood sample after a transdermal puncture (e.g., finger prick) only after the initial perfusion of blood is discarded, thereby removing damaged white blood cells caused by the transdermal puncture, if any. In other embodiments, methods, devices, and systems allow for lysing of tight junctions in the skin enabling extraction of fluid containing the same cell-free nucleic acids as in the capillary blood from the skin, without a need for transdermal puncture. These methods, systems, and devices may avoid, prevent or reduce white blood cell lysis. In some embodiments, devices, systems, kits, and methods disclosed herein further enable a biological sample obtained from the subject collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, cleaning a surface of a transdermal puncture site (e.g., skin) is provided herein prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the transdermal puncture site is skin of a finger.


Devices, systems, kits and methods disclosed herein allow for rapid processing of a blood sample at a point of need. This avoids elongated storage and shipment of samples that can lead to blood cell lysis. In some instances, devices disclosed herein perform integrated separation, e.g. immediate isolation of plasma through filtration, to avoid, reduce or prevent cell lysis. Immediate separation of cells from cfDNA may be desirable when a reagent (e.g., probe, primer, antibody) or detection method does not provide much specificity. In some instances, methods are performed with whole blood in an effort to avoid any white blood cell lysis. When relatively higher specificity can be achieved, analysis from whole blood may be more desirable.


In addition to requiring only small volumes of samples, devices, systems, kits and methods disclosed herein are highly desirable for at least the following reasons. Devices, systems, kits and methods disclosed herein generally require little to no technical training. Thus, the costs of performing genetic testing is reduced relative to the cost of testing performed by trained personnel, and the test is available to subjects who do not have access to trained personnel. Furthermore, results may be obtained within minutes (e.g., less than an hour). This may be especially important when testing for an infection. An individual or animal testing positive for an infection may be isolated and treated quickly, preventing the spread of infection. Moreover, results may be obtained privately. In some cases, only the patient that is being tested is privy to the genetic information obtained. Devices, systems and kits disclosed herein are generally lightweight and handheld, making them suitable and accessible to remote locations. Thus, they may be employed at home, in a school, in a workplace, on a battlefield, on a farm, or any other site where it would be impractical or inconvenient to visit a laboratory or clinical setting. Furthermore, since the sample may be analyzed at the point of care, the sample does not need to be stored or shipped, reducing the risk of sample degradation and misidentification (e.g., sample swapping).



FIG. 1 shows a general flow chart with various routes that methods, devices and systems disclosed herein can follow. Initially a sample is obtained in step 110. A minimal amount of sample must be obtained in order to gather useful information from the sample. The sample may be obtained by transdermal puncture. The sample may be obtained by means of extracting useful genetic information, without the need for transdermal puncture, such as those described herein. The sample may be a biological sample disclosed herein. The sample may be a crude, unprocessed sample (e.g., whole blood, interstitial fluid). The sample may be a processed sample (e.g., plasma). The amount of sample is likely based on the sample type. Typically, the sample is processed or an analyte (e.g., a nucleic acid or other biomarker) is purified from the sample in step 120 to produce an analyte that can be amplified and/or detected. Processing may comprise filtering a sample, binding a component of the sample that contains an analyte, binding the analyte, stabilizing the analyte, purifying the analyte, or a combination thereof. Non-limiting examples of sample components are cells, viral particles, bacterial particles, exosome, and nucleosomes. In some instances, the analyte is a nucleic acid and it is amplified to produce an amplicon for analysis in step 130. In other instances, the analyte may or may not be a nucleic acid, but regardless is not amplified. The analyte or amplicon is optionally modified (140) before detection and analysis. In some instances, modification occurs during amplification (not shown). For example, the analyte or amplicon may be tagged or labeled. Detection may involve sequencing, target-specific probes, isothermal amplification and detection methods, quantitative PCR, or single molecule detection. FIG. 1 is provided as a broad overview of devices and methods disclosed herein, but devices and methods disclosed herein are not limited by FIG. 1. Devices and methods may comprise additional components and steps, respectively that are not shown in FIG. 1.


In some instances, devices, systems, kits and methods disclosed herein are desirable because the genetic information can be kept private to the user. In fact, even the use of the device can be kept private. Alternatively, devices, systems, kits and methods are configured to share information with others or can be easily adapted by the user to share information (e.g., turning on a Bluetooth signal). For example, information may be easily shared with a nurse or doctor. In some instances, the device or system can send/share test results through a secure portal or application programming interface (API) to a medical practitioner or staff at an office or hospital. In some instances, the user may choose to share information with the medical practitioner in person after receiving the result. In some instances, the information may even be shared in real-time. This kind of communication would be desirable for couples or families that are split up, for example, by military commitments, employment obligations, migration policies, or health issues.


There are myriad applications for the devices, systems, kits, and methods disclosed herein. Devices, systems, kits and methods disclosed herein allow for diagnosing and monitoring medical conditions. Non-limiting examples of medical conditions include autoimmune conditions, metabolic conditions, cancer, and neurological conditions. Devices, systems, kits and methods disclosed herein allow for personalized medicine, including microbiome testing, determining an appropriate personal medical dosage and/or detecting a response to a drug or dose thereof. Devices, systems, kits and methods disclosed herein provide for detecting an infection by a pathogen and/or a subject's resistance to drugs that could be used to treat the infection. In almost all cases, there is little to no need for technical training or large, expensive laboratory equipment.



FIG. 1 shows that one using the devices, systems, kits or methods disclosed herein may start with a microvolume (e.g., less than a milliliter) of a biological sample from a subject. The biological sample generally contains less than 5000 genome equivalents of cell-free DNA. The sample may be processed by filtration, stabilization, purification, or a combination thereof, to allow for analysis. In some instances the sample does not require processing, such as filtration, stabilization or purification. Several different analytes in the sample can be informative, e.g., cell-free DNA, cell-free RNA, microvesicle-associated nucleic acids, and epigenetic markers on the cell-free DNA. One or more of these analytes may be analyzed. In some instances, the analyte is not amplified. In some instances, the analyte is sequenced without amplification or modification of the analyte. In some instances, the analyte is amplified (e.g., polymerase-mediated nucleic acid amplification) to generate amplicons of the analyte. In some instances, the amplicons are sequenced. In some instances, the amplicons are sequenced without further preparation or modification. In some instances, a feature such as a polymorphism, mutation, epigenetic mark or aberration within an amplicon or target region is used for further analysis.


In some instances, the analyte, the amplicons, or a combination thereof are converted to a library by labeling the analyte with a label, bar-code or tag. The terms label, bar-code and tag are used interchangeably herein, unless otherwise specified. In some instances, library members are amplified to produce amplified library members. In some instances, library members are subjected to whole genome amplification. In some instances, library members are products of whole genome amplification. In some instances, library members are not amplified to produce amplified library members. In some instances, library members are not subjected to whole genome amplification. In some instances, library members are not the products of whole genome amplification. In some instances, library members are captured to produce captured library members. In some instances, library members are captured and amplified to produce captured, amplified library members. In some instances, library members are sequenced. In some instances, amplified library members are sequenced. In some instances, captured library members are sequenced. In some instances, captured, amplified library members are sequenced. In some instances, library members are not sequenced. For instance, library members may be detected or quantified by an array of probes or by single molecule counting. In some instances, the amplified library members are detected or quantified by an array of probes or by single molecule counting. In some instances, the captured library members are detected or quantified by an array of probes or by single molecule counting. In some instances, the captured, amplified library members are detected or quantified by an array of probes or by single molecule counting.



FIG. 2 shows that methods, systems, devices and kits disclosed herein can be distributed amongst several locations. For instances, methods disclosed herein may be fully performed in a home setting, or other point of need. This is particularly important for subjects that do not have access (e.g., physically, financially) to a laboratory, nucleic acid processing and analyzing equipment, or a technician or doctor. In some instances, samples may be processed in a laboratory (e.g., laboratory equipment required). However, methods, systems, devices and kits disclosed herein may still allow for sample collection and reporting in the home. By way of example, the sample may be collected at home, shipped to a laboratory where processing occurs, and the results are delivered to the subject in the home via electronic communication. Therefore, even when processing in a laboratory is required, methods, systems, devices and kits disclosed herein are still convenient to the user, requiring only that they have a means to ship/transport their sample. In some instances, it is convenient for data processing to occur in a cloud or on a server that can communicate test results to the subject. In some instances, it is convenient for data processing to occur in a laboratory and the results reported to the subject in their home without relying on a cloud or internet server.


Methods

Provided herein are methods comprising obtaining a biological sample and detecting a component thereof. In some instances, methods disclosed herein are performed with a device, system or kit described herein. In some instances, the component comprises cell-free nucleic acids, such as cell-free DNA or cell-free RNA. In some instances, the biological sample comprises maternal blood, such as capillary blood obtained from a mother. In some instances, the component detected is a fetal cell-free DNA component of the maternal blood.


Obtaining the biological sample may occur in a clinical or laboratory setting, such as, by way of non-limiting example, a medical clinic, a hospital, a scientific research laboratory, a pathology laboratory, or a clinical test laboratory. Alternatively, obtaining may occur at a location remote from a clinical or laboratory setting, such as, by way of non-limiting example, a home, a family planning center, a workplace, a school, a farm, or a battlefield. In some instances, obtaining is performed using a device or a system described herein.


Detecting the component, in some instances, is performed by analyzing the biological sample to detect a presence, an absence or a level of the component (e.g., biomarker, cell-free DNA) in the biological sample. In some instances, methods comprise determining whether there is an overrepresentation or an underrepresentation of a genomic region of interest in the component relative to representation of the genomic region of interest in at least one control subject.


Methods disclosed herein comprise obtaining and analyzing a relatively small volume of a biological sample, regardless of whether collection occurs in a clinical setting or a remote location. In some instances, detecting occurs in a clinical or laboratory setting. In other instances, detecting occurs at a location remote from a clinical or laboratory setting. Other steps of the methods disclosed herein, e.g., amplifying a nucleic acid, may occur in the clinical/laboratory setting or at a remote location. In some instances, the methods may be performed by the subject. In some instances, methods disclosed herein are performed by a user that has not received any technical training necessary to perform the method.


Obtaining Samples

In some instances, methods disclosed herein comprise obtaining a biological sample described herein. A sample may be obtained directly (e.g., a doctor takes a blood sample from a subject). A sample may be obtained indirectly (e.g., through shipping, by a technician from a doctor or a subject). In some instances, the biological sample is a biological fluid. In some instances, the biological sample is a swab sample (e.g., buccal swab, vaginal and/or cervical swab). In some instances, methods disclosed herein comprise obtaining whole blood, plasma, serum, urine, saliva, interstitial fluid, or vaginal fluid. In some instances, methods disclosed herein comprise obtaining a blood sample via a finger prick. In some instances, methods disclosed herein comprise obtaining a blood sample via a single finger prick. In some instances, methods disclosed herein comprise obtaining a blood sample with not more than a single finger prick. In some instances, the blood sample is obtained via a finger prick only after the initial perfusion of blood is discarded (e.g., finger is pricked, initial blood sample is wiped clean, and second blood sample is collected). In some embodiments, the biological sample obtained from the subject is collected by a process of: (a) inducing a first transdermal puncture to produce a first fraction of a biological sample; (b) discarding the first fraction of the biological sample; and (c) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis. In some embodiments, surface of a transdermal puncture site (e.g., skin) is cleaned prior to obtaining the biological sample from the subject. In some instances, the cleaning comprises removing or reducing unwanted contaminant. In some instances, the unwanted contaminant comprises DNA from the transdermal puncture site. In some instances, the transdermal puncture site is skin of a finger. In some instances, methods disclosed herein comprise obtaining capillary blood (e.g., blood obtained from a finger or a prick of the skin). In some instances, methods comprise squeezing or milking blood from a prick to obtain a desired volume of blood. In other instances, methods do not comprise squeezing or milking blood from a prick to obtain a desired volume of blood. While a finger prick is a common method for obtaining capillary blood, other locations on the body would also be suitable, e.g., toe, heel, arm, palm, shoulder, earlobe. In some instances, methods disclosed herein comprise obtaining a blood sample without a phlebotomy. In some instances, methods disclosed herein comprise obtaining capillary blood. In some instances, methods disclosed herein comprise obtaining venous blood. In some instances, methods disclosed herein do not comprise obtaining venous blood (e.g., blood obtained from a vein). In some instances, methods comprise obtaining a biological sample via a biopsy. In some instances, methods comprise obtaining a biological fluid via a liquid biopsy.


In some instances, methods, systems, and devices described herein comprising obtaining a biological sample containing reliable genetic information, without a need for transdermal puncture. In some embodiments, the tight junctions in the skin of the subject are lysed, making them permeable to fluid that may be pushed into the intercellular space and reabsorbed in the capillary, and which may be extracted from the permeable skin without transdermal puncture.


In some instances, methods comprise obtaining samples with fragmented nucleic acids. The sample may have been subjected to conditions that are not conducive to preserving the integrity of nucleic acids. By way of non-limiting example, the sample may be a forensic sample. Forensic samples are often contaminated, exposed to air, heat, light, etc. The sample may have been frozen and thawed. The sample may have been exposed to chemicals or enzymes that degrade nucleic acids. In some instances, methods comprise obtaining a tissue sample wherein the tissue sample comprises fragmented nucleic acids. In some instances, methods comprise obtaining a tissue sample wherein the tissue sample comprises nucleic acids and fragmenting the nucleic acids to produced fragmented nucleic acids. In some instances, the tissue sample is a frozen sample. In some instances, the sample is a preserved sample. In some instances the tissue sample is a fixed sample (e.g. formaldehyde-fixed). Methods may comprise isolating the (fragmented) nucleic acids from the sample. Methods may comprise providing the fragmented nucleic acids in a solution for genetic analysis.


In some instances, methods disclosed herein are performed with not more than 50 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 75 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 100 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 125 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 150 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 200 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 300 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 400 μl of the biological fluid sample. In some instances, methods disclosed herein are performed with not more than 500 μl of the biological fluid sample.


In some instances, methods disclosed herein comprise obtaining an ultra-low volume of a biological fluid sample, wherein the ultra-low volume falls within a range of sample volumes. In some instances, the range of sample volumes is about 5 μl to about one milliliter. In some instances, the range of sample volumes is about 5 μl to about 900 μl. In some instances, the range of sample volumes is about 5 μl to about 800 μl. In some instances, the range of sample volumes is about 5 μl to about 700 μl. In some instances, the range of sample volumes is about 5 μl to about 600 μl. In some instances, the range of sample volumes is about 5 μl to about 500 μl. In some instances, the range of sample volumes is about 5 μl to about 400 μl. In some instances, the range of sample volumes is about 5 μl to about 300 μl. In some instances, the range of sample volumes is about 5 μl to about 200 μl. In some instances, the range of sample volumes is about 5 μl to about 150 μl. In some instances, the range of sample volumes is 5 μl to about 100 μl. In some instances, the range of sample volumes is about 5 μl to about 90 μl. In some instances, the range of sample volumes is about 5 μl to about 85 μl. In some instances, the range of sample volumes is about 5 μl to about 80 μl. In some instances, the range of sample volumes is about 5 μl to about 75 μl. In some instances, the range of sample volumes is about 5 μl to about 70 μl. In some instances, the range of sample volumes is about 5 μl to about 65 μl. In some instances, the range of sample volumes is about 5 μl to about 60 μl. In some instances, the range of sample volumes is about 5 μl to about 55 μl. In some instances, the range of sample volumes is about 5 μl to about 50 μl. In some instances, the range of sample volumes is about 15 μl to about 150 μl. In some instances, the range of sample volumes is about 15 μl to about 120 μl. In some instances, the range of sample volumes is 15 μl to about 100 μl. In some instances, the range of sample volumes is about 15 μl to about 90 μl. In some instances, the range of sample volumes is about 15 μl to about 85 μl. In some instances, the range of sample volumes is about 15 μl to about 80 μl. In some instances, the range of sample volumes is about 15 μl to about 75 μl. In some instances, the range of sample volumes is about 15 μl to about 70 μl. In some instances, the range of sample volumes is about 15 μl to about 65 μl. In some instances, the range of sample volumes is about 15 μl to about 60 μl. In some instances, the range of sample volumes is about 15 μl to about 55 μl. In some instances, the range of sample volumes is about 15 μl to about 50 μl.


In some instances, methods disclosed herein comprise obtaining an ultra-low volume of a biological fluid sample, wherein the ultra-low volume is about 100 μl to about 500 μl. In some instances, methods disclosed herein comprise obtaining an ultra-low volume of the biological fluid sample, wherein the ultra-low volume about 100 μl to about 1000 μl. In some instances, the ultra-low volume is about 500 μl to about 1 ml. In some instances, the ultra-low volume is about 500 μl to about 2 ml. In some instances, the ultra-low volume is about 500 μl to about 3 ml. In some instances, the ultra-low volume is about 500 μl to about 5 ml.


In some instances, methods disclosed herein comprise obtaining an ultra-low volume of a biological sample, wherein the biological sample is whole blood. The ultra-low volume may be about 1 μl to about 250 μl. The ultra-low volume may be about 5 μl to about 250 μl. The ultra-low volume may be about 10 μl to about 25 μl. The ultra-low volume may be about 10 μl to about 35 μl. The ultra-low volume may be about 10 μl to about 45 μl. The ultra-low volume may be about 10 μl to about 50 μl. The ultra-low volume may be about 10 μl to about 60 μl. The ultra-low volume may be about 10 μl to about 80 μl. The ultra-low volume may be about 10 μl to about 100 μl. The ultra-low volume may be about 10 μl to about 120 μl. The ultra-low volume may be about 10 μl to about 140 μl. The ultra-low volume may be about 10 μl to about 150 μl. The ultra-low volume may be about 10 μl to about 160 μl. The ultra-low volume may be about 10 μl to about 180 μl. The ultra-low volume may be about 10 μl to about 200 μl.


In some instances, methods disclosed herein comprise obtaining a ultra-low volume of a biological sample wherein the biological sample is plasma or serum. The ultra-low volume may be about 1 μl to about 200 μl. The ultra-low volume may be about 1 μl to about 190 μl. The ultra-low volume may be about 1 μl to about 180 μl. The ultra-low volume may be about 1 μl to about 160 μl. The ultra-low volume may be about 1 μl to about 150 μl. The ultra-low volume may be about 1 μl to about 140 μl. The ultra-low volume may be about 5 μl to about 15 μl. The ultra-low volume may be about 5 μl to about 25 μl. The ultra-low volume may be about 5 μl to about 35 μl. The ultra-low volume may be about 5 μl to about 45 μl. The ultra-low volume may be about 5 μl to about 50 μl. The ultra-low volume may be about 5 μl to about 60 μl. The ultra-low volume may be about 5 μl to about 70 μl. The ultra-low volume may be about 5 μl to about 80 μl. The ultra-low volume may be about 5 μl to about 90 μl. The ultra-low volume may be about 5 μl to about 100 μl. The ultra-low volume may be about 5 μl to about 125 μl. The ultra-low volume may be about 5 μl to about 150 μl. The ultra-low volume may be about 5 μl to about 175 μl. The ultra-low volume may be about 5 μl to about 200 μl.


In some instances, methods disclosed herein comprise obtaining an ultra-low volume of a biological sample, wherein the biological sample is urine. Generally, the concentration of DNA in urine is about 40 ng/ml to about 200 ng/ml. In some instances, the ultra-low volume of urine is about 0.25 μl to 1 milliliter. In some instances, the ultra-low volume of urine is about 0.25 μl to about 1 milliliter. In some instances, the ultra-low volume of urine is at least about 0.25 μl. In some instances, the ultra-low volume of urine is at most about 1 milliliter. In some instances, the ultra-low volume of urine is about 0.25 μl to about 0.5 μl, about 0.25 μl to about 0.75 μl, about 0.25 μl to about 1 μl, about 0.25 μl to about 5 μl, about 0.25 μl to about 10 μl, about 0.25 μl to about 50 μl, about 0.25 μl to about 100 μl, about 0.25 μl to about 150 μl, about 0.25 μl to about 200 μl, about 0.25 μl to about 500 μl, about 0.25 μl to about 1 milliliter, about 0.5 μl to about 0.75 μl, about 0.5 μl to about 1 μl, about 0.5 μl to about 5 μl, about 0.5 μl to about 10 μl, about 0.5 μl to about 50 μl, about 0.5 μl to about 100 μl, about 0.5 μl to about 150 μl, about 0.5 μl to about 200 μl, about 0.5 μl to about 500 μl, about 0.5 μl to about 1 milliliter, about 0.75 μl to about 1 μl, about 0.75 μl to about 5 μl, about 0.75 μl to about 10 μl, about 0.75 μl to about 50 μl, about 0.75 μl to about 100 μl, about 0.75 μl to about 150 μl, about 0.75 μl to about 200 μl, about 0.75 μl to about 500 μl, about 0.75 μl to about 1 milliliter, about 1 μl to about 5 μl, about 1 μl to about 10 μl, about 1 μl to about 50 μl, about 1 μl to about 100 μl, about 1 μl to about 150 μl, about 1 μl to about 200 μl about 1 μl to about 500 μl, about 1 μl to about 1 milliliter, about 5 μl to about 10 μl, about 5 μl to about 50 μl, about 5 μl to about 100 μl, about 5 μl to about 150 μl, about 5 μl to about 200 μl, about 5 μl to about 500 μl, about 5 μl to about 1 milliliter, about 10 μl to about 50 μl, about 10 μl to about 100 μl about 10 μl to about 150 μl, about 10 μl to about 200 μl, about 10 μl to about 500 μl, about 10 μl to about 1 milliliter, about 50 μl to about 100 μl, about 50 μl to about 150 μl, about 50 μl to about 200 μl, about 50 μl to about 500 μl, about 50 μl to about 1 milliliter, about 100 μl to about 150 μl, about 100 μl to about 200 about 100 μl to about 500 μl, about 100 μl to about 1 milliliter, about 150 μl to about 200 μl, about 150 μl to about 500 μl, about 150 μl to about 1 milliliter, about 200 μl to about 500 μl, about 200 μl to about 1 milliliter, or about 500 μl to about 1 milliliter. In some instances, the volume of urine used is about 0.25 μl, about 0.5 μl, about 0.75 μl, about 1 μl, about 5 μl, about 10 μl, about 50 μl, about 100 μl, about 150 μl, about 200 μl, about 500 μl, or about 1 milliliter.


In some instances, methods disclosed herein comprise obtaining at least about 5 μL of blood to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 10 μL of blood to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 15 μL of blood to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 20 μL of blood to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 20 μL of blood to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 20 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 20 μL of blood to provide a test result with at least about 98% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 20 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 20 μL to about 120 μL of blood to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 20 μL to about 120 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the methods disclosed herein comprise obtaining only about 20 μL to about 120 μL of blood to provide a test result with at least about 97% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 20 μL to about 120 μL of blood to provide a test result with at least about 98% confidence or accuracy. In some instances, the methods disclosed herein comprise obtaining only about 20 μL to about 120 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 20 μL to about 120 μL of blood to provide a test result with at least about 99.5% confidence or accuracy.


In some instances, the biological fluid sample is plasma or serum. Plasma or serum makes up roughly 55% of whole blood. In some instances, methods disclosed herein comprise obtaining at least about 10 μL of plasma or serum to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 10 μL of plasma or serum to provide a test result with at least about 98% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 12 μL of plasma or serum to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 12 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 12 μL of plasma or serum to provide a test result with at least about 98% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining at least about 12 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 10 μL to about 60 μL of plasma or serum to provide a test result with at least about 90% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 10 μL to about 60 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 10 μL to about 60 μL of plasma or serum to provide a test result with at least about 97% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 10 μL to about 60 μL of plasma or serum to provide a test result with at least about 98% confidence or accuracy. In some instances, v only about 10 μL to about 60 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, methods disclosed herein comprise obtaining only about 10 μL to about 60 μL of plasma or serum to provide a test result with at least about 99.5% confidence or accuracy.


In some instances, methods disclosed herein comprise obtaining a biological sample from a subject, wherein the biological sample contains an amount of cell-free nucleic acid molecules. In some instances, obtaining the biological sample results in disrupting or lysing cells in the biological sample. Thus, in some instances, the biological sample comprises cellular nucleic acid molecules. In some instances, cellular nucleic acid molecules make up less than about 1% of the total cellular nucleic acid molecules in the biological sample. In some instances, cellular nucleic acid molecules make up less than about 5% of the total cellular nucleic acid molecules in the biological sample. In some instances, cellular nucleic acid molecules make up less than about 10% of the total cellular nucleic acid molecules in the biological sample. In some instances, cellular nucleic acid molecules make up less than about 20% of the total cellular nucleic acid molecules in the biological sample. In some instances, cellular nucleic acid molecules make up more than about 50% of the total cellular nucleic acid molecules in the biological sample. In some instances, cellular nucleic acid molecules make up less than about 90% of the total cellular nucleic acid molecules in the biological sample.


In some instances, methods disclosed herein comprise obtaining an ultra-low volume of a biological fluid sample from a subject, wherein the biological fluid sample contains an ultra-low amount of cell-free nucleic acids. In some instances, the ultra-low amount is between about 4 pg to about 100 pg. In some instances, the ultra-low amount is between about 4 pg to about 150 pg. In some instances, the ultra-low amount is between about 4 pg to about 200 pg. In some instances, the ultra-low amount is between about 4 pg to about 300 pg. In some instances, the ultra-low amount is between about 4 pg to about 400 pg. In some instances, the ultra-low amount is between about 4 pg to about 500 pg. In some instances, the ultra-low amount is between about 4 pg to about 1 ng. In some instances, the ultra-low amount is between about 10 pg to about 100 pg. In some instances, the ultra-low amount is between about 10 pg to about 150 pg. In some instances, the ultra-low amount is between about 10 pg to about 200 pg. In some instances, the ultra-low amount is between about 10 pg to about 300 pg. In some instances, the ultra-low amount is between about 10 pg to about 400 pg. In some instances, the ultra-low amount is between about 10 pg to about 500 pg. In some instances, the ultra-low amount is between about 10 pg to about 1 ng. In some instances, the ultra-low amount is between about 20 pg to about 100 pg. In some instances, the ultra-low amount is between about 20 pg to about 200 pg. In some instances, the ultra-low amount is between about 20 pg to about 500 pg. In some instances, the ultra-low amount is between about 20 pg to about 1 ng. In some instances, the ultra-low amount is between about 30 pg to about 150 pg. In some instances, the ultra-low amount is between about 30 pg to about 180 pg. In some instances, the ultra-low amount is between about 30 pg to about 200 pg. In some instances, the ultra-low amount is between is about 30 pg to about 300 pg. In some instances, the ultra-low amount is between about 30 pg to about 400 pg. In some instances, the ultra-low amount is between about 30 pg to about 500 pg. In some instances, the ultra-low amount is between is about 30 pg to about 1 ng. In some instance, the subject is a pregnant subject and the cell-free nucleic acids comprise cell-free fetal DNA. In some instances, the subject has a tumor and the cell-free nucleic acids comprise cell-free tumor DNA. In some instances, the subject is an organ transplant recipient and the cell-free nucleic acids comprise organ donor DNA.


In some instances, methods comprise obtaining less than about 1 ng of cell-free fetal nucleic acids. In some instances, methods comprise obtaining less than about 500 pg of cell-free fetal nucleic acids. In some instances, methods comprise obtaining less than about 100 pg of cell-free fetal nucleic acids. In some instances, methods comprise obtaining at least 3.5 pg of cell-free fetal nucleic acids. In some instances, methods comprise obtaining at least 10 pg of cell-free fetal nucleic acids. In some instances, methods comprise obtaining not more than about 100 pg of cell-free fetal nucleic acids. In some instances, methods comprise obtaining not more than about 500 pg of cell-free fetal nucleic acids. In some instances, methods comprise obtaining not more than about 1 ng of cell-free fetal nucleic acids.


In some instances, methods disclosed herein comprise obtaining a biological fluid sample from a subject, wherein the biological fluid sample contains at least 1 genome equivalent of cell-free DNA. One skilled in the art understands that a genome equivalent is the amount of DNA necessary to be present in a sample to guarantee that all genes will be present. Ultra-low volumes of biological fluid samples disclosed herein may contain an ultra-low number of genome equivalents. In some instances, the biological fluid sample contains less than 1 genome equivalent of cell-free nucleic acids. In some instances, the biological fluid sample contains at least 5 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains at least 10 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains at least 15 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains at least 20 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains about 5 to about 50 genome equivalents. In some instances, the biological fluid sample contains about 10 to about 50 genome equivalents. In some instances, the biological fluid sample contains about 10 to about 100 genome equivalents. In some instances, the biological fluid sample contains not more than 50 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 60 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 80 genome equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 100 genome equivalents of cell-free nucleic acids.


Ultra-low volumes of biological fluid samples disclosed herein may contain an ultra-low number of cell equivalents. In some instances, methods disclosed herein comprise obtaining a biological fluid sample from a subject, wherein the biological fluid sample contains at least 1 cell equivalent of cell-free DNA. In some instances, the biological fluid sample contains at least 2 cell equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains at least 5 cell equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains about 5 cell equivalents of cell-free nucleic acids to about 40 cell equivalents. In some instances, the biological fluid sample contains at least 5 cell equivalents to about 100 cell equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 30 cell equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 50 cell equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 80 cell equivalents of cell-free nucleic acids. In some instances, the biological fluid sample contains not more than 100 cell equivalents of cell-free nucleic acids.


In some instances, methods disclosed herein comprise obtaining a biological sample from a subject, wherein the biological sample contains at least one cell-free nucleic acid of interest. By way of non-limiting example, the cell-free nucleic acid of interest may be a cell-free fetal nucleic acid, cell-free tumor DNA, or DNA from a transplanted organ. In some instances, methods disclosed herein comprise obtaining a biological sample from the subject, wherein the biological sample contains about 1 to about 5 cell-free nucleic acids. In some instances, methods disclosed herein comprise obtaining a biological sample from the subject, wherein the biological sample contains about 1 to about 15 cell-free nucleic acids. In some instances, methods disclosed herein comprise obtaining a biological sample from the subject, wherein the biological sample contains about 1 to about 25 cell-free nucleic acids. In some instances, methods disclosed herein comprise obtaining a biological sample from the subject, wherein the biological sample contains about 1 to about 100 cell-free nucleic acids. In some instances, methods disclosed herein comprise obtaining a biological sample from the subject, wherein the biological sample contains about 5 to about 100 cell-free nucleic acids. In some instances, the at least one cell-free nucleic acid is represented by a sequence that is unique to a target chromosome disclosed herein.


In some instances, methods disclosed herein comprise obtaining a biological sample from a subject, wherein the biological sample contains about 102 cell-free nucleic acids to about 1010 cell-free nucleic acids. In some instances, the biological sample contains about 102 cell-free nucleic acids to about 109 cell-free nucleic acids. In some instances, the biological sample contains about 102 cell-free nucleic acids to about 108 cell-free nucleic acids. In some instances, the biological sample contains about 102 cell-free nucleic acids to about 107 cell-free nucleic acids. In some instances, the biological sample contains about 102 cell-free nucleic acids to about 106 cell-free nucleic acids. In some instances, the biological sample contains about 102 cell-free nucleic acids to about 105 cell-free nucleic acids.


In some instances, methods disclosed herein comprise obtaining a biological sample from a subject, wherein the biological sample contains about 103 cell-free nucleic acids to about 1010 cell-free nucleic acids. In some instances, the biological sample contains about 103 cell-free nucleic acids to about 109 cell-free nucleic acids. In some instances, the biological sample contains about 103 cell-free nucleic acids to about 108 cell-free nucleic acids. In some instances, the biological sample contains about 103 cell-free nucleic acids to about 107 cell-free nucleic acids. In some instances, the biological sample contains about 103 cell-free nucleic acids to about 106 cell-free nucleic acids. In some instances, the biological sample contains about 103 cell-free nucleic acids to about 105 cell-free nucleic acids.


In some instances, methods disclosed herein comprise obtaining a biological sample from a subject, wherein the biological sample has a number of cell-free nucleic acids that correspond to a typical sample type volume. By way of non-limiting example, 4 ml of human blood from a pregnant subject typically contains about 1010 cell-free fetal nucleic acids. However, the concentration of cell-free fetal nucleic acids in a sample, and thus, the sample volume required to be informative about fetal genetics, will depend on the sample type. Example 7, provided herein, also demonstrates how one of skill in the art can determine the minimum volume necessary to obtain a sufficient number of cell-free fetal nucleic acids.


Sample Processing

In some instances, methods disclosed herein comprise isolating or purifying cell-free nucleic acid molecules from a biological sample. In some instances, methods disclosed herein comprise isolating or purifying nucleic cell-free fetal nucleic acid molecules from a biological sample. In some instances, methods disclosed herein comprise removing non-nucleic acid components from a biological sample described herein.


In some instances, isolating or purifying comprises reducing or removing unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 5%, at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, or at least 90% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 95% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 97% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 98% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 99% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 95% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 97% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 98% of unwanted non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises removing at least 99% of unwanted non-nucleic acid components from a biological sample.


In some instances, methods disclosed herein comprise isolating or purifying nucleic acids from one or more non-nucleic acid components of a biological sample. Non-nucleic acid components may also be considered unwanted substances. Non-limiting examples of non-nucleic acid components include cells (e.g., blood cells), cell fragments, extracellular vesicles, lipids, proteins or a combination thereof. Additional non-nucleic acid components are described herein and throughout. It should be noted that while methods may comprise isolating/purifying nucleic acids, they may also comprise analyzing a non-nucleic acid component of a sample that is considered an unwanted substance in a nucleic acid purifying step. Isolating or purifying may comprise removing components of a biological sample that would inhibit, interfere with or otherwise be detrimental to the later process steps such as nucleic acid amplification or detection.


Isolating or purifying may be performed with a device or system disclosed herein. Isolating or purifying may be performed within a device or system disclosed herein. Isolating and/or purifying may occur with the use of a sample purifier disclosed herein. In some instances, isolating or purifying nucleic acids comprises removing non-nucleic acid components from a biological sample described herein. In some instances, isolating or purifying nucleic acids comprises discarding non-nucleic acid components from a biological sample. In some instances, isolating or purifying comprises collecting, processing and analyzing the non-nucleic acid components. In some instances, the non-nucleic acid components may be considered biomarkers because they provide additional information about the subject.


In some instances, isolating or purifying nucleic acids comprise lysing a cell. In some instances, isolating or purifying nucleic acids avoids lysing a cell. In some instances, isolating or purifying nucleic acids does not comprise lysing a cell. In some instances, isolating or purifying nucleic acids does not comprise an active step intended to lyse a cell. In some instances, isolating or purifying nucleic acids does not comprise intentionally lysing a cell. Intentionally lysing a cell may include mechanically disrupting a cell membrane (e.g., shearing). Intentionally lysing a cell may include contacting the cell with a lysis reagent. Exemplary lysis reagents are described herein.


In some instances, isolating or purifying nucleic acids comprises lysing and performing sequence specific capture of a target nucleic acid with “bait” in a solution followed by binding of the “bait” to solid supports such as magnetic beads, e.g. Legler et al., Specific magnetic bead-based capture of free fetal DNA from maternal plasma, Transfusion and Apheresis Science 40 (2009), 153-157. In some instances, methods comprise performing sequence specific capture in the presence of a recombinase or helicase. Use of a recombinase or helicase may avoid the need for heat denaturation of a nucleic acid and speed up the detection step.


In some instances, isolating or purifying comprises separating components of a biological sample disclosed herein. By way of non-limiting example, isolating or purifying may comprise separating plasma from blood. In some instances, isolating or purifying comprises centrifuging the biological sample. In some instances, isolating or purifying comprises filtering the biological sample in order to separate components of a biological sample. In some instances, isolating or purifying comprises filtering the biological sample in order to remove non-nucleic acid components from the biological sample. In some instances, isolating or purifying comprises filtering the biological sample in order to capture nucleic acids from the biological sample.


In some instances, the biological sample is blood and isolating or purifying a nucleic acid comprises obtaining or isolating plasma from blood. Obtaining plasma may comprise separating plasma from cellular components of a blood sample. Obtaining plasma may comprise centrifuging the blood, filtering the blood, or a combination thereof. Obtaining plasma may comprise allowing blood to be subjected to gravity (e.g., sedimentation). Obtaining plasma may comprise subjecting blood to a material that wicks a portion of the blood away from non-nucleic acid components of the blood. In some instances, methods comprise subjecting the blood to vertical filtration. In some instances, methods comprise subjecting the blood to a sample purifier comprising a filter matrix for receiving whole blood, the filter matrix having a pore size that is prohibitive for cells to pass through, while plasma can pass through the filter matrix uninhibited. Such vertical filtration and filter matrices are described for devices disclosed herein.


In some instances, isolating or purifying comprises subjecting a biological sample, or a fraction thereof, or a modified version thereof, to a binding moiety. The binding moiety may be capable of binding to a component of a biological sample and removing it to produce a modified sample depleted of cells, cell fragments, nucleic acids or proteins that are unwanted or of no interest. In some instances, isolating or purifying comprises subjecting a biological sample to a binding moiety to reduce unwanted substances or non-nucleic acid components in a biological sample. In some instances, isolating or purifying comprises subjecting a biological sample to a binding moiety to produce a modified sample enriched with target cell, target cell fragments, target nucleic acids or target proteins. By way of non-limiting example, isolating or purifying may comprise subjecting a biological sample to a binding moiety for capturing placenta educated platelets, which may contain fetal DNA or RNA fragments. The resulting cell-bound binding moieties can be captured/enriched for with antibodies or other methods, e.g., low speed centrifugation.


Isolating or purifying may comprise capturing an extracellular vesicle or extracellular microparticle in the biological sample with a binding moiety. In some instances, the extracellular vesicle contains at least one of DNA and RNA. In some instances, the extracellular vesicle is fetal/placental in origin. Methods may comprise capturing an extracellular vesicle or extracellular microparticle in the biological sample that comes from a maternal cell. In some instances, methods disclosed herein comprise capturing and discarding an extracellular vesicle or extracellular microparticle from a maternal cell to enrich the sample for fetal/placental nucleic acids.


In some instances, methods comprise capturing a nucleosome in a biological sample and analyzing nucleic acids attached to the nucleosome. In some instances, methods comprise capturing an exosome in a biological sample and analyzing nucleic acids attached to the exosome. Capturing nucleosomes and/or exosomes may preclude the need for a lysis step or reagent, thereby simplifying the method and reducing time from sample collection to detection.


In some instances, methods comprise subjecting a biological sample to a cell-binding moiety for capturing placenta educated platelets, which may contain fetal DNA or RNA fragments. Capturing may comprise contacting the placenta educated platelets with a binding moiety (e.g., an antibody for a cell surface marker), subjecting the biological sample to low speed centrifugation, or a combination thereof. In some instances, the binding moiety is attached to a solid support disclosed herein, and methods comprise separating the solid support from the rest of the biological sample after the binding moiety has made contact with the biological sample.


In some instances, methods disclosed herein comprise removing unwanted non-nucleic acid components from a biological sample. In some instances, methods disclosed herein comprise removing and discarding non-nucleic acid components from a biological sample. Non-limiting examples of non-nucleic acid components include cells (e.g., blood cells), cell fragments, extracellular vesicles, lipids, proteins or a combination thereof. In some instances, removing non-nucleic acid components may comprise centrifuging the biological sample. In some instances, removing non-nucleic acid components may comprise filtering the biological fluid sample. In some instances, removing non-nucleic acid components may comprise contacting the biological sample with a binding moiety described herein.


In some embodiments, methods disclosed herein comprise purifying nucleic acids in a sample. In some instances, purifying does not comprise washing the nucleic acids with a wash buffer. In some instances, the nucleic acids are cell-free fetal nucleic acids. In some embodiments, purifying comprises capturing the nucleic acids with a nucleic acid capturing moiety to produce captured nucleic acids. Non-limiting examples of nucleic acid capturing moieties are silica particles and paramagnetic particles. In some embodiments, purifying comprises passing the sample containing the captured nucleic acids through a hydrophobic phase (e.g., a liquid or wax). The hydrophobic phase retains impurities in the sample that would otherwise inhibit further manipulation (e.g., amplification, sequencing) of the nucleic acids.


In some instances, methods disclosed herein comprise removing nucleic acid components from a biological sample described herein. In some instances, the removed nucleic acid components are discarded. By way of non-limiting example, methods may comprise analyzing only DNA. Thus, RNA is unwanted and creates undesirable background noise or contamination to the DNA. In some instances, methods disclosed herein comprise removing RNA from a biological sample. In some instances, methods disclosed herein comprise removing mRNA from a biological sample. In some instances, methods disclosed herein comprise removing microRNA from a biological sample. In some instances, methods disclosed herein comprise removing maternal RNA from a biological sample. In some instances, methods disclosed herein comprise removing DNA from a biological sample. In some instances, methods disclosed herein comprise removing maternal DNA from a biological sample of a pregnant subject. In some instances, removing nucleic acid components comprises contacting the nucleic acid components with an oligonucleotide capable of hybridizing to the nucleic acid, wherein the oligonucleotide is conjugated, attached or bound to a capturing device (e.g., bead, column, matrix, nanoparticle, magnetic particle, etc.). In some instances, the removed nucleic acid components are discarded.


In some instances, removing nucleic acid components comprises separating the nucleic acid components on a gel by size. For example, circulating cell-free fetal DNA fragments are generally less than 200 base pairs in length. In some instances, methods disclosed herein comprise removing cell-free DNA from the biological sample. In some instances, methods disclosed herein comprise capturing cell-free DNA from the biological sample. In some instances, methods disclosed herein comprise selecting cell-free DNA from the biological sample. In some instances, the cell-free DNA has a minimum length. In some instances, the minimum length is about 50 base pairs. In some instances, the minimum length is about 100 base pairs. In some instances, the minimum length is about 110 base pairs. In some instances, the minimum length is about 120 base pairs. In some instances, the minimum length is about 140 base pairs. In some instances, the cell-free DNA has a maximum length. In some instances, the maximum length is about 180 base pairs. In some instances, the maximum length is about 200 base pairs. In some instances, the maximum length is about 220 base pairs. In some instances, the maximum length is about 240 base pairs. In some instances, the maximum length is about 300 base pairs. Size based separation would be useful for other categories of nucleic acids having limited size ranges, which are well known in the art (e.g., microRNAs).


In some instances, processing comprises enriching fetal trophoblasts containing fetal genomic DNA of interest in the biological sample. In some instances the fetal trophoblasts are enriched by morphology (e.g., size) or a marker antigens (e.g., cell surface antigens), or both. In some cases, enrichment of trophoblasts is performed using the isolation by size of epithelial tumor cells (ISET) method. In some cases, enrichment of trophoblasts in a biological sample comprises contacting the biological sample with an antibody or antigen-binding fragment specific to a cell-surface antigen of a trophoblast. Non-limiting examples of trophoblast cell-surface antigens include tropomyosin-1 (Tropl), tropomyosin-2 (Trop2), cyto and syncytio-trophoblast marker, GB25, human placental lactogen (HPL), and alpha human chorionic gonadotrophin (alpha HCG). In some instances, purifying or isolating the fetal trophoblasts comprises using fluoresce-activated cell sorting (FACS), column chromatography, or magnetic sorting (e.g., Dynabeads). In some instances, the fetal genetic information is extracted from the enriched and/or purified trophoblasts, using any suitable DNA extraction method, such as those described herein.


Amplifying Nucleic Acids

In some instances, methods disclosed herein comprise amplifying at least one nucleic acid (e.g., cell-free nucleic acid, such as cell-free DNA or cell-free RNA) in a sample to produce at least one amplification product. The at least one nucleic acid may be a cell-free nucleic acid. The sample may be a biological sample disclosed herein or a fraction or portion thereof. In some instances, methods comprise producing a copy of the nucleic acid in the sample and amplifying the copy to produce the at least one amplification product. In some instances, methods comprise producing a reverse transcript of the nucleic acid in the sample and amplifying the reverse transcript to produce the at least one amplification product.


In some instances, methods comprise performing whole genome amplification. In some instances, methods do not comprise performing whole genome amplification. The term, “whole genome amplification” may refer to amplifying all of the cell-free nucleic acids in a biological sample. The term, “whole genome amplification” may refer to amplifying at least 90% of the cell-free nucleic acids in a biological sample. Whole genome may refer to multiple genomes. Whole genome amplification may comprise amplifying cell-free nucleic acids from a biological sample of a subject, wherein the biological sample comprises cell-free nucleic acids from the subject and a foreign tissue. For example, whole genome amplification may comprise amplifying cell-free nucleic acids from both a subject (a host genome) and an organ or tissue that has been transplanted into the subject (a donor genome). Also by way of non-limiting example, whole genome amplification may comprise amplifying cell-free nucleic acids from a biological sample of a pregnant subject, wherein the biological sample comprises cell-free nucleic acids from the pregnant subject and her fetus. Whole genome amplification may comprise amplifying cell-free nucleic acids from a biological sample of a subject having cancer, wherein the biological sample comprises cell-free nucleic acids from benign tissue of the subject and a tumor in the subject. Whole genome amplification may comprise amplifying cell-free nucleic acids from a biological sample of a subject having an infection, wherein the biological sample comprises cell-free nucleic acids from the subject and a pathogen.


In some instances, methods disclosed herein comprise amplifying a nucleic acid, wherein amplifying comprises performing an isothermal amplification of the nucleic acid. Non-limiting examples of isothermal amplification are as follows: loop-mediated isothermal amplification (LAMP), strand displacement amplification (SDA), helicase dependent amplification (HDA), nicking enzyme amplification reaction (NEAR), and recombinase polymerase amplification (RPA).


Any appropriate nucleic acid amplification method known in the art is contemplated for use in the devices and methods described herein. In some instances, isothermal amplification is used. In some instances, amplification is isothermal with the exception of an initial heating step before isothermal amplification begins. A number of isothermal amplification methods, each having different considerations and providing different advantages, are known in the art and have been discussed in the literature, e.g., by Zanoli and Spoto, 2013, “Isothermal Amplification Methods for the Detection of Nucleic Acids in Microfluidic Devices,” Biosensors 3: 18-43, and Fakruddin, et al., 2013, “Alternative Methods of Polymerase Chain Reaction (PCR),” Journal of Pharmacy and Bioallied Sciences 5(4): 245-252, each incorporated herein by reference in its entirety. In some instances, any appropriate isothermic amplification method is used. In some instances, the isothermic amplification method used is selected from: Loop Mediated Isothermal Amplification (LAMP); Nucleic Acid Sequence Based Amplification (NASBA); Multiple Displacement Amplification (MDA); Rolling Circle Amplification (RCA); Helicase Dependent Amplification (HDA); Strand Displacement Amplification (SDA); Nicking Enzyme Amplification Reaction (NEAR); Ramification Amplification Method (RAM); and Recombinase Polymerase Amplification (RPA).


In some instances, the amplification method used is LAMP (see, e.g., Notomi, et al., 2000, “Loop Mediated Isothermal Amplification” NAR 28(12): e63 i-vii, and U.S. Pat. No. 6,410,278, “Process for synthesizing nucleic acid” each incorporated by reference herein in its entirety). LAMP is a one-step amplification system using auto-cycling strand displacement deoxyribonucleic acid (DNA) synthesis. In some instances, LAMP is carried out at 60-65° C. for 45-60 min in the presence of a thermostable polymerase, e.g., Bacillus stearothermophilus (Bst) DNA polymerase I, deoxyribonucleotide triphosphate (dNTPs), specific primers and the target DNA template. In some instances, the template is RNA and a polymerase having both reverse transcriptase activity and strand displacement-type DNA polymerase activity, e.g., Bca DNA polymerase, is used, or a polymerase having reverse transcriptase activity is used for the reverse transcriptase step and a polymerase not having reverse transcriptase activity is used for the strand displacement-DNA synthesis step.


In some instances, the amplification method is Nucleic Acid Sequence Based Amplification (NASBA). NASBA (also known as 3SR, and transcription-mediated amplification) is an isothermal transcription-based RNA amplification system. Three enzymes (avian myeloblastosis virus reverse transcriptase, RNase H and T7 DNA dependent RNA polymerase) are used to generate single-stranded RNA. In certain cases NASBA can be used to amplify DNA. The amplification reaction is performed at 41° C., maintaining constant temperature, typically for about 60 to about 90 minutes (see, e.g., Fakruddin, et al., 2012, “Nucleic Acid Sequence Based Amplification (NASBA) Prospects and Applications,” Int. J. of Life Science and Pharma Res. 2(1):L106-L121, incorporated by reference herein).


In some instances, the NASBA reaction is carried out at about 40° C. to about 42° C. In some instances, the NASBA reaction is carried out at 41° C. In some instances, the NASBA reaction is carried out at at most about 42° C. In some instances, the NASBA reaction is carried out at about 40° C. to about 41° C., about 40° C. to about 42° C., or about 41° C. to about 42° C. In some instances, the NASBA reaction is carried out at about 40° C., about 41° C., or about 42° C.


In some instances, the amplification method is Strand Displacement Amplification (SDA). SDA is an isothermal amplification method that uses four different primers. A primer containing a restriction site (a recognition sequence for HincII exonuclease) is annealed to the DNA template. An exonuclease-deficient fragment of Eschericia coli DNA polymerase 1 (exo-Klenow) elongates the primers. Each SDA cycle consists of (1) primer binding to a displaced target fragment, (2) extension of the primer/target complex by exo-Klenow, (3) nicking of the resultant hemiphosphothioate HincII site, (4) dissociation of HincII from the nicked site and (5) extension of the nick and displacement of the downstream strand by exo-Klenow.


In some instances, methods comprise contacting DNA in a sample with a helicase. In some instances, the amplification method is Helicase Dependent Amplification (HDA). HDA is an isothermal reaction because a helicase, instead of heat, is used to denature DNA.


In some instances, the amplification method is Multiple Displacement Amplification (MDA). The MDA is an isothermal, strand-displacing method based on the use of the highly processive and strand-displacing DNA polymerase from bacteriophage 029, in conjunction with modified random primers to amplify the entire genome with high fidelity. It has been developed to amplify all DNA in a sample from a very small amount of starting material. In MDA 029 DNA polymerase is incubated with dNTPs, random hexamers and denatured template DNA at 30° C. for 16 to 18 hours and the enzyme must be inactivated at high temperature (65° C.) for 10 min. No repeated recycling is required, but a short initial denaturation step, the amplification step, and a final inactivation of the enzyme are needed.


In some instances, the amplification method is Rolling Circle Amplification (RCA). RCA is an isothermal nucleic acid amplification method which allows amplification of the probe DNA sequences by more than 109 fold at a single temperature, typically about 30° C. Numerous rounds of isothermal enzymatic synthesis are carried out by 029 DNA polymerase, which extends a circle-hybridized primer by continuously progressing around the circular DNA probe. In some instances, the amplification reaction is carried out using RCA, at about 28° C. to about 32° C.


Additional amplification methods can be found in the art that could be incorporated into devices and methods disclosed herein. Ideally, the amplification method is isothermal and fast relative to traditional PCR. In some instances, amplifying comprises performing an exponential amplification reaction (EXPAR), which is an isothermal molecular chain reaction in that the products of one reaction catalyze further reactions that create the same products. In some instances, amplifying occurs in the presence of an endonuclease. The endonuclease may be a nicking endonuclease. See, e.g., Wu et al., “Aligner-Mediated Cleavage of Nucleic Acids,” Chemical Science (2018). In some instances, amplifying does not require initial heat denaturation of target DNA. See, e.g., Toley et al., “Isothermal strand displacement amplification (iSDA): a rapid and sensitive method of nucleic acid amplification for point-of-care diagnosis,” The Analyst (2015). Pulse controlled amplification in an ultrafast amplification method developed by GNA Biosolutions GmbH.


In some instances, methods comprise performing multiple cycles of nucleic acid amplification with a pair of primers. The number of amplification cycles is important because amplification may introduce a bias into the representation of regions. With ultra low input amounts, amplification is even more prone to create biases and hence increasing efficiency prior to amplification is important for high accuracy. Not all regions amplify with the same efficiency and therefore the overall representation may not be uniform which will impact the accuracy of the analysis. Usually fewer cycles are ideal if amplification is necessary at all. In some instances, methods comprise performing fewer than 30 cycles of amplification. In some instances, methods comprise performing fewer than 25 cycles of amplification. In some instances, methods comprise performing fewer than 20 cycles of amplification. In some instances, methods comprise performing fewer than 15 cycles of amplification. In some instances, methods comprise performing fewer than 12 cycles of amplification. In some instances, methods comprise performing fewer than 11 cycles of amplification. In some instances, methods comprise performing fewer than 10 cycles of amplification. In some instances, methods comprise performing at least 3 cycles of amplification. In some instances, methods comprise performing at least 5 cycles of amplification. In some instances, methods comprise performing at least 8 cycles of amplification. In some instances, methods comprise performing at least 10 cycles of amplification.


In some instances, the amplification reaction is carried for about 5 to about 90 minutes. In some instances, the amplification reaction is carried out for at least about 30 minutes. In some instances, the amplification reaction is carried out for at most about 90 minutes. In some instances, the amplification reaction is carried out for about 30 minutes to about 35 minutes, about 30 minutes to about 40 minutes, about 30 minutes to about 45 minutes, about 30 minutes to about 50 minutes, about 30 minutes to about 55 minutes, about 30 minutes to about 60 minutes, about 30 minutes to about 65 minutes, about 30 minutes to about 70 minutes, about 30 minutes to about 75 minutes, about 30 minutes to about 80 minutes, about 30 minutes to about 90 minutes, about 35 minutes to about 40 minutes, about 35 minutes to about 45 minutes, about 35 minutes to about 50 minutes, about 35 minutes to about 55 minutes, about 35 minutes to about 60 minutes, about 35 minutes to about 65 minutes, about 35 minutes to about 70 minutes, about 35 minutes to about 75 minutes, about 35 minutes to about 80 minutes, about 35 minutes to about 90 minutes, about 40 minutes to about 45 minutes, about 40 minutes to about 50 minutes, about 40 minutes to about 55 minutes, about 40 minutes to about 60 minutes, about 40 minutes to about 65 minutes, about 40 minutes to about 70 minutes, about 40 minutes to about 75 minutes, about 40 minutes to about 80 minutes, about 40 minutes to about 90 minutes, about 45 minutes to about 50 minutes, about 45 minutes to about 55 minutes, about 45 minutes to about 60 minutes, about 45 minutes to about 65 minutes, about 45 minutes to about 70 minutes, about 45 minutes to about 75 minutes, about 45 minutes to about 80 minutes, about 45 minutes to about 90 minutes, about 50 minutes to about 55 minutes, about 50 minutes to about 60 minutes, about 50 minutes to about 65 minutes, about 50 minutes to about 70 minutes, about 50 minutes to about 75 minutes, about 50 minutes to about 80 minutes, about 50 minutes to about 90 minutes, about 55 minutes to about 60 minutes, about 55 minutes to about 65 minutes, about 55 minutes to about 70 minutes, about 55 minutes to about 75 minutes, about 55 minutes to about 80 minutes, about 55 minutes to about 90 minutes, about 60 minutes to about 65 minutes, about 60 minutes to about 70 minutes, about 60 minutes to about 75 minutes, about 60 minutes to about 80 minutes, about 60 minutes to about 90 minutes, about 65 minutes to about 70 minutes, about 65 minutes to about 75 minutes, about 65 minutes to about 80 minutes, about 65 minutes to about 90 minutes, about 70 minutes to about 75 minutes, about 70 minutes to about 80 minutes, about 70 minutes to about 90 minutes, about 75 minutes to about 80 minutes, about 75 minutes to about 90 minutes, or about 80 minutes to about 90 minutes. In some instances, the amplification reaction is carried out for about 30 minutes, about 35 minutes, about 40 minutes, about 45 minutes, about 50 minutes, about 55 minutes, about 60 minutes, about 65 minutes, about 70 minutes, about 75 minutes, about 80 minutes, or about 90 minutes.


In some instances, methods disclosed herein comprise amplifying a nucleic acid at least at one temperature. In some instances, methods disclosed herein comprise amplifying a nucleic acid at a single temperature (e.g., isothermal amplification). In some instances, methods disclosed herein comprise amplifying a nucleic acid, wherein the amplifying occurs at not more than two temperatures. Amplifying may occur in one step or multiple steps. Non-limiting examples of amplifying steps include double strand denaturing, primer hybridization, and primer extension.


In some instances, at least one step of amplifying occurs at room temperature. In some instances, all steps of amplifying occur at room temperature. In some instances, at least one step of amplifying occurs in a temperature range. In some instances, all steps of amplifying occur in a temperature range. In some instances, the temperature range is about 0° C. to about 100° C. In some instances, the temperature range is about 15° C. to about 100° C. In some instances, the temperature range is about 25° C. to about 100° C. In some instances, the temperature range is about 35° C. to about 100° C. In some instances, the temperature range is about 55° C. to about 100° C. In some instances, the temperature range is about 65° C. to about 100° C. In some instances, the temperature range is about 15° C. to about 80° C. In some instances, the temperature range is about 25° C. to about 80° C. In some instances, the temperature range is about 35° C. to about 80° C. In some instances, the temperature range is about 55° C. to about 80° C. In some instances, the temperature range is about 65° C. to about 80° C. In some instances, the temperature range is about 15° C. to about 60° C. In some instances, the temperature range is about 25° C. to about 60° C. In some instances, the temperature range is about 35° C. to about 60° C. In some instances, the temperature range is about 15° C. to about 40° C. In some instances, the temperature range is about −20° C. to about 100° C. In some instances, the temperature range is about −20° C. to about 90° C. In some instances, the temperature range is about −20° C. to about 50° C. In some instances, the temperature range is about −20° C. to about 40° C. In some instances, the temperature range is about −20° C. to about 10° C. In some instances, the temperature range is about 0° C. to about 100° C. In some instances, the temperature range is about 0° C. to about 40° C. In some instances, the temperature range is about 0° C. to about 30° C. In some instances, the temperature range is about 0° C. to about 20° C. In some instances, the temperature range is about 0° C. to about 10° C. In some instances, the temperature range is about 15° C. to about 100° C. In some instances, the temperature range is about 15° C. to about 90° C. In some instances, the temperature range is about 15° C. to about 80° C. In some instances, the temperature range is about is about 15° C. to about 70° C. In some instances, the temperature range is about 15° C. to about 60° C. In some instances, the temperature range is about 15° C. to about 50° C. In some instances, the temperature range is about 15° C. to about 30° C. In some instances, the temperature range is about 10° C. to about 30° C. In some instances, methods disclose herein are performed at room temperature, not requiring cooling, freezing or heating. In some instances, amplifying comprises contacting the sample with random oligonucleotide primers. In some instances, amplifying comprises contacting cell-free nucleic acid molecules disclosed herein with random oligonucleotide primers. In some instances, amplifying comprises contacting cell-free fetal nucleic acid molecules disclosed herein with random oligonucleotide primers. In some instances, amplifying comprises contacting the tagged nucleic acid molecules disclosed herein with random oligonucleotide primers. Amplifying with a plurality of random primers generally results in non-targeted amplification of multiple nucleic acids of different sequences or an overall amplification of most nucleic acids in a sample.


In some instances, amplifying comprises targeted amplification (e.g., selector method (described in U.S. Pat. No. 6,558,928), molecular inversion probes). In some instances, amplifying a nucleic acid comprises contacting a nucleic acid with at least one primer having a sequence corresponding to a target chromosome sequence. Exemplary chromosome sequences are disclosed herein. In some instances, amplifying comprises contacting the nucleic acid with at least one primer having a sequence corresponding to a non-target chromosome sequence. In some instances, amplifying comprises contacting the nucleic acid with not more than one pair of primers, wherein each primer of the pair of primers comprises a sequence corresponding to a sequence on a target chromosome disclosed herein. In some instances, amplifying comprises contacting the nucleic acid with multiple sets of primers, wherein each of a first pair in a first set and each of a pair in a second set are all different.


In some instances, amplifying comprises contacting the sample with at least one primer having a sequence corresponding to a sequence on a target chromosome disclosed herein. In some instances, amplifying comprises contacting the sample with at least one primer having a sequence corresponding to a sequence on a non-target chromosome disclosed herein. In some instances, amplifying comprises contacting the sample with not more than one pair of primers, wherein each primer of the pair of primers comprises a sequence corresponding to a sequence on a target chromosome disclosed herein. In some instances, amplifying comprises contacting the sample with multiple sets of primers, wherein each of a first pair in a first set and each of a pair in a second set are all different.


In some instances, amplifying comprises multiplexing (nucleic acid amplification of a plurality of nucleic acids in one reaction). In some instances, multiplexing comprises contacting nucleic acids of the biological sample with a plurality of oligonucleotide primer pairs. In some instances, multiplexing comprising contacting a first nucleic acid and a second nucleic acid, wherein the first nucleic acid corresponds to a first sequence and the second nucleic acid corresponds to a second sequence. In some instances, the first sequence and the second sequence are the same. In some instances, the first sequence and the second sequence are different. In some instances, amplifying does not comprise multiplexing. In some instances, amplifying does not require multiplexing. In some instance, amplifying comprises nested primer amplification. Methods may comprise multiplex PCR of multiple regions, wherein each region comprises a single nucleotide polymorphism (SNP). Multiplexing may occur in a single tube. In some instances, methods comprise multiplex PCR of more than 100 regions wherein each region comprises a SNP. In some instances, methods comprise multiplex PCR of more than 500 regions wherein each region comprises a SNP. In some instances, methods comprise multiplex PCR of more than 1000 regions wherein each region comprises a SNP. In some instances, methods comprise multiplex PCR of more than 2000 regions wherein each region comprises a SNP. In some instances, methods comprise multiplex PCR of more than 300 regions wherein each region comprises a SNP.


In some instances, methods comprise amplifying a nucleic acid in the sample, wherein amplifying comprises contacting the sample with at least one oligonucleotide primer, wherein the at least one oligonucleotide primer is not active or extendable until it is in contact with the sample. In some instances, amplifying comprises contacting the sample with at least one oligonucleotide primer, wherein the at least one oligonucleotide primer is not active or extendable until it is exposed to a selected temperature. In some instances, amplifying comprises contacting the sample with at least one oligonucleotide primer, wherein the at least one oligonucleotide primer is not active or extendable until it is contacted with an activating reagent. By way of non-limiting example, the at least one oligonucleotide primer may comprise a blocking group. Using such oligonucleotide primers may minimize primer dimers, allow recognition of unused primer, and/or avoid false results caused by unused primers. In some instances, amplifying comprises contacting the sample with at least one oligonucleotide primer comprising a sequence corresponding to a sequence on a target chromosome disclosed herein.


In some instances, methods disclosed herein comprise the use of one or more tags. The use of one or more tags may increase at least one of the efficiency, speed and accuracy of methods disclosed herein. In some instances, the oligonucleotide primer comprises a tag, wherein the tag is not specific to a target sequence. Such a tag may be referred to as a universal tag. In some instances, methods comprise tagging a target sequence, or fragment thereof, in the sample with a tag that is not specific to the target sequence. In some instances, the tag that is not specific to a sequence on a human chromosome. Alternatively or additionally, methods comprise contacting the sample with a tag and at least one oligonucleotide primer comprising a sequence corresponding to a target sequence, wherein the tag is separate from the oligonucleotide primer. In some instances, the tag is incorporated in an amplification product produced by extension of the oligonucleotide primer after it hybridizes to the target sequence. The tag may be an oligonucleotide, a small molecule, or a peptide. In some instances, the tag does not comprise a nucleotide. In some instances, the tag does not comprise an oligonucleotide. In some instances, the tag does not comprise an amino acid. In some instances, the tag does not comprise a peptide. In some instances, the tag is not sequence specific. In some instances, the tag comprises a generic sequence that does not correspond to any particular target sequence. In some instances, the tag is detectable when an amplification product is produced, regardless of the sequence amplified. In some instances, at least one of the oligonucleotide primer and tag comprises a peptide nucleic acid (PNA). In some instances, at least one of the oligonucleotide primer and tag comprises a locked nucleic acid (LNA).


In some instances, methods disclosed herein comprise the use of a plurality of tags, thereby increasing at least one of the accuracy of the method, speed of the method and information obtained by the method. In some instances, methods disclosed herein comprise the use of a plurality of tags, thereby decreasing the volume of sample required to obtain a reliable result. In some instances, the plurality of tags comprises at least one capture tag. In some instances, the plurality of tags comprises at least one detection tag. In some instances, the plurality of tags comprises a combination of least one capture tag and at least one detection tag. A capture tag is generally used to isolate or separate a specific sequence or region from other regions. A typical example for a capture tag is biotin (that can be captured using streptavidin coated surfaces for example). Examples of detection tags are digoxigenin and a fluorescent tag. The detection tag may be detected directly (e.g., laser irradiation and/or measuring emitted light) or indirectly through an antibody that carries or interacts with a secondary detection system such as a luminescent assay or enzymatic assay. In some instances, the plurality of tags comprises a combination of least one capture tag (a tag used to isolate an analyte) and at least one detection tag (a tag used to detect the analyte). In some instance, a single tag acts as a detection tag and a capture tag.


In some instances, methods comprise contacting the at least one circulating cell-free nucleic acid in the sample with a first tag and a second tag, wherein the first tag comprises a first oligonucleotide that is complementary to a sense strand of the circulating cell-free nucleic acid, and the second capture tag comprises a second oligonucleotide that is complementary to an antisense strand of the circulating cell-free nucleic acid. In some instances, methods comprise contacting the at least one circulating cell-free nucleic acid in the sample with a first tag and a second tag, wherein the first tag carries the same label as the second tag. In some instances, methods comprise contacting the at least one circulating cell-free nucleic acid in the sample with a first tag and a second tag, wherein the first tag carries a different label than the second tag. In some instances, the tags are the same and there is a single qualitative or quantitative signal that is the aggregate of all probes/regions detected. In some instances, the tags are different. One tag may be used to purify and one tag may be used to detect. In some instances, a first oligonucleotide tag is specific to a region (e.g., cfDNA fragment) and carries a fluorescent label and a second oligonucleotide is specific to an adjacent region and carries the same fluorescent label because only the aggregate signal is desired. In other instances, a first oligonucleotide tag is specific to a region (e.g., cfDNA fragment) and carries a fluorescent label and a second oligonucleotide is specific to an adjacent region and carries a different fluorescent label to detect two distinct regions.


In some instances, methods comprise detecting an amplification product, wherein the amplification product is produced by amplifying at least a portion of a target chromosome disclosed herein, or fragment thereof. The portion or fragment of the target chromosome may comprise at least 5 nucleotides. The portion or fragment of the target chromosome may comprise at least about 10 nucleotides. The portion or fragment of the target chromosome may comprise at least about 15 nucleotides. In some instances, detecting amplification products disclosed herein does not comprise tagging or labeling the amplification product. In some instances, methods detect the amplification product based on its amount. For example, the methods may detect an increase in the amount of double stranded DNA in the sample. In some instances, detecting the amplification product is at least partially based on its size. In some instances, the amplification product has a length of about 50 base pairs to about 500 base pairs.


In some instances, detecting the amplification product comprises contacting the amplification product with a tag. In some instances, the tag comprises a sequence that is complementary to a sequence of the amplification product. In some instances, the tag does not comprise a sequence that is complementary to a sequence of the amplification product. Non-limiting examples of tags are described in the foregoing and following disclosure.


In some instances, detecting the amplification product, whether tagged or not tagged, comprises subjecting the amplification product to a signal detector or assay assembly of a device, system, or kit disclosed herein. In some instances, methods comprise comprises amplifying and detecting on an assay assembly of a device, system, or kit disclosed herein. In some instances, the assay assembly comprises amplification reagents. In some instances, methods comprise applying an instrument or reagent to an assay assembly (e.g., lateral flow assay) disclosed herein to control the flow of a biological sample, solution, or combination thereof, through the lateral flow assay. In some instances, the instrument is a vacuum, a pipet, a pump, or a combination thereof.


Sequencing

In some instances, methods disclosed herein comprise sequencing a nucleic acid. The nucleic acid may be a nucleic acid disclosed herein, such as a tagged nucleic acid, an amplified nucleic acid, a cell-free nucleic acid, a cell-free fetal nucleic acid, a nucleic acid having a sequence corresponding to a target chromosome, a nucleic acid having a sequence corresponding to a region of a target chromosome, a nucleic acid having a sequence corresponding to a non-target chromosome, or a combination thereof. In some instances, the nucleic acid is DNA. In some instances, the nucleic acid is RNA. In some instances, the nucleic acid comprises DNA. In some instances, the nucleic acid comprises RNA.


In some instances, sequencing comprises targeted sequencing. In some instances, sequencing comprises whole genome sequencing. In some instances, sequencing comprises targeted sequencing and whole genome sequencing. In some instances, whole genome sequencing comprises massive parallel sequencing, also referred to in the art as next generation sequencing or second generation sequencing. In some instances, whole genome sequencing comprises random massive parallel sequencing. In some instances, sequencing comprises random massive parallel sequencing of target regions captured from a whole genome library.


In some instances, methods comprise sequencing amplified nucleic acids disclosed herein. In some instances, amplified nucleic acids are produced by targeted amplification (e.g., with primers specific to target sequences of interest). In some instances, amplified nucleic acids are produced by non-targeted amplification (e.g., with random oligonucleotide primers). In some instances, methods comprise sequencing amplified nucleic acids, wherein the sequencing comprises massive parallel sequencing.


In some instances, methods comprise performing a genome sequence alignment using an algorithm. By way of non-limiting example, the algorithm may be designed to recognize a chromosome copy number. The algorithm may be designed to reveal an observed number of sequence reads associated with each relevant allele at various SNP loci. The algorithm may use parental genotypes and crossover frequency data to create monosomic, disomic and trisomic fetal genotypes at measured loci in silico, which are then used to predict sequencing data for each genotype. Using a Bayesian model, the sequencing data with the maximum likelihood is selected as the copy number and fetal fraction and the likelihood is the calculated accuracy. Different probability distributions may be expected for each of the two possible alleles for each SNP and compared the observed alleles. This is described by Zimmermann et al., in Prenat Diagn (2012) 32:1233-1241. However, Zimmermann et al. believed that samples containing less than a 4.0% fetal fraction could not be informative and that a volume of at least 20 ml of blood was necessary to get enough cell-free DNA to perform this type of analysis. In contrast, the methods of the instant application may employ this analysis with samples with less than a 4% fetal fraction and samples that do not require nearly as much sample.


Library Preparation

In some instances, methods disclosed herein comprise modifying cell-free nucleic acids in the biological sample to produce a library of cell-free nucleic acids for detection. In some instances, methods comprise modifying cell-free nucleic acids for nucleic acid sequencing. In some instances, methods comprise modifying cell-free nucleic acids for detection, wherein detection does not comprise nucleic acid sequencing. In some instances, methods comprise modifying cell-free nucleic acids for detection, wherein detection comprises counting tagged cell-free nucleic acids based on an occurrence of tag detection. In some instances, methods disclosed herein comprise modifying cell-free nucleic acids in the biological sample to produce a library of cell-free nucleic acids, wherein the method comprises amplifying the cell-free nucleic acids. In some instances, modifying occurs before amplifying. In some instances, modifying occurs after amplifying.


In some instances, modifying the cell-free nucleic acids comprises repairing ends of cell-free nucleic acids that are fragments of a nucleic acid. By way of non-limiting example, repairing ends may comprise restoring a 5′ phosphate group, a 3′ hydroxy group, or a combination thereof to the cell-free nucleic acid. In some instances, repairing comprises 5′-phosphorylation, A-tailing, gap filling, closing nick sites or a combination thereof. In some instances, repairing may comprise removing overhangs. In some instances, repairing may comprise filling in overhangs with complementary nucleotides. In some instances, modifying the cell-free nucleic acids for preparing a library comprises use of an adapter. The adapter may also be referred to herein as a sequencing adapter. In some instances, the adapter aids in sequencing. Generally, the adapter comprises an oligonucleotide. By way of non-limiting example, the adapter may simplify other steps in the methods, such as amplifying, purification and sequencing because it is a sequence that is universal to multiple, if not all, cell-free nucleic acids in a sample after modifying. In some instances, modifying the cell-free nucleic acids comprises ligating an adapter to the cell-free nucleic acids. Ligating may comprise blunt ligation. In some instances, modifying the cell-free nucleic acids comprises hybridizing an adapter to the nucleic acids. In some instances, the sequencing adaptor comprises a hairpin or stem-loop adaptor. In some instances, modifying the cell-free nucleic acids comprises hybridizing a hairpin or stem-loop adaptor to the nucleic acids, thereby generating a circular library product that is sequenced or analyzed. In some instances, the sequencing adaptor comprises a blocked 5′ end leaving a nick at the 3′ end. Advantages of this configuration include, but are not limited to, an increase in library efficiency and reduction of unwanted byproducts such as adaptor dimers. In further instances the adaptor has a cleavable replication stop to linearize templates.


The efficiency of library preparation steps (e.g., end repair, tailing, and ligation of adaptors) and amplifying may benefit from the addition of crowding agents to the sample or the amplifying reaction. Enzymatic processes in their natural environments (e.g., DNA replication in a cell) often occur in a crowded environment. Some of these enzymatic processes are more efficient in a crowded environment. For example, a crowded environment may enhance the activity of DNA helicase and the sensitivity of DNA polymerase. Thus, crowding agents can be added to mimic the crowded environment. The crowding agent may be a polymer. The crowding agent may be a protein. The crowding agent may be a polysaccharide. Non-limiting examples of crowding agents are polyethylene glycol, dextran and Ficoll. Concentrations that mimic crowding in vivo are often desirable. For example, 4% (40 mg/ml) PEG 1 kDa provides an approximate crowding effect found in vivo. In some instances, the concentration of the crowding agent is about 2% to about 20% w/v in the amplification reaction. In some instances, the concentration of the crowding agent is about 2% to about 15% w/v in the amplification reaction. In some instances, the concentration of the crowding agent is about 2% to about 10% w/v in the amplification reaction. In some instances, the concentration of the crowding agent is about 2% to about 8% w/v in the amplification reaction. In some instances, the concentration of the crowding agent is about 3% to about 6% w/v in the amplification reaction.


In some instances, modifying the cell-free nucleic acids for preparing a library comprises use of a tag. The tag may also be referred to herein as a barcode. In some instances, methods disclosed herein comprise modifying cell-free nucleic acids with a tag that corresponds to a chromosomal region of interest. In some instances, methods disclosed herein comprise modifying cell-free nucleic acids with a tag that is specific to a chromosomal region that is not of interest. In some instances, methods disclosed herein comprise modifying a first portion of cell-free nucleic acids with a first tag that corresponds to at least one chromosomal region that is of interest and a second portion of cell-free nucleic acids with a second tag that corresponds to at least one chromosomal region that is not of interest. In some instances, modifying the cell-free nucleic acids comprises ligating a tag to the cell-free nucleic acids. Ligating may comprise blunt ligation. In some instances, modifying the cell-free nucleic acids comprises hybridizing a tag to the nucleic acids. In some instances, the tags comprise oligonucleotides. In some instances, the tags comprise a non-oligonucleotide marker or label that can be detected by means other than nucleic acid analysis. By way of non-limiting example, a non-oligonucleotide marker or label could comprise a fluorescent molecule, a nanoparticle, a dye, a peptide, or other detectable/quantifiable small molecule.


In some embodiments, the tagging of (c) comprises: (a) generating ligation competent cell-free DNA by one or more steps comprising: (i) generating a blunt end of the cell-free DNA, In some embodiments, a 5′ overhang or a 3′ recessed end is removed using one or more polymerase and one or more exonuclease; (ii) dephosphorylating the blunt end of the cell-free DNA; (iii) contacting the cell-free DNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cell-free DNA; or (iv) repairing or remove DNA damage in the cell-free DNA using a ligase; and (b) ligating the ligation competent cell-free DNA to adaptor oligonucleotides by contacting the ligation competent cell-free DNA to adaptor oligonucleotides in the presence of a ligase, crowding reagent, and/or a small molecule enhancer. In some embodiments, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, ligating in (b) comprises blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof.


In some instances, modifying the cell-free nucleic acids for preparing a library comprises use of a sample index, also simply referred to herein as an index. By way of non-limiting example, the index may comprise an oligonucleotide, a small molecule, a nanoparticle, a peptide, a fluorescent molecule, a dye, or other detectable/quantifiable moiety. In some instances, a first group of cell-free nucleic acids from a first biological sample are labeled with a first index, and a first group of cell-free nucleic acids from a first biological sample are labeled with a second index, wherein the first index and the second index are different. Thus, multiple indexes allow for distinguishing cell-free nucleic acids from multiple samples when multiple samples are analyzed at once. In some instances, methods disclose amplifying cell-free nucleic acids wherein an oligonucleotide primer used to amplify the cell-free nucleic acids comprises an index.


While DNA loss can occur at every step of DNA isolation and analysis, the highest loss typically appears at the step of library preparation. Traditional methods show losses of 80% to 90% of material. Often this loss is compensated by a subsequent amplification step to bring the concentration of DNA up to the necessary level required for next generation sequencing, but the amplification cannot compensate for a loss of information that occurred during the prior steps. A library suffering a loss of 80% of initial DNA in the sample can be described as a library with a 20% efficiency or an efficiency of 0.2. In some instances, methods disclosed herein comprise achieving a library with an efficiency of at least about 0.2, at least about 0.3, at least about 0.4, at least about 0.5, at least about 0.6 or at least about 0.8. In some instances, methods disclosed herein comprise producing a library with an efficiency of at least about 0.4. In some instances, methods disclosed herein comprise producing a library with an efficiency of at least about 0.5. Methods that produce a library with such efficiencies may achieve these efficiencies by using crowding agents and repairing cell-free DNA fragment ends, ligation methods, purification methods, cycling parameters and stoichiometric ratios as described herein. In some instances, methods of library preparation do not require nucleic acid amplification of the cell-free nucleic acids. In some cases, a library of cell-free nucleic acids is generated using an in vitro CRISPR-Cas-targeted cleavage process, whereby guide RNAs (gRNAs) are binding to complementary DNA target sites and in combination with, for example, Cas9 can produce a blunt-ended, double-stranded break or single stranded nicks, if engineered proteins are used. This DNA break can be used to modify the cleaved DNA and make it competent for ligation and/or amplification processes and subsequent analysis of these target regions. Thus, CRISPR is used as a target enrichment process for DNA analysis or sequencing. Non-limiting examples of natural and engineered CRISPR-Cas combinations useful for this purpose include, Cas9, Cas12, Cascade and Cas13, or subtypes thereof. In some cases, the Cas enzyme is a Cas orthologue, such as those described in Adrian Pickar-Oliver et al., The next generation of CRISPR-Cas technologies and applications, Nat Rev Mol Cell Biol. 2019 August; 20(8):490-507, which is incorporated here in its entirety.


In a further example, the gRNAs are engineered to increase target fragmentation and decreasing off-target fragmentation, thereby increasing library efficiency without amplification. In some cases, the library is treated with an exonuclease, thereby depleting non-target DNA fragments and enriching target fragments in the library.


Endonucleases described herein, in some cases, function by initiating a single or a double stranded DNA break. In some cases, the single stranded or double stranded DNA breaks are immediately adjacent to, within, or to either side of the guide RNA binding sites. In this manner, the endonucleases described herein can be designed to provide a specific solution to the analytic question at hand. In some instances, library preparation is mediated by a transposase enzyme that fragments double stranded DNA and ligates a synthetic tag on the 3′ and 5′ ends of the fragment (e.g., oligonucleotides). In some cases, the process is tagmentation-based library construction. In some cases, the transposase operates with a “cut-and-paste” mechanism. In some cases, the transposase is Tn5 transposase, or a variant thereof. In some cases, the tag is an oligonucleotide that is about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 base pairs in length. In some cases, the tag is a free synthetic ME adaptor, such as those provided in NExtera DNA kits (Illumina). In some cases, the fragmented DNA is amplified by methods described herein.


Detecting Genetic Information

In general, methods disclosed herein comprise detecting a biomarker, an analyte or a modified form thereof. In some instances, methods comprise detecting nucleic acids. In some instances, methods comprise detecting cell-free nucleic acids. In some instances, methods comprise detecting a tag of a nucleic acid. In some instances, methods comprise detecting an amplicon of a nucleic acid. Alternatively or additionally, methods comprise detecting a non-nucleic acid component. By way of non-limiting example, the non-nucleic acid component may be selected from a protein, a peptide, a lipid, a fatty acid, a sterol, a phospholipid, a carbohydrate, a viral component, a microbial component, and a combination thereof. In the instance of a viral component or a microbial component, methods may comprise releasing, purifying, and/or amplifying a nucleic acid from a virus or bacteria before detecting.


Detecting may comprise sequencing a nucleic acid of interest. Detecting may comprise detecting a tag on a nucleic acid of interest. Detecting may comprise detecting a tag on a biomarker of interest. The biomarker may be an epigenetic modification. The biomarker may be an epigenetic profile (plurality of epigenetic modifications). The biomarker may be an epigenetically modified nucleic acid. Detecting may comprise bisulfite sequencing. Detecting may comprise performing a chromatin immunoprecipitation (ChIP) assay. Detecting may comprise sequencing a tag on a biomarker of interest.


Detecting may comprise amplifying, as described herein. For example, amplifying may comprise qPCR in which a signal is generated based on the presence or absence of a target analyte. In some instances, amplifying comprises PCR. In some instances, amplifying does not comprise PCR. In some instances, amplifying comprises rolling circle amplification (RCA). In some instances, cfDNA is contacted with a DNA ligase and probes designed to hybridize to cfDNA. In some instances, cfDNA is first cleaved (e.g., subjected to a restriction enzyme) to produce cfDNA fragments and the cfDNA fragments are contacted with the ligase and probes. The ligase creates circularized cfDNA labeled with probes. Optionally a backbone oligo is used to circularize the cfDNA or cfDNA fragments. These circularized fragments are replicated by RCA to produce concatamers. The probes can be recognized with a detectable oligonucleotide (e.g., fluorescent) and imaged.


Methods may comprise detecting a genetic mutation in a nucleic acid of a biological sample. Methods may comprise detecting a plurality of genetic mutations in a nucleic acid of a biological sample. Methods may comprise detecting a genetic mutation in each of a plurality of nucleic acids of a biological sample. Methods may comprise detecting a plurality of genetic mutations in a plurality of nucleic acids of a biological sample.


Methods may comprise detecting an epigenetic modification of a nucleic acid of a biological sample. In some instances, detecting the epigenetic modification comprises performing bisulfite sequencing. In some instances, detecting the epigenetic modification comprises performing a chromatin immunoprecipitation (ChIP) assay. In some instances, the epigenetic modification is a heritable alteration. In some instances, the epigenetic modification is an alteration that allows a cell to affect transcription in response to one or more environmental stimuli. By way of non-limiting example, the epigenetic modification may be a methylation of a cytosine or adenine residue. In some instances, the epigenetic modification is an absence of a methyl group. Typically methylations promote silencing of a gene. Epigenetic modifications also include acetylation, methylation, ubiquitination and phosphorylation of histones. The epigenetic modification may promote, inhibit, prevent or reduce a biological process (e.g., an immune response, cellular proliferation). Methods may comprise detecting a plurality of epigenetic modifications of a nucleic acid of a biological sample. Methods may comprise detecting an epigenetic modification of each of a plurality of nucleic acids of a biological sample. Methods may comprise detecting an epigenetic modification of a plurality of nucleic acids of a biological sample. Methods may comprise performing a genome wide analysis of epigenetic modifications to identify differentially methylated regions between a test sample and a control/reference sample.


Methods may comprise detecting one or more epigenetic modifications that is specific to a tissue. For instances, tissues have distinct methylation profiles that can be used to track the origin of cell-free nucleic acids. This may be useful in determining where a cell-free nucleic acid originated. By way of non-limiting example, the epigenetic modification may be specific to the brain and a cell-free nucleic acid bearing that epigenetic modification may be indicative of a neurodegenerative disease or a brain tumor. Methods may further comprise testing, biopsying, imaging, or treating a tissue if such a cell-free nucleic acid is detected. Methods may comprise detecting one or more epigenetic modifications that is specific to only two tissues. Methods may comprise detecting one or more epigenetic modifications that is specific to fewer than three tissues. Methods may comprise detecting one or more epigenetic modifications that is specific to fewer than five tissues.


Methods may comprise detecting a detectable label or detectable signal of a nucleic acid or non-nucleic acid component. Methods may comprise detecting a detectable label or detectable signal of a binding moiety (e.g., small molecule, peptide, aptamer, antibody, or antigen binding fragment thereof) that binds the nucleic acid or non-nucleic acid component. By way of non-limiting example, the detectable label or signal may be a fluorescent molecule, a bioluminescent molecule, a luminescent molecule, a radioactive signal, a magnetic signal, an electric signal, or a dye. For example, methods may comprise detecting an interaction between the binding moiety and a protein of interest. By way of non-limiting example, detecting may comprise performing IPCR or PLA.


Detecting may comprise viewing an interface of a device or system disclosed herein where the result of a test is displayed. See, e.g., FIG. 4 and FIGS. 5A-E. Detecting may comprise viewing a color appearance or fluorescent signal on a lateral flow device. Detecting may comprise receiving a result of a test on a device disclosed herein. Detecting may comprise receiving a result of a test on a mobile device, computer, notepad or other electronic device in communication with a device of system disclosed herein.


Generally, the methods, kits, systems and devices disclosed herein are capable of providing genetic information in a short amount of time. In some instances, methods disclosed herein can be performed in less than about 1 minute. In some instances, methods disclosed herein can be performed in less than about 2 minutes. In some instances, methods disclosed herein can be performed in less than about 5 minutes. In some instances, methods disclosed herein can be performed in less than about 10 minutes. In some instances, methods disclosed herein can be performed in less than about 15 minutes. In some instances, methods disclosed herein can be performed in less than about 20 minutes. In some instances, methods disclosed herein can be performed in less than about 30 minutes. In some instances, methods disclosed herein can be performed in less than about 45 minutes. In some instances, methods disclosed herein can be performed in less than about 60 minutes. In some instances, methods disclosed herein can be performed in less than about 90 minutes. In some instances, methods disclosed herein can be performed in less than about 2 hours. In some instances, methods disclosed herein can be performed in less than about 3 hours. In some instances, methods disclosed herein can be performed in less than about 4 hours.


In some instances, methods disclosed herein require minimal technical training. In some instances, methods disclosed herein do not require any technical training. In some instances, methods disclosed herein require only that an individual practicing the methods disclosed herein follow a simple protocol of transferring and mixing samples and solutions. For instance, methods disclosed herein may be used by the pregnant subject in her home without the assistance of a technician or medical provider. In some instances, methods disclosed herein can be performed by a user with no medical training or technical training. In some instances, methods, kits, systems and devices disclosed herein simply require that a user add a biological sample to the system or device and view a result to obtain genetic information.


Methods of Detecting Disease or Condition in a Subject

Methods may comprise detecting the presence of a disease or condition based on the detecting. Methods may comprise detecting the risk of a disease or condition based on the detecting. Methods may comprise detecting the status of a disease or condition based on the detecting. Methods may comprise monitoring the status of a disease or condition based on the detecting. Methods may comprise administering a therapy based on the detecting. Methods may comprise modifying the dose of a drug that is being administered to the subject based on the detecting. Methods may comprise monitoring the response of a subject to a therapy based on the detecting. For example, the disease may be a cancer and the therapy may be a chemotherapy. Other cancer therapies include, but are not limited to antibodies, antibody-drug conjugates, antisense molecules, engineered T cells, and radiation. Methods may comprise further testing a subject based on the detecting. For example, the disease may be cancer and further testing may include, but is not limited to imaging (e.g., CAT-SCAN, PET-SCAN), and performing a biopsy.


In some instances, methods disclosed herein comprise detecting that there is a fetal aneuploidy of at least one target chromosome. In some instances, methods disclosed herein comprise detecting that there is a fetal aneuploidy of the at least one target chromosome when a quantity of sequencing reads is detected in a sample disclosed herein. In some instances, the quantity of sequencing reads corresponds to sequences from a chromosome or chromosome region that is known to present aneuploidy in the human population, as described herein.


In some instances, methods disclosed herein comprise detecting that there is a fetal aneuploidy of the at least one target chromosome when a ratio of sequencing reads corresponding to the at least one target chromosome to sequencing reads corresponding to the at least one non-target chromosome is different from a respective ratio in a control biological sample from a control pregnant subject with a euploid fetus. In some instances, methods disclosed herein comprise detecting that there is a fetal aneuploidy of the at least one target chromosome because a ratio of sequencing reads corresponding to the at least one target chromosome to sequencing reads corresponding to the at least one non-target chromosome is different from a respective ratio in a control biological sample from a control pregnant subject with a euploid fetus. In some instances, methods disclosed herein comprise detecting that there is not a fetal aneuploidy of the at least one target chromosome because a ratio of sequencing reads corresponding to the at least one target chromosome to sequencing reads corresponding to the at least one non-target chromosome is not different from a respective ratio in a control biological sample from a control pregnant subject with a euploid fetus.


In some instances, the sequencing reads corresponding to the at least one target chromosome comprises sequencing reads corresponding to a chromosome region of the at least one target chromosome. In some instances, the sequencing reads corresponding to the at least one non-target chromosome comprises sequencing reads corresponding to a chromosome region of the non-target chromosome. In some instances, the chromosome region is at least about 10 base pairs in length. In some instances, the chromosome region is at least about 20 base pairs in length. In some instances, the chromosome region is at least about 50 base pairs in length.


In some instances, the at least one target chromosome is at least one of chromosome 13, chromosome 16, chromosome 18, chromosome 21, chromosome 22, chromosome X, or chromosome Y. In some instances, the at least one target chromosome is at least one of chromosome 13, chromosome 18, and chromosome 21. In some instances, the at least one target chromosome is at least one of chromosome 13, chromosome 18, chromosome 21, and chromosome X. In some instances, the at least one target chromosome is at least one of chromosome 13, chromosome 18, chromosome 21, and chromosome Y. In some instances, the at least one target chromosome is at least one of chromosome 13, chromosome 18, chromosome 21, chromosome X, and chromosome Y. In some instances, the at least one target chromosome is chromosome 13. In some instances, the at least one target chromosome is chromosome 16. In some instances, the at least one target chromosome is chromosome 18. In some instances, the at least one target chromosome is chromosome 21. In some instances, the target chromosome is chromosome 22. In some instances, the at least one target chromosome is a sex chromosome. In some instances, the at least one target chromosome is chromosome X. In some instances, the at least one target chromosome is chromosome Y.


In some instances, the at least one non-target chromosome is at least one of a chromosome other than chromosome 13, chromosome 16, chromosome 18, chromosome 21, chromosome 22, chromosome X, or chromosome Y. In some instances, the at least one non-target chromosome is not chromosome13, chromosome 16, chromosome 18, chromosome 21, chromosome 22, chromosome X, or chromosome Y. In some instances, the at least one non-target chromosome is selected from chromosome 1, chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, chromosome 9, chromosome 10, chromosome 11, chromosome 12, chromosome 14, chromosome 15, chromosome 17, chromosome 19, and chromosome 20. In some instances, the non-target chromosome is chromosome 1. In some instances, the at least one non-target chromosome is chromosome 2. In some instances, the at least one non-target chromosome is chromosome 3. In some instances, the non-target chromosome is chromosome 4. In some instances, the at least one non-target chromosome is chromosome 5. In some instances, the at least one non-target chromosome is chromosome 6. In some instances, the at least one non-target chromosome is chromosome 7. In some instances, the at least one non-target chromosome is chromosome 8. In some instances, the at least one non-target chromosome is chromosome 9. In some instances, the at least one non-target chromosome is chromosome 10. In some instances, the at least one non-target chromosome is chromosome 11. In some instances, the at least one non-target chromosome is chromosome 12. In some instances, the at least one non-target chromosome is chromosome 14. In some instances, the at least one non-target chromosome is chromosome 15. In some instances, the at least one non-target chromosome is chromosome 17. In some instances, the at least one non-target chromosome is chromosome 19. In some instances, the at least one non-target chromosome is chromosome 20.


In some instances, the at least one target chromosome is chromosome 13, and the at least one non-target chromosome is a chromosome other than chromosome 13. In some instances, the at least one target chromosome is chromosome 16, and the at least one non-target chromosome is a chromosome other than chromosome 16. In some instances, the at least one target chromosome is chromosome 18, and the at least one non-target chromosome is a chromosome other than chromosome 18. In some instances, the at least one target chromosome is chromosome 21, and the at least one non-target chromosome is a chromosome other than chromosome 21. In some instances, the at least one target chromosome is chromosome 22, and the at least one non-target chromosome is a chromosome other than chromosome 22. In some instances, the at least one target chromosome is chromosome X, and the at least one non-target chromosome is a chromosome other than chromosome X. In some instances, the at least one target chromosome is chromosome Y, and the at least one non-target chromosome is a chromosome other than chromosome Y.


In some instances, methods disclosed herein comprise detecting that the fetus of the pregnant subject has a genetic abnormality. In some instances, the genetic abnormality is due to insertion of at least one nucleotide in a target chromosomal region. In some instances, the genetic abnormality is due to deletion of at least one nucleotide in a target chromosomal region. In some instances, the genetic abnormality is due to translocation of nucleotide between a first target chromosomal region and a second chromosomal target region. Generally, the first target chromosomal region and a second chromosomal target region are located on different chromosomes.


In some instances, the target chromosomal region is defined by a minimal length. In some instances, the target chromosomal region is at least about 50 base pairs in length. In some instances, the target chromosomal region is at least about 100 base pairs in length. In some instances, the target chromosomal region is at least about 200 base pairs in length. In some instances, the target chromosomal region is at least about 300 base pairs in length. In some instances, the target chromosomal region is at least about 500 base pairs in length. In some instances, the target chromosomal region is at least about 1000 base pairs in length.


In some instances, the target chromosomal region is defined by a maximum length. In some instances, the target chromosomal region is as long as about 100,000 base pairs. In some instances, the target chromosomal region is as long as about 500,000 base pairs. In some instances, the target chromosomal region is as long as about 1,000,000 base pairs. In some instances, the target chromosomal region is as long as about 10,000,000 base pairs. In some instances, the target chromosomal region is as long as about 100,000,000 base pairs. In some instances, the target chromosomal region is as long as about 200,000,000 base pairs.


In some instances, the genetic abnormality is a copy number variation. In some instances, the copy number variation comprises a deletion of a gene on at least one chromosome. In some instances, the copy number variation comprises a duplication of a gene on at least one chromosome. In some instances, the copy number variation comprises a triplication of a gene on at least one chromosome. In some instances, the copy number variation comprises more than three copies of the gene. In some instances, the copy number variation comprises a duplication of a non-protein coding sequence on at least one chromosome. In some instances, the copy number variation comprises a triplication of a non-coding region on at least one chromosome. In some instances, the copy number variation comprises a duplication of a non-coding region on at least one chromosome.


In some instances, the genetic abnormality results in at least about 0.001% of a chromosomal arm being duplicated. In some instances, the genetic abnormality results in at least about 0.01% of a chromosomal arm being duplicated. In some instances, the genetic abnormality results in at least about 0.1% of a chromosomal arm being duplicated. In some instances, the genetic abnormality results in at least about 1% of a chromosomal arm being duplicated. In some instances, the genetic abnormality results in at least about 10% of a chromosomal arm being duplicated. In some instances, at least about 20% of a chromosomal arm is duplicated. In some instances, at least about 30% of a chromosomal arm is duplicated. In some instances, at least about 50% of a chromosomal arm is duplicated. In some instances, at least about 70% of a chromosomal arm is duplicated. In some instances, at least about 90% of a chromosomal arm is duplicated. In some instances, an entire chromosomal arm is duplicated.


In some instances, the genetic abnormality results in at least about 0.001% of a chromosomal arm being deleted. In some instances, the genetic abnormality results in at least about 0.01% of a chromosomal arm being deleted. In some instances, the genetic abnormality results in at least about 0.1% of a chromosomal arm being deleted. In some instances, the genetic abnormality results in at least about 1% of a chromosomal arm being deleted. In some instances, the genetic abnormality results in at least about 10% of a chromosomal arm being deleted. In some instances, at least about 20% of a chromosomal arm is deleted. In some instances, at least about 30% of a chromosomal arm is deleted. In some instances, at least about 50% of a chromosomal arm is deleted. In some instances, at least about 70% of a chromosomal arm is deleted. In some instances, at least about 90% of a chromosomal arm is deleted. In some instances, an entire chromosomal arm is deleted.


In some instances, methods comprise detecting that the fetus has a genetic abnormality when a quantity of sequencing reads corresponding to the target chromosomal region are detected, wherein the quantity is indicative of the genetic abnormality.


In some instances methods disclosed herein comprise sequencing nucleic acids. In some instances, the nucleic acids are cell-free nucleic acids. In some instances, the nucleic acids comprise cell-free fetal nucleic acids. In some instances, the nucleic acids are cell-free fetal nucleic acids. In some instances methods disclosed herein comprise producing at least a minimum amount of sequencing reads. In some instances, the minimum amount of sequencing reads is about 100. In some instances, the minimum amount of sequencing reads is about 1000. In some instances, the minimum amount of sequencing reads is about 2000. In some instances, the minimum amount of sequencing reads is about 3000. In some instances, the minimum amount of sequencing reads is about 4000. In some instances, the minimum amount of sequencing reads is about 5000. In some instances, the minimum amount of sequencing reads is about 6000. In some instances, the minimum amount of sequencing reads is about 7000. In some instances, the minimum amount of sequencing reads is about 8000. In some instances, the minimum amount of sequencing reads is about 9000. In some instances, the minimum amount of sequencing reads is about 10,000.


In some instances, methods comprise detecting that the fetus has a genetic abnormality when a ratio of (1) sequencing reads corresponding to the target chromosomal region to (2) sequencing reads corresponding to the at least one non-target chromosomal region is different from a respective ratio in a control biological sample from a control pregnant subject with a fetus not having the genetic abnormality. In some instances, methods comprise detecting that the fetus has a genetic abnormality because a ratio of (1) sequencing reads corresponding to the target chromosomal region to (2) sequencing reads corresponding to the at least one non-target chromosomal region is different from a respective ratio in a control biological sample from a control pregnant subject with a fetus not having the genetic abnormality. In some instances, methods comprise detecting that the fetus does not have a genetic abnormality when a ratio of (1) sequencing reads corresponding to the target chromosomal region to (2) sequencing reads corresponding to the at least one non-target chromosomal region is not different from a respective ratio in a control biological sample from a control pregnant subject with a fetus not having the genetic abnormality. In some instances the chromosomal region and the non-target chromosomal region are on the same chromosome. In some instances the chromosomal region and the non-target chromosomal region are on different chromosomes.


In some instances, genetic information is detected with a certain degree of accuracy. Non-limiting examples of genetic information include fetal aneuploidy, genetic abnormality, presence/quantity of tumor DNA, and presence/quantity of transplanted organ/tissue DNA. In some instances, genetic information is detected with at least about 95% accuracy. In some instances, genetic information is detected with at least about 96% accuracy. In some instances, genetic information is detected with at least about 97% accuracy. In some instances, genetic information is detected with at least about 98% accuracy. In some instances, genetic information is detected with at least about 99% accuracy. In some instances, genetic information is detected with at least about 99.5% accuracy. In some instances, genetic information is detected with at least about 99.9% accuracy. In some instances, genetic information is detected with at least about 99.99% accuracy.


Reads from each chromosome are roughly represented according to the length of the chromosome. Most reads are obtained from chromosome 1, while the fewest reads from an autosome will originate from chromosome 21. A common method for detecting a trisomic sample is to measure the percentage of reads originating from a chromosome in a population of euploid samples. Next, a mean and a standard deviation for this set of chromosome percentage values are calculated. A cutoff value is determined by adding three standard deviations to the mean. If a new sample has a chromosome percentage value above the cutoff value, an overrepresentation of that chromosome can be assumed, which is often consistent with a trisomy of the chromosome. A prophetic example of detecting an over presentation of a chromosome is presented in Example 13.


In some instances, fetal aneuploidy is detected when the ratio of (1) sequencing reads corresponding to the at least one target chromosome to (2) sequencing reads corresponding to the at least one non-target chromosome differs from a respective ratio in a control biological sample from a control pregnant subject with a euploid fetus by at least about 0.1%. In some instances, the ratios differ by at least 1%.


In some instances, the control pregnant subject is a euploid pregnant subject. In some instances the control is a mean or median value from a group of pregnant subjects. In some instances the control is a mean or median value from a pool of plasma samples from pregnant subjects. In some instances, the control is a similarly obtained value from an artificial mixture of nucleic acids mimicking a pregnant subject with a euploid fetus. In some instances, the control pregnant subject is a euploid pregnant subject carrying a fetus with a euploid chromosome set. In some instances, the control pregnant subject does not have a genetic abnormality, e.g., copy number variation. In some instances, the fetus carried by the control pregnant subject does not have a genetic abnormality, e.g., copy number variation. In some instances, the control pregnant subject does not have a genetic abnormality in a target chromosome disclosed herein. In some instances, the fetus carried by the control pregnant subject does not have a genetic abnormality in a target chromosome disclosed herein. In some instances, at least one of the control pregnant subject and her fetus has an aneuploidy. In some instances, at least one of the control pregnant subject and her fetus has a genetic abnormality disclosed herein. In some instances, at least one of the control pregnant subject and her fetus has a genetic abnormality in a target chromosome disclosed herein. In some instances, methods disclosed herein comprise use of a respective ratio in a control biological sample from a control pregnant population. In some instances, the respective ratio is from a respective mean ratio in the control pregnant population. In some instances, the respective ratio is from a respective median ratio in the control pregnant population.


Methods of Detecting Fetal Gender

Disclosed herein, in some embodiments, are method comprising obtaining a biological sample from the female subject, wherein the biological sample contains at least one cell free fetal nucleic acid comprising a sequence unique to a Y chromosome. In some instances, the sequence unique to the Y chromosome is indicative that the fetus is a male. In some instances, methods disclosed herein comprise obtaining a biological sample from the female subject, wherein the biological sample contains about 1 to about 5 cell free fetal nucleic acids comprising a sequence unique to a Y chromosome. In some instances, methods disclosed herein comprise obtaining a biological sample from the female subject, wherein the biological sample contains about 1 to about 15 cell free fetal nucleic acids comprising a sequence unique to a Y chromosome. In some instances, methods disclosed herein comprise obtaining a biological sample from the female subject, wherein the biological sample contains about 1 to about 25 cell free fetal nucleic acids comprising a sequence unique to a Y chromosome. In some instances, methods disclosed herein comprise obtaining a biological sample from the female subject, wherein the biological sample contains about 1 to about 100 cell free fetal nucleic acids comprising a sequence unique to a Y chromosome. In some instances, methods disclosed herein comprise obtaining a biological sample from the female subject, wherein the biological sample contains about 5 to about 100 cell free fetal nucleic acids comprising a sequence unique to a Y chromosome.


By way of non-limiting example, methods may comprise obtaining a fluid sample from a female pregnant subject with a handheld device, wherein the volume of the fluid sample is not greater than about 300 μL; sequencing at least one cell free nucleic acid in the fluid sample with the handheld device; detecting the presence or absence of a sequence corresponding to a Y chromosome through a display in the handheld device, thereby determining a gender of a fetus in the female pregnant subject; and communicating, with the handheld device, the gender to another subject. In some instances, the volume of the biological sample is not greater than about 120 μl. In some instances, the methods comprise detecting sequencing reads corresponding to the Y chromosome.


Also by way of non-limiting example, methods may comprise obtaining a biological sample from a female subject, wherein the volume of the biological sample is not greater than about 1200; contacting the sample with an oligonucleotide primer comprising a sequence corresponding to a Y chromosome for amplifying at least one circulating cell free nucleic acid in the sample; detecting an absence of an amplification product, thereby indicating that the fetus is female. Obtaining, contacting and detecting may occur with a single device.


In some instances, devices, systems and kits disclosed herein comprise at least one nucleic acid amplification reagent and at least one oligonucleotide primer capable of amplifying a first sequence in a genome and a second sequence in a genome, wherein the first sequence and the second sequence are similar, and wherein the first sequence is physically distant enough from the second sequence such that the first sequence is present on a first cell-free nucleic acid of the subject and the second sequence is present on a second cell-free nucleic acid of the subject. In some instances, the at least two sequences are immediately adjacent. In some instances, the at least two sequences are separated by at least one nucleotide. In some instances, the at least two sequences are separated by at least two nucleotides. In some instances, the at least two sequences are separated by at least about 5, at least about 10, at least about 15, at least about 20, at least about 30, at least about 40, at least about 50, or at least about 100 nucleotides. In some instances, the at least two sequences are at least about 50% identical. In some instances, the at least two sequences are at least about 60% identical, at least about 60% identical, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 99%, or 100% identical. In some instances, the first sequence and the second sequence are each at least 10 nucleotides in length. In some instances, the first sequence and the second sequence are each at least about 10, at least about 15, at least about 20, at least about 30, at least about 50, or at least about 100 nucleotides in length. In some instances, the first sequence and the second sequence are on the same chromosome. In some instances, the first sequence is on a first chromosome and the second sequence is on a second chromosome. In some instances, the first sequence and the second sequence are in functional linkage. For example, all CpG sites in the promotor region of gene AOX1 show the same hypermethylation in prostate cancer, so these sites are in functional linkage because they functionally carry the same information but are located one or more nucleotides apart.


In some instances, devices, systems and kits disclosed herein comprise at least one of an oligonucleotide probe or oligonucleotide primer that is capable of annealing to a strand of a cell-free nucleic acid, wherein the cell-free nucleic acid comprises a sequence corresponding to a region of interest or a portion thereof. In some instances, the region of interest is a region of a Y chromosome. In some instances, the region of interest is a region of an X chromosome. In some instances, the region of interest is a region of an autosome. In some instances, the region of interest, or portion thereof, comprises a repeat sequence as described herein that is present in a genome more than once.


In some instances, a region of interest disclosed herein is about 10 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is at least 10 nucleotides in length. In some instances, the region of interest is at least 100 nucleotides in length. In some instances, the region is at least 1000 nucleotides in length. In some instances, the region of interest is about 10 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 10 nucleotides to about 300,000 nucleotides in length. In some instances, the region of interest is about 100 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is about 100 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 100 nucleotides to about 300,000 base pairs in length. In some instances, the region of interest is about 1000 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is about 1000 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 1000 nucleotides to about 300,000 nucleotides in length. In some instances, the region of interest is about 10,000 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is about 10,000 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 10,000 nucleotides to about 300,000 nucleotides in length. In some instances, the region of interest is about 300,000 nucleotides in length.


In some instances, the sequence corresponding to the region of interest is at least about 5 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 8 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 10 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 15 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 20 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 50 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 100 nucleotides in length. In some instances, the sequence is about 5 nucleotides to about 1000 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 1000 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 500 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 400 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 300 nucleotides in length. In some instances, the sequence is about 50 nucleotides to about 1000 nucleotides in length. In some instances, the sequence is about 50 nucleotides to about 500 nucleotides in length.


In some instances, devices, systems and kits disclosed herein comprise at least one of an oligonucleotide probe and oligonucleotide primer that is capable of annealing to a strand of a cell-free nucleic acid, wherein the cell-free nucleic acid comprises a sequence corresponding to a sub-region of interest disclosed herein. In some instances, the sub-region is represented by a sequence that is present in the region of interest more than once. In some instances, the sub-region is about 10 to about 1000 nucleotides in length. In some instances, the sub-region is about 50 to about 500 nucleotides in length. In some instances, the sub-region is about 50 to about 250 nucleotides in length. In some instances, the sub-region is about 50 to about 150 nucleotides in length. In some instances, the sub-region is about 100 nucleotides in length.


In some instances, devices, systems and kits disclosed herein comprise at least one oligonucleotide primer, wherein the oligonucleotide primer has a sequence complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise a pair of oligonucleotide primers, wherein the pair of oligonucleotide primers have sequences complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise at least one oligonucleotide primer, wherein the oligonucleotide primer comprises a sequence complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise a pair of oligonucleotide primers, wherein the pair of oligonucleotide primers comprise sequences complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise at least one oligonucleotide primer, wherein the oligonucleotide primer consists of a sequence complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise a pair of oligonucleotide primers, wherein the pair of oligonucleotide primers consists of sequences complementary to or corresponding to a Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 75% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 80% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 85% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 80% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 90% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 95% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 97% identical to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is 100% identical to a wild-type human Y chromosome sequence.


Detecting Single Nucleotide Polymorphisms

In some instances, methods described herein comprise (a) obtaining a biological sample comprising cell-free fetal nucleic acids from a pregnant subject; (b) optionally, enriching the cell-free fetal nucleic acids of that may be in a mixed sample, ex vivo; (c) optionally, amplifying the cell-free nucleic acids, ex vivo; (d) preferentially enriching specific loci in the cell-free fetal nucleic acids, ex vivo; (e) measuring the cell-free nucleic acids, ex vivo to generate genotypic data; and (f) analyzing the genotypic data obtained on a computer, and ex vivo. In some embodiments, the genotype data comprises a chromosomal abnormality. In some embodiments, the genotype data comprises one or more genetic mutations or natural variations in a genome. In some embodiments, the analyzing step (f) is performed by calculating the maximum likelihood technique. In some embodiments, the maximum likelihood techniques uses the allelic distribution associated with each hypothesis to estimate the likelihood of the data conditioned on each hypothesis. In some instances, the cell-free nucleic acids comprise one or more single nucleotide polymorphisms (SNPs) or indels, or a combination thereof.


In some instances, methods disclosed herein employ the following devices, systems and kits.


Devices, Systems, and Kits

Aspects disclosed herein provide devices, systems and kits for obtaining genetic information from a biological sample. As described herein, devices, systems and kits disclosed herein allow a user to collect and test a biological sample at a location of choice to detect the presence and/or quantity of a target analyte in the sample. In some instances, devices, systems and kits disclosed herein are used in the foregoing methods. In some instances, devices, systems and kits disclosed herein comprise a sample purifier that removes at least one component (e.g., cell, cell fragment, protein) from a biological sample of a subject; a nucleic acid sequencer for sequencing at least one nucleic acid in the biological sample; and a nucleic acid sequence output for relaying sequence information to a user of the device, system or kit.


In general, devices, systems, and kits of the present disclosure, integrate multiple functions, e.g., purification, amplification, and detection of the target analyte (e.g., including amplification products thereof), and combinations thereof. In some instances, the multiple functions are carried out within a single assay assembly unit or a single device. In some instances, all of the functions occur outside of the single unit or device. In some instances, at least one of the functions occurs outside of the single unit or device. In some instances, only one of the functions occurs outside of the single unit or device. In some instances, the sample purifier, nucleic acid amplification reagent, oligonucleotide, and detection reagent or component are housed in a single device. In general, devices, systems, and kits of the present disclosure comprise a display, a connection to a display, or a communication to a display for relaying information about the biological sample to one or more people.


In some instances, devices, systems and kits comprise an additional component disclosed herein. Non-limiting examples of an additional component include a sample transportation compartment, a sample storage compartment, a sample and/or reagent receptacle, a temperature indicator, an electronic port, a communication connection, a communication device, a sample collection device, and a housing unit. In some instances, the additional component is integrated with the device. In some instances, the additional component is not integrated with the device. In some instances, the additional component is housed with the sample purifier, nucleic acid amplification reagent, oligonucleotide, and detection reagent or component in a single device. In some instances, the additional component is not housed within the single device.


In some instances, devices, systems and kits disclosed herein comprise components to obtain a sample, extract cell-free nucleic acids, and purify cell-free nucleic acids. In some instances, devices, systems and kits disclosed herein comprise components to obtain a sample, extract cell-free nucleic acids, purify cell-free nucleic acids, and prepare a library of the cell-free nucleic acids. In some instances, devices, systems and kits disclosed herein comprise components to obtain a sample, extract cell-free nucleic acids, purify cell-free nucleic acids, and sequence cell-free nucleic acids. In some instances, devices, systems and kits disclosed herein comprise components to obtain a sample, extract cell-free nucleic acids, purify cell-free nucleic acids, prepare a library of the cell-free nucleic acids, and sequence the cell-free nucleic acids. By way of non-limiting example, components for obtaining a sample are a transdermal puncture device and a filter for obtaining plasma from blood. Also, by way of non-limiting example, components for extracting and purifying cell-free nucleic acids comprise buffers, beads and magnets. Buffers, beads and magnets may be supplied at volumes appropriate for receiving a general sample volume from a finger prick (e.g., 50-150 μl of blood).


In some instances, devices, systems and kits comprise a receptacle for receiving the biological sample. The receptacle may be configured to hold a volume of a biological sample between 1 μl and 1 ml. The receptacle may be configured to hold a volume of a biological sample between 1 μl and 500 μl. The receptacle may be configured to hold a volume of a biological sample between 1 μl and 200 μl. The receptacle may have a defined volume that is the same as a suitable volume of sample for processing and analysis by the rest of the device/system components. This would preclude the need for a user of the device, system or kit to measure out a specified volume of the sample. The user would only need to fill the receptacle and thereby be assured that the appropriate volume of sample had been delivered to the device/system. In some instances, devices, systems and kits do not comprise a receptacle for receiving the biological sample. In some instances, the sample purifier receives the biological sample directly. Similar to the description above for the receptacle, the sample purifier may have a defined volume that is suitable for processing and analysis by the rest of the device/system components. In general, devices, systems, and kits disclosed herein are intended to be used entirely at point of care. However, in some instances, the user may want to preserve or send the analyzed sample to another location (e.g., lab, clinic) for additional analysis or confirmation of results obtained at point of care. By way of non-limiting example, the device/system may separate plasma from blood. The plasma may be analyzed at point of care and the cells from the blood shipped to another location for analysis. In some instances, devices, systems and kits comprise a transport compartment or storage compartment for these purposes. The transport compartment or storage compartment may be capable of containing a biological sample, a component thereof, or a portion thereof. The transport compartment or storage compartment may be capable of containing the biological sample, portion thereof, or component thereof, during transit to a site remote to the immediate user. The transport compartment or storage compartment may be capable of containing cells that are removed from a biological sample, so that the cells can be sent to a site remote to the immediate user for testing. Non-limiting examples of a site remote to the immediate user may be a laboratory or a clinic when the immediate user is at home. In some instances, the home does not have a machine or additional device to perform an additional analysis of the biological sample. The transport compartment or storage compartment may be capable of containing a product of a reaction or process that result from adding the biological sample to the device. In some instances, the product of the reaction or process is a nucleic acid amplification product or a reverse transcription product. In some instances, the product of the reaction or process is a biological sample component bound to a binding moiety described herein. The biological sample component may comprise a nucleic acid, a cell fragment, an extracellular vesicle, a protein, a peptide, a sterol, a lipid, a vitamin, or glucose, any of which may be analyzed at a remote location to the user. In some instances, the transport compartment or storage compartment comprises an absorption pad, a paper, a glass container, a plastic container, a polymer matrix, a liquid solution, a gel, a preservative, or a combination thereof. An absorption pad or a paper may be useful for stabilizing and transporting a dried biological fluid with a protein or other biomarker for screening.


In some instances, devices and systems disclosed herein provide for analysis of cell-free nucleic acids (e.g., circulating RNA and/or DNA) and non-nucleic acid components of a sample. Analysis of both cell-free nucleic acids and non-nucleic acid components may both occur at a point of need. In some instances, systems and devices provide an analysis of cell-free nucleic acids at a point of need and preservation of at least a portion or component of the sample for analysis of non-nucleic acid components at a site remote from the point of need. In some instances, systems and devices provide an analysis of non-nucleic acid components at a point of need and preservation of at least a portion or component of the sample for analysis of cell-free nucleic acids at a site remote from the point of need. These devices and systems may be useful for carrier testing and detecting inherited diseases, such as those disclosed herein.


In some instances, the transport compartment or storage compartment comprises a preservative. The preservative may also be referred to herein as a stabilizer or biological stabilizer. In some instances, the device, system or kit comprises a preservative that reduces enzymatic activity during storage and/or transportation. In some instances, the preservative is a whole blood preservative. Non-limiting examples of whole blood preservatives, or components thereof, are glucose, adenine, citric acid, trisodium citrate, dextrose, sodium di-phosphate, and monobasic sodium phosphate. In some instances, the preservative comprises EDTA. EDTA may reduce enzymatic activity that would otherwise degrade nucleic acids. In some instances, the preservative comprises formaldehyde. In some instances, the preservative is a known derivative of formaldehyde. Formaldehyde, or a derivative thereof, may cross link proteins and therefore stabilize cells and prevent cell lysis.


Generally, devices and systems disclosed herein are portable for a single person. In some instances, devices and systems are handheld. In some instances, devices and systems have a maximum length, maximum width or maximum height. In some instances, devices and systems are housed in a single unit having a maximum length, maximum width or maximum height. In some instances the maximum length is not greater than 12 inches. In some instances the maximum length is not greater than 10 inches. In some instances the maximum length is not greater than 8 inches. In some instances the maximum length is not greater than 6 inches. In some instances the maximum width is not greater than 12 inches. In some instances the maximum width is not greater than 10 inches. In some instances the maximum width is not greater than 8 inches. In some instances the maximum width is not greater than 6 inches. In some instances the maximum width is not greater than 4 inches. In some instances the maximum height is not greater than 12 inches. In some instances the maximum height is not greater than 10 inches. In some instances the maximum height is not greater than 8 inches. In some instances the maximum height is not greater than 6 inches. In some instances the maximum height is not greater than 4 inches. In some instances the maximum height is not greater than 2 inches. In some instances the maximum height is not greater than 1 inch.


Sample Collection

In some instances, devices, systems and kits disclosed herein comprise a sample collector. In some instances, the sample collector is provided separately from the rest of the device, system or kit. In some instances, the sample collector is physically integrated with the device, system or kit, or a component thereof. In some instances, the sample collector is integrated with a receptacle described herein. In some instances, the sample collector may be a cup, tube, capillary, or well for applying the biological fluid. In some instances, the sample collector may be a cup for applying urine. In some instances, the sample collector may comprise a pipet for applying urine in the cup to the device, system or kit. In some instances, the sample collector may be a capillary integrated with a device disclosed herein for applying blood. In some instances, the sample collector may be tube, well, pad or paper integrated with a device disclosed herein for applying saliva. In some instances, the sample collector may be pad or paper for applying sweat.


In some instances, devices, systems and kits disclosed herein comprise a transdermal puncture device. Non-limiting examples of transdermal puncture devices are needles and lancets. In some instances, the sample collector comprises the transdermal puncture device. In some instances, devices, systems and kits disclosed herein comprise a microneedle, microneedle array or microneedle patch. In some instances, devices, systems and kits disclosed herein comprise a hollow microneedle. By way of non-limiting example, the transdermal puncture device is integrated with a well or capillary so that as the subject punctures their finger, blood is released into the well or capillary where it will be available to the system or device for analysis of its components. In some instances, the transdermal puncture device is a push button device with a needle or lancet in a concave surface. In some instances, the needle is a microneedle. In some instances, the transdermal puncture device comprises an array of microneedles. By pressing an actuator, button or location on the non-needle side of the concave surface, the needle punctures the skin of the subject in a more controlled manner than a lancet. Furthermore, the push button device may comprise a vacuum source or plunger to help draw blood from the puncture site.


In some instances, devices, systems and kits disclosed herein comprise a device that does not require transdermal puncture, for e.g., lysing the tight junctions of the skin such that fluid containing the reliable genetic information.


Sample Processor and Purifier

Disclosed herein are devices, systems and kits that comprise a sample processor, wherein the sample processor modifies a biological sample to remove a component of the sample or separate the sample into multiple fractions (e.g., blood cell fraction and plasma or serum). The sample processor may comprise a sample purifier, wherein the sample purifier is configured to remove an unwanted substance or non-target component of a biological sample, thereby modifying the sample. Depending on the source of the biological sample, unwanted substances can include, but are not limited to, proteins (e.g., antibodies, hormones, enzymes, serum albumin, lipoproteins), free amino acids and other metabolites, microvesicles, nucleic acids, lipids, electrolytes, urea, urobilin, pharmaceutical drugs, mucous, bacteria, and other microorganisms, and combinations thereof. In some instances, the sample purifier separates components of a biological sample disclosed herein. In some instances, sample purifiers disclosed herein remove components of a sample that would inhibit, interfere with or otherwise be detrimental to the later process steps such as nucleic acid amplification or detection. In some instances, the resulting modified sample is enriched for target analytes. This can be considered indirect enrichment of target analytes. Alternatively or additionally, target analytes may be captured directly, which is considered direct enrichment of target analytes.


In some instances, the biological sample comprises fetal trophoblasts, that in some cases, contain the genetic information of a fetus (e.g., RNA, DNA). In some instances, the sample processor is configured to enrich the fetal trophoblast in the biological sample, such as by morphology (e.g., size) or marker antigens (e.g., cell surface antigens). In some cases, the sample processor is configured to enrich the trophoblasts using isolation by size of epithelial tumor cells (ISET) method. In some cases, the sample processor is configured to enrich the trophoblasts in the biological sample by contacting the biological sample with an antibody or antigen-binding fragment specific to a cell-surface antigen of a trophoblast. Non-limiting examples of trophoblast cell-surface antigens include tropomyosin-1 (Tropl), tropomyosin-2 (Trop2), cyto and syncytio-trophoblast marker, GB25, human placental lactogen (HPL), and alpha human chorionic gonadotrophin (alpha HCG). The sample purifier, in some cases, is configured to purify or isolate fetal trophoblasts from the biological sample using fluoresce-activated cell sorting (FACS), column chromatography, or magnetic sorting (e.g., Dynabeads). The fetal genomic DNA is extracted from the enriched and/or purified trophoblasts by the sample purifier, using any suitable DNA extraction methods such as those described herein.


In some instances, the sample processor is configured to process the fetal trophoblasts by (1) isolating the trophoblasts from the biological sample; (2) lysing the isolated trophoblasts; (3) lysing the isolated fetal trophoblasts; and (4) purifying the genomic DNA from the isolated fetal trophoblasts. In some instances, fetal nuclei are treated with a DNAase prior to lysing or isolation. In a non-limiting example, the biological sample contain fetal and maternal cells (e.g., trophoblasts) are centrifuged and resuspended in media. Next, the cells are mechanically separated using a magnetic separation procedure (e.g., magnetic nanoparticles conjugated to a cell surface antigen-specific monoclonal antibody). Cells are washed and suspended in media. Maternal cells (e.g., cell-surface antigen negative) are separated from magnetized (cell-surface antigen positive) fetal trophoblast cells using a DynaMag™ Spin magnet (Life Technologies). The fetal trophoblast cells are washed multiple times using a magnet to remove residual maternal cells. The isolated fetal trophoblast cells are resuspended in a solution. Isolated fetal trophoblast cells are lysed by addition of a lysing buffer, followed by centrifugation at low speed to pellet intact fetal trophoblast cell nuclei. The supernatant is removed and the nuclei are washed multiple times. Genomic DNA is extracted from the fetal trophoblast cell nuclei by addition of 25 microliters of 3× concentrated DNA extraction buffer to the fetal trophoblast cell nuclei, and incubated for about 3 hours. Optionally the DNA is still further purified, for example using commercial DNA purification and concentration kits.


In some instances, the sample purifier comprises a separation material for removing unwanted substances other than patient cells from the biological sample. Useful separation materials may include specific binding moieties that bind to or associate with the substance. Binding can be covalent or noncovalent. Any suitable binding moiety known in the art for removing a particular substance can be used. For example, antibodies and fragments thereof are commonly used for protein removal from samples. In some instances, a sample purifier disclosed herein comprises a binding moiety that binds a nucleic acid, protein, cell surface marker, or microvesicle surface marker in the biological sample. In some instances, the binding moiety comprises an antibody, antigen binding antibody fragment, a ligand, a receptor, a peptide, a small molecule, or a combination thereof.


In some instances, sample purifiers disclosed herein comprise a filter. In some instances, sample purifiers disclosed herein comprise a membrane. Generally the filter or membrane is capable of separating or removing cells, cell particles, cell fragments, blood components other than cell-free nucleic acids, or a combination thereof, from the biological samples disclosed herein.


In some instances, the sample purifier facilitates separation of plasma or serum from cellular components of a blood sample. In some instances, the sample purifier facilitates separation of plasma or serum from cellular components of a blood sample before starting a molecular amplification reaction or a sequencing reaction. Plasma or serum separation can be achieved by several different methods such as centrifugation, sedimentation or filtration. In some instances, the sample purifier comprises a filter matrix for receiving whole blood, the filter matrix having a pore size that is prohibitive for cells to pass through, while plasma or serum can pass through the filter matrix uninhibited. In some instances, the filter matrix combines a large pore size at the top with a small pore size at the bottom of the filter, which leads to very gentle treatment of the cells preventing cell degradation or lysis, during the filtration process. This is advantageous because cell degradation or lysis would result in release of nucleic acids from blood cells or maternal cells that would contaminate target cell-free nucleic acids. Non-limiting examples of such filters include Pall Vivid™ GR membrane, Munktell Ahlstrom filter paper (see, e.g., WO2017017314), TeraPore filters.


In some instances devices, systems, and kits disclosed herein employ vertical filtration, driven by capillary force to separate a component or fraction from a sample (e.g., plasma from blood). By way of non-limiting example, vertical filtration may comprise gravitation assisted plasma separation. A high-efficiency superhydrophobic plasma separator is described, e.g., by Liu et al., A High Efficiency Superhydrophobic Plasma Separation, Lab Chip 2015.


The sample purifier may comprise a lateral filter (e.g., sample does not move in a gravitational direction or the sample moves perpendicular to a gravitational direction). The sample purifier may comprise a vertical filter (e.g., sample moves in a gravitational direction). The sample purifier may comprise vertical filter and a lateral filter. The sample purifier may be configured to receive a sample or portion thereof with a vertical filter, followed by a lateral filter. The sample purifier may be configured to receive a sample or portion thereof with a lateral filter, followed by a vertical filter. In some instances, a vertical filter comprises a filter matrix. In some instances, the filter matrix of the vertical filter comprises a pore with a pore size that is prohibitive for cells to pass through, while plasma can pass the filter matrix uninhibited. In some instances, the filter matrix comprises a membrane that is especially suited for this application because it combines a large pore size at the top with a small pore size at the bottom of the filter, which leads to very gentle treatment of the cells preventing cell degradation during the filtration process.


In some instances, the sample purifier comprises an appropriate separation material, e.g., a filter or membrane, that removes unwanted substances from a biological sample without removing cell-free nucleic acids. In some instances, the separation material separates substances in the biological sample based on size, for example, the separation material has a pore size that excludes a cell but is permeable to cell-free nucleic acids. Therefore, when the biological sample is blood, the plasma or serum can move more rapidly than a blood cell through the separation material in the sample purifier, and the plasma or serum containing any cell-free nucleic acids permeates the holes of the separation material. In some instances, the biological sample is blood, and the cell that is slowed and/or trapped in the separation material is a red blood cell, a white blood cell, or a platelet. In some instances, the cell is from a tissue that contacted the biological sample in the body, including, but not limited to, a bladder or urinary tract epithelial cell (in urine), or a buccal cell (in saliva). In some instances, the cell is a bacterium or other microorganism.


In some instances, the sample purifier is capable of slowing and/or trapping a cell without damaging the cell, thereby avoiding the release of cell contents including cellular nucleic acids and other proteins or cell fragments that could interfere with subsequent evaluation of the cell-free nucleic acids. This can be accomplished, for example, by a gradual, progressive reduction in pore size along the path of a lateral flow strip or other suitable assay format, to allow gentle slowing of cell movement, and thereby minimize the force on the cell. In some instances, at least 95%, at least 98%, at least 99%, or up to 100% of the cells in a biological sample remain intact when trapped in the separation material. In addition to or independently of size separation, the separation material can trap or separate unwanted substances based on a cell property other than size, for example, the separation material can comprise a binding moiety that binds to a cell surface marker. In some instances, the binding moiety is an antibody or antigen binding antibody fragment. In some instances, the binding moiety is a ligand or receptor binding protein for a receptor on a blood cell or microvesicle.


In some instances, systems and devices disclosed herein comprise a separation material that moves, draws, pushes, or pulls the biological sample through the sample purifier, filter and/or membrane. In some instances, the material is a wicking material. Examples of appropriate separation materials used in the sample purifier to remove cells include, but are not limited to, polyvinylidene difluoride, polytetrafluoroethylene, acetylcellulose, nitrocellulose, polycarbonate, polyethylene terephthalate, polyethylene, polypropylene, glass fiber, borosilicate, vinyl chloride, silver. Suitable separation materials may be characterized as preventing passage of cells. In some instances, the separation material is not limited as long as it has a property that can prevent passage of the red blood cells. In some instances, the separation material is a hydrophobic filter, for example a glass fiber filter, a composite filter, for example Cytosep (e.g., Ahlstrom Filtration or Pall Specialty Materials, Port Washington, N.Y.), or a hydrophilic filter, for example cellulose (e.g., Pall Specialty Materials). In some instances, whole blood can be fractionated into red blood cells, white blood cells and serum components for further processing according to the methods of the present disclosure using a commercially available kit (e.g., Arrayit Blood Card Serum Isolation Kit, Cat. ABCS, Arrayit Corporation, Sunnyvale, Calif.).


In some instances the sample purifier comprises at least one filter or at least one membrane characterized by at least one pore size. In some instances, the sample purifier comprises multiple filters and/or membranes, wherein the pore size of at least a first filter or membrane differs from a second filter or membrane. In some instances, at least one pore size of at least one filter/membrane is about 0.05 microns to about 10 microns. In some instances, the pore size is about 0.05 microns to about 8 microns. In some instances, the pore size is about 0.05 microns to about 6 microns. In some instances, the pore size is about 0.05 microns to about 4 microns. In some instances, the pore size is about 0.05 microns to about 2 microns. In some instances, the pore size is about 0.05 microns to about 1 micron. In some instances, at least one pore size of at least one filter/membrane is about 0.1 microns to about 10 microns. In some instances, the pore size is about 0.1 microns to about 8 microns. In some instances, the pore size is about 0.1 microns to about 6 microns. In some instances, the pore size is about 0.1 microns to about 4 microns. In some instances, the pore size is about 0.1 microns to about 2 microns. In some instances, the pore size is about 0.1 microns to about 1 micron.


In some instances, the sample purifier is characterized as a gentle sample purifier. Gentle sample purifiers, such as those comprising a filter matrix, a vertical filter, a wicking material, or a membrane with pores that do not allow passage of cells, are particularly useful for analyzing cell-free nucleic acids. For example, prenatal applications of cell-free fetal nucleic acids in maternal blood are presented with the additional challenge of analyzing cell-free fetal nucleic acids in the presence of cell-free maternal nucleic acids, the latter of which create a large background signal to the former. By way of non-limiting example, a sample of maternal blood may contain about 500 to 750 genome equivalents of total cell-free DNA (maternal and fetal) per milliliter of whole blood when the sample is obtained without cell lysis or other cell disruption caused by the sample collection method. The fetal fraction in blood sampled from pregnant women may be around 10%, about 50 to 75 genome equivalents per ml. The process of obtaining cell-free nucleic acids usually involves obtaining plasma from the blood. If not performed carefully, maternal white blood cells may be destroyed, releasing additional cellular nucleic acids into the sample, creating a lot of background noise to the fetal cell-free nucleic acids. The typical white cell count is around 4*10{circumflex over ( )}6 to 10*10{circumflex over ( )}6 cells per ml of blood and therefore the available nuclear DNA is around 4,000 to 10,000 times higher than the overall cell-free DNA (cfDNA). Consequently, even if only a small fraction of maternal white blood cells is destroyed, releasing nuclear DNA into the plasma, the fetal fraction is reduced dramatically. For example, a white cell degradation of 0.01% may reduce the fetal fraction from 10% to about 5%. Devices, systems, and kits disclosed herein aim to reduce these background signals.


In some instances, the sample processor is configured to separate blood cells from whole blood. In some instances, the sample processor is configured to isolate plasma from whole blood. In some instances, the sample processor is configured to isolate serum from whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 1 milliliter of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 1 milliliter of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 500 μL of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 4004 of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 300 μL of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 200 μL of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 150 μL of whole blood. In some instances, the sample processor is configured to isolate plasma or serum from less than 100 μL of whole blood.


In some instances, devices, systems and kits disclosed herein comprise a binding moiety for producing a modified sample depleted of cells, cell fragments, nucleic acids or proteins that are unwanted or of no interest. In some instances, devices, systems and kits disclosed herein comprise a binding moiety for reducing cells, cell fragments, nucleic acids or proteins that are unwanted or of no interest, in a biological sample. In some instances, devices, systems and kits disclosed herein comprise a binding moiety for producing a modified sample enriched with target cell, target cell fragments, target nucleic acids or target proteins.


In some instances, devices, systems and kits disclosed herein comprise a binding moiety capable of binding a nucleic acid, a protein, a peptide, a cell surface marker, or microvesicle surface marker. In some instances, devices, systems and kits disclosed herein comprise a binding moiety for capturing an extracellular vesicle or extracellular microparticle in the biological sample. In some instances, the extracellular vesicle contains at least one of DNA and RNA. In some instances, devices, systems and kits disclosed herein comprise reagents or components for analyzing DNA or RNA contained in the extracellular vesicle. In some instances, the binding moiety comprises an antibody, antigen binding antibody fragment, a ligand, a receptor, a protein, a peptide, a small molecule, or a combination thereof.


In some instances, devices, systems and kits disclosed herein comprise a binding moiety capable of interacting with or capturing an extracellular vesicle that is released from a cell. In some instances, the cell is a fetal cell. In some instances, the cell is a placental cell. The fetal cell or the placental cell may be circulating in a biological fluid (e.g., blood) of a female pregnant subject. In some instances, the extracellular vesicle is released from an organ, gland or tissue. By way of non-limiting example, the organ, gland or tissue may be diseased, aging, infected, or growing. Non-limiting examples of organs, glands and tissues are brain, liver, heart, kidney, colon, pancreas, muscle, adipose, thyroid, prostate, breast tissue, and bone marrow.


By way of non-limiting example, devices, systems and kits disclosed herein may be capable of capturing and discarding an extracellular vesicle or extracellular microparticle from a maternal sample to enrich the sample for fetal/placental nucleic acids. In some instances, the extracellular vesicle is fetal/placental in origin. In some instances, the extracellular vesicle originates from a fetal cell. In some instances, the extracellular vesicle is released by a fetal cell. In some instances, the extracellular vesicle is released by a placental cell. The placental cell may be a trophoblast cell. In some instances, devices, systems and kits disclosed herein comprise a cell-binding moiety for capturing placenta educated platelets, which may contain fetal DNA or RNA fragments. These can be captured/enriched for with antibodies or other methods (low speed centrifugation). In such instances, the fetal DNA or RNA fragments may be analyzed as described herein to detect or indicate chromosomal information (e.g., gender). Alternatively or additionally, devices, systems and kits disclosed herein comprise a binding moiety for capturing an extracellular vesicle or extracellular microparticle in the biological sample that comes from a maternal cell.


In some instances, the binding moiety is attached to a solid support, wherein the solid support can be separated from the rest of the biological sample or the biological sample can be separated from the solid support, after the binding moiety has made contact with the biological sample. Non-limiting examples of solid supports include a bead, a nanoparticle, a magnetic particle, a chip, a microchip, a fibrous strip, a polymer strip, a membrane, a matrix, a column, a plate, or a combination thereof.


Devices, systems and kits disclosed herein may comprise a cell lysis reagent. Non-limiting examples of cell lysis reagents include detergents such as NP-40, sodium dodecyl sulfate, and salt solutions comprising ammonium, chloride, or potassium. Devices, systems and kits disclosed herein may have a cell lysis component. The cell lysis component may be structural or mechanical and capable of lysing a cell. By way of non-limiting example, the cell lysis component may shear the cells to release intracellular components such as nucleic acids. In some instances, devices, systems and kits disclosed herein do not comprise a cell lysis reagent. Some devices, systems and kits disclosed herein are intended to analyze cell-free nucleic acids.


Nucleic Acid Amplification

Generally, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid. Often devices, systems and kits disclosed herein comprise a DNA polymerase. In some instances, the devices, systems and kits disclosed herein comprise a reverse transcriptase enzyme to produce complementary DNA (cDNA) from RNA in biological samples disclosed herein, wherein the cDNA can be amplified and/or analyzed similarly to genomic DNA as described herein. Devices, systems and kits disclosed herein also often contain a crowding agent which can increase the efficiency enzymes like DNA polymerases and helicases. Crowding agents may increase an efficiency of a library, as described elsewhere herein. The crowding agent may comprise a polymer, a protein, a polysaccharide, or a combination thereof. Non-limiting examples of crowding agents that may be used in devices, systems and kits disclosed herein are dextran, poly(ethylene glycol) and dextran.


A traditional polymerase chain reaction requires thermocycling. This would be possible, but inconvenient for a typical at-home user without a thermocycler machine. In some instances, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid without changing the temperature of the device or system or a component thereof. In some instances, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid isothermally. Non-limiting examples of isothermal amplification are as follows: loop-mediated isothermal amplification (LAMP), strand displacement amplification (SDA), helicase dependent amplification (HDA), nicking enzyme amplification reaction (NEAR), and recombinase polymerase amplification (RPA). Thus, devices, systems and kits disclosed herein may comprise reagents necessary to carry out an isothermal amplification. Non-limiting examples of isothermal amplification reagents include recombinase polymerases, single-strand DNA-binding proteins, and strand-displacing polymerases. Generally, isothermal amplification using recombinase polymerase amplification (RPA) employs three core enzymes, recombinase, single-strand DNA-binding protein, and strand-displacing polymerase, to (1) pair oligonucleotide primers with homologous sequence in DNA, (2) stabilize displaced DNA strands to prevent primer displacement, and (3) extend the oligonucleotide primer using a strand displacing DNA polymerase. Using paired oligonucleotide primers, exponential DNA amplification can take place with incubation at room temperature (optimal at 37° C.).


In some instances, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid at a temperature. In some instances, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid at not more than two temperatures. In some instances, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid at not more than three temperatures. In some instances, devices, systems and kits disclosed herein only require initially heating one reagent or component of the device, system or kit.


In some instances, devices, systems and kits disclosed herein are capable of amplifying a nucleic acid at a range of temperatures. In some instances, the range of temperatures is about −50° C. to about 100° C. In some instances, the range of temperatures is about −50° C. to about 90° C. In some instances, the range of temperatures is about −50° C. to about 80° C. In some instances, the range of temperatures is about is about −50° C. to about 70° C. In some instances, the range of temperatures is about −50° C. to about 60° C. In some instances, the range of temperatures is about −50° C. to about 50° C. In some instances, the range of temperatures is about −50° C. to about 40° C. In some instances, the range of temperatures is about −50° C. to about 30° C. In some instances, the range of temperatures is about −50° C. to about 20° C. In some instances, the range of temperatures is about −50° C. to about 10° C. In some instances, the range of temperatures is about 0° C. to about 100° C. In some instances, the range of temperatures is about 0° C. to about 90° C. In some instances, the range of temperatures is about 0° C. to about 80° C. In some instances, the range of temperatures is about is about 0° C. to about 70° C. In some instances, the range of temperatures is about 0° C. to about 60° C. In some instances, the range of temperatures is about 0° C. to about 50° C. In some instances, the range of temperatures is about 0° C. to about 40° C. In some instances, the range of temperatures is about 0° C. to about 30° C. In some instances, the range of temperatures is about 0° C. to about 20° C. In some instances, the range of temperatures is about 0° C. to about 10° C. In some instances, the range of temperatures is about 15° C. to about 100° C. In some instances, the range of temperatures is about 15° C. to about 90° C. In some instances, the range of temperatures is about 15° C. to about 80° C. In some instances, the range of temperatures is about is about 15° C. to about 70° C. In some instances, the range of temperatures is about 15° C. to about 60° C. In some instances, the range of temperatures is about 15° C. to about 50° C. In some instances, the range of temperatures is about 15° C. to about 40° C. In some instances, the range of temperatures is about 15° C. to about 30° C. In some instances, the range of temperatures is about 10° C. to about 30° C. In some instances, devices, systems, kits disclosed herein, including all components thereof, and all reagents thereof, are completely operable at room temperature, not requiring cooling, freezing or heating.


In some instances, at least a portion of the devices, systems and kits disclosed herein operate at about 20° C. to about 50° C. In some instances, at least a portion of the devices, systems, and kits disclosed herein operate at about 37° C. In some instances, at least a portion of the devices, systems and kits disclosed herein operate at about 42° C. In some instances, the devices, systems and kits disclosed herein are advantageously operated at room temperature. In some instances, at least a portion of the devices, systems and kits disclosed herein are capable of amplifying a nucleic acid isothermally at about 20° C. to about 30° C. In some instances, at least a portion of the devices, systems and kits disclosed herein are capable of amplifying a nucleic acid isothermally at about 23° C. to about 27° C.


In some instances, devices, systems, kits, and methods disclosed herein comprise a hybridization probe with an abasic site, a fluorophore and quencher to monitor amplification. Exonuclease III may be included to cleave the abasic site and release the quencher to allow fluorescent excitation. In some instances, amplification products are detected or monitored via lateral flow by attaching a capture molecule (e.g. Biotin) to one of the amplification primers and labeling a hybridization primer with a 5′-antigenic molecule (e.g. fluorescein derivative FAM) for capture to allow for detection. As such, in some instances, devices, systems, kits, and methods disclosed herein provide for detection of nucleic acids and amplification products on a lateral flow device. Lateral flow devices are described herein.


In some instances, devices, systems and kits disclosed herein comprise at least one nucleic acid amplification reagent and at least one oligonucleotide primer capable of amplifying a first sequence in a genome and a second sequence in a genome, wherein the first sequence and the second sequence are similar, and wherein the first sequence is physically distant enough from the second sequence such that the first sequence is present on a first cell-free nucleic acid of the subject and the second sequence is present on a second cell-free nucleic acid of the subject. In some instances, the at least two sequences are immediately adjacent. In some instances the at least two sequences are separated by at least one nucleotide. In some instances, the at least two sequences are separated by at least two nucleotides. In some instances, the at least two sequences are separated by at least about 5, at least about 10, at least about 15, at least about 20, at least about 30, at least about 40, at least about 50, or at least about 100 nucleotides. In some instances, the at least two sequences are at least about 50% identical. In some instances, the at least two sequences are at least about 60% identical, at least about 60% identical, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 99%, or 100% identical. In some instances, the first sequence and the second sequence are each at least 10 nucleotides in length. In some instances, the first sequence and the second sequence are each at least about 10, at least about 15, at least about 20, at least about 30, at least about 50, or at least about 100 nucleotides in length. In some instances, the first sequence and the second sequence are on the same chromosome. In some instances, the first sequence is on a first chromosome and the second sequence is on a second chromosome. In some instances, the first sequence and the second sequence are in functional linkage. For example, all CpG sites in the promotor region of gene AOX1 show the same hypermethylation in prostate cancer, so these sites are in functional linkage because they functionally carry the same information but are located one or more nucleotides apart.


In some instances, devices, systems and kits disclosed herein comprise at least one of an oligonucleotide probe or oligonucleotide primer that is capable of annealing to a strand of a cell-free nucleic acid, wherein the cell-free nucleic acid comprises a sequence corresponding to a region of interest or a portion thereof. In some instances, the region of interest is a region of a Y chromosome. In some instances, the region of interest is a region of an X chromosome. In some instances, the region of interest is a region of an autosome. In some instances, the region of interest, or portion thereof, comprises a repeat sequence as described herein that is present in a genome more than once. In some instances, the region of interest is about 10 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is at least 10 nucleotides in length. In some instances, the region of interest is at least 100 nucleotides in length. In some instances, the region is at least 1000 nucleotides in length. In some instances, the region of interest is about 10 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 10 nucleotides to about 300,000 nucleotides in length. In some instances, the region of interest is about 100 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is about 100 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 100 nucleotides to about 300,000 base pairs in length. In some instances, the region of interest is about 1000 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is about 1000 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 1000 nucleotides to about 300,000 nucleotides in length. In some instances, the region of interest is about 10,000 nucleotides to about 1,000,000 nucleotides in length. In some instances, the region of interest is about 10,000 nucleotides to about 500,000 nucleotides in length. In some instances, the region of interest is about 10,000 nucleotides to about 300,000 nucleotides in length. In some instances, the region of interest is about 300,000 nucleotides in length.


In some instances, the sequence corresponding to the region of interest is at least about 5 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 8 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 10 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 15 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 20 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 50 nucleotides in length. In some instances, the sequence corresponding to the region of interest is at least about 100 nucleotides in length. In some instances, the sequence is about 5 nucleotides to about 1000 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 1000 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 500 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 400 nucleotides in length. In some instances, the sequence is about 10 nucleotides to about 300 nucleotides in length. In some instances, the sequence is about 50 nucleotides to about 1000 nucleotides in length. In some instances, the sequence is about 50 nucleotides to about 500 nucleotides in length.


In some instances, devices, systems and kits disclosed herein comprise at least one of an oligonucleotide probe and oligonucleotide primer that is capable of annealing to a strand of a cell-free nucleic acid, wherein the cell-free nucleic acid comprises a sequence corresponding to a sub-region of interest disclosed herein. In some instances, the sub-region is represented by a sequence that is present in the region of interest more than once. In some instances, the sub-region is about 10 to about 1000 nucleotides in length. In some instances, the sub-region is about 50 to about 500 nucleotides in length. In some instances, the sub-region is about 50 to about 250 nucleotides in length. In some instances, the sub-region is about 50 to about 150 nucleotides in length. In some instances, the sub-region is about 100 nucleotides in length.


Any appropriate nucleic acid amplification method known in the art is contemplated for use in the devices and methods described herein. In some instances, isothermal amplification is used. In some instances, amplification is isothermal with the exception of an initial heating step before isothermal amplification begins. A number of isothermal amplification methods, each having different considerations and providing different advantages, are known in the art and have been discussed in the literature, e.g., by Zanoli and Spoto, 2013, “Isothermal Amplification Methods for the Detection of Nucleic Acids in Microfluidic Devices,” Biosensors 3: 18-43, and Fakruddin, et al., 2013, “Alternative Methods of Polymerase Chain Reaction (PCR),” Journal of Pharmacy and Bioallied Sciences 5(4): 245-252, each incorporated herein by reference in its entirety. In some instances, any appropriate isothermic amplification method is used. In some instances, the isothermic amplification method used is selected from: Loop Mediated Isothermal Amplification (LAMP); Nucleic Acid Sequence Based Amplification (NASBA); Multiple Displacement Amplification (MDA); Rolling Circle Amplification (RCA); Helicase Dependent Amplification (HDA); Strand Displacement Amplification (SDA); Nicking Enzyme Amplification Reaction (NEAR); Ramification Amplification Method (RAM); and Recombinase Polymerase Amplification (RPA).


In some instances, the amplification method used is LAMP (see, e.g., Notomi, et al., 2000, “Loop Mediated Isothermal Amplification” NAR 28(12): e63 i-vii, and U.S. Pat. No. 6,410,278, “Process for synthesizing nucleic acid” each incorporated by reference herein in its entirety). LAMP is a one-step amplification system using auto-cycling strand displacement deoxyribonucleic acid (DNA) synthesis. In some instances, LAMP is carried out at 60-65° C. for 45-60 min in the presence of a thermostable polymerase, e.g., Bacillus stearothermophilus (Bst) DNA polymerase I, deoxyribonucleotide triphosphate (dNTPs), specific primers and the target DNA template. In some instances, the template is RNA and a polymerase having both reverse transcriptase activity and strand displacement-type DNA polymerase activity, e.g., Bca DNA polymerase, is used, or a polymerase having reverse transcriptase activity is used for the reverse transcriptase step and a polymerase not having reverse transcriptase activity is used for the strand displacement-DNA synthesis step.


In some instances, the amplification reaction is carried out using LAMP, at temperature or a range in temperatures described herein. In some instances, the amplification reaction is carried out using LAMP, for an amount of time or a range in times described herein.


In some instances, the amplification method is Nucleic Acid Sequence Based Amplification (NASBA). NASBA (also known as 3SR, and transcription-mediated amplification) is an isothermal transcription-based RNA amplification system. Three enzymes (avian myeloblastosis virus reverse transcriptase, RNase H and T7 DNA dependent RNA polymerase) are used to generate single-stranded RNA. In certain cases NASBA can be used to amplify DNA. The amplification reaction is performed at 41° C., maintaining constant temperature, typically for about 60 to about 90 minutes (see, e.g., Fakruddin, et al., 2012, “Nucleic Acid Sequence Based Amplification (NASBA) Prospects and Applications,” Int. J. of Life Science and Pharma Res. 2(1):L106-L121, incorporated by reference herein).


In some instances, the NASBA reaction is carried out at about 40° C. to about 42° C. In some instances, the NASBA reaction is carried out at 41° C. In some instances, the NASBA reaction is carried out at most at about 42° C. In some instances, the NASBA reaction is carried out at about 40° C. to about 41° C., about 40° C. to about 42° C., or about 41° C. to about 42° C. In some instances, the NASBA reaction is carried out at about 40° C., about 41° C., or about 42° C.


In some instances, the amplification reaction is carried out using NASBA, for an amount of time or range of times described herein.


In some instances, the amplification method is Strand Displacement Amplification (SDA). SDA is an isothermal amplification method that uses four different primers. A primer containing a restriction site (a recognition sequence for HincII exonuclease) is annealed to the DNA template. An exonuclease-deficient fragment of Eschericia coli DNA polymerase 1 (exo-Klenow) elongates the primers. Each SDA cycle consists of (1) primer binding to a displaced target fragment, (2) extension of the primer/target complex by exo-Klenow, (3) nicking of the resultant hemiphosphothioate HincII site, (4) dissociation of HincII from the nicked site and (5) extension of the nick and displacement of the downstream strand by exo-Klenow.


In some instances, the amplification method is Multiple Displacement Amplification (MDA). The MDA is an isothermal, strand-displacing method based on the use of the highly processive and strand-displacing DNA polymerase from bacteriophage Ø29, in conjunction with modified random primers to amplify the entire genome with high fidelity. It has been developed to amplify all DNA in a sample from a very small amount of starting material. In MDA Ø29 DNA polymerase is incubated with dNTPs, random hexamers and denatured template DNA at 30° C. for 16 to 18 hours and the enzyme must be inactivated at high temperature (65° C.) for 10 min. No repeated recycling is required, but a short initial denaturation step, the amplification step, and a final inactivation of the enzyme are needed.


In some instances, the amplification method is Rolling Circle Amplification (RCA). RCA is an isothermal nucleic acid amplification method which allows amplification of the probe DNA sequences by more than 109 fold at a single temperature, typically about 30° C. Numerous rounds of isothermal enzymatic synthesis are carried out by Ø29 DNA polymerase, which extends a circle-hybridized primer by continuously progressing around the circular DNA probe. In some instances, the amplification reaction is carried out using RCA, at about 28° C. to about 32° C.


In some instances, devices, systems and kits disclosed herein comprise at least one oligonucleotide primer, wherein the oligonucleotide primer has a sequence complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise a pair of oligonucleotide primers, wherein the pair of oligonucleotide primers have sequences complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise at least one oligonucleotide primer, wherein the oligonucleotide primer comprises a sequence complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise a pair of oligonucleotide primers, wherein the pair of oligonucleotide primers comprise sequences complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise at least one oligonucleotide primer, wherein the oligonucleotide primer consists of a sequence complementary to or corresponding to a Y chromosome sequence. In some instances, devices, systems and kits disclosed herein comprise a pair of oligonucleotide primers, wherein the pair of oligonucleotide primers consists of sequences complementary to or corresponding to a Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 75% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 80% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 85% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 80% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 90% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 95% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is at least 97% homologous to a wild-type human Y chromosome sequence. In some instances, the sequence(s) complementary to or corresponding to a Y chromosome sequence is 100% homologous to a wild-type human Y chromosome sequence.


Nucleic Acid Ligator

In some instances, devices, systems and kits disclosed herein are capable of preparing a library of nucleic acids for detection. In instances, the nucleic acids are optionally amplified by the nucleic acid amplifier and/or amplification reagents described herein. In some instances, the devices or systems described herein comprise a nucleic acid ligator capable of producing a library-competent target nucleic acid for detection.


In some embodiments, the nucleic acid ligator comprises a ligation formulation for producing ligation-competent target nucleic acids (e.g., cell-free nucleic acids). The ligation formulation comprises one or more of: (i) one or more exonucleases adapted to generate a blunt end of the target cell-free nucleic acid and remove a 5′ overhang or a 3′ recessed end of the blunt end of the target cell-free nucleic acids; (ii) a blunt end cell-free nucleic acid dephosphorylating agent; (iii) a crowding reagent; (iv) a nucleic acid damage repair agent; or (v) a nucleic acid ligase. In some cases, the ligation formulation comprises two or more of (i)-(v), three or more of (i)-(v), or all four of (i)-(v). The nucleic acid ligator also comprises one or more adaptor oligonucleotides ligated to the ligation-competent target cell-free nucleic acid.


In some embodiments, the nucleic acid damage repair agent comprises one or more polymerases. In some cases, the one or more polymerases comprises T4 DNA polymerase or DNA polymerase I. In some embodiments, the one or more exonucleases comprises T4 polynucleotide kinase or exonuclease III. In some embodiments, the ligase comprises T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, Taq Ligase, Ampligase, E. coli Ligase, or Sso7-ligase fusion protein. In some embodiments, the crowding reagent comprises polyethylene glycol (PEG), glycogen, or dextran, or a combination thereof. In some embodiments, the small molecule enhancer comprises dimethyl sulfoxide (DMSO), polysorbate 20, formamide, or a diol, or a combination thereof. In some embodiments, the ligator is engineered to perform blunt end ligating, or single nucleotide overhang ligating. In some embodiments, the adaptor oligonucleotides comprise Y shaped adaptors, hairpin adaptors, stem loop adaptors, degradable adaptors, blocked self-ligating adaptors, or barcoded adaptors, or a combination thereof.


In some cases, the ligation formulation comprises an endonuclease, such as a clustered regularly interspaced short palindromic repeats (CRISPR)-Cas combination. In some instances, the Cas enzyme Cas9, Cas12, Cascade and Cas13, or subtypes thereof. In some cases, the Cas enzyme is a Cas orthologue, such as those described in Nat Rev Mol Cell Biol. 2019 August; 20(8):490-507. In some cases, the ligation formulation comprises a transposase. In some instances, the transposase has a “cut-and-paste” mechanism, such as the Tn5 transposase, or a variant thereof.


Nucleic Acid Detector

In some instances, devices, systems and kits disclosed herein comprise a nucleic acid detector. In some instances, the nucleic acid detector comprises a nucleic acid sequencer. In some instances, devices, systems and kits disclosed herein are configured to amplify nucleic acids and sequence the resulting amplified nucleic acids from the nucleic acid amplifier, or the ligation-competent target nucleic acids from the nucleic acid ligator, or both. In some instances, devices, systems and kits disclosed herein are configured to sequence nucleic acids without amplifying nucleic acids. In some instances, devices, systems and kits disclosed herein comprise a nucleic acid sequencer, but do not comprise a nucleic acid amplifying reagent or nucleic acid amplifying component. In some instances, the nucleic acid sequencer comprises a signal detector that detects a signal that reflects successful amplification or unsuccessful amplification. In some instances, the nucleic acid sequencer is the signal detector. In some instances, the signal detector comprises the nucleic acid sequencer.


In some instances, the nucleic acid sequencer has a communication connection with an electronic device that analyzes sequencing reads from the nucleic acid sequencer. In some instances the communication connection is hard wired. In some instances the communication connection is wireless. For example, a mobile device app or computer software, such as those disclosed herein, may receive the sequencing reads, and based on the sequencing reads, display or report genetic information about the sample (e.g., presence of a disease/infection, response to a drug, genetic abnormality or mutation of a fetus).


In some instances, the nucleic acid sequencer comprises a nanopore sequencer. In some instances, the nanopore sequencer comprises a nanopore. In some instances, the nanopore sequencer comprises a membrane and solutions that create a current across the membrane and drive movement of charged molecules (e.g., nucleic acids) through the nanopore. In some instances, the nanopore sequencer comprises a transmembrane protein, a portion thereof, or a modification thereof. In some instances, the transmembrane protein is a bacterial protein. In some instances, the transmembrane protein is not a bacterial protein. In some instances, the nanopore is synthetic. In some instances, the nanopore performs solid state nanopore sequencing. In some instances, the nanopore sequencer is described as pocket-sized, portable, or roughly the size of a cell phone. In some instances, the nanopore sequencer is configured to sequence at least one of RNA and DNA. Non-limiting examples of nanopore sequencing devices include Oxford Nanopore Technologies MinION and SmidgION nanopore sequencing USB devices. Both of these devices are small enough to be handheld. Nanopore sequencing devices and components are further described in reviews by Howorka (Nat Nanotechnol. 2017 Jul. 6; 12(7):619-630), and Garrido-Cardenas et al. (Sensors (Basel). 2017 Mar. 14; 17(3)), both incorporated herein by reference. Other non-limiting examples of nanopore sequencing devices are offered by Electronic Biosciences, Two Pore Guys, Stratos, and Agilent (technology originally from Genia).


In some instances, the nucleic acid detector comprises reagents and components required for bisulfate sequencing to detect epigenetic modifications. For instance, a long region with many methylation markers can be fragmented. Here, each fragment carrying a methylation marker can be an independent signal. Signals from all the fragments are sufficient in combination to obtain useful genetic information.


In some instances, the nucleic acid detector does not comprise a nucleic acid sequencer. In some instances, the nucleic acid detector is configured to count tagged nucleic acids, wherein the nucleic acid detector quantifies a collective signal from one or more tags.


Capture and Detection

In some instances, devices, systems and kits disclosed herein comprise at least one of a nucleic acid detector, capture component, signal detector, a detection reagent, or a combination thereof, for detecting a nucleic acid in the biological sample. In some instances, the capture component and the signal detector are integrated. In some instances, the capture component comprises a solid support. In some instances the solid support comprises a bead, a chip, a strip, a membrane, a matrix, a column, a plate, or a combination thereof.


In some instances, devices, systems and kits disclosed herein comprise at least one probe for an epigenetically modified region of a chromosome or fragment thereof. In some instances, the epigenetic modification of the epigenetically modified region of a chromosome is indicative of gender or a marker of gender. In some instances, devices, systems and kits disclosed herein comprise at least one probe for a paternally inherited sequence that is not present in the maternal DNA. In some instances, devices, systems and kits disclosed herein comprise at least one probe for a paternally inherited single nucleotide polymorphism. In some instances, the chromosome is a Y chromosome. In some instances, the chromosome is an X chromosome. In some instances, the chromosome is a Y chromosome. In some instances, the chromosome is an autosome. In some instances, the probe comprises a peptide, an antibody, an antigen binding antibody fragment, a nucleic acid or a small molecule.


In some instances, devices, systems and kits comprise a sample purifier disclosed herein and a capture component disclosed herein. In some instances, the sample purifier comprises the capture component. In some instances, the sample purifier and the capture component are integrated. In some instances, the sample purifier and the capture component are separate.


In some instances, the capture component comprises a binding moiety described herein. In some instances, the binding moiety is present in a lateral flow assay. In some instances, the binding moiety is added to the sample before the sample is added to the lateral flow assay. In some instances, the binding moiety comprises a signaling molecule. In some instances, the binding moiety is physically associated with a signaling molecule. In some instances, the binding moiety is capable of physically associating with a signaling molecule. In some instances, the binding moiety is connected to a signaling molecule. Non-limiting examples of signaling molecules include a gold particle, a fluorescent particle, a luminescent particle, and a dye molecule. In some instances the capture component comprises a binding moiety that is capable of interacting with an amplification product described herein. In some instances the capture component comprises a binding moiety that is capable of interacting with a tag on an amplification product described herein.


In some instances, devices, systems and kits disclosed herein comprise a detection system. In some instances, the detection system comprises a signal detector. Non-limiting examples of a signal detector include a fluorescence reader, a colorimeter, a sensor, a wire, a circuit, a receiver. In some instances, the detection system comprises a detection reagent. Non-limiting examples of a detection reagent include a fluorophore, a chemical, a nanoparticle, an antibody, and a nucleic acid probe. In some instances, the detection system comprises a pH sensor and a complementary metal-oxide semiconductor, which can be used to detect changes in pH. In some instances, production of an amplification product by devices, systems, kits or methods disclosed herein changes the pH, thereby indicating genetic information.


In some instances, the detection system comprises a signal detector. In some instances, the signal detector is a photodetector that detects photons. In some instances, the signal detector detects fluorescence. In some instances, the signal detector detects a chemical or compound. In some instances, the signal detector detects a chemical that is released when the amplification product is produced. In some instances, the signal detector detects a chemical that is released when the amplification product is added to the detection system. In some instances, the signal detector detects a compound that is produced when the amplification product is produced. In some instances, the signal detector detects a compound that is produced when the amplification product is added to the detection system.


In some instances, the signal detector detects an electrical signal. In some instances, the signal detector comprises an electrode. In some instances, the signal detector comprises a circuit a current, or a current generator. In some instances, the circuit or current is provided by a gradient of two or more solutions or polymers. In some instances, the circuit or current is provided by an energy source (e.g., battery, cell phone, wire from electrical outlet). In some instances, nucleic acids, amplification products, chemicals or compounds disclosed herein provide an electrical signal by disrupting the current and the signal detector detects the electrical signal.


In some instances, the signal detector detects light. In some instances, the signal detector comprises a light sensor. In some instances, the signal detector comprises a camera. In some instances, the signal detector comprises a cell phone camera or a component thereof.


In some instances, the signal detector comprises a nanowire that detects the charge of different bases in nucleic acids. In some instances, the nanowire has a diameter of about 1 nm to about 99 nm. In some instances, the nanowire has a diameter of about 1 nm to about 999 nm. In some instances, the nanowire comprises an inorganic molecule, e.g., nickel, platinum, silicon, gold, zinc, graphene, or titanium. In some instances, the nanowire comprises an organic molecule (e.g., a nucleotide).


In some instances, the detection system comprises an assay assembly, wherein the assay assembly is capable of detecting a target analyte (e.g., nucleic acid amplification product). In some instances, the assay assembly comprises a lateral flow strip, also referred to herein and in the field, as a lateral flow assay, lateral flow test or lateral flow device. In some instances, a lateral flow assay provides a fast, inexpensive, and technically simple method to detect amplification products disclosed herein. Generally, lateral flow assays disclosed herein comprise a porous material or porous matrix that transports a fluid, and a detector that detects the amplification product when it is present. The porous material may comprise a porous paper, a polymer structure, a sintered polymer, or a combination thereof. In some instances, the lateral flow assay transports the biological fluid or portion thereof (e.g., plasma of blood sample). In some instances, the lateral flow assay transports a solution containing the biological fluid or portion thereof. For instance, methods may comprise adding a solution to the biological fluid before or during addition of the sample to the device or system. The solution may comprise a salt, a polymer, or any other component that facilitates transport of the sample and or amplification product through the lateral flow assay. In some instances, nucleic acids are amplified after they have traveled through the lateral flow strip.


In some instances, devices, the detection system comprises a lateral flow device, wherein the lateral flow device comprises multiple sectors or zones, wherein each desired function can be present in a separate sector or zone. In general, in a lateral flow device, a liquid sample, e.g., a body fluid sample as described herein, containing the target analyte moves with or without the assistance of external forces through sectors or zones of the lateral flow device. In some instances, the target analyte moves without the assistance of external forces, e.g., by capillary action. In some instances, the target analyte moves with assistance of external forces, e.g., by facilitation of capillary action by movement of the lateral flow device. Movement can comprise any motion caused by external input, e.g., shaking, turning, centrifuging, applying an electrical field or magnetic field, applying a pump, applying a vacuum, or rocking of the lateral flow device.


In some instances, the lateral flow device is a lateral flow test strip, comprising zones or sectors that are situated laterally, e.g., behind or ahead of each other. In general, a lateral flow test strip allows accessibility of the functional zones or sectors from each side of (e.g., above and below) the test strip as a result of exposure of a large surface area of each functional zone or sector. This facilitates the addition of reagents, including those used in sample purification, or target analyte amplification, and/or detection.


Any suitable lateral flow test strip detection format known to those of skill in the art is contemplated for use in an assay assembly of the present disclosure. Lateral flow test strip detection formats are well known and have been described in the literature. Lateral flow test strip assay formats are generally described by, e.g., Sharma et al., (2015) Biosensors 5:577-601, incorporated by reference herein in its entirety. Detection of nucleic acids using lateral flow test strip sandwich assay formats is described by, e.g., U.S. Pat. No. 9,121,849, “Lateral Flow Assays,” incorporated by reference herein in its entirety. Detection of nucleic acids using lateral flow test strip competitive assay formats is described by, e.g., U.S. Pat. No. 9,423,399, “Lateral Flow Assays for Tagged Analytes,” incorporated by reference herein in its entirety.


In some instances, a lateral flow test strip detects the target analyte in a test sample using a sandwich format, a competitive format, or a multiplex detection format. In a traditional sandwich assay format, the detected signal is directly proportional to the amount of the target analyte present in the sample, so that increasing amounts of the target analyte lead to increasing signal intensity. In traditional competitive assay formats, the detected signal has an inverse relationship with the amount of analyte present, and increasing amounts of analyte lead to decreasing signal intensity.


In a lateral flow sandwich format, also referred to as a “sandwich assay,” the test sample typically is applied to a sample application pad at one end of a test strip. The applied test sample flows through the test strip, from the sample application pad to a conjugate pad located adjacent to the sample application pad, where the conjugate pad is downstream in the direction of sample flow. In some instances, the conjugate pad comprises a labeled, reversibly-immobilized probe, e.g., an antibody or aptamer labeled with, e.g., a dye, enzyme, or nanoparticle. A labeled probe-target analyte complex is formed if the target analyte is present in the test sample. This complex then flows to a first test zone or sector (e.g., a test line) comprising an immobilized second probe which is specific to the target analyte, thereby trapping any labeled probe-target analyte complex. In some instances, the intensity or magnitude of signal, e.g., color, at the first test zone or sector is used to indicate the presence or absence, quantity, or presence and quantity of target analyte in the test sample. A second test zone or sector can comprise a third probe that binds to excess labeled probe. If the applied test sample comprises the target analyte, little or no excess labeled probe will be present on the test strip following capture of the target analyte by the labeled probe on the conjugate pad. Consequently, the second test zone or sector will not bind any labeled probe, and little or no signal (e.g., color) at the second test zone or sector is expected to be observed. The absence of signal at the second test zone or sector thus can provide assurance that signal observed in the first test zone or sector is due to the presence of the target analyte.


In some instances, devices and systems disclosed herein comprise a sandwich assay. In some instances, the sandwich assay is configured to receive a biological sample disclosed herein and retain sample components (e.g., nucleic acids, cells, microparticles). In some instances, the sandwich assay is configured to receive a flow solution that flushes non-nucleic acid components of the biological sample (e.g., proteins, cells, microparticles), leaving nucleic acids of the biological sample behind. In some instances, the sandwich assay comprises a membrane that binds nucleic acids to help retain the nucleic acids when the flow solution is applied. Non-limiting examples of a membrane the binds nucleic acids includes chitosan modified nitrocellulose.


Similarly, in a lateral flow competitive format a test sample is applied to a sample application pad at one end of a test strip, and the target analyte binds to a labeled probe to form a probe-target analyte complex in a conjugate pad downstream of the sample application pad. In the competitive format, the first test zone or sector typically comprises the target analyte or an analog of the target analyte. The target analyte in the first test zone or sector binds any free labeled probe that did not bind to the test analyte in the conjugate pad. Thus, the amount of signal observed in the first test zone or sector is higher when there is no target analyte in the applied test sample than when target analyte is present. A second test zone or sector comprises a probe that specifically binds to the probe-target analyte complex. The amount of signal observed in this second test zone or sector is higher when the target analyte is present in the applied test sample.


In a lateral flow test strip multiplex detection format, more than one target analyte is detected using the test strip through the use of additional test zones or sectors comprising, e.g., probes specific for each of the target analytes.


In some instances, the lateral flow device is a layered lateral flow device, comprising zones or sectors that are present in layers situated medially, e.g., above or below each other. In some instances, one or more zones or sectors are present in a given layer. In some instances, each zone or sector is present in an individual layer. In some instances, a layer comprises multiple zones or sectors. In some instances, the layers are laminated. In a layered lateral flow device, processes controlled by diffusion and directed by the concentration gradient are possible driving forces. For example, multilayer analytical elements for fluorometric assay or fluorometric quantitative analysis of an analyte contained in a sample liquid are described in EP0097952, “Multilayer analytical element,” incorporated by reference herein.


A lateral flow device can comprise one or more functional zones or sectors. In some instances, the test assembly comprises 1 to 20 functional zones or sectors. In some instances, the functional zones ore sectors comprise at least one sample purification zone or sector, at least one target analyte amplification zone or sector, at least one target analyte detection zone or sector, and at least one target analyte detection zone or sector.


In some instances, the target analyte is a nucleic acid sequence, and the lateral flow device is a nucleic acid lateral flow assay. In some instances, devices, systems and kits disclosed herein comprise a nucleic acid lateral flow assay, wherein the nucleic acid lateral flow assay comprises nucleic acid amplification function. In some instances, target nucleic acid amplification that is carried out by the nucleic acid amplification function takes place prior to, or at the same time as, detection of the amplified nucleic acid species. In some instances, detection comprises one or more of qualitative, semi-quantitative, or quantitative detection of the presence of the target analyte.


In some instances, devices, systems and kits disclosed herein comprise an assay assembly wherein a target nucleic acid analyte is amplified in a lateral flow test strip to generate a labeled amplification product, or an amplification product that can be labeled after amplification. In some instances, a label is present on one or more amplification primers, or subsequently conjugated to one or more amplification primers, following amplification. In some instances, at least one target nucleic acid amplification product is detected on the lateral flow test strip. For example, one or more zones or sectors on the lateral flow test strip may comprise a probe that is specific for a target nucleic acid amplification product.


In some instances, the devices, systems and kits disclosed herein comprise a detector, wherein the detector comprises a graphene biosensor. Graphene biosensors are described, e.g., by Afsahi et al., in the article entitled, “Novel graphene-based biosensor for early detection of Zika virus infection, Biosensor and Bioelectronics,” (2018) 100:85-88.


In some instances, a detector disclosed herein comprises a nanopore, a nanosensor, or a nanoswitch. For instance, the detector may be capable of nanopore sequencing, a method of transporting a nucleic acid through a nanpore based on an electric current across a membrane, the detector measuring disruptions in the current corresponding to specific nucleotides. A nanoswitch or nanosensor undergoes a structural change upon exposure to the detectable signal. See, e.g., Koussa et al., “DNA nanoswitches: A quantitative platform for gel-based biomolecular interaction analysis,” (2015) Nature Methods, 12(2): 123-126.


In some instances, the detector comprises a rapid multiplex biomarker assay where probes for an analyte of interest are produced on a chip that is used for real-time detection. Thus, there is no need for a tag, label or reporter. Binding of analytes to these probes causes a change in a refractive index that corresponds to a concentration of the analyte. All steps may be automated. Incubations may be not be necessary. Results may be available in less than an hour (e.g., 10-30 minutes). A non-limiting example of such a detector is the Genalyte Maverick Detection System.


Additional Tests

In some instances, devices, systems and kits disclosed herein comprise additional features, reagents, tests or assays for detection or analysis of biological components besides nucleic acids. By way of non-limiting example, the biological component may be selected from a peptide, a lipid, a fatty acid, a sterol, a carbohydrate, a viral component, a microbial component, and a combination thereof. The biological component may be an antibody. The biological component may be an antibody produced in response to a peptide in the subject. These additional assays may be capable of detecting or analyzing biological components in the small volumes or sample sizes disclosed herein and throughout. An additional test may comprise a reagent capable of interacting with a biological component of interest. Non-limiting examples of such reagents include antibodies, peptides, oligonucleotides, aptamers, and small molecules, and combinations thereof. The reagent may comprise a detectable label. The reagent may be capable of interacting with a detectable label. The reagent may be capable of providing a detectable signal.


Additional tests may require one or more antibodies. For instance, the additional test may comprise reagents or components that provide for performing Immuno-PCR (IPCR). IPCR is a method wherein a first antibody for a protein of interest is immobilized and exposed to a sample. If the sample contains the protein of interest, it will be captured by the first antibody. The captured protein of interest is then exposed to a second antibody that binds the protein of interest. The second antibody has been coupled to a polynucleotide that can be detected by real-time PCR. Alternatively or additionally, the additional test may comprise reagents or components that provide for performing a proximity ligation assay (PLA), wherein the sample is exposed to two antibodies specific for a protein of interest, each antibody comprising an oligonucleotide. If both antibodies bind to the protein of interest, the oligonucleotides of each antibody will be close enough to be amplified and/or detected.


In some instances, devices, systems and kits disclosed herein comprise a pregnancy test to confirm the subject is pregnant. In some instances, devices, systems and kits disclosed herein comprise a test for presence of a Y chromosome or absence of a Y chromosome (gender test). In some instances, devices, systems and kits disclosed herein comprise a test for gestational age.


In some instances, devices, systems, and kits disclosed herein comprise a test for multiple pregnancies, e.g., twins or triplets. In some instances, methods disclosed herein quantify (absolute or relative) the total amount of fetal nucleic acids in a maternal sample, and the amount of sequences represented by the various autosomes, X and Y chromosomes to detect if one, both or all fetuses are male or female, euploid or aneuploid, etc.


In some instances, devices, systems and kits disclosed herein comprise a pregnancy test for indicating, detecting or verifying the subject is pregnant. In some instances the pregnancy test comprises a reagent or component for measuring a pregnancy related factor. By way of non-limiting example, the pregnancy related factor may be human chorionic gonadotropin protein (hCG) and the reagent or component for hCG comprising an anti-hCG antibody. Also by way of non-limiting example, the pregnancy related factor may be an hCG transcript and the reagent or component for measuring the hCG transcript is an oligonucleotide probe or primer that hybridizes to the hCG transcript. In some instances, the pregnancy related factor is heat shock protein 10 kDa protein 1, also known as early-pregnancy factor (EPF).


In some instances, devices, systems and kits disclosed herein are capable of conveying the age of the fetus. For example, a signal may be generated from the device or system, wherein the level of the signal corresponds to the amount of hCG in the sample from the subject. This level or strength of the signal may be translated or equivocated with a numerical value representing the amount of hCG in the sample. The amount of hCG may indicate an approximate age of the fetus.


In some instances, devices, systems and kits disclosed herein provide an indication or verification of pregnancy, an indication or verification of gestational age, and an indication or verification of gender. In some instances, devices, systems and kits disclosed herein provide an indication of pregnancy, gestational age, and/or gender with at least about 90% confidence (e.g., 90% of the time, the indication is accurate). In some instances, devices, systems and kits disclosed herein provide an indication of pregnancy, gestational age, and/or gender with at least about 95% confidence. In some instances, devices, systems and kits disclosed herein provide an indication of pregnancy, gestational age, and/or gender with at least about 99% confidence.


Performance Parameters

In some instances, the devices, systems and kits disclosed herein are operable at one or more temperatures. In some instances, the temperature of a component or reagent of the device system, or kit needs to be altered in order for the device system, or kit to be operable. Generally, devices, systems and kits are considered “operable” when they are capable of providing information conveyed by biomarkers (e.g., RNA/DNA, peptides) in the biological sample. In some instances, temperature(s) at which the devices, systems, kits, components thereof, or reagents thereof are operable are obtained in a common household. By way of non-limiting example, temperature(s) obtained in a common household may be provided by room temperature, a refrigerator, a freezer, a microwave, a stove, an electric hot pot, hot/cold water bath, or an oven.


In some instances, devices, systems, kits, components thereof, or reagents thereof, as described herein, are operable at a single temperature. In some instances, devices, systems, kits, components thereof, or reagents thereof, as described herein, only require a single temperature to be operable. In some instances, devices, systems, kits, components thereof, or reagents thereof, as described herein, only require two temperatures to be operable. In some instances, devices, systems, kits, components thereof, or reagents thereof, as described herein, only require three temperatures to be operable.


In some instances, devices, systems, kits disclosed herein comprises a heating device or a cooling device to allow a user to obtain the at least one temperature. Non-limiting examples of heating devices and cooling devices are pouches or bag of material that can be cooled in a refrigerator or freezer, or microwaved or boiled on a stove top, or plugged into an electrical socket, and subsequently applied to devices disclosed herein or components thereof, thereby transmitting heat to the device or component thereof or cooling the device or component thereof. Another non-limiting example of a heating device is an electrical wire or coil that runs through the device or portion thereof. The electrical wire or coil may be activated by external (e.g. solar, outlet) or internal (e.g., battery, cell phone) power to convey heat to the device or portion thereof. In some instances, devices, systems, kits disclosed herein comprise a thermometer or temperature indicator to assist a user with assessing a temperature within the range of temperatures. Alternatively, or additionally, the user employs a device in a typical home setting (e.g., thermometer, cell phone, etc.) to assess the temperature.


In some instances, temperature at which the devices, systems, kits, components thereof, or reagents thereof are operable at a range of temperatures or at least one temperature that falls within a range of temperatures. In some instances, the range of temperatures is about −50° C. to about 100° C. In some instances, the range of temperatures is about −50° C. to about 90° C. In some instances, the range of temperatures is about −50° C. to about 80° C. In some instances, the range of temperatures is about is about −50° C. to about 70° C. In some instances, the range of temperatures is about −50° C. to about 60° C. In some instances, the range of temperatures is about −50° C. to about 50° C. In some instances, the range of temperatures is about −50° C. to about 40° C. In some instances, the range of temperatures is about −50° C. to about 30° C. In some instances, the range of temperatures is about −50° C. to about 20° C. In some instances, the range of temperatures is about −50° C. to about 10° C. In some instances, the range of temperatures is about 0° C. to about 100° C. In some instances, the range of temperatures is about 0° C. to about 90° C. In some instances, the range of temperatures is about 0° C. to about 80° C. In some instances, the range of temperatures is about is about 0° C. to about 70° C. In some instances, the range of temperatures is about 0° C. to about 60° C. In some instances, the range of temperatures is about 0° C. to about 50° C. In some instances, the range of temperatures is about 0° C. to about 40° C. In some instances, the range of temperatures is about 0° C. to about 30° C. In some instances, the range of temperatures is about 0° C. to about 20° C. In some instances, the range of temperatures is about 0° C. to about 10° C. In some instances, the range of temperatures is about 15° C. to about 100° C. In some instances, the range of temperatures is about 15° C. to about 90° C. In some instances, the range of temperatures is about 15° C. to about 80° C. In some instances, the range of temperatures is about is about 15° C. to about 70° C. In some instances, the range of temperatures is about 15° C. to about 60° C. In some instances, the range of temperatures is about 15° C. to about 50° C. In some instances, the range of temperatures is about 15° C. to about 40° C. In some instances, the range of temperatures is about 15° C. to about 30° C. In some instances, the range of temperatures is about 10° C. to about 30° C. In some instances, devices, systems, kits disclosed herein, including all components thereof, and all reagents thereof, are completely operable at room temperature, not requiring cooling, freezing or heating.


In some instances, devices, systems and kits disclosed herein detect components of the biological sample or products thereof (e.g., amplification products, conjugation products, binding products) within a time range of receiving the biological sample. In some instances, detecting occurs via a signaling molecule described herein. In some instances, the time range is about one second to about one minute. In some instances, the time range is about ten seconds to about one minute. In some instances, the time range is about ten seconds to about one minute. In some instances, the time range is about thirty seconds to about one minute. In some instances, the time range is about 10 seconds to about 2 minutes. In some instances, the time range is about 10 seconds to about 3 minutes. In some instances, the time range is about 10 seconds to about 5 minutes. In some instances, the time range is about 10 seconds to about 10 minutes. In some instances, the time range is about 10 seconds to about 15 minutes. In some instances, the time range is about 10 seconds to about 20 minutes. In some instances, the time range is about 30 seconds to about 2 minutes. In some instances, the time range is about 30 seconds to about 5 minutes. In some instances, the time range is about 30 seconds to about 10 minutes. In some instances, the time range is about 30 seconds to about 15 minutes. In some instances, the time range is about 30 seconds to about 20 minutes. In some instances, the time range is about 30 seconds to about 30 minutes. In some instances, the time range is about 1 minute to about 2 minutes. In some instances, the time range is about 1 minute to about 3 minutes. In some instances, the time range is about 1 minute to about 5 minutes. In some instances, the time range is about 1 minute to about 10 minutes. In some instances, the time range is about 1 minute to about 20 minutes. In some instances, the time range is about 1 minute to about 30 minutes. In some instances, the time range is about 5 minutes to about 10 minutes. In some instances, the time range is about 5 minutes to about 15 minutes. In some instances, the time range is about 5 minutes to about 20 minutes. In some instances, the time range is about 5 minutes to about 30 minutes. In some instances, the time range is about 5 minutes to about 60 minutes. In some instances, the time range is about 30 minutes to about 60 minutes. In some instances, the time range is about 30 minutes to about 2 hours. In some instances, the time range is about 1 hour to about 2 hours. In some instances, the time range is about 1 hour to about 4 hours.


In some instances, devices, systems and kits disclosed herein detect a component of the biological sample or a product thereof (e.g., amplification product, conjugation product, binding product) in less than a given amount of time. In some instances, devices, systems and kits disclosed herein provide an analysis of a component of a biological sample or product thereof in less than a given amount of time. In some instances, the amount of time is less than 1 minute. In some instances, the amount of time is less than 5 minutes. In some instances, the amount of time is less than 10 minutes. In some instances, the amount of time is 15 minutes. In some instances, the amount of time is less than 20 minutes. In some instances, the amount of time is less than 30 minutes. In some instances, the amount of time is less than 60 minutes. In some instances, the amount of time is less than 2 hours. In some instances, the amount of time is less than 8 hours.


Communication & Information Storage

In general, devices, systems and kits disclosed herein comprise a nucleic acid information output. The nucleic acid information output is configured to communicate genetic information from the sample to the user. In some instances, the nucleic acid information output comprises a communication connection or interface so that genetic information obtained can be shared with others not physically present (e.g., family member, physician, or genetic counselor). The communication connection or interface may also allow for input from other sources. In some instances, devices, systems and kits disclosed herein comprise an interface for receiving information based on the genetic information obtained. The interface or communication connection may also receive non-genetic information from the user (e.g., medical history, medical conditions, age, weight, heart rate, blood pressure, physical activity, etc.). The interface or communication connection may also receive information provided by someone or something other than the user. By way of non-limiting example, this includes web-based information, information from a medical practitioner, and information from an insurance company. In some instances, devices, systems and kits disclosed herein comprise an interface for communicating information based on the genetic information obtained. In some instances, the interface provides a description of a genetic or chromosomal abnormality. In some instances, the interface provides a list of local contacts, such as doctors, support groups, stores and service providers, which support families of children with a genetic or chromosomal abnormality. In some instances, the interface provides an online listing of products or services that would be useful to children with a genetic or chromosomal abnormality. In some instances, devices, systems and kits disclosed herein comprise an information storage unit, e.g., a computer chip. In some instances, the devices, systems and kits disclosed herein comprise means to store genetic information securely. For example, devices, systems and kits disclosed herein may comprise a data chip or a connection (wired or wireless) to a hard drive, server, database or cloud. Non-limiting examples of interfaces for devices and systems disclosed herein are shown in FIG. 4B and FIGS. 5A-E.


In some instances, the devices, systems and kits disclosed herein are capable of collecting, encrypting, and/or storing information from users in a secure manner. Non-limiting examples of such information include health information, information from their wearables, other tests they have done or will do, demographic information etc.


In some instances, the devices, systems and kits disclosed herein are capable of communicating information about biomarkers in the biological sample to a communication device. In some instances the communication device is capable of being connected to the internet (e.g., via port or wireless connection). In some instances the communication device is connected to the internet. In some instances the communication device is not connected to the internet. In some instances, devices, systems and kits disclosed herein are capable of communicating information about biomarkers in the biological sample through the communication device to the internet. Non-limiting examples of communication devices are cell phones, electronic notepads, and computers.


In some instances, devices, systems and kits disclosed herein comprise a communication connection or a communication interface. In some embodiments, the communication interface provides a wired interface. In further embodiments, the wired communications interface utilizes Universal Serial Bus (USB) (including mini-USB, micro-USB, USB Type A, USB Type B, and USB Type C), IEEE 1394 (FireWire), Thunderbolt, Ethernet, and optical interconnect.


In some embodiments, the communication interface provides a wireless interface. See, e.g., FIGS. 5A-E. In further embodiments, the wireless communications interface utilizes a wireless communications protocol such as infrared, near-field communications (NFC) (including RFID), Bluetooth, Bluetooth Low Energy (BLE), ZigBee, ANT, IEEE 802.11 (Wi-Fi), Wireless Local Area Network (WLAN), Wireless Personal Area Network (WPAN), Wireless Wide Area Network (WWAN), WiMAX, IEEE 802.16 (Worldwide Interoperability for Microwave Access (WiMAX)), or 3G/4G/LTE/5G cellular communication methods.


In some embodiments, devices, systems, kits, and methods described herein include a digital processing device, or use of the same. In further embodiments, the digital processing device includes one or more hardware central processing units (CPUs) or general purpose graphics processing units (GPGPUs) that carry out the device's functions. In still further embodiments, the digital processing device further comprises an operating system configured to perform executable instructions. In some embodiments, the digital processing device includes a communication interface (e.g., network adapter) for communicating with one or more peripheral devices, one or more distinct digital processing devices, one or more computing systems, one or more computer networks, and/or one or more communications networks.


In some embodiments, the digital processing device is communicatively coupled to a computer network (“network”) with the aid of the communication interface. Suitable networks include, a personal area network (PAN), a local area networks (LAN), a wide area network (WAN), an intranet, an extranet, the Internet (providing access to the World Wide Web) and combinations thereof. The network in some cases is a telecommunication and/or data network. The network, in various cases, includes one or more computer servers, which enable distributed computing, such as cloud computing. The network, in some cases and with the aid of the device, implements a peer-to-peer network, which enables devices coupled to the device to behave as a client or a server.


In accordance with the description herein, suitable digital processing devices include, by way of non-limiting examples, server computers, desktop computers, laptop computers, notebook computers, sub-notebook computers, netbook computers, netpad computers, set-top computers, media streaming devices, handheld computers, Internet appliances, fitness trackers, smart watches, mobile smartphones, tablet computers, and personal digital assistants. Those of skill in the art will recognize that many smartphones are suitable for use in the system described herein. Those of skill in the art will also recognize that select televisions, video players, and digital music players with optional computer network connectivity are suitable for use in the system described herein. Suitable tablet computers include those with booklet, slate, and convertible configurations, known to those of skill in the art.


In some embodiments, the digital processing device includes an operating system configured to perform executable instructions. The operating system is, for example, software, including programs and data, which manages the device's hardware and provides services for execution of applications. Those of skill in the art will recognize that suitable server operating systems include, by way of non-limiting examples, FreeBSD, OpenBSD, NetBSD®, Linux, Apple® Mac OS X Server®, Oracle® Solaris Windows Server®, and Novell® NetWare®. Those of skill in the art will recognize that suitable personal computer operating systems include, by way of non-limiting examples, Microsoft® Windows®, Apple® Mac OS X®, UNIX®, and UNIX-like operating systems such as GNU/Linux®. In some embodiments, the operating system is provided by cloud computing. Those of skill in the art will also recognize that suitable mobile smart phone operating systems include, by way of non-limiting examples, Nokia® Symbian® OS, Apple® iOS®, Research In Motion® BlackBerryOS®, Google® Android®, Microsoft® Windows Phone® OS, Microsoft® Windows Mobile® OS, Linux®, and Palm® WebOS®. Those of skill in the art will also recognize that suitable media streaming device operating systems include, by way of non-limiting examples, Apple TV®, Roku®, Boxee®, GoogleTV®, GoogleChromecast®, AmazonFire®, and Samsung® HomeSync®. In some instances, the operating system comprises an Internet of Things (IoT) device. Non-limiting examples of an IoT device include Amazon's Alexa®, Microsoft's Cortana®, Apple Home Pod®, and GoogleSpeaker®. In some instances, devices, systems, and kits disclosed herein comprise a virtual reality and/or augmented reality system.


In some embodiments, devices, systems, and kits disclosed herein comprise a storage and/or memory device. The storage and/or memory device is one or more physical apparatuses used to store data or programs on a temporary or permanent basis. In some embodiments, the device is volatile memory and requires power to maintain stored information. In some embodiments, the device is non-volatile memory and retains stored information when the digital processing device is not powered. In further embodiments, the non-volatile memory comprises flash memory. In some embodiments, the non-volatile memory comprises dynamic random-access memory (DRAM). In some embodiments, the non-volatile memory comprises ferroelectric random access memory (FRAM). In some embodiments, the non-volatile memory comprises phase-change random access memory (PRAM). In other embodiments, the device is a storage device including, by way of non-limiting examples, CD-ROMs, DVDs, flash memory devices, magnetic disk drives, magnetic tapes drives, optical disk drives, and cloud computing based storage. In further embodiments, the storage and/or memory device is a combination of devices such as those disclosed herein.


In some embodiments, the digital processing device includes a display to send visual information to a user. In some embodiments, the display is a liquid crystal display (LCD). In further embodiments, the display is a thin film transistor liquid crystal display (TFT-LCD). In some embodiments, the display is an organic light emitting diode (OLED) display. In various further embodiments, on OLED display is a passive-matrix OLED (PMOLED) or active-matrix OLED (AMOLED) display. In some embodiments, the display is a plasma display. In other embodiments, the display is a video projector. In yet other embodiments, the display is a head-mounted display in communication with the digital processing device, such as a VR headset.


In some embodiments, the digital processing device includes an input device to receive information from a user. In some embodiments, the input device is a keyboard. In some embodiments, the input device is a pointing device including, by way of non-limiting examples, a mouse, trackball, track pad, joystick, game controller, or stylus. In some embodiments, the input device is a touch screen or a multi-touch screen. In other embodiments, the input device is a microphone to capture voice or other sound input. In other embodiments, the input device is a video camera or other sensor to capture motion or visual input. In further embodiments, the input device is a Kinect, Leap Motion, or the like. In still further embodiments, the input device is a combination of devices such as those disclosed herein.


Mobile Application

In some embodiments, devices, systems, kits, and methods disclosed herein comprise a digital processing device, or use of the same, wherein the digital processing device is provided with executable instructions in the form of a mobile application. In some embodiments, the mobile application is provided to a mobile digital processing device at the time it is manufactured. In other embodiments, the mobile application is provided to a mobile digital processing device via the computer network described herein. Mobile applications disclosed herein may be configured to locate, encrypt, index, and/or access information. Mobile applications disclosed herein may be configured to acquire, encrypt, create, manipulate, index, and peruse data.


Referring to FIG. 5A, in a particular embodiment, a mobile application is configured to connect with, communicate with, and receive genetic information and other information from the devices, systems and kits disclosed herein. FIG. 5A is a diagram depicting various functions that the mobile application optionally provides to users. In this embodiment, the mobile application optionally provides: 1) a personalized, tailored user experience (UX) based on the personal information and preferences of the user; 2) an interactive text-, audio-, and/or video-driven instructional experience to inform the user how to utilize the devices, systems, and kits; 3) a content platform that provides the user with access to articles, news, media, games, and the like; and 4) tools for tracking and sharing information, test results, and events.


Referring to FIG. 5B, in a particular embodiment, the mobile application optionally includes an interactive interface providing a step-by-step walkthrough to guide a user through use of the devices, systems and kits disclosed herein. In various embodiments, the interactive walkthrough includes text, images, animations, audio, video, and the like to inform and instruct the user.


Referring to FIG. 5C, in a particular embodiment, the mobile application optionally includes a home screen allowing a user to access the mobile application functionality disclosed herein. In this embodiment, the home screen includes a personalized greeting as well as interface elements allowing the user to start a test, view current and historic test results, share test results, and interact with a larger community of users.


Referring to FIG. 5D, in a particular embodiment, the mobile application optionally includes a progress diagram informing a user of the status of a process for connecting to a device, system, or kit disclosed herein. In this embodiment, the diagram shows all the steps and indicates the current step. The steps are: 1) pair with the device via, for example, Bluetooth; 2) detect a sample in the device; and 3) wait for the sample to be processed. In some embodiments, the diagram is interactive, animated, or augmented with media or other content.


Referring to FIG. 5E, in a particular embodiment, the mobile application optionally includes a social sharing screen allowing a user to access features to share test results. Many services, platforms, and networks are suitable for sharing test results and other information and events. Suitable social networking and sharing platforms include, by way of non-limiting examples, Facebook, YouTube, Twitter, LinkedIn, Pinterest, Google Plus+, Tumblr, Instagram, Reddit, VK, Snapchat, Flickr, Vine, Meetup, Ask.fm, Classmates, QQ, WeChat, Swarm by Foursquare, Kik, Yik Yak, Shots, Periscope, Medium, Soundcloud, Tinder, WhatsApp, Snap Chat, Slack, Musical.ly, Peach, Blab, Renren, Sina Weibo, Renren, Line, and Momo. In some embodiments, the test results are shared by SMS, MMS or instant message. In some embodiments, the test results are shared by email.


In some embodiments, the mobile application optionally includes a home screen allowing a user to access additional features such as a blog and timeline of important information and events related to the test results, which is optionally shared. In various embodiments, suitable information and events include those pertaining to clinical trial outcomes, newly marketed therapeutics, nutrition, exercise, fetal development, health, etc. In this embodiment, the home screen further includes access to user preferences and settings.


In some instances, devices and systems disclosed herein are in communication with the mobile application. The mobile application may provide for obtaining a Patient ID and electronic health record (EHR), arranging device shipment (to and/or from a user), online ordering of test results. The mobile application may provide for tracking a device or a portion thereof (e.g., shipping/storage compartment), or information obtained with the device, from one point to another. Various points may be selected from shipping, home, sample processing laboratory, and physician's office.


In view of the disclosure provided herein, a mobile application is created by techniques known to those of skill in the art using hardware, languages, and development environments known to the art. Those of skill in the art will recognize that mobile applications are written in several languages. Suitable programming languages include, by way of non-limiting examples, C, C++, C#, Objective-C, Java™, Javascript, Pascal, Object Pascal, Python™, Ruby, VB.NET, WML, and XHTML/HTML with or without CSS, or combinations thereof.


Suitable mobile application development environments are available from several sources. Commercially available development environments include, by way of non-limiting examples, AirplaySDK, alcheMo, Appcelerator®, Celsius, Bedrock, Flash Lite, .NET Compact Framework, Rhomobile, and WorkLight Mobile Platform. Other development environments are available without cost including, by way of non-limiting examples, Lazarus, MobiFlex, MoSync, and Phonegap. Also, mobile device manufacturers distribute software developer kits including, by way of non-limiting examples, iPhone and iPad (iOS) SDK, Android™ SDK, BlackBerry® SDK, BREW SDK, Palm® OS SDK, Symbian SDK, webOS SDK, and Windows® Mobile SDK.


Those of skill in the art will recognize that several commercial forums are available for distribution of mobile applications including, by way of non-limiting examples, Apple® App Store, Google® Play, Chrome WebStore, BlackBerry® App World, App Store for Palm devices, App Catalog for webOS, Windows® Marketplace for Mobile, Ovi Store for Nokia® devices, and Samsung® Apps.


Aspects Related to Devices, Systems, Kits and Methods

The following aspects are related to devices, systems, kits and methods disclosed herein. Devices, systems, kits and methods disclosed herein are generally designed to process and analyze cell-free nucleic acids in biological samples of female subjects. The following descriptions of cell-free nucleic acids, biological samples, and subjects may aid in understanding the utility of devices, systems, kits and methods disclosed herein.


Diseases and Conditions

Disclosed herein are devices, systems, kits and methods for detecting the presence, absence, or severity of a disease or condition in a subject. In some instances, the disease or condition is due to a genetic mutation. The genetic mutation may be inherited (e.g., the mutation was present in an ancestor or relative). The genetic mutation may be a spontaneous mutation (e.g., an error in DNA replication or repair). The genetic mutation may be due to exposure to an environmental factor (e.g., UV light, carcinogen). By way of non-limiting example, the genetic mutation may be selected from a frameshift mutation, an insertion mutation, a deletion mutation, a substitution mutation, a single nucleotide polymorphism, a copy number variation, and a chromosomal translocation.


In some instances, the disease or condition is due to an environmental factor (e.g., carcinogen, diet, stress, pathogen). In some instances, the environmental factor causes a genetic mutation. In other instances, the environmental factor does not cause a genetic mutation. In some instances, the environmental factor causes a change in one or more epigenetic modifications in a subject relative to a healthy individual. In some instances, the environmental factor causes a change in one or more epigenetic modifications in a subject relative to that of the subject at an earlier time point.


Tissue or Cell-Type Specific Disease

Devices, systems, kits and methods disclosed herein may be used to detect or monitor a disease or condition that affects one or more tissues, organs or cell types. The disease or condition may cause a release of nucleic acids from one or more tissues, organs or cell types. The disease or condition may increase a release of nucleic acids from one or more tissues, organs or cell types relative to a corresponding release occurring in a healthy individual. A tissue may be classified as epithelial, connective, muscle, or nervous tissue. Non-limiting examples of tissues are adipose, muscle, connective tissue, mammary tissue, and bone marrow. Non-limiting examples of organs are brain, thymus, thyroid, lung, heart, spleen, liver, kidney, pancreas, stomach, small intestine, large intestine, colon, prostate, ovary, uterus, and urinary bladder. Non-limiting examples of cell types are endothelial cells, vascular smooth muscle cells, cardiomyocytes, hepatocytes, pancreatic beta cells, adipocytes, neurons, endometrial cells, immune cells (T cells, B cells, dendritic cells, monocytes, macrophages, Kupffer cells, microglia).


Organ Transplant Monitoring

Devices, systems, kits and methods disclosed herein may be used to detect or monitor general health. Devices, systems, kits and methods disclosed herein may be used to detect or monitor fitness. Devices, systems, kits and methods disclosed herein may be used to detect or monitor the health of an organ transplant recipient and/or the health of the transplanted organ.


Proliferative Disease (Cancer)

In the oncology field, liquid biopsy is a viable alternative to tissue-based biopsy methods in many cases. In particular, liquid biopsy is advantageous when the procedure is too costly, presents an unjustifiable risk to the patient, is inconvenient for the patient, or impractical as is the case in metastatic disease, neurological diseases and in monitoring settings, where there is no tissue to be biopsied.


Disclosed herein, in some embodiments, are devices, systems, kits and methods useful for early cancer detection (screening), disease monitoring and characterization, and determining a disease burden, deriving a precision treatment regimen, as shown in Example 15.


The disease or condition may comprise an abnormal cell growth or proliferation. The disease or condition may comprise leukemia. Non-limiting types of leukemia include acute lymphoblastic leukemia (ALL), chronic lymphocytic leukemia (CLL), acute myelogenous leukemia (AML), chronic myelogenous leukemia (CML), and hairy cell leukemia (HCL). The disease or condition may comprise a lymphoma. The lymphoma may be a non-Hodgkin's lymphoma (e.g., B cell lymphoma, diffuse large B-cell lymphoma, T cell lymphoma, Waldenstrom macroglobulinemia) or a Hodgkin's lymphoma. The disease or condition may comprise a cancer. The cancer may be breast cancer. The cancer may be lung cancer. The cancer may be esophageal cancer. The cancer may be pancreatic cancer. The cancer may be ovarian cancer. The cancer may be uterine cancer. The cancer may be cervical cancer. The cancer may be testicular cancer. The cancer may be prostate cancer. The cancer may be bladder cancer. The cancer may be colon cancer. The cancer may be a sarcoma. The cancer may be an adenocarcinoma. The cancer may be isolated, that is it has not spread to other tissues besides the organ or tissue where the cancer originated. The cancer may be metastatic. The cancer may have spread to neighboring tissues. The cancer may have spread to cells, tissues or organs in physical contact with the organ or tissue where the cancer originated. The cancer may have spread to cells, tissues or organs not in physical contact with the organ or tissue where the cancer originated. The cancer may be in an early stage, such as Stage 0 (abnormal cell with the potential to become cancer) or Stage 1 (small and confined to one tissue). The cancer may be intermediate, such as Stage 2 or Stage 3, grown into tissues and lymph nodes in physical contact with the tissue of the original tumor. The cancer may be advanced, such as Stage 4 or Stage 5, wherein the cancer has metastasized to tissues that are distant (e.g., not adjacent or in physical contact) to the tissue of the original tumor. In some instances, the cancer is not advanced. In some instances, the cancer is not metastatic. In some instances, the cancer is metastatic.


Autoimmune Disease

The disease or condition may comprise an autoimmune disorder. Autoimmune and immune disorders include, but are not limited to, type 1 diabetes, rheumatoid arthritis, psoriasis, multiple sclerosis, lupus, inflammatory bowel disease, Addison's Disease, Graves Disease, Crohn's Disease and Celiac disease.


Metabolic Disease

The disease or condition may comprise a metabolic disorder. Metabolic conditions and disease, include, but are not limited to obesity, a thyroid disorder, hypertension, type 1 diabetes, type 2 diabetes, non-alcoholic steatohepatitis, coronary artery disease, and atherosclerosis.


The disease or condition may comprise a cardiovascular condition. Non-limiting examples of cardiovascular conditions are atherosclerosis, myocardial infarction, pericarditis, myocarditis, ischemic stroke, hypertensive heart disease rheumatic heart disease, cardiomyopathy, congenital heart disease, valvular heart disease, carditis, aortic aneurysms, peripheral artery disease, thromboembolic disease, and venous thrombosis.


Neurological Disease

The disease or condition may comprise a neurological disorder. The neurological disorder may comprise a neurodegenerative disease. Non-limiting examples of neurodegenerative and neurological disorders are Alzheimer's disease, Parkinson's disease, Huntington's disease, Spinocerebellar ataxia, amyotrophic lateral sclerosis (ALS), motor neuron disease, chronic pain, and spinal muscular atrophy. Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor a psychiatric disorder in a subject and/or a response to a drug to treat the psychiatric disorder.


Infectious Disease

The disease or condition may comprise an infection. The disease or condition may be caused by an infection. The disease or condition may be exacerbated by an infection. The infection may be a viral infection. The infection may be a bacterial infection. The infection may be a fungal infection.


Aging

The disease or condition may be associated with aging. Disease and conditions associated with aging include, but are not limited to, cancer, osteoporosis, dementia, macular degeneration, metabolic conditions, and neurodegenerative disorders.


Blood Disease or Disorder

The disease or condition may be a blood disorder. Non-limiting examples of blood disorders are anemia, hemophilia, blood clotting and thrombophilia. For example, detecting thrombophilia may comprise detecting a polymorphism present in a gene selected from Factor V Leiden (FVL), prothrombin gene (PT G20210A), and methylenetetrahydrofolate reductase (MTHFR).


Allergy or Intolerance

The disease or condition may be an allergy or intolerance to a food, liquid or drug. By way of non-limiting example, a subject can be allergic or intolerant to lactose, wheat, soy, dairy, caffeine, alcohol, nuts, shellfish, and eggs. A subject could also be allergic or intolerant to a drug, a supplement or a cosmetic. In some instances, methods comprise analyzing genetic markers that are predictive of skin type or skin health.


In some instances, the condition is associated with an allergy. In some instances, the subject is not diagnosed with a disease or condition, but is experiencing symptoms that indicate a disease or condition is present. In other instances, the subject is already diagnosed with a disease or condition, and the devices, systems, kits and methods disclosed herein are useful for monitoring the disease or condition, or an effect of a drug on the disease or condition.


Chromosomal Abnormalities

Disclosed herein are devices, systems, kits and methods for detecting chromosomal abnormalities. Those of skill in the field may also refer to chromosomal abnormalities as chromosomal aberrations. In some instances, the chromosomal abnormality is a chromosomal duplication. In some instances, the chromosomal abnormality is a chromosomal deletion. In some instances, the chromosomal abnormality is deletion of an arm of a chromosome. In some instances, the chromosomal abnormality is a partial deletion of an arm of a chromosome. In some instances, the chromosomal abnormality comprises at least one copy of a gene. In some instances, the chromosomal abnormality is due to a breakage of a chromosome. In some instances, the chromosomal abnormality is due to a translocation of a portion of a first chromosome to a portion of a second chromosome.


Many known chromosomal abnormalities results in chromosomal disorders. Thus, the devices, systems, kits and methods disclosed herein may be used for detecting chromosomal disorders. By way of non-limiting example, chromosomal disorders include Down's syndrome (trisomy 21), Edward's syndrome (trisomy 18), Patau syndrome (trisomy 13), Cri du chat syndrome (partial deletion of short arm of chromosome 5), Wolf-Hirschhorn syndrome (deletion of short arm of chromosome 4), Jacobsen syndrome (deletion of long arm of chromosome 11), diGeorge's syndrome (small deletion of chromosome 22), Klinefelter's syndrome (presence of additional X chromosome in males), and Turner syndrome (presence of only a single X chromosome in females).


Biological Samples

Disclosed herein are devices, systems, kits and methods for analyzing cell-free nucleic acids in a biological sample. Non-limiting examples of biological samples include samples of whole blood, plasma, serum, saliva, urine, sweat, tears, vaginal fluid, and cervical fluid or biopsy. In some instances, the biological sample comprises whole blood. In some instances, the biological sample is an environmental sample that contains biological matter. For instance, the biological sample may be a food sample or water sample that contains a virus, bacteria or a fragment/particle thereof.


Methods, systems and kits described herein generally detect and quantify cell-free nucleic acids. For this reason, biological samples described herein are generally biological fluids that are substantially acellular or can be modified to be acellular biological fluids. Samples from subjects, by way of non-limiting example, may be blood from which cells are removed, plasma, serum, urine, saliva, or vaginal fluid. For instance, the cell-free nucleic acid may be circulating in the bloodstream of the subject, and therefore the detection reagent may be used to detect or quantify the marker in a blood or serum sample from the subject. The terms “plasma” and “serum” are used interchangeably herein, unless otherwise noted. However, in some cases they are included in a single list of sample species to indicate that both are covered by the description or claim. In some instances, the biological fluid does not comprise amniotic fluid.


In some instances, devices, systems, kits and methods disclosed herein are capable of removing cells from a biological sample. The resulting sample may be referred to as a cell-depleted sample. The cell-depleted sample may have at least 95% fewer whole, intact cells than the biological sample. The cell-depleted sample may have at least 90% fewer whole, intact cells than the biological sample. The cell-depleted sample may have at least 80% fewer whole, intact cells than the biological sample. The cell-depleted sample may have at least about 75%, at least about 70%, at least about 60%, at least about 50%, at least about 40%, or at least about 25% fewer whole, intact cells than the biological sample. The cell-depleted sample may be completely free of any whole, intact cells.


Blood obtained from capillaries (e.g., blood vessels of extremities like fingers, toes) may be referred to herein as “capillary blood.” Blood obtained from veins (e.g., arm, middle of hand) may be referred to herein as “venous blood.” Common veins for venipuncture to obtain venous blood are the median cubital vein, cephalic vein, basilic vein, and dorsal metacarpal veins. In some instances, the biological sample comprises capillary blood. In some instances, the biological sample consists essentially of capillary blood. In some instances, the biological sample consists of capillary blood. In some embodiments, the biological sample does not comprise venous blood. In some instances, the biological sample comprises plasma. In some instances, the biological sample consists essentially of plasma. In some instances, the biological sample consists of plasma. In some instances, the biological sample comprises serum. In some instances, the biological sample consists essentially of serum. In some instances, the biological sample consists of serum. In some instances, the biological sample comprises urine. In some instances, the biological sample consists essentially of urine. In some instances, the biological sample consists of urine. In some instances, the biological sample comprises saliva. In some instances, the biological sample consists essentially of saliva. In some instances, the biological sample consists of saliva. In some instances, the biological fluid comprises vaginal fluid. In some instances, the biological fluid consists essentially of vaginal fluid. In some instances, the biological fluid consists of vaginal fluid. In some instances, the vaginal fluid is obtained by performing a vaginal swab of the pregnant subject. In some instances, the biological fluid comprises interstitial fluid. In some instances, the biological fluid consists essentially of interstitial fluid. In some instances, the biological fluid comprises synovial fluid. In some instances, the biological fluid consists essentially of synovial fluid. In some instances, the biological fluid comprises fluid from a liquid biopsy. In some instances, the biological fluid consists essentially of fluid from a liquid biopsy. An example of a liquid biopsy is obtaining blood from a cancer patient and testing for nucleic acids that have been released into the blood stream from a tumor or cancer cells. Nucleic acids may be released from tumor or cancer cells due to necrosis, apoptosis, autophagy, and cancer therapies that cause death/damage to cancer cells.


In some instances, the biological sample is whole blood. Generally, the devices, systems, kits, and methods disclosed herein are capable of analyzing cell-free nucleic acids from very small samples of whole blood. In some instances, the small sample of whole blood maybe obtained with a finger prick, such as performed with a lancet or pin/needle. In some instances, the small sample of whole blood maybe obtained without a phlebotomy.


In some instances, the devices, systems, kits, and methods disclosed herein require at least about 1 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 10 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 20 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 30 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 40 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 50 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 60 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 70 μL of blood to provide a test result with at least about 95% confidence or accuracy.


In some instances, the devices, systems and kits disclosed herein require at least about 1 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 10 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 20 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 30 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 40 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 60 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 80 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 100 μL of blood to provide a test result with at least about 90% confidence or accuracy.


In some instances, the method comprise obtaining only about 1 μL to about 500 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the method comprise obtaining only about 10 μL to about 200 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the method comprise obtaining only about 15 μL to about 150 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the method comprise obtaining only about 20 μL to about 100 μL of blood to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 20 μL to about 100 μL of blood to provide a test result with at least about 98% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 20 μL to about 100 μL of blood to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 20 μL to about 100 μL of blood to provide a test result with about 99.5% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 20 μL to about 100 of blood to provide a test result with about 99.9% confidence or accuracy.


In some instances, the biological sample is plasma or serum. Plasma or serum makes up roughly 55% of whole blood. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 1 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 10 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 20 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 30 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 40 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 50 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 10 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 20 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 30 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 40 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require at least about 50 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 10 μL to about 50 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 20 μL to about 60 μL of plasma or serum to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems and kits disclosed herein require only about 10 μL to about 50 μL of plasma or serum to provide a test result with at least about 99% confidence or accuracy.


In some instances, the biological sample is saliva. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 100 μL of saliva to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 200 μL of saliva to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 500 μL of saliva to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 1 ml of saliva to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 2 ml of saliva to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 3 ml of saliva to provide a test result with at least about 95% confidence or accuracy.


In some instances, the biological sample is vaginal fluid. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 50 μL of vaginal fluid to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 100 μL of vaginal fluid to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 200 μL of vaginal fluid to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 500 μL of vaginal fluid to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 1 ml of vaginal fluid to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 2 ml of vaginal fluid to provide a test result with at least about 95% confidence or accuracy. In some instances, the devices, systems, kits, and methods disclosed herein require at least about 3 ml of vaginal fluid to provide a test result with at least about 95% confidence or accuracy.


In some instances, biological samples disclosed herein comprise cell-free nucleic acids wherein a fraction of the cell-free nucleic acids are from a foreign tissue or an abnormal tissue. The cell-free nucleic acids in the fraction may be referred to as “foreign cell-free nucleic acids” or “foreign cell-free nucleic acids.” By way of non-limiting example, the foreign or abnormal tissue may comprise a tissue or organ that has been transplanted into the subject. The foreign or abnormal tissue may be referred to as donor tissue and the subject may be referred to as host tissue. Also by way of non-limiting example, an abnormal tissue may comprise a tumor. In some instances, the fraction is a fraction of all (total) cell-free nucleic acids in the biological sample, wherein the fraction comprises the foreign or abnormal cell-free nucleic acids. In some instances, the fraction consists essentially of the foreign or abnormal cell-free nucleic acids. In some instances, the foreign or abnormal cell-free nucleic acids comprise DNA. In some instances, the foreign or abnormal cell-free nucleic acids comprise RNA. In some instances, the foreign or abnormal cell-free nucleic acids consist essentially of DNA. In some instances, the foreign or abnormal cell-free nucleic acids consist essentially of RNA.


The fraction of cell-free nucleic acids that are from a foreign or abnormal tissue may be characterized as a percentage of the total cell-free nucleic acids in a sample. In some instances, the fraction of the cell-free nucleic acids that are from a foreign or abnormal tissue is less than 25%. In some instances, the fraction of the cell-free nucleic acids that are from a foreign or abnormal tissue is less than 20%. In some instances, the fraction is less than 15%. In some instances, the fraction is less than 10%. In some instances, the fraction is less than 8%. In some instances the fraction is less than 6%. In some instances, the fraction is less than 5%. In some instances, the fraction is less than 4%. In some instances, the fraction is less than 2%. In some instances, the fraction is at least 1%. In some instances, the fraction is about 1.5% to about 15%. In some instances, the fraction is about 2% to about 12%. In some instances, the fraction is about 4% to about 10%. In some instances, the fraction is about 4% to about 9%. In some instances, the fetal fraction is about 4% to about 8%. In some instances, the fetal fraction is about 1% to about 5%. In some instances, the fetal fraction is about 1% to about 4%.


In some instances, biological samples disclosed herein comprise cell-free nucleic acids wherein a fraction of the cell-free nucleic acids are from a fetus. This fraction may be referred to as a fetal fraction. In some instances, the fetal fraction is a fraction of all (total) nucleic acids in the biological sample, wherein the fraction consists of fetal nucleic acids. In some instances, the nucleic acids and/or fetal nucleic acids comprise DNA. In some instances, the nucleic acids and/or fetal nucleic acids comprise RNA. In some instances, the nucleic acids and/or fetal nucleic acids consist essentially of DNA. In some instances, the nucleic acids and/or fetal nucleic acids comprise DNA and RNA. In some instances, the fetal fraction is about 1.5% to about 15% of the total cell-free nucleic acids in the biological sample. In some instances, the fetal fraction is about 2% to about 12% of the total cell-free nucleic acids in the biological sample. In some instances, the fetal fraction is about 4% to about 10% of the total cell-free nucleic acids in the biological sample. In some instances, the fetal fraction is about 4% to about 9% of the total cell-free nucleic acids in the biological sample. In some instances, the fetal fraction is about 4% to about 8% of the total cell-free nucleic acids in the biological sample. In some instances, the fetal fraction is about 1% to about 5% of the total cell-free nucleic acids in the biological sample. In some instances, the fetal fraction is about 1% to about 4% of the total cell-free nucleic acids in the biological sample. In some instances, at least a portion of fetal nucleic acids are from the fetus. In some instances, at least a portion of the fetal nucleic acids are from the placenta. In some instances, at least a portion of fetal nucleic acids are from the fetus and at least a portion of the fetal nucleic acids are from the placenta. In some instances, the fetal nucleic acids are only from the fetus. In some instances, the fetal nucleic acids are only from the placenta. In some instances, the fetal nucleic acids are all nucleic acids from the fetus and the placenta. In some instances, the fetal nucleic acids are not from a maternal tissue or maternal fluid. In some instances, the maternal tissue is a maternal tissue other than the placenta. In some instances, the maternal fluid is a maternal fluid other than the amniotic fluid.


In some instances, methods disclosed herein comprise modifying the biological fluid to make the biological sample compatible with amplifying or sequencing. In some instances, methods disclosed herein may comprise adding a buffer, salt, protein, or nucleic acid to the biological sample. By way of non-limiting example, EDTA may be added to a blood sample to prevent coagulation. For simplicity, such a modified biological sample is still referred to as the ‘biological sample.’


Cell-Free Nucleic Acids

In some instances, the compositions and methods of the instant disclosure are useful for evaluating a cell-free nucleic acid in a biological sample. The cell-free nucleic acid could be from an animal. The cell-free nucleic acid could be from a mammal. The cell-free nucleic acid could be from a human subject. The cell-free nucleic acid could be from a plant. The cell-free nucleic acid could be from a pathogen. The cell-free nucleic acid could be from a pathogen that is present in the biological sample, wherein the biological sample is from an animal. The cell-free nucleic acid could be from a pathogen that is present in the biological sample, wherein the biological sample is from a human subject. The pathogen may comprise bacteria or a component thereof. The pathogen may be a virus or a component thereof. The pathogen may be a fungus or a component thereof.


In some instances, the cell-free nucleic acid is DNA (cf-DNA). In some instances, the cell-free nucleic acid is genomic DNA. In some instances, the cell-free nucleic acid is RNA (cf-RNA). In some instances, the cell-free nucleic acid is a nucleic acid from a cell of a fetus, referred to herein as a cell-free fetal nucleic acid. In some instances, the cell-free fetal nucleic acid is cell-free fetal DNA (cff-DNA) or cell-free fetal RNA (cff-RNA). In some instances, the cf-DNA or cff-DNA is genomic DNA. In some instances, the cell-free nucleic acid is in the form of complementary DNA (cDNA), generated by reverse transcription of a cf-RNA or cff-RNA. In some instances, the cf-DNA comprises mitochondrial DNA. In some instances, the cf-RNA or cff-RNA is a messenger RNA (mRNA), a microRNA (miRNA), mitochondrial RNA, or a natural antisense RNA (NAS-RNA). In some instances, the cell-free nucleic acid sequence comprises an RNA molecule or a fragmented RNA molecule (RNA fragments) selected from: small interfering RNA (siRNA), a microRNA (miRNA), a pre-miRNA, a pri-miRNA, a mRNA, a pre-mRNA, a viral RNA, a viroid RNA, a virusoid RNA, circular RNA (circRNA), a ribosomal RNA (rRNA), a transfer RNA (tRNA), a pre-tRNA, a long non-coding RNA (lncRNA), a small nuclear RNA (snRNA), a circulating RNA, a cell-free RNA, an exosomal RNA, a vector-expressed RNA, an RNA transcript, and combinations thereof. In some instances, the cell-free nucleic acid is a mixture of maternal and fetal nucleic acid. A cell-free fetal nucleic acid that circulates in the maternal bloodstream can be referred to as a “circulating cell-free nucleic acid” or a “circulatory extracellular DNA.” In some instances, the cell-free nucleic acid comprises epigenetic modifications. In some instances, the cell-free nucleic acid comprises a pattern of epigenetic modifications that corresponds to gender or other genetic information of interest. In some instances, the cell-free nucleic acid comprises methylated cytosines. In some instances, the cell-free nucleic acid comprises a cytosine methylation pattern that corresponds to gender or other genetic information of interest.


In some instances, methods, devices, systems and kits disclosed herein are configured to detect or quantify cellular nucleic acids, such as nucleic acids from disrupted cells or lysed cells. In some instances, cellular nucleic acids are from cells that are intentionally disrupted or lysed. In some instances, cellular nucleic acids are from cells that are unintentionally disrupted or lysed. Methods, devices, systems and kits disclosed herein may be configured to analyze intentionally disrupted or lysed cells, but not unintentionally disrupted or lysed cells. In some instances, less than about 0.1% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 1% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 5% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 10% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 20% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 30% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 40% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 50% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 60% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 70% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 80% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, less than about 90% of the total nucleic acids in the biological sample are cellular nucleic acids. In some instances, devices, systems, kits and methods comprise an experimental control or use thereof. In some instances, the experimental control comprises a nucleic acid, a protein, a peptide, an antibody, an antigen binding antibody fragment, a binding moiety. In some instances, the experimental control comprises a signal for detecting the experimental control. Non-limiting examples of signals are fluorescent molecules, dye molecules, nanoparticles, and colorimetric indicators. In some instances, the experimental control comprises a cell-free nucleic acid. In some instances, the cell-free nucleic acid comprises a cell-free fetal nucleic acid. In some instances, the cell-free nucleic acid comprises a maternal cell-free nucleic acid. In some instances, the cell-free nucleic acid comprises a maternal cell-free nucleic acid (e.g., to assess the amount of cellular disruption/lysis that occurs during sample processing). In some instances, the cell-free nucleic acid comprises a sequence corresponding to an autosome. In some instances, the cell-free nucleic acid comprises a sequence corresponding to sex chromosome. In some instances, the cell-free nucleic acid comprises a sequence corresponding to a chromosome that is possibly aneuploidy (e.g., chromosome 13, 16, 18, 21, 22, X, Y). In some instances, the cell-free nucleic acid comprises a sequence corresponding to a chromosome that is very unlikely to be aneuploidy (e.g., chromosome 1-12, 14, 15, 17, 19, or 20).


In some instances, the biological sample comprises a maternal body fluid sample. In some instances, the maternal bodily fluid sample comprises blood, e.g., whole blood, a peripheral blood sample, or a blood fraction (plasma, serum). In some instances, the maternal body fluid sample comprises sweat, tears, sputum, urine, ear flow, lymph, saliva, cerebrospinal fluid, bone marrow suspension, vaginal fluid, transcervical lavage, brain fluid, ascites, or milk. In some instances, the maternal body fluid sample comprises secretions of the respiratory, intestinal and genitourinary tracts, amniotic fluid, or a leukophoresis sample. In some instances, the biological fluid sample is a maternal body fluid sample that is can be obtained easily by non-invasive procedures, e.g., blood, plasma, serum, sweat, tears, sputum, urine, ear flow, or saliva. In some instances, the sample is a combination of at least two body fluid samples. In some instances, the cell-free fetal nucleic acid originates from the maternal placenta, e.g., from apoptosed placental cells. In some instances, the biological sample is placental blood.


In some instances, a nucleic acid evaluated or analyzed by devices, systems, kits, and methods disclosed herein has a preferable length. In some instances, the nucleic acid is a cell-free fetal DNA fragment. In some instances, the cell-free fetal DNA fragment is from a Y chromosome. In some instances, the nucleic acid is about 15 bp to about 500 bp in length. In some instances, the nucleic acid is about 50 bp in length to about 200 bp in length. In some instances, the nucleic acid is at least about 15 bp in length. In some instances, the nucleic acid is at most about 500 bp in length. In instances, the nucleic acid is about 15 bp in length to about 50 bp in length, about 15 bp in length to about 75 bp in length, about 15 bp in length to about 100 bp in length, about 15 bp in length to about 150 bp in length, about 15 bp in length to about 200 bp in length, about 15 bp in length to about 250 bp in length, about 15 bp in length to about 300 bp in length, about 15 bp in length to about 350 bp in length, about 15 bp in length to about 400 bp in length, about 15 bp in length to about 450 bp in length, about 15 bp in length to about 500 bp in length, about 50 bp in length to about 75 bp in length, about 50 bp in length to about 100 bp in length, about 50 bp in length to about 150 bp in length, about 50 bp in length to about 200 bp in length, about 50 bp in length to about 250 bp in length, about 50 bp in length to about 300 bp in length, about 50 bp in length to about 350 bp in length, about 50 bp in length to about 400 bp in length, about 50 bp in length to about 450 bp in length, about 50 bp in length to about 500 bp in length, about 75 bp in length to about 100 bp in length, about 75 bp in length to about 150 bp in length, about 75 bp in length to about 200 bp in length, about 75 bp in length to about 250 bp in length, about 75 bp in length to about 300 bp in length, about 75 bp in length to about 350 bp in length, about 75 bp in length to about 400 bp in length, about 75 bp in length to about 450 bp in length, about 75 bp in length to about 500 bp in length, about 100 bp in length to about 150 bp in length, about 100 bp in length to about 200 bp in length, about 100 bp in length to about 250 bp in length, about 100 bp in length to about 300 bp in length, about 100 bp in length to about 350 bp in length, about 100 bp in length to about 400 bp in length, about 100 bp in length to about 450 bp in length, about 100 bp in length to about 500 bp in length, about 150 bp in length to about 200 bp in length, about 150 bp in length to about 250 bp in length, about 150 bp in length to about 300 bp in length, about 150 bp in length to about 350 bp in length, about 150 bp in length to about 400 bp in length, about 150 bp in length to about 450 bp in length, about 150 bp in length to about 500 bp in length, about 200 bp in length to about 250 bp in length, about 200 bp in length to about 300 bp in length, about 200 bp in length to about 350 bp in length, about 200 bp in length to about 400 bp in length, about 200 bp in length to about 450 bp in length, about 200 bp in length to about 500 bp in length, about 250 bp in length to about 300 bp in length, about 250 bp in length to about 350 bp in length, about 250 bp in length to about 400 bp in length, about 250 bp in length to about 450 bp in length, about 250 bp in length to about 500 bp in length, about 300 bp in length to about 350 bp in length, about 300 bp in length to about 400 bp in length, about 300 bp in length to about 450 bp in length, about 300 bp in length to about 500 bp in length, about 350 bp in length to about 400 bp in length, about 350 bp in length to about 450 bp in length, about 350 bp in length to about 500 bp in length, about 400 bp in length to about 450 bp in length, about 400 bp in length to about 500 bp in length, or about 450 bp in length to about 500 bp in length. In some instances, the nucleic acid is about 15 bp in length, about 50 bp in length, about 75 bp in length, about 100 bp in length, about 150 bp in length, about 200 bp in length, about 250 bp in length, about 300 bp in length, about 350 bp in length, about 400 bp in length, about 450 bp in length, or about 500 bp in length.


The sizes of the cell-free nucleic acids evaluated using the device, systems, kits and methods of the present disclosure can vary depending upon, e.g., the particular body fluid sample used. For example, cell-free DNA sequences have been observed to be shorter than maternal cell-free DNA sequences, and both cell-free DNA and maternal cell-free DNA to be shorter in urine than in plasma samples.


In some instances, the cell-free DNA sequences evaluated in urine range from about 20 bp to about 300 bp in length. In some instances, the cell-free DNA sequences evaluated in a urine sample are about 15 bp in length to about 300 bp in length. In some instances, the cell-free DNA sequences evaluated in a urine sample are at least about 15 bp in length. In some instances, the cell-free DNA sequences evaluated in a urine sample are at most about 300 bp in length. In some instances, the cell-free DNA sequences evaluated in a urine sample are about 15 bp in length to about 20 bp in length, about 15 bp in length to about 30 bp in length, about 15 bp in length to about 60 bp in length, about 15 bp in length to about 90 bp in length, about 15 bp in length to about 120 bp in length, about 15 bp in length to about 150 bp in length, about 15 bp in length to about 180 bp in length, about 15 bp in length to about 210 bp in length, about 15 bp in length to about 240 bp in length, about 15 bp in length to about 270 bp in length, about 15 bp in length to about 300 bp in length, about 20 bp in length to about 30 bp in length, about 20 bp in length to about 60 bp in length, about 20 bp in length to about 90 bp in length, about 20 bp in length to about 120 bp in length, about 20 bp in length to about 150 bp in length, about 20 bp in length to about 180 bp in length, about 20 bp in length to about 210 bp in length, about 20 bp in length to about 240 bp in length, about 20 bp in length to about 270 bp in length, about 20 bp in length to about 300 bp in length, about 30 bp in length to about 60 bp in length, about 30 bp in length to about 90 bp in length, about 30 bp in length to about 120 bp in length, about 30 bp in length to about 150 bp in length, about 30 bp in length to about 180 bp in length, about 30 bp in length to about 210 bp in length, about 30 bp in length to about 240 bp in length, about 30 bp in length to about 270 bp in length, about 30 bp in length to about 300 bp in length, about 60 bp in length to about 90 bp in length, about 60 bp in length to about 120 bp in length, about 60 bp in length to about 150 bp in length, about 60 bp in length to about 180 bp in length, about 60 bp in length to about 210 bp in length, about 60 bp in length to about 240 bp in length, about 60 bp in length to about 270 bp in length, about 60 bp in length to about 300 bp in length, about 90 bp in length to about 120 bp in length, about 90 bp in length to about 150 bp in length, about 90 bp in length to about 180 bp in length, about 90 bp in length to about 210 bp in length, about 90 bp in length to about 240 bp in length, about 90 bp in length to about 270 bp in length, about 90 bp in length to about 300 bp in length, about 120 bp in length to about 150 bp in length, about 120 bp in length to about 180 bp in length, about 120 bp in length to about 210 bp in length, about 120 bp in length to about 240 bp in length, about 120 bp in length to about 270 bp in length, about 120 bp in length to about 300 bp in length, about 150 bp in length to about 180 bp in length, about 150 bp in length to about 210 bp in length, about 150 bp in length to about 240 bp in length, about 150 bp in length to about 270 bp in length, about 150 bp in length to about 300 bp in length, about 180 bp in length to about 210 bp in length, about 180 bp in length to about 240 bp in length, about 180 bp in length to about 270 bp in length, about 180 bp in length to about 300 bp in length, about 210 bp in length to about 240 bp in length, about 210 bp in length to about 270 bp in length, about 210 bp in length to about 300 bp in length, about 240 bp in length to about 270 bp in length, about 240 bp in length to about 300 bp in length, or about 270 bp in length to about 300 bp in length. In some instances, the cell-free DNA sequences evaluated in a urine sample are about 15 bp in length, about 20 bp in length, about 30 bp in length, about 60 bp in length, about 90 bp in length, about 120 bp in length, about 150 bp in length, about 180 bp in length, about 210 bp in length, about 240 bp in length, about 270 bp in length, or about 300 bp in length.


In some instances, the cell-free DNA sequences evaluated in a plasma or serum sample are at least about 20 bp in length. In some instances, the cell-free DNA sequences evaluated in a plasma or serum sample are at least about 40 bp in length. In some instances, the cell-free DNA sequences evaluated in a plasma or serum sample are at least about 80 bp in length. In some instances, the cell-free DNA sequences evaluated in a plasm or serum sample are at most about 500 bp in length. In some instances, the cell-free DNA sequences evaluated in plasma or serum range from about 100 bp to about 500 bp in length. In some instances, the cell-free DNA sequences evaluated in a plasma or serum sample are about 50 bp in length to about 500 bp in length. In some instances, the cell-free DNA sequences evaluated in a plasma or serum sample are about 80 bp in length to about 100 bp in length, about 80 bp in length to about 125 bp in length, about 80 bp in length to about 150 bp in length, about 80 bp in length to about 175 bp in length, about 80 bp in length to about 200 bp in length, about 80 bp in length to about 250 bp in length, about 80 bp in length to about 300 bp in length, about 80 bp in length to about 350 bp in length, about 80 bp in length to about 400 bp in length, about 80 bp in length to about 450 bp in length, about 80 bp in length to about 500 bp in length, about 100 bp in length to about 125 bp in length, about 100 bp in length to about 150 bp in length, about 100 bp in length to about 175 bp in length, about 100 bp in length to about 200 bp in length, about 100 bp in length to about 250 bp in length, about 100 bp in length to about 300 bp in length, about 100 bp in length to about 350 bp in length, about 100 bp in length to about 400 bp in length, about 100 bp in length to about 450 bp in length, about 100 bp in length to about 500 bp in length, about 125 bp in length to about 150 bp in length, about 125 bp in length to about 175 bp in length, about 125 bp in length to about 200 bp in length, about 125 bp in length to about 250 bp in length, about 125 bp in length to about 300 bp in length, about 125 bp in length to about 350 bp in length, about 125 bp in length to about 400 bp in length, about 125 bp in length to about 450 bp in length, about 125 bp in length to about 500 bp in length, about 150 bp in length to about 175 bp in length, about 150 bp in length to about 200 bp in length, about 150 bp in length to about 250 bp in length, about 150 bp in length to about 300 bp in length, about 150 bp in length to about 350 bp in length, about 150 bp in length to about 400 bp in length, about 150 bp in length to about 450 bp in length, about 150 bp in length to about 500 bp in length, about 175 bp in length to about 200 bp in length, about 175 bp in length to about 250 bp in length, about 175 bp in length to about 300 bp in length, about 175 bp in length to about 350 bp in length, about 175 bp in length to about 400 bp in length, about 175 bp in length to about 450 bp in length, about 175 bp in length to about 500 bp in length, about 200 bp in length to about 250 bp in length, about 200 bp in length to about 300 bp in length, about 200 bp in length to about 350 bp in length, about 200 bp in length to about 400 bp in length, about 200 bp in length to about 450 bp in length, about 200 bp in length to about 500 bp in length, about 250 bp in length to about 300 bp in length, about 250 bp in length to about 350 bp in length, about 250 bp in length to about 400 bp in length, about 250 bp in length to about 450 bp in length, about 250 bp in length to about 500 bp in length, about 300 bp in length to about 350 bp in length, about 300 bp in length to about 400 bp in length, about 300 bp in length to about 450 bp in length, about 300 bp in length to about 500 bp in length, about 350 bp in length to about 400 bp in length, about 350 bp in length to about 450 bp in length, about 350 bp in length to about 500 bp in length, about 400 bp in length to about 450 bp in length, about 400 bp in length to about 500 bp in length, or about 450 bp in length to about 500 bp in length. In some instances, the cell-free DNA sequences evaluated in a plasma or serum sample are about 80 bp in length, about 100 bp in length, about 125 bp in length, about 150 bp in length, about 175 bp in length, about 200 bp in length, about 250 bp in length, about 300 bp in length, about 350 bp in length, about 400 bp in length, about 450 bp in length, or about 500 bp in length.


Subjects

Disclosed herein are devices, systems, kits and methods for analyzing a biological component in a sample from a subject. The subject may be human. The subject may be non-human. The subject may be non-mammalian (e.g., bird, reptile, insect). In some instances, the subject is a mammal. In some instances, the mammal is female. In some instances, the subject is a human subject. In some instances, the mammal is a primate (e.g., human, great ape, lesser ape, monkey). In some instances, the mammal is canine (e.g., dog, fox, wolf). In some instances, the mammal is feline (e.g., domestic cat, big cat). In some instances, the mammal is equine (e.g., horse). In some instances, the mammal is bovine (e.g., cow, buffalo, bison). In some instances, the mammal is a sheep. In some instances, the mammal is a goat). In some instances, the mammal is a pig. In some instances, the mammal is a rodent (e.g., mouse, rat, rabbit, guinea pig).


In some instances, a subject described herein is affected by a disease or a condition. Devices, systems, kits and methods disclosed herein may be used to test for the disease or condition, detect the disease or condition, and/or monitor the disease or condition. Devices, systems, kits and methods disclosed herein may be used to test for the presence of inherited traits, monitor fitness, and detect family ties.


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor cancer in a subject. Non-limiting examples of cancers include breast cancer, prostate cancer, skin cancer, lung cancer, colorectal cancer/colon cancer, bladder cancer, pancreatic cancer, lymphoma, and leukemia.


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor an immune disorder or autoimmune disorder in a subject. Autoimmune and immune disorders include, but are not limited to, type 1 diabetes, rheumatoid arthritis, psoriasis, multiple sclerosis, lupus, inflammatory bowel disease, Addison's Disease, Graves Disease, Crohn's Disease and Celiac disease.


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor a disease or condition that is associated with aging of a subject. Disease and conditions associated with aging include, but are not limited to, cancer, osteoporosis, dementia, macular degeneration, metabolic conditions, and neurodegenerative disorders.


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor a blood disorder. Non-limiting examples of blood disorders are anemia, hemophilia, blood clotting and thrombophilia. For example, detecting thrombophilia may comprise detecting a polymorphism present in a gene selected from Factor V Leiden (FVL), prothrombin gene (PT G20210A), and methylenetetrahydrofolate reductase (MTHFR).


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor a neurological disorder or a neurodegenerative disorder in a subject. Non-limiting examples of neurodegenerative and neurological disorders are Alzheimer's disease, Parkinson's disease, Huntington's disease, Spinocerebellar ataxia, amyotrophic lateral sclerosis (ALS), motor neuron disease, chronic pain, and spinal muscular atrophy. Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor a psychiatric disorder in a subject and/or a response to a drug to treat the psychiatric disorder.


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor a metabolic condition or disease. Metabolic conditions and disease, include, but are not limited to obesity, a thyroid disorder, hypertension, type 1 diabetes, type 2 diabetes, non-alcoholic steatohepatitis, coronary artery disease, and atherosclerosis.


Devices, systems, kits and methods disclosed herein may be used to test for, detect, and/or monitor an allergy or intolerance to a food, liquid or drug. By way of non-limiting example, a subject can be allergic or intolerant to lactose, wheat, soy, dairy, caffeine, alcohol, nuts, shellfish, and eggs. A subject could also be allergic or intolerant to a drug, a supplement or a cosmetic. In some instances, methods comprise analyzing genetic markers that are predictive of skin type or skin health.


In some instances, the condition is associated with an allergy. In some instances, the subject is not diagnosed with a disease or condition, but is experiencing symptoms that indicate a disease or condition is present. In other instances, the subject is already diagnosed with a disease or condition, and the devices, systems, kits and methods disclosed herein are useful for monitoring the disease or condition, or an effect of a drug on the disease or condition.


Disclosed herein are devices, systems, kits and methods for analyzing cell-free nucleic acids from a fetus in a maternal biological sample from a pregnant subject. Generally, the pregnant subject is a human pregnant subject. However, one of skill in the art would understand that the instant disclosure could be applied to other mammals, perhaps for breeding purposes on farms or in zoos. In some instances, the pregnant subject is euploid. In some instances, the pregnant subject comprises an aneuploidy. In some instances, the pregnant subject has a copy variation of a gene or portion thereof. In some instances, the pregnant subject has a genetic insertion mutation. In some instances, the pregnant subject has a genetic deletion mutation. In some instances, the pregnant subject has a genetic missense mutation. In some instances, the pregnant subject has a single nucleotide polymorphism. In some instances, the pregnant subject has a single nucleotide polymorphism. In some instances, the pregnant subject has translocation mutation resulting in a fusion gene. By way of non-limiting example, the BCR-ABL gene is a fusion gene that can be found on chromosome 22 of many leukemia patients. The altered chromosome 22 is referred to as the Philadelphia chromosome.


In some instances, the pregnant subject is about 2 weeks pregnant to about 42 weeks pregnant. In some instances, the pregnant subject is about 3 weeks pregnant to about 42 weeks pregnant. In some instances, the pregnant subject is about 4 weeks pregnant to about 42 weeks pregnant. In some instances, the pregnant subject is about 5 weeks pregnant to about 42 weeks pregnant. In some instances, the pregnant subject is about 6 weeks pregnant to about 42 weeks pregnant. In some instances, the pregnant subject is about 7 weeks pregnant to about 42 weeks pregnant. In some instances, the pregnant subject is about 8 weeks pregnant to about 42 weeks pregnant.


In some instances, the pregnant subject is at fewer than about 6 weeks, about 7 weeks, about 8 weeks, about 9 weeks, about 10 weeks, about 12 weeks, about 16 weeks, about 20 weeks, about 21 weeks, about 22 weeks, about 24 weeks, about 26 weeks, or about 28 weeks of gestation. In some instances, the pregnant subject is as few as 5 weeks pregnant. In some instances, the human subject is a pregnant human female who has reached at least about 5 weeks, at least about 6 weeks, at least about 7 weeks, or at least about 8 weeks of gestation. In some instances, the human subject is a pregnant human female who has reached at least about 5 to about 8 weeks of gestation. In some instances, the human subject is a pregnant human female who has reached at least about 5 to about 8, at least about 5 to about 12, at least about 5 to about 16, at least about 5 to about 20, at least about 6 to about 21, at least about 6 to about 22, at least about 6 to about 24, at least about 6 to about 26, at least about 6 to about 28, at least about 6 to about 9, at least about 6 to about 12, at least about 6 to about 16, at least about 6 to about 20, at least about 6 to about 21, at least about 6 to about 22, at least about 6 to about 24, at least about 6 to about 26, or at least about 6 to about 28 weeks of gestation. In some instances, the human subject is a pregnant human female who has reached at least about 7 to about 8, at least about 7 to about 12, at least about 7 to about 16, at least about 7 to about 20, at least about 7 to about 21, at least about 7 to about 22, at least about 7 to about 24, at least about 7 to about 26, at least about 7 to about 28, at least about 8 to about 9, at least about 8 to about 12, at least about 6 to about 16, at least about 8 to about 20, at least about 8 to about 21, at least about 6 to about 22, at least about 8 to about 24, at least about 8 to about 26, or at least about 8 to about 28 weeks of gestation. In some instances, gestation times are detected by measuring the gestation time from the first day of the last menstrual period.


In some instances, the biological sample is a maternal body fluid sample obtained from a pregnant subject, a subject suspected of being pregnant, or a subject that has given birth recently, e.g., within the past day. In some instances, the subject is a mammal. In some instances, the mammal is female. In some instances, the mammal is a primate (e.g., human, great ape, lesser ape, monkey), canine (e.g., dog, fox, wolf), feline (e.g., domestic cat, big cat), equine (e.g., horse), bovine (e.g., cow, buffalo, bison), ovine (e.g., sheep), caprine (e.g., goat) porcine (e.g., pig), a rhinoceros, or a rodent (e.g., mouse, rat, rabbit, guinea pig). In some instances, the subject is a pregnant human female in her first, second, or third trimester of pregnancy. In some instances, the human subject is a pregnant human female at fewer than about 6, about 7, about 8, about 9, about 10, about 11, about 12, about 13, about 14, about 15, about 16, about 17, about 18, about 19, about 20, about 21, about 22, about 23, about 24, about 25, about 26, about 27, about 28, about 29, about 30, about 31, about 32, about 33, about 34, about 35, about 36, about 37, about 38, about 39, or about 40 weeks gestation.


Paternity Testing

One application for methods, devices, and systems disclosed herein is non-invasive paternity testing. The determination of paternity of a child is important for several reasons, including to establish legal and social (economic) benefits for the child (e.g., social security, inheritance benefits, veteran's benefits), as well as accurately provide medical history to better manage the child's health. In addition, the determination of paternal contributors to offspring of other non-human organisms is important for elucidating the molecular ecology and evolution of certain species.


Current paternity tests involve invasive procedures such as amniocentesis or chorionic villus sampling that each poses various risks to the fetus. Non-invasive methods for determining paternity that have emerged, have the significant drawbacks of requiring several milliliters of peripheral blood from the pregnant subject, as well as a buccal swab, saliva, or several milliliters of peripheral blood from the alleged paternal father. These drawbacks often results in a need for a phlebotomist and very often a physician, which makes the process costly and time-consuming. Moreover, current non-invasive methods of determining paternity lack the desired level of privacy for many pregnant subjects and alleged paternal fathers, which in some cases deter use of such tests.


Disclosed herein, in some embodiments, are methods, systems and devices are for analyzing a biological sample obtained from child or a fetus to determine the paternity of the child or the fetus. In some embodiments, the methods, systems, and devices comprises obtaining a biological sample from a child, wherein the biological sample comprises genetic information of the child (e.g., DNA). In some embodiments, the methods, systems, and devices comprises obtaining a biological sample from a pregnant subject, wherein the biological sample comprises genetic information of the fetus (e.g., cell-free fetal DNA). In some embodiments, a biological sample is obtained from a subject suspected to be the father of the child or fetus.


In contrast to existing paternity tests, the methods, systems and devices disclosed herein minimize the amount of DNA (e.g., paternal DNA, child DNA, and/or fetal cell-free DNA) required for accurate paternity determinations, thereby avoiding the need for large sample volumes. In the case when the sample is blood obtained from the child, pregnant subject, or subject suspected of being the paternal father, a sufficient amount of blood may be obtained with a finger prick. Thus, methods and systems disclosed herein eliminate the need for a venipuncture, thereby providing for paternity tests at point of care with a significant reduction in cost of testing.


Devices, systems, kits and methods disclosed herein may obtain genetic information in the privacy of a home, without the need for laboratory equipment and without the risk of sample swapping. Genetic information can be detected in minutes or seconds with devices, systems, kits and methods disclosed herein. In addition, the devices, systems, kits and methods of determining the paternity of a child or fetus offer the advantage of being (1) minimally invasive, (2) applicable in home with little or no technical training; and (3) generally do not require complex or expensive equipment.


Disclosed herein are methods, devices, systems and kits for determining the paternity of a fetus, comprising: (a) obtaining a biological sample from a subject pregnant with a fetus, wherein the biological sample comprises fetal cell-free nucleic acids; (b) optionally tagging at least a portion of the fetal cell-free nucleic acids to produce a library of optionally tagged fetal cell-free nucleic acids; (c) optionally amplifying the optionally tagged cell-free nucleic acids; (d) sequencing at least a portion of the optionally tagged fetal cell-free nucleic acids; and (e) receiving paternal genotype information from an individual suspected to be a paternal father of the fetus; (f) comparing the paternal genotype information with the fetal cell-free nucleic acids to determine whether there is a match.


In some aspects, the devices, systems, kits and methods comprise (a) analyzing a biological sample obtained from a subject suspected of being a biological father of a child or fetus; (b) determining one or more paternal DNA signatures; (c) obtaining a biological sample from a subject who is a child, wherein the biological sample DNA; (d) optionally, amplifying the DNA; (e) tagging at least a portion of the DNA to produce a library of tagged DNA; (f) optionally, amplifying the tagged DNA; and (g) sequencing at least a portion of the tagged DNA; and (h) detecting the one or more paternal DNA signatures in the at least portion of the tagged DNA.


In some aspects, the devices, systems, kits and methods comprise (a) analyzing a biological sample obtained from a subject suspected of being a biological father of a child or fetus; (b) determining one or more paternal DNA signatures; (c) obtaining a biological sample from a pregnant subject, wherein the biological sample comprises cell-free fetal nucleic acids; (d) optionally, amplifying the cell-free nucleic acids; (e) tagging at least a portion of the cell-free nucleic acids to produce a library of tagged cell-free nucleic acids; (f) optionally, amplifying the tagged cell-free nucleic acids; and (g) sequencing at least a portion of the tagged cell-free nucleic acids; and (h) detecting the one or more paternal DNA signatures in the at least portion of the tagged cell-free nucleic acids.


In some aspects, the devices, systems, kits and methods comprise (a) analyzing a biological sample obtained from a subject suspected of being a biological father of a child or fetus; (b) determining one or more paternal DNA signatures; (c) obtaining a biological sample from a pregnant subject, wherein the biological sample contains up to about 109 cell-free fetal nucleic acid molecules; (d) sequencing at least a portion of the cell-free fetal nucleic acids to produce sequencing reads; (e) measuring sequencing reads corresponding to at least one target nucleic acid; (f) measuring, with greater than 98% accuracy, that the paternal DNA signature is present in the at least one target nucleic acid.


In some aspects, described herein are methods comprising: (a) analyzing a biological sample obtained from a subject suspected of being a biological father of a child or fetus; (b) determining one or more paternal DNA signatures; (c) obtaining a biological sample from a pregnant subject, wherein the biological sample contains up to about 109 cell-free fetal nucleic acid molecules; (d) sequencing at least 2000 of the cell-free fetal nucleic acids to produce sequencing reads; (e) measuring at least 1000 sequencing reads corresponding to at least one target nucleic; and (f) measuring, with greater than 98% accuracy, that there is the paternal DNA signature is present in the at least one target nucleic acid. These numbers of sequencing reads are sufficient even when the fraction of cell-free fetal nucleic acid molecules in the total cell-free nucleic acid molecules of the biological sample is low.


In some aspects, described herein are methods comprising: (a) analyzing a biological sample obtained from a subject suspected of being a biological father of a child or fetus; (b) determining one or more paternal DNA signatures; (c) obtaining a biological sample from a pregnant subject, wherein the biological sample contains up to about 109 cell-free fetal nucleic acid molecules; (d) sequencing at least 8000 of the cell-free fetal nucleic acids to produce sequencing reads; (e) measuring at least 4000 sequencing reads corresponding to at least one target nucleic; and (f) measuring, with greater than 98% accuracy, that there is the paternal DNA signature is present in the at least one target nucleic acid. These numbers of sequencing reads are sufficient even when the fraction of cell-free fetal nucleic acid molecules in the total cell-free nucleic acid molecules of the biological sample is low. In some instances, the methods comprise amplifying the tagged cell-free DNA fragments before sequencing the at least 8000 tagged cell-free nucleic acid molecules.


“DNA signature,” as used herein, refers to the nucleic acid composition of a biological sample that may be used to differentiate the source of the biological sample from another putative source. In some embodiments, the DNA signature comprises one or more genetic mutations or natural variations in the genome of the source of the biological sample. In some embodiments, the genetic mutations or natural variations comprise a single nucleotide variations (SNV) or indel (e.g., deletion or insertion of one or more nucleic acids). In some embodiments, the DNA signature is determined using a computer-implemented method comprising (a) calculating a probability that the subject suspected of being the biological father is the biological father by performing a statistical analysis. In some embodiments, the statistical analysis comprises a maximum likelihood estimation (MLE) or maximum a posteriori technique.


Non-Invasive Prenatal Testing (NIPT)

Assessment of the risk of fetal chromosomal or genetic aberrations is a standard of care in the management of pregnancies in many countries. Traditional methods for determining genetic information from a fetus while the mother is gestating are often inconclusive, have limited sensitivities early in the gestation period, are invasive (e.g., chorionic villus sampling (sampling of placental tissue), and pose a risk to the mother and fetus (e.g., amniocentesis). Further, access to traditional prenatal testing is limited, because they require medical providers with technical training in clinical settings.


Despite recent advances in non-invasive prenatal testing (NIPT), current NIPT methods require venipuncture (e.g., a phlebotomy) to obtain the large amounts (e.g., 16 milliliters) of maternal blood/plasma sufficient to achieve appropriate screening performance. Access to a phlebotomy requires medical personnel to perform, which operates to reduce access and increase costs of current NIPT tests for many mothers.


Disclosed herein, are methods, devices, systems and kits for performing NIPT tests using ultra-low volumes of the initial biological sample. In contrast to existing NIPT, the methods, systems and devices disclosed herein minimize the amount of cell-free fetal DNA required for accurate screening of fetal chromosomal aberrations, thereby avoiding the need for a phlebotomy. In the case when the sample is blood, a sufficient amount of blood may be obtained with a finger prick.


Devices, systems, kits and methods disclosed herein may be advantageously capable of obtaining genetic information at very early stages of gestation. Devices, systems, kits and methods disclosed herein may obtain genetic information of a fetus in the privacy of a home, without the need for laboratory equipment and without the risk of sample swapping. Genetic information can be detected in minutes or seconds with devices, systems, kits and methods disclosed herein.


In some aspects, disclosed herein are methods comprising: obtaining a biological sample from a pregnant subject, wherein the biological sample contains up to at least or about 10 genomic equivalents of cell-free fetal DNA; sequencing at least a portion of the cell-free fetal nucleic acid molecules to produce sequencing reads; measuring at least a portion of sequencing reads corresponding to at least one chromosomal region; and detecting a normal representation, an overrepresentation or an underrepresentation of the at least one chromosomal region. See Example 21. The methods described herein, in some cases, are performed with at least or about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% accuracy. See Example 21.


In some instances, overrepresentation or an underrepresentation is relative to representation of a chromosome or chromosomal region in at least one control pregnant subject. In some instances, the at least one control pregnant subject is a pregnant euploid subject. In some instances, the at least one control pregnant subject is a pregnant aneuploid subject. In some instances, the at least one control pregnant subject is a pregnant subject with no chromosomal abnormalities. In some instances, the at least one control pregnant subject is a pregnant subject with at least one chromosomal abnormality. In some instances, the control pregnant subject has a euploid fetus. In some instances, the control pregnant subject has an aneuploid fetus. In some instances, the control pregnant subject has a fetus with no genetic abnormalities. In some instances, the control pregnant subject has a fetus with at least one genetic abnormality. In some instances, the at least one control pregnant subject comprises a plurality of pregnant subjects having the same presence or lack of chromosomal abnormalities.


In some instances, methods, devices, systems and kits disclosed herein utilize additional controls. In some instances, the control is a representation of a chromosome that is expected if a fetus is euploid. In some instances, the control is a representation of a chromosome that is expected if a fetus is aneuploid. In some instances, the control is a quantity of a chromosome that is expected if a fetus is euploid. In some instances, the control is a quantity of a chromosome that is expected if a fetus is aneuploid. In some instances, the control is a quantity of sequencing reads corresponding to a chromosome that is expected if a fetus is euploid. In some instances, the control is a quantity of sequencing reads corresponding to a chromosome that is expected if a fetus is aneuploid. In some instances, methods comprise analyzing and detecting genetic information in a control sample. In some instances, methods comprise analyzing and detecting genetic information in a control sample, and instead use a predetermined control reference value obtained from control reference data. This would be particularly useful when the pregnant subject performs analysis of her sample at home and does not have access to a control sample. However, often, the pregnant subject could also easily obtain a control sample (e.g., from a relative, spouse, friend). Furthermore, systems and methods, as described herein, provide for analyzing multiple samples simultaneously by indexing each sample.


EXAMPLES

The following examples are given for the purpose of illustrating various embodiments of the devices, systems and kits disclosed herein and are not meant to limit the present devices, systems and kits in any fashion. The present examples, along with the methods described herein are presently representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the inventive concepts described herein. Changes therein and other uses which are encompassed within the spirit of the devices, systems and kits disclosed herein as defined by the scope of the claims will occur to those skilled in the art.


Example 1: Trisomy Detection in Ultra-Low (˜20 μl) Amounts of Maternal Blood

Trisomy detection relies on the accurate representation of genetic material originating on a chromosome compared to genetic material originating from other chromosomes. This ratio is compared to the distribution of ratios in the euploid population. A trisomy is called when the ratio of ((chr21/chr.all)-MEDIAN(chr21))/MAD(chr21) is statistically sufficiently different from that distribution.


While 10% fetal fraction is the median of a typical population at 9 weeks gestational age and above, not all samples will have fetal fraction levels as high as 10% and some might have even higher levels. A typical cutoff for fetal fraction is 4%. A model that takes the distribution of fetal fraction in a typical population into account and requires the more common cutoff values for specificity (99.9%) and sensitivity (99%) can help to illustrate the input requirements for this method. With around 5 million marker counts (sequence reads), this sensitivity can be accomplished. However, if one analyzes one marker per chromosome, this would require 30,000 cell equivalents, which is not feasible.


Methods and systems disclosed herein are based on the fact that each genome equivalent is essentially divided into 20 million cfDNA fragments through the process of apoptosis (3 billion base pairs per genome divided by 150 base pairs average size of cfDNA). The implication is that if every single molecule of cfDNA can be transferred from blood to sequencer, the equivalent of a quarter of a euploid genome is sufficient for analysis.


However, in reality every step in the process is impaired by various amounts of DNA loss. Therefore much higher amounts are being sampled and moved through the library generation and sequencing process. While DNA loss occurs at every step of the process, the highest loss typically appears at the step of library preparation. Traditional methods show losses of 80% to 90% of material. Often this loss is compensated by a subsequent amplification step (Universal PCR), to bring the concentration of DNA up to the necessary level required for next generation sequencing. While amplification is a good method to increase the overall nucleic acid material available for sequencing, under specific conditions the amplification cannot compensate for a loss of information that occurred during the prior steps. To understand the loss of information a simple thought experiment can help. Assume one starts with 1000 genome equivalents, which represents 20*109 cfDNA fragments. If one assumes an enormous loss and only two fragments are available for amplification. One fragment from the reference region and one from the target region. Two fragments alone are not sufficient to load sequencing equipment, but via amplification (PCR) each fragment can easily be copied billions of times. Now after amplification enough material is available to start the sequencing process but the information in the sample had been reduced to the information held in those two copies. And in this case the information is insufficient for classification of euploid and trisomic samples, because both sample type will show an indistinguishable 50% fraction.


Specifications for a typical next generation sequencer require that 5 μl of a 4 nM solution is diluted in 995 μl NaOH to make a 20 pM solution of which 600 μl are loaded on the sequencer. Consequently, a total of 1.2*1010 DNA fragments is needed, to create 20 million sequencing counts. As demonstrated above, 20 million counts are sufficient for 4 samples and therefore each sample has to contribute ˜3*109DNA fragments. (Because each genome equivalent contributes 20 million DNA fragments a total of 150 genome equivalents would be needed when no loss and no amplification occurs.) This is outlined in FIG. 6.


Typical NIPT protocols start with a high amount of cfDNA (6000 genome equivalents), which allows for a high amount of loss during the library preparation. The material is then amplified and highly diluted to be suitable for sequencing. The problem with typical NIPT protocols is that high amount of loss during library preparation that are subsequently highly diluted lead to an inaccurate representation of the genetic material originating on a chromosome.


For example, a typical sample contains 1500 genome equivalents of cfDNA in ml of blood plasma. A regular blood draw of 8 to 10 ml of blood yields around 4 ml of plasma, resulting in 6000 available genome equivalents of cfDNA. Assuming typical numbers for DNA extraction efficiency (90%) and library preparation efficiency (10%) about 540 genome equivalents moved into amplification (typically 8 to 10 cycles, here for the example 1000 fold amplification). After amplification a total of 540000 genome equivalents or 1.08*1013 DNA fragments are available for sequencing. More than 1000 fold dilution is performed to adjust the amplified library to the required 4 nM. See Table 1.









TABLE 1







Standard 8-10 ml blood draw













cfDNA





4 ml plasma @
Genome
cfDNA




1500 GE/ml
Equivalents
fragments
efficiency
















Blood Draw
6000
1.20E+11




DNA Extraction
5400
1.08E+11
0.9



Library Prep
540
1.08E+10
0.1



Amplification
540000
1.08E+13
1000



Normalization
150
3.00E+09
0.0003



and Multiplexing






Denaturation
90
1.80E+09
0.6



Sequencing
0.25
5.00E+06
0.003










This data might mistakenly imply that because of the vast excess of DNA fragments created in the process, one could simply be scaled down the reactions to accommodate a blood volume of less than 100 μl. However, because of the aforementioned loss in information this is not possible. See Table 2. Performing a simulation at lower limit of fetal fraction (4%) that takes into account the losses during DNA Extraction (efficiency 90%) and library preparation (efficiency 10%) as well as the PCR amplification (˜10 cycles) shows that sensitivity decreases below 25 (inflection point at 10) copies of input DNA material. Sensitivity at 10 copies is reduced to 89% and at 5 copies to 81%, both values would not be acceptable in a market that requires ˜95% theoretical sensitivity for samples at 4% fetal fraction. See FIG. 7.









TABLE 2







Scale down Standard protocol to 20 μl blood draw













cfDNA





4 ml plasma @
Genome
cfDNA




1500 GE/ml
Equivalents
fragments
efficiency
















Blood Draw
10
2.00E+08




DNA Extraction
9
1.80E+08
0.9



Library Prep
1.8
3.60E+07
0.1



Amplification
1800
3.60E+10
1000



Normalization
150
3.00E+09
0.0833



and Multiplexing






Denaturation
90
1.80E+09
0.6



Sequencing
0.25
5.00E+06
0.003










In contrast, the present disclosure provides methods, systems, and devices that increase the library preparation efficiency, preventing a high amount of loss during library preparation and obviating the need for overamplification and high dilutions. Thus, the present disclosure solves the problems associated with typical NIPT protocols, by maintaining an accurate representation of genetic material originating on the chromosome. Increasing library efficiency and decreasing amplification according the present embodiments (e.g., with crowding agents, end-repair), results in more genetic information that is preserved and sensitivities above 95% even at copy numbers (genome equivalents) below 5. See Table 3 and FIG. 7.









TABLE 3







Protocol with increased library efficiency and low amplification











cfDNA




20 ul plasma @
Genome
cfDNA



1500 GEml
Equivalents
fragments
efficiency













Blood Draw
10
2.00E+08



DNA Extraction
9
1.80E+08
0.9


Library Prep
4.5
9.00E+07
0.5


Amplification
450
9.00E+09
100


Normalization
150
3.00E+09
0.33


and Multiplexing





Denaturation
90
1.80E+09
0.6


Sequencing
0.25
5.00E+06
0.003









Example 2. Viability of Low Coverage Whole Genome Sequencing-by-Synthesis Using Ultra-Low Input Amounts of ccfDNA (1-20 Genome Equivalents)

Male whole blood (10 ml) was collected via venous puncture into a Streck cell-free DNA BCT and processed to plasma by double-spin centrifugation as follows:

    • Spin 1-1330 rpm for 20 minutes, no brake
    • Spin 2-3300 rpm for 10 minutes


Plasma was stored at 4° C. or −80° C. until use. Circulating cell-free DNA was extracted from the plasma using the 4 ml protocol for the Qiagen Circulating Nucleic Acid Extraction Kit per manufacturer's protocol with elution in 55 μl of EB. Genome equivalents for each sample were determined using a SRY/RNase P Taqman biplex qPCR assay (Life Technologies) on a Quantstudio 6 real-time instrument. DNA libraries were prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Template ccfDNA for library preparation was titrated 1:5 from 96 GEs down to 1 GE per library. Libraries were generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template. Library preparation consisted of:

  • 1. End-repair, 5′-phosphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
  • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors were diluted 1:25 to a 0.6 μM working concentration. The cleaved, adaptor-ligated library was then subjected to bead-based purification using SPRISelect beads. The volume of beads was increased to 116 μl to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
  • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library was then purified using SPRISelect beads (45 μl).


All libraries were sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations were determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library was normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing. Sequencing-by-synthesis was conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) was conducted for each index/sample. In general, each sample generated approximately 4 million passed-filter reads in each direction. All sequencing data (fasta.gz files) was aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1-n 0”. For further analysis, the human genome was divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” was calculated for each bin with an accuracy up to 3 decimals. For each bin the aligned sequence reads that start in a bin were counted. For further analysis the data was reduced by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y were excluded). After this filtering, a Loess regression between GC content and read count per bin was performed and the median bin count was calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value was divided by the median bin count to get a correction factor. The measured bin count was then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count was calculated. All 50 kb bins were divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) was calculated. Bins with a low MAD and a median around the expected value of 1 were selected (bins with MAD>=0.25 or Median <0.7 or Median >1.3 were filtered out).


Electropherograms of libraries were generated from decreasing amounts of ccfDNA input and showed total library product decreases with input but adaptor dimer amounts do not increase significantly. See FIG. 8A-8C. The y-axis shows relative fluorescence units (intensity) and the x-axis shows time in seconds. The primary peak at 70 seconds is the desired 300 bp library product. From FIG. 8A to FIG. 8B to FIG. 8C, input genome equivalents are titrated 1:5 from 20 GEs. Input down to 1 GE generated sufficient library for viable sequencing-by-synthesis with acceptable sequencing metrics compared to other euploid samples with much higher template input.


Example 3. Detection of Low Fraction Y-Chromosome (2.5% or Greater) Using Low Coverage Whole Genome Sequencing-by-Synthesis with Ultra-Low Input Amounts of ccfDNA (10 Genome Equivalents) Isolated from Capillary Blood

Female or male whole blood was collected by finger-tip capillary bed puncture using a contact-activated lancet (BD Microtainer) and blood collection into a SAFE-T_FILL capillary collection device (KABE Labortechnik, GMBH). Capillary blood was processed to plasma by double-spin centrifugation as follows:

    • Spin 1-1330 rpm for 20 minutes
    • Spin 2-3300 rpm for 10 minutes


Plasma was stored at 4° C. until use. Male plasma was spiked into female plasma at varying percentages ranging from 2.5%-20% by volume. Circulating cell-free DNA was then extracted from the plasma using a modified protocol for 10 ul of plasma with the MagMax Cell-Free DNA Isolation Kit (Life Technologies). Isolation consisted of the following steps:

  • 1. Incubation of plasma with Proteinase K (volume dependent on starting input) at 60° C. for 20 minutes.
  • 2. Lysis/binding of plasma to DynaBeads MyOne Silane paramagnetic beads (2.5-5 ul) with binding for 10 minutes at room temperature.
  • 3. Washing of the bead/ccfDNA complex (volume dependent on starting input).
  • 4. Rinse bead/ccfDNA complex with 80% ethanol (volume dependent on starting input).
  • 5. Elution of ccfDNA from beads (volume dependent on starting input) with incubation at room temperature for 2 minutes.


Genome equivalents for each sample were estimated to be 1 GE/μl of plasma based on previous extractions at volumes ranging from 10 μl-4000 μl and published data. All of the eluted ccfDNA was used as input for library generation. DNA libraires were prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Libraries were generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template. Library preparation consisted of:

  • 1. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
  • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors were diluted 1:25 to a 0.6 μM working concentration. The cleaved, adaptor-ligated library was then subjected to bead-based purification using SPRISelect beads. The volume of beads was increased to 116 μl to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
  • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library was then purified using SPRISelect beads (45 ul).


All libraries were sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations were determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library was normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing. Sequencing-by-synthesis was conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) was conducted for each index/sample. In general, each sample generated approximately 4 million passed-filter. All sequencing data (fasta.gz files) was aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1-n 0”. For further analysis, the human genome was divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” was calculated for each bin with an accuracy up to 3 decimals. For each bin the aligned sequence reads that start in a bin were calculated. For further analysis the data was reduced by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y were excluded). After this filtering, a Loess regression between GC content and read count per bin was performed and the median bin count was calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value was divided by the median bin count to get a correction factor. The measured bin count was then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count was calculated. All 50 kb bins were divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) was calculated. Bins with a low MAD and a median around the expected value of 1 were selected (bins with MAD>=0.25 or Median <0.7 or Median >1.3 were filtered out). Specifically for the calculation of Y chromosome representation, LOESS regression was performed for bins originating on chromosome Y. See FIG. 10 and FIG. 11. FIG. 10 shows detection of low fraction Y-chromosome (2.5% or greater) using low coverage Whole Genome Sequencing-by-Synthesis with ultra-low input amounts of cfDNA isolated from capillary blood/plasma mixtures of female and male DNA. With the ultra-low amounts of cfDNA in mixtures of female/male plasma derived from capillary blood collected by a finger prick, we still a corresponding increase of chromosome Y representation with increasing amounts of male capillary blood derived plasma. FIG. 11 shows a cfDNA fragment size distribution comparison between cfDNA from capillary blood and venous blood based on paired end sequencing data. Size profiles of cfDNA from ultra-low amounts of plasma derived from venous blood and capillary blood look similar. A percentage representation of sequence reads originating from chromosome Y was calculated by summing up all GC normalized values for bins originating on chromosome Y and dividing by the sum of all GC normalized values, excluding those originating from chromosome 21 and 19.


Example 4. Detection of Low Fraction Y-Chromosome (2.5% or Greater) Using Low Coverage Whole Genome Sequencing-by-Synthesis with Ultra-Low Input Amounts of ccfDNA (10 Genome Equivalents)

Female or male whole blood (10 ml) was collected by venous puncture into a Streck cell-free DNA BCT and processed to plasma by double-spin centrifugation as follows:

    • Spin 1-1330 rpm for 20 minutes, no brake
    • Spin 2-3300 rpm for 10 minutes


Plasma was stored at 4° C. until use. Male plasma was spiked into female plasma at varying percentages ranging from 2.5%-20% by volume. Circulating cell-free DNA was then extracted from the plasma using a modified protocol for 10 μl or 20 μl of plasma with the MagMax Cell-Free DNA Isolation Kit (Life Technologies). Isolation consisted of the following steps:

    • 1. Incubation of plasma with Proteinase K (volume dependent on starting input) at 60° C. for 20 minutes.
    • 2. Lysis/binding of plasma to DynaBeads MyOne Silane paramagnetic beads (2.5-5 μl) with binding for 10 minutes at room temperature.
    • 3. Washing of the bead/ccfDNA complex (volume dependent on starting input).
    • 4. Rinse bead/ccfDNA complex with 80% ethanol (volume dependent on starting input).
    • 5. Elution of ccfDNA from beads (volume dependent on starting input) with incubation at room temperature for 2 minutes.


Genome equivalents for each sample were estimated to be 1 GE/ul of plasma based on previous extractions at volumes ranging from 10 ul-4000 ul and published data. All of the eluted ccfDNA was used as input for library generation. DNA libraries were prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Libraries were generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template. Library preparation consisted of:

    • 1. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
    • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors were diluted 1:25 to a 0.6 uM working concentration. The cleaved, adaptor-ligated library was then subjected to bead-based purification using SPRISelect beads. The volume of beads was increased to 116 ul to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
    • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library was then purified using SPRISelect beads (45 ul).


All libraries were sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations were determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library was normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing. Sequencing-by-synthesis was conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) was conducted for each index/sample. In general, each sample generated approximately 4 million passed-filter. All sequencing data (fasta.gz files) was aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1−n 0”. For further analysis, the human genome was divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” was calculated for each bin with an accuracy up to 3 decimals. For each bin aligned sequence reads that start in a bin were counted. For further analysis the data was reduced by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y were excluded). After this filtering, a Loess regression between GC content and read count per bin was performed and the median bin count was calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value was divided by the median bin count to get a correction factor. The measured bin count was then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count was calculated. All 50 kb bins were divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) was calculated. Bins with a low MAD and a median around the expected value of 1 were selected (bins with MAD>=0.25 or Median <0.7 or Median >1.3 were filtered out).


Specifically for the calculation of Y chromosome representation, a LOESS regression was also performed for bins originating on chromosome Y. A percentage representation of sequence reads originating from chromosome Y was calculated by summing up all GC normalized values for bins originating on chromosome Y and dividing by the sum of all GC normalized values, excluding those originating from chromosome 21 and 19. See FIG. 9. FIG. 9 shows detection of low fraction Y-chromosome (2.5% or greater) using low coverage Whole Genome Sequencing-by-Synthesis with ultra-low amounts of cfDNA (10 genome equivalents) isolated from venous blood. Male plasma was mixed into female plasma at fixed amounts to create female/male plasma mixtures. cfDNA was extracted from the plasma mixtures and sequenced. The representation of chromosome Y was determined to show that with increasing amount of male plasma mixed into female plasma a corresponding increase in chromosome Y representation can still be detected precisely from ultra-low input amounts of cfDNA.


Example 5. Detection of Fetal Chromosomal Aneuploidy Using Low Coverage Whole Genome Sequencing-by-Synthesis with Ultra-Low Input Amounts of ccfDNA (10 Genome Equivalents)

Whole blood (10 ml) was collected via venous puncture into a Streck cell-free DNA BCT and processed to plasma by double-spin centrifugation. Plasma was processed fresh, stored at 4° C. or −80° C. until use. Circulating cell-free DNA was then extracted from the plasma using a modified protocol for 1.2 ml of plasma with paramagnetic beads. Isolation consisted of the following steps:

  • 1. Incubation of plasma with Proteinase K, glycogen and Lysis Buffer and beads at room temperature for 20 minutes for lysis/binding.
  • 2. Washing of the bead/ccfDNA complex.
  • 3. Elution of ccfDNA from beads (42 μl) with incubation at 55° C. temperature for 10 minutes.


Extracted DNA was quantified for upstream applications. All samples were normalized to 33 pg (10 GEs) total input per library. DNA libraries were prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Libraries were generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template. Library preparation consisted of:

  • 1. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
  • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors were diluted 1:25 to a 0.6 uM working concentration. The cleaved, adaptor-ligated library was then subjected to bead-based purification using SPRISelect beads. The volume of beads was increased to 116 ul to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
  • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library was then purified using SPRISelect beads (45 ul).


All libraries were sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations were determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library was normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing. Sequencing-by-synthesis was conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) was conducted for each index/sample. In general, each sample generated approximately 4 million passed-filter. All sequencing data (fasta.gz files) was aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1-n 0”. For further analysis, the human genome was divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” was calculated for each bin with an accuracy up to 3 decimals. For each bin the aligned sequence reads that start in a bin were counted. For further analysis the data was reduced by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y were excluded). After this filtering, a Loess regression between GC contend and read count per bin was performed and the median bin count was calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value was divided by the median bin count to get a correction factor. The measured bin count was then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count was calculated. All 50 kb bins were divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) was calculated. Bins with a low MAD and a median around the expected value of 1 were selected (bins with MAD>=0.25 or Median <0.7 or Median >1.3 were filtered out).


From the reduced and normalized data, all sequence bins originating on chromosome 21 were identified. The percentage representation of sequence reads originating from chromosome 21 was calculated by summing up all GC normalized values for bins originating on chromosome 21 and dividing the sum by the sum of all GC normalized values excluding GC normalized values of bins originating from chromosome 21 and 19 (as well as other chromosomes already excluded in earlier steps, e.g. X and Y, chromosomes other than 1-22). The median and MAD of the chromosomes 21 representation were then calculated from a set of known euploid samples (reference samples). For each sample the median chromosome 21 representation was subtracted from the sample specific chromosome 21 representation resulting in a sample specific difference. This sample specific difference was divided by the chromosome 21 representation MAD, providing a value referred to as the Z-score. Test samples were then classify based on their Z-score, where samples with a Z-score of 3 and higher were classified as trisomic and samples with a Z-score of less than 3 were classified as euploid. See FIG. 12. The reference sample set used consisted of 36 sequencing results overall. 20 were obtained from one male individual. Libraries were generated with various amounts of ultra-low input amounts of circulating cell-free DNA (cfDNA): 2 sequencing libraries were generated from 1 Genomes Equivalent (GE) of cfDNA input amount (˜3.5 pg of cfDNA); 2 sequencing libraries were generated from 4 GE of cfDNA (˜14 pg of cfDNA); 4 sequencing libraries were generated from 10 GE of cfDNA (˜35 pg of cfDNA); 2 sequencing libraries at 19 GE of cfDNA; 3 sequencing libraries at 25 GE of cfDNA; 3 sequencing libraries at 50 GE of cfDNA; 1 sequencing library at 96 GE of cfDNA; 2 sequencing libraries at 100 GE of cfDNA; 1 sequencing library at 2000 GE of cfDNA.


Data was also analyzed to establish a reference sample independent method to determine the presence of a fetal trisomy from ultra-low input circulating cell-free DNA from blood of a pregnant woman. All sequencing data (fasta.gz files) was aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1−n 0”. For further analysis, the human genome was divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” was calculated for each bin with an accuracy up to 3 decimals. For each bin the aligned sequence reads that start in a bin were counted.


For further analysis, data was reduced by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y were excluded). After this filtering, a Loess regression between GC contend and read count per bin was performed and the median bin count was calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value was divided by the median bin count to get a correction factor. The measured bin count was then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count was calculated.


All 50 kb bins were divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) was calculated. Bins with a low MAD and a median around the expected value of 1 were selected (bins with MAD>=0.25 or Median <0.7 or Median >1.3 were filtered out).


To detect a potential chromosomal aberration (e.g., trisomy), for each test sample all bins that originate from one chromosome were selected and a correction factor was subtracted. Specific correction values used in this analysis were: Chr1 0.018246891, Chr2 0.020434185, Chr3 0.011982353, Chr4 0.001049686, Chr5 0.020581150, Chr6 0.009152075, Chr7 0.005677261, Chr8 0.022754399, Chr9 0.015059119, Chr10 0.021188753, Chr1 1 0.017143964, Chr12 0.007069202 Chr13 0.002157471, Chr14 0.010356892, Chr15 0.019037573, Chr16 0.009929239, Chr17 0.004990359, Chr18 0.023177486, Chr19-0.063998368, Chr20 0.042335516, Chr21 0.00498782, Chr22 0.025008553.


Then the number of bins originating from chromosome 21 in this filtered set (657 in this data set) was counted and then the same amount of bins (657) were selected randomly from all available bins such that they contain bins originating from various different chromosomes. Then a percentage representation was calculated for this set of randomly selected bins. All GC normalized values were summed up for this set of randomly selected bins and divided by the sum of all GC normalized values. The last steps were repeated ten-thousand times and each value was stored. Then a percentage representation of sequence reads originating from chromosome 21 was calculated by summing up all GC normalized values for bins originating from chromosome 21 and this number was divided by the sum of all GC normalized values. It was calculated how many times the percentage representation of the bins originating from chromosome 21 was higher than the chromosome representation from the ten-thousand repeats of randomly selected bins. See FIG. 13. The sum divided by 10,000 is a value between 0 and 1, referred to as “percentile” herein. Samples were classified based on their percentile value: a value of ten-thousand (percentile 1) classifies the sample as a trisomy, a value lower than ten-thousand (percentile below 1) classifies the sample as euploid.


Example 6. In-Home Non-Invasive Prenatal Testing

A pregnant woman with a history of miscarriages suspects she is pregnant again and that she is probably about 6 weeks into gestation. She would like to know as soon as possible if she is actually pregnant and if the fetus has any genetic abnormalities that may put it at risk. She purchases a Non-Invasive Prenatal Testing device disclosed herein and takes it home. With the emotional support of her closest family members and friends present, she initiates the test by pressing her finger against a microneedle array in a well of the device. A nanopore sequencer in the device sequences a sufficient amount of nucleic acids in her blood sample (less than 109 fetal nucleic acids) to reveal desired genetic information in less than about one hour. A USB port or wireless technology relays the sequence information to an app on her phone or a website on her computer. The app or website employs software to obtain genetic information from the sequencing reads, revealing a panel of results for the woman to review. Alternatively, the device itself has software to read the sequences and produce a panel in a window of the device. The panel confirms the woman is pregnant and includes information about whether the fetus has a known chromosomal aberration (e.g., trisomy of chromosome 13, 16, 18, 21, 22, and/or X/Y) or other genetic abnormality. The panel also confirms she is pregnant and that she is expecting a boy.


Example 7: Non-Invasive Prenatal Testing with Microvolumes of Maternal Sample

Performing a simulation at lower limit of fetal fraction (4%) that takes into account the losses of standard methods during DNA Extraction (efficiency 90%) and library preparation (efficiency 10%) as well as the PCR amplification (˜10 cycles) shows that accuracy decreases below 25 (inflection point at 10) copies of input DNA material. Accuracy at 10 copies is reduced to 89% and at 5 copies to 81%, both values would not be acceptable in a market that requires ˜95% theoretical accuracy for samples at 4% fetal fraction. See FIG. 7 light grey line.


When increasing the library efficiency (to 50%, versus 10%) and decreasing the amplification, more information is preserved and sensitivities above 95% can be achieved even at copy numbers (genome equivalents) below 5. See FIG. 7, dark grey line.









TABLE 4







Workflow for obtaining fetal genetic information from 20 μl plasma










20 μl plasma @
cfDNA Genome
Total



1500 genome
Equivalents
cfDNA



equiv. per ml
(fetal + maternal)
fragments
Efficiency













Blood Draw
10
2.00E+08



DNA Extraction
9
1.80E+08
0.9


Library Prep
4.5
9.00E+07
0.5


Amplification
450
9.00E+09
100


Normalization
150
3.00E+09
0.33


and Multiplexing





Denaturation
90
1.80E+09
0.6


Sequencing
0.25
5.00E+06
0.003









Example 8: Analysis of Fetal Chromosomal Abnormality by Whole Genome Sequencing of Cell-Free DNA from Pregnant Women

180 pg of cell-free DNA was obtained from a biological fluid of a pregnant woman, an amount that is equivalent to the amount of cell-free DNA in about 100 μl of blood. The cell-free DNA was purified with a DNA repair kit and contained in a buffered solution to preserve its integrity.


In order to prepare the cell-free DNA for sequencing, ends of the cell-free DNA fragments were repaired with a DNA fragment end repair kit. Next, the repaired ends were ligated to adapters to produce adapter ligated DNA.


The adapter ligated DNA was purified by incubating the adapter ligated DNA with beads that can bind DNA. Using a magnet to trap the beads, the beads with the DNA were washed several times with an ethanol solution, before the adapter ligated DNA was eluted from the beads.


Cycled amplification of the adapter ligated DNA was performed with an initial denaturation step at 98° C. for 30 seconds, followed by 10 cycles of 98° C. for 10 seconds and 65° C. for 75 seconds, followed by a final extension at 65° C. for 5 minutes. Optionally, the adapter ligated DNA can be amplified with the use of an index primer, which can be useful in a case of running multiple samples on the same sequencer run. These were different from unique barcodes/tags introduced prior to library amplification. Similar to the adapter ligated DNA, the amplified DNA was purified with a bead and magnet system. The resulting purified amplified DNA was subjected to sequencing.


Sequencing was performed with a high throughput sequencing machine that generates millions of sequencing reads with read lengths of 30 to 500 base pairs. The indices allowed for obtaining sequencing reads from multiple sample simultaneously. Approximately 4 million reads were obtained per sample per sequencing run.


For each sample the following steps were performed:


Sequence alignment to detect the genomic origin of all sequence reads.


Subsets of the genome were put into non-overlapping bins of 50 kb length. GC content was calculated for each bin based on a reference genome. The number of sequence reads located in each of the bin regions was counted. A linear model for the relationship between GC-content and count of the bins was calculated according to y=ax+b (y: expected counts, a: slope, x: GC content, b: intercept). The count per bin was adjusted based on the linear model to reduce GC bias. For each bin the difference between the median count of all bins was calculated and the expected count value was subtracted from the linear fit. This difference was added to the observed count value for each respective bin. The percentage of sequence reads that originated from a chromosome of interest was calculated. In this example, the chromosome of interest was chromosome 21.


A set of reference samples was used to calculate the percentage of sequence reads that originate from chromosome 21 (referred to as ref p21). The median value (referred to as ref med) was calculated, along with the median absolute deviation (MAD) (referred to as ref mad) for the set of ref.p21 samples.


Similar values were measured in at least one test sample (same protocol as described above). The percentage of sequence reads that originate from chromosome 21, (referred to as test.p21) were calculated. For each sample the Z-score was calculated by calculating a difference between the test sample percentage of sequence reads that originate from chromosome 21 and the median of the reference (test.p21-refmed) and dividing this difference by the median absolute deviation of the reference set ([test.p21-refmed]/mad.ref). See FIG. 3 for results.


If it was found that the Z-score value was above a predetermined cut-off (typically the cutoff is equal to 3), the sample could be interpreted to have an overrepresentation of genomic material originating from chromosome 21. This overrepresentation was indicative for a fetal trisomy 21. Conversely, if the Z-score was below a predetermined cutoff, the sample could be interpreted to have a normal or underrepresentation of genomic material. This analysis could be applied to other chromosomes or chromosomal regions.


Example 9. Detecting Genetic Abnormalities by Sequencing Cell-Free Fetal Nucleic Acids in Maternal Plasma

A blood sample is collected from a pregnant subject. The pregnant subject may be as little as 5 weeks into gestation. In some cases, she is as little as 7 weeks into gestation. In some instances, the pregnant subject collects the blood herself by pricking her finger on a device at home. The pregnant subject sends her sample, either in the device or in a container to a laboratory that has sample processing and sequencing equipment. Alternatively, the device performs sample processing (e.g., purification, target enrichment) and/or sequencing, and thus, the pregnant subject does not need to send her sample to a laboratory. The finger prick obtains about 100 μl of blood, of which about 50 μl of plasma or serum is obtained. The 50 μl of plasma contains about 1.5×10{circumflex over ( )}8 of cell-free fetal nucleic acids, because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the plasma sample at the time of sampling is on average 10%. In some instances, the fetal fraction is only 4%, and the 100 μl blood sample contains about 6×10{circumflex over ( )}7 of cell-free fetal nucleic acids. Because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the plasma sample can be as low as 1%, the minimum volume of blood that should be obtained from the subject to ensure reliable information at any stage of pregnancy is about 2 μl.


Example 10. Detecting Genetic Abnormalities by Sequencing Cell-Free Fetal Nucleic Acids in Maternal Urine

A urine sample is collected from a pregnant subject. The pregnant subject may be as little as 5 weeks into gestation. In some cases, she is as little as 7 weeks into gestation. In some instances, the pregnant subject collects the urine herself at home. The pregnant subject sends her sample, either in the device or in a container to a laboratory that has sample processing and sequencing equipment. Alternatively, the pregnant subject puts the urine sample in a home device that performs sample processing (e.g., purification, target enrichment) and/or sequencing, and thus, the pregnant subject does not need to send her sample to a laboratory. In some cases, the urine sample has a volume of about 100 The 100 μl of urine contains about 8×10{circumflex over ( )}10 cell-free fetal nucleic acids, because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the urine sample at the time of sampling is 4%, and the typical concentration of cell-free nucleic acids in urine is 8×10{circumflex over ( )}11 fragments per ml. In some instances, the fetal fraction is 4%, and the urine sample contains about 3.2×10{circumflex over ( )}9 cell-free fetal nucleic acids. Because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the urine sample can be as low as 1%, the minimum volume of urine that should be obtained from the subject to ensure reliable information at any stage of pregnancy is about 2 μl.


Example 11. Detecting Genetic Abnormalities by Counting Cell-Free Fetal Nucleic Acids in Maternal Plasma in a Laboratory from a Home-Collected Sample

A blood sample is collected from a pregnant subject. The pregnant subject may be as little as 5 weeks into gestation. In some cases, she is as little as 7 weeks into gestation. In some instances, the pregnant subject collects capillary blood herself, for example, by pricking her finger, on a device at home. In some instances, the device separates the blood into plasma. The pregnant subject sends her blood (or plasma sample) in the device or a container to a laboratory that has reagents and equipment for sample processing, nucleic acid library preparation and sequencing. In some instances, library preparation involves tagging cell-free fetal nucleic acids with a label or signal that is counted or quantified. In some instances, the label or signal is connected to an oligonucleotide that hybridizes to specific cell-free fetal nucleic acids.


The amount of the specific cell-free fetal nucleic acids is translated into a quantity through the signal or label, and is detected by the pregnant subject, the device or a technician performing the analysis. The finger prick obtains about 100 μl of blood, of which about 50 μl of plasma or serum is obtained. The 50 μl of plasma contains about 1.5×10{circumflex over ( )}8 cell-free fetal nucleic acids, because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the plasma sample at the time of sampling is about 10%. In some instances, the fetal fraction is about 4%, and the 100 μl blood sample contains about 6×10{circumflex over ( )}7 cell-free fetal nucleic acids. Because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the plasma sample can be as low as 1%, the minimum volume of blood that should be obtained from the subject to ensure reliable information at any stage of pregnancy is about 2 μl.


Results of analysis in the lab are sent to the pregnant subject electronically.


Example 12. Detecting Genetic Abnormalities by Counting Cell-Free Fetal Nucleic Acids in Maternal Plasma in a Laboratory from a Home-Processed Sample

A blood sample is collected from a pregnant subject. The pregnant subject may be as little as 5 weeks into gestation. In some cases, she is as little as 7 weeks into gestation. In some instances, the pregnant subject collects the blood herself by pricking her finger on a device at home. The device performs sample processing (e.g., purification, target enrichment) and library preparation. Thus, the pregnant subject only need send her processed and prepared sample to a sequencing facility or facility capable of sequencing nucleic acids.


The amount of the specific cell-free fetal nucleic acids is translated into a quantity through the signal or label, and is detected by the pregnant subject, the device or a technician performing the analysis. The finger prick obtains about 100 μl of blood, of which about 50 μl of plasma or serum is obtained. The 50 μl of plasma contains about 1.5×10{circumflex over ( )}8 cell-free fetal nucleic acids, because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the plasma sample at the time of sampling is about 10%. In some instances, the fetal fraction is about 4%, and the 100 μl blood sample contains about 6×10{circumflex over ( )}7 cell-free fetal nucleic acids. Because the percentage of cell-free fetal nucleic acids in the total cell-free nucleic acids of the plasma sample can be as low as 1%, the minimum volume of blood that should be obtained from the subject to ensure reliable information at any stage of pregnancy is about 2 μl.


Results of analysis in the lab are sent to the pregnant subject electronically.


Example 13. Detecting a Fetal Trisomy

Reads from each chromosome are roughly represented according to the length of the chromosome. Most reads are obtained from chromosome 1, while the fewest reads from an autosome will originate from chromosome 21. A common method for detecting a trisomic sample is to measure the percentage of reads originating from a chromosome in a population of euploid samples. Next a mean and a standard deviation for this set of chromosome percentage values are calculated. A cutoff value is determined by adding three standard deviations to the mean. If a new sample has a chromosome percentage value above the cutoff value, an overrepresentation of that chromosome can be assumed, which is often consistent with a trisomy of the chromosome.


For a pregnant subject with a euploid fetus, the average value for the percentage of reads obtained from chromosome 21 is 1.27% with a standard deviation of 0.01 percent. Therefore the cutoff to indicate a trisomy is 1.30%. This theoretical example shows a trisomy sample with a fetal fraction of 10% and a chromosome 21 percentage of 1.34. The sample is above the cutoff and would be correctly classified as a trisomy sample. Exemplary averages of chromosome percentages for all chromosomes in a euploid subject's sample with a euploid fetus, as well as percentages for all chromosomes in a euploid subject's sample with an aneuploid fetus is shown in Table 5.









TABLE 5







Average of chromosome percentages for chromosomes














Cutoff for






chromosome






percentage






to enable






trisomy






detection




Average of

based




chromosome

on mean




percentages
Standard
plus three
Example



for a
deviation of
standard
of trisomy



euploid
chromosome
deviations
21 sample


Chromosome
sample
percentages
method
percentages





 1
8.38
0.02
8.46
8.39


 2
8.51
0.02
8.59
8.48


 3
6.92
0.02
7.01
6.93


 4
6.27
0.03
6.39
6.22


 5
6.18
0.03
6.31
6.18


 6
5.88
0.02
5.96
5.87


 7
5.55
0.01
5.61
5.54


 8
5.13
0.02
5.20
5.13


 9
4.10
0.01
4.15
4.08


10
4.96
0.01
5.00
4.97


11
4.87
0.01
4.91
4.85


12
4.76
0.03
4.86
4.75


13
3.23
0.02
3.32
3.21


14
3.21
0.02
3.28
3.20


15
3.02
0.02
3.09
3.06


16
3.07
0.02
3.15
3.07


17
3.07
0.02
3.17
3.04


18
2.69
0.01
2.72
2.68


19
2.27
0.03
2.40
2.28


20
2.44
0.03
2.55
2.44


21
1.27
0.01
1.30
1.34


22
1.46
0.02
1.54
1.45


X
2.50
0.02
*
2.47


Y
0.25
0.01
*
0.24





* a similar cutoff is not available for sex chromosomes, because this exemplary method is only applicable to autosomes.






Example 14: Device for Analysis of Fetal Cell-Free Nucleic Acids from Maternal Blood

A device for separating plasma from whole blood for the purpose of analyzing cell-free nucleic acids comprises 6 layers. From bottom to top these are:


(1) Lower Adhesive Sheet


(2) Lower Separation Disc: 16 mm diameter disc of adhesive sheet material (polymer material that is inert to DNA or Plasma) with glue on the side facing the Lower Adhesive Sheet


(3) Polyethersulfone (PES) membrane, various sizes, typically between 6 and 16 mm, preferred design features 10 mm PES membrane. The membrane serves as wicking material which attracts the plasma from the filter through capillary force.


(4) Filter Disc (e.g., Pall Vivid™ Membrane), 16 mm diameter, rough side facing up, shiny side facing the PES membrane.


(5) Upper Separation Disc: same material as Lower Separation Disc, size 12 or 14 mm diameter, containing a 4 mm hole in the center. When using adhesive sheet material, now the glue side is facing up to meet the Upper Adhesive Sheet. The Upper Separation Disc is smaller than the Filter Disc in diameter. This allows the glue from the Upper Adhesive Sheet to interact with the edges of the Filter Disc and thereby sealing it at the edges.


(6) Upper Adhesive Sheet, a 6 mm hole is punched in the location where the center of the device will be located.


All layers are lined up at their center and then laminated using a standard office lamination machine.


The device is configured to perform the test described in Example 6.


Application of blood and filtration to the device occurs as follows:


100 μl of whole blood is applied to the center of the device through a hole in an Upper Adhesive Sheet and a hole in an Upper Separation Disc. The blood distributes centripetally throughout a Filter Disc by capillary forces. Plasma is also wicked through the Filter Disc into a PES membrane by capillary forces. After about two minutes, the maximum amount of plasma has been transferred into the PES membrane. The device or portion thereof with the PES membrane is shipped to a laboratory for DNA testing.


The PES membrane containing cell-free nucleic acids is recovered as follows:


The PES membrane is removed from the device. For example, the device is cut out around the edges of the PES membrane. The membrane separates easily from the Filter and the Lower Disc.


DNA is eluted from the membrane as follows:


The PES membrane containing the plasma is transferred into an Eppendorf tube (0.5 ml) and 100 μl of elution buffer are added (elution buffer can be H2O, EB buffer (QGEN), PBS, TE or others suitable for subsequent molecular analysis). After elution of the DNA from the membrane, the buffer, containing the eluted cfDNA, is subjected to genetic analysis which involves nucleic acid amplification, tagging, sequencing, or a combination thereof.


Example 15. Detecting Genomic Alteration Using Non-Invasive Ultra-Low Volume Liquid Biopsy Test to Monitor Cancer Disease Progression

Whole blood is collected by finger-tip capillary bed puncture using a contact-activated lancet (BD Microtainer) and blood collection into a SAFE-T_FILL capillary collection device (KABE Labortechnik, GMBH). Capillary blood is processed to plasma by double-spin centrifugation as follows:

    • 1. Spin 1-1330 rpm for 20 minutes
    • 2. Spin 2-3300 rpm for 10 minutes
    • 3. Plasma is stored at 4° C. until use.


Circulating cell-free DNA is then extracted from the plasma using a modified protocol for 10 ul of plasma with the MagMax Cell-Free DNA Isolation Kit (Life Technologies). Isolation consisted of the following steps:

    • 1. Incubation of plasma with Proteinase K (volume dependent on starting input) at 60° C. for 20 minutes.
    • 2. Lysis/binding of plasma to DynaBeads MyOne Silane paramagnetic beads (2.5-5 ul) with binding for 10 minutes at room temperature.
    • 3. Ishing of the bead/ccfDNA complex (volume dependent on starting input).
    • 4. Rinse bead/ccfDNA complex with 80% ethanol (volume dependent on starting input).
    • 5. Elution of ccfDNA from beads (volume dependent on starting input) with incubation at room temperature for 2 minutes.


Genome equivalents for each sample are estimated to be 1 GE/ul of plasma based on previous extractions at volumes ranging from 10 ul-4000 ul and published data. All of the eluted ccfDNA is used as input for library generation. DNA libraries are prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Libraries are generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template.


Library preparation consisted of:

    • 1. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
    • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors are diluted 1:25 to a 0.6 uM working concentration. The cleaved, adaptor-ligated library is then subjected to bead-based purification using SPRISelect beads. The volume of beads is increased to 116 ul to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
    • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library is then purified using SPRISelect beads (45 ul).


All libraries are sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations are determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library is normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing.


Sequencing-by-synthesis is conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) is conducted for each index/sample. In general, each sample generated approximately 20 million passed-filter.


All sequencing data (fasta.gz files) is aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1-n 0”. For further analysis, the human genome is divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” is calculated for each bin with an accuracy up to 3 decimals. For each bin we then counted how many of the aligned sequence reads start in a bin. For further analysis we reduced the data by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y are excluded). After this filtering, a Loess regression between GC contend and read count per bin is performed and the median bin count is calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value is divided by the median bin count to get a correction factor. The measured bin count is then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count is calculated. All 50 kb bins are divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) is calculated. Bins with a low MAD and a median around the expected value of 1 are selected (bins with MAD>=0.25 or Median <0.7 or Median >1.3 are filtered out).



FIG. 15 shows capillary blood based circulating cell-free DNA sequencing data retrieved from a patient with advanced cancer prior to and after treatment. This sample shows a significant increase in the amount of reads from the p-arm of chromosome 14 (indicated in black). This overrepresentation of part of chromosome 14 is caused by a large contribution of circulating tumor DNA carrying extra chromosome 14 material into the overall circulating cell-free DNA. After surgical removal of the cancer tissue, a capillary blood sample is received and analyzed. The relative representation of chromosome 14 has normalized because the source of the circulating tumor DNA contributing extra chromosome 14 material, the cancer tissue, has been effectively removed by the surgery (the area indicated in black is now normally represented).


Example 16. Detecting Viral Genome Using Ultra-Low Liquid Biopsy Test

A capillary blood sample is taken from a subject through transdermal puncture of the pointer finger. The puncture site is first cleaned with water, followed by cleaning with an alcohol pad. The first blood drop obtained after puncture of the pointer finger is discarded. A total of 100 ul capillary blood is then collected through “milking” of the finger into a micro tube containing coagulant. The capillary blood is then immediately centrifuged to create plasma and minimize white blood cell lysis (to prevent further increase of background genomic DNA) and the plasma is carefully removed by pipetting.


DNA Extraction


Circulating cell-free DNA is extracted from the plasma (45 ul) using a modified MagMax bead protocol.


Library Preparation


Sequencing libraries are generated from the extracted cfDNA using the NEB Ultra 2 kit (provide details for protocol) using Indexing (detail) primers.


Sequencing Library


Sequencing libraries are sequenced on an Illumina NextSeq500 sequencer using paired end sequencing (add details on read-length and indexes). 25 million reads are obtained for the sample.

    • 1. Whole blood is collected by finger-tip capillary bed puncture using a contact-activated lancet (BD Microtainer) and blood collection into a SAFE-T_FILL capillary collection device (KABE Labortechnik, GMBH). Capillary blood is processed to plasma by double-spin centrifugation as follows:
    • 2. Spin 1-1330 rpm for 20 minutes
    • 3. Spin 2-3300 rpm for 10 minutes
    • 4. Plasma is stored at 4° C. until use.
    • 5. Circulating cell-free DNA is then extracted from the plasma using a modified protocol for 10 ul of plasma with the MagMax Cell-Free DNA Isolation Kit (Life Technologies). Isolation consisted of the following steps:
      • a. Incubation of plasma with Proteinase K (volume dependent on starting input) at 60° C. for 20 minutes.
      • b. Lysis/binding of plasma to DynaBeads MyOne Silane paramagnetic beads (2.5-5 ul) with binding for 10 minutes at room temperature.
      • c. Ishing of the bead/ccfDNA complex (volume dependent on starting input).
      • d. Rinse bead/ccfDNA complex with 80% ethanol (volume dependent on starting input).
      • e. Elution of ccfDNA from beads (volume dependent on starting input) with incubation at room temperature for 2 minutes.
    • 6. Genome equivalents for each sample are estimated to be 1 GE/ul of plasma based on previous extractions at volumes ranging from 10 ul-4000 ul and published data. All of the eluted ccfDNA is used as input for library generation. DNA libraries are prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Libraries are generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template.
    • 7. Library preparation consists of:
      • a. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
      • b. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors are diluted 1:25 to a 0.6 uM working concentration. The cleaved, adaptor-ligated library is then subjected to bead-based purification using SPRISelect beads. The volume of beads is increased to 116 ul to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
      • c. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library is then purified using SPRISelect beads (45 ul).


All libraries are sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations are determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library is normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing.


Sequencing-by-synthesis is conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) is conducted for each index/sample. In general, each sample generated approximately 35 million passed-filter.


Data processing: Sequencing data is demultiplexed by bcl2fastq v2.17.1.14 with default parameters. Reads are quality trimmed and reads shorter than 20bases are filtered with Trimmomatic v0.32. Pass filter reads are aligned against the human reference using Bowtie v2.2.4 and are set aside. Reads potentially originating from repeat region such as human microsatellite DNA are filtered out. Remaining reads are aligned against microorganism reference databases with BLAST v2.2.30. Reads showing high identity with any of the reference database sequences are retained. Reads aligning with mitochondrial DNA and PCR duplicates are removed based on alignment data.


Determination of relative amounts of microorganisms and Taxon abundance is based on alignment results (sequencing read amounts and alignment scores to compensate for potential mismatches between reference sequence data and sequence reads that are caused by genetic drift). Reference genomes for alignment for Homo sapiens and microorganisms are retrieved from the National Center for Biotechnology Information ftp site.


Expected Results


Circulating cell-free DNA is extracted from capillary blood obtained from a hospitalized patient with a bloodstream infection (BSI). The cfDNA sequencing data identified a high relative abundance of K. pneumoniae. Serial capillary blood samples taken over 10 days allowed monitoring treatment success of the patient with the antibiotic Ceftazidime. After treatment initiation at day 3, the relative abundance of K. pneumoniae reduced significantly with increased treatment time. FIG. 16 provides exemplary results from this experiment.


Example 17. Existing Un-Optimized Protocols for Ion Semiconductor Sequencing Technologies Fail to Adequately Represent Total and Fetal Cell-Free DNA Fraction in Maternal Sample

A standard protocol (e.g., library preparation and sequencing methodologies) for detecting cell-free DNA in a maternal sample and an optimized protocol, as described herein, were compared. Sequencing data from both protocols were analyzed in the context of trisomy detection in order to evaluate if the standard library preparation protocol would provide equivalent accuracy to the optimized protocols of the present disclosure.


In this study 8 cfDNA samples were analyzed, including 4 samples obtained from women carrying a euploid fetus and 4 samples obtained from women carrying a fetus with trisomy 21. These 8 samples were processed using two sets of experimental conditions. In the first set, an optimized library preparation kit was used (NEB Next Ultra II library kit) with optimized volumes and ratios for low input cfDNA amounts to create the sequencing libraries and a fluorescence-based next generation sequencer to perform the sequencing. In the second set, an un-optimized library preparation kit was used (NEB Next DNA Library Prep Set for IonTorrent kit) to create the sequencing libraries and the ion semiconductor sequencer to perform the sequencing. In both conditions, 10 genome equivalents (GE) of cfDNA were used as input to the library preparation process.


Methods


Circulating cell-free DNA was isolated from blood plasma using paramagnetic beads to capture the cfDNA. Briefly, plasma was separated from whole blood by centrifugation and lysed/bound to the beads in a solution of protease K, guanidine hydrochloride, beads and glycogen. The beads were then washed in three steps using Triton X-100, guandindine hydrochloride and sodium chloride. Elution of cfDNA was conducted with water containing sodium azide. All samples were then quantified to determine the yield of cfDNA for downstream testing.


Prior to sequencing library generation all samples were normalized to 10 GEs of cfDNA for input into the library reactions.


Method 1: Standard Protocol


Libraries were generated for the ion semiconductor sequencer using the NEBNext Fast DNA Library Prep Set for Ion Torrent with modifications to the standard protocol. Library generation consisted of end repair, Ion Torrent-specific adaptor ligation, reaction clean-up with Ampure XP beads, library amplification with Ion Torrent-specific primers, and purification of amplified library with Ampure XP beads and final elution of the amplified library. Adaptors were diluted 1:10 for all libraries, amplification was conducted with 15 cycles and all libraries were eluted in 25 ul of molecular-grade water. Following library generation all samples were sized and quantified using an Agilent Bioanalyzer 2100 high-sensitivity DNA chip. Quantification was then repeated using a ThermoFisher Qubit 3.0. Libraries were further size-selected to eliminate adaptor-dimer products from the sequencing process. Purity and concentration of the size-selected libraries were confirmed as above.


Ion torrent S5 sequencing template and chip generation were conducted using an Ion Chef with the Ion 540 Kit and Ion 540 chip. Runs generated approximately 100 million reads in general with a minimum of 20 million reads per sample in the data generated.


Method 2: Optimized for Low-Input Amounts


DNA libraries were prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Libraries were generated using reduced volumes to account for the stoichiometry of the lower template amounts. The volumes used depended on the input amount of template. Library preparation consisted of:

    • 1. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
    • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors were diluted 1:25 to a 0.6 uM working concentration. The cleaved, adaptor-ligated library was then subjected to bead-based purification using SPRISelect beads. The volume of beads was increased to 116 ul to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
    • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 13 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library was then purified using SPRISelect beads (45 ul).


All libraries were sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations were determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library was normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing. Sequencing-by-synthesis was conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) was conducted for each index/sample. In general, each sample generated approximately 4 million passed-filter.


Based on the amount of input material (normalized to 10 genome equivalents of circulating cell-free DNA), the theoretical lower limit of cfDNA fragments that should be available for analysis is around 10M (or 0.5 GE). To have 10M cfDNA fragments available for sequencing requires that a higher number has to be sampled from blood, because most process steps during sample preparation will be accompanied with some sample loss. It is generally accepted that library preparation efficiency is one of the most affected/least efficient process steps. It is important to control how many cfDNA fragments participate in the reaction and ultimately are being sequenced. In short 1 GE is represented by about 20M cfDNA fragments (3B base pairs; 150 bp fragment length). When the efficiency from blood draw to adapter ligation is only 1%, then the starting material before PCR is only 200,000 cfDNA fragments. During the PCR step these 200,000 fragments can be amplified to a sufficient degree for next generation sequencing. When these 200,000 cfDNA fragments are sequenced 2M times, the majority of cfDNA fragments are sequenced multiple times. In contrast the same sample processed with an efficiency of 100% provides 20M potential cfDNA fragments for sequencing and at the same 2M sequence reads only a small subset will have been sequenced more than once.


The sequencing data was analyzed in the context of trisomy detection in order to evaluate if a standard library preparation protocol as previously used on a ion semiconductor sequencer would have been able to provide equivalent accuracy to methods optimized for ultra-low input amounts.


Median and Median Variances


The relationship between median bin count and median absolute deviation (MAD) per bin for the two data sets was explored. Median counts were positively correlated with MAD. In addition there is a subset of bins with higher MADs. This effect is present in the raw and the GC corrected data indicating that the higher MAD are not caused by GC bias introduced during processing, but instead represent true biological variation. Comparing the two library preparation/sequencing methods confirms previous observations (FIG. 14A, 14B). Median normalized GC corrected bin counts are similar between the two different datasets (p-value=0.31, t-test). Bin specific MADs are lower in the standard protocol dataset (p-value <2.2e-16, t-test), potentially indicating better performance in CNV classification for the standard protocol data. The lower bin specific median might be a result of the significantly higher sequence counts that were available in the standard protocol dataset.



FIGS. 14A-14B show the relationship between median bin count and median absolute deviation (MAD) per bin for the standard versus optimized protocol data sets. Median normalized GC corrected bin counts are similar between the two different datasets (p-value=0.31, t-test). Bin specific MADs are lower in the standard protocol dataset (p-value <2.2e-16, t-test), potentially indicating better performance in CNV classification for the standard protocol data. The lower bin specific median might be a result of the significantly higher sequence counts that were available in the standard protocol dataset.


Duplicates


The analysis of duplicate sequence reads was used to estimate the number of genome equivalents (and therefore cfDNA fragments) that were available for sequencing after library preparation. The calculation is complex and will be outlined hereafter. In theory, the amount of duplicate reads are dependent on: a) how many cfDNA fragments participated in the reaction and b) how many sequence reads are generated.


To calculate the expected value the expected lambda value for the Poisson distribution was determined, which is sequence reads/cfDNA fragments. The expected duplication rate is not simply the probability to observe two or more. Because we do not have a measure for 0 counts we need to exclude those. Hence our expected duplication rate is the probability to observe 2 or more counts over the probability to observe 1 or more counts [(1−P(0)−P(1))/(1−P(0))]. We can then use this matrix of expected values as a lookup table to identify the input genome equivalents by matching the number of sequence read to the duplication rate.

    • poom<−1-dpois(0, seq.count.vec/cpy.tmp) #P(>=1) probability one or more
    • peo<-dpois(1, seq.count.vec/cpy.tmp) #P(1) probability exactly one
    • ptom<-poom-peo #P(>=2) probability two or more
    • mat.dup.rate[i,]<-ptom/poom#/#(peo+ptom) #bit unclean could also be ptom/poom



FIG. 14C shows library preparation and sequencing with the standard protocol yields fewer Genome Equivalents for sequencing, as compared to the optimized protocol of the present disclosure (median for Life=1.355, median for ILMN=6.065).


A starting amount of 10 GE was used for library preparation of each sample. FIG. 14C. shows library preparation and sequencing with the standard protocol yields fewer Genome Equivalents for sequencing, as compared to the optimized protocol of the present disclosure (median for Life=1.355, median for ILMN=6.065).


The number of available cfDNA fragment is a determining factor for classification accuracy and this data shows standard processing with the standard protocol results in a significant reduction of available cfDNA fragments.



FIG. 14D shows optimized protocol data points in yellow, standard protocol points in blue


Chromosome Representation Percentages and Z-Score


The percentage representation of fragments originating from chromosome 21 over the representation of all qualifying autosomes (excluding chromosome 21 and 19) were calculated for both protocols. The percentage for chrY and chrX was also calculated. The percentage representation of the sex chromosomes can be used to determine the sex of the fetus. For male samples percentage of sex chromosome representation can also be used to estimate the fraction of cfDNA originating from the fetus (fetal fraction). For chromosome 21 we calculated a Z-score according to well established methods. The median and MAD for a set of euploid reference samples were calculated. Next, the difference in median for each sample from that reference median was calculated. Finally, the difference was divided by the reference MAD to derive the Z-score. A score greater than 3 indicates the presence of a trisomy 21).



FIG. 14E shows that the data derived from the standard protocol library preparation and sequencing is noisy and does not allow for an easy delineation of samples carrying a male versus female fetus.


However, the data from the optimized and more efficient library preparation and sequencing protocol of the present disclosure for chrY representation is clear and shows that the set comprises three (3) male and five (5) female samples. In addition, there is not a good consensus between the two data sets for chrY measurements. Consequently chrX representation was used for the estimation of fetal fraction in male samples for the remaining analysis.


Performance Comparison Between Standard Library Preparation and Sequencing Protocol Vs Optimized Library Preparation and Sequencing Protocol Data


After correction for outlier bins, the Z-score analysis shows that the optimized library preparation and ILMN sequencing data performed as expected. FIG. 14F shows that the standard protocol data showed good specificity (0 false positives, 100% specificity) but poor sensitivity (2 false negatives, 50% sensitivity). Both datasets contain exactly the same samples and were given the exact same amount of input material. The standard protocol data has significantly more sequence reads per sample. However, as noted above, the number of sequence reads does not necessarily correlate with an accurate representation of cell-free DNA in the original sample. Next, the relationship between available cfDNA fragments, fetal fraction, and Z-score, was


To explore the relationship between fetal fraction, copy numbers and Z-scores, the percentage representation for chr21 and chrY was calculated. These percentages were used to estimate the fraction of fetal genetic material in the sample (herein referred to as fetal fraction). Female samples will not have an elevated chrY representation. For those female samples that show chr21 overrepresentation a fetal fraction was calculated from the chr21 overrepresentation. Samples were identified as female if their chrY representation in the optimized protocol dataset was less than 8.2*10−4.



FIG. 14G shows plots indicating samples with a fetal trisomy (red) and euploid fetus (black).


After transforming the chromosome representation percentage measurements into fetal fraction estimates, the value for chrY, chrX and chr 21 were on the same scale. All male samples had a fetal fraction estimate available. Also all trisomy 21 had an estimation available. As seen before, the optimized protocol data clearly delineates between male/female and euploid/trisomic samples. The standard protocol data is noisy and does not allow for a clear separation. We then constructed a fetal fraction measurement that uses the chrX measure for all male samples and the chr21 measure for all female samples with Trisomy 21. Fetal fraction for female euploid samples was not available.



FIG. 14G shows a combined fetal fraction measurement for all samples correlated well with the observed effect introduced by chr21 using the standard protocol (left) as compared to the optimized protocol (right)).


Z-Scores, Copy Numbers and Fetal Fraction


The relationship between copy numbers, fetal fraction and Z-scores, was plotted. Euploid samples are distributed on the copy number/fetal fraction plane but their z-scores are not correlated to those parameters. This behavior is expected, but complicates the visualization. The protocol data is distinct from the standard protocol data with respect to copy numbers.



FIG. 14H shows that correctly classified samples (True Positives, TP) separate from incorrectly classified samples (False Negatives, FN) for both protocols. Also shown are more copy numbers resulting from the optimized protocol as compared to the standard protocol.


Using a computer simulation that takes into account sampling error at all stages of the library preparation process, we can build a model to predict performance for each combination of available cfDNA fragments and fetal fraction. At an estimated PCR efficiency of 90%, library efficiency of 5% and 36M sequence reads, the resulting line that indicates 50% sensitivity perfectly separates the True Positives from the False Negative samples (FIG. 14I).


CONCLUSION

Disclosed herein, is a demonstration that a standard library preparation and sequencing method that is not optimized for low-input amounts does leads to a reduced amount of copies of cell-free DNA as compared to the optimized protocol, when the same low input amount is used. A reduced copy number representation is a result of a higher noise in the chromosome representations and therefore lower performance in detection of aberrations.


Example 18. Determining a Baseline Fragment Length Profiles

There are artifacts during the blood collection and DNA extraction process that can result in lower fetal fraction. These artifacts include mechanical stress, contamination, and most notably, white blood cell (WBC) degradation. For example, blood sampled via finger prick comes in contact with the surface of the skin before being collected in a container. During that contact additional DNA from, for example, shed skin cells can be picked up and add to the overall pool of maternal DNA. Furthermore, the method of skin puncture is also very different between venous blood draw and finger prick. The needle in a venous blood draw is designed to slice through the skin and be positioned inside the vein. Cell that have been damaged during the skin puncture are unlikely to contribute significantly to the maternal background because the open tip of the needle is positioned away from the skin puncture. During a finger prick the sampled blood washes directly passed the punctured skin to be collected on the surface. During that passage it can collect the DNA from al the cells that have been damaged when puncturing the skin. While these DNA contaminations might be small they become impactful when small amounts of blood are collected. For example, assuming the same 10% fetal fraction and 1000 GE of cfDNA/ml plasma, we would expect a total of 10 GE cfDNA (9 maternal and 1 fetal) in a 20 ul capillary blood draw. In an example we assume that 10 cells are damaged and contribute to the blood collection. In a capillary blood draw of 20 ul the fetal fraction would drop from 10% to 5%.


DNA that has not undergone apoptosis, such as nucleosomal DNA released after mechanical cell damage or DNA for shed skin surface cells shows a different fragment length distribution. By analyzing size length profiles of cfDNA fragments derived from venous blood draw and comparing them to cfDNA fragment length profiles from other sources we can describe the impact of non-cfDNA. In the context of NIPT this non-cfDNA can be regarded as maternal background contamination, which leads to a reduction in fetal fraction.


Venous Blood


The length profiles of cfDNA, which has been extracted from venous blood, have been described in the art and are understood to a person of skill in the art to multiples of about 166-168 bp. This is explained by the process of apoptosis, where nucleosomal DNA is wrapped around nucleosomes and cut in between those nucleosomes. The resulting pattern is the characteristic size profile of cfDNA isolated from plasma (FIG. 17A). Genomic DNA that does not undergo apoptosis does not exhibit the characteristic size profile, but instead shows a size distribution biased towards smaller fragment lengths.


Capillary Blood


A standard capillary blood draw was performed, including a finger prick with a onetime use lancet to penetrate the skin at the lateral distal end of a finger, and collection of blood at the incision site in an EDTA collection tube. The plasma was separated within 20 min of collection and a standard cfDNA extraction was performed. Sequencing of the extracted DNA revealed a different fragment length distribution compared to cfDNA derived from venous blood (FIG. 17A). To make the fragment length distributions comparable between different samples, the counts for each length were divided by the mode of the distribution. FIG. 17B shows the profile for capillary blood, characterized by an overrepresentation in fragments smaller than 168 bp peak and fragments larger than the 168 bp.


To determine whether the collection process (e.g., either by the contamination of addition DNA or mechanical stress) was causing the overrepresentation in the short fragment lengths, DNA extracted from skin was analyzed.


DNA from Skin Surface


To measure DNA from the skin surface we applied water to the surface of a fingertip and subsequently collected it. During the application of water, DNA located on the surface is dissolved in the water. Following the water collection, the DNA was extracted and sequenced. The size profiles show the highest representation at small fragment lengths with a steady decrease towards longer fragment lengths (FIG. 18A). This fragmentation size pattern is representative for DNA that has not been protected from digestion by nucleosomes, such as nucleosomal DNA from shed skin cells.


DNA from Cell Damage


To model a sample containing cell-free DNA and damaged DNA (as a possible source of the fragments smaller than 168 bp peak and fragments larger than the 168 bp), we induced local hypoxemia to the distal end of the 5th digit to introduce cell damage. The component of DNA from damaged cells was estimated by inspecting the difference between the fragmentation length difference between a regular capillary blood draw and the capillary blood draw with induced cell damage. A comparative analysis of each fragment length bin was normalized to the representation of that bin in venous blood. The fragment length distributions of damaged cells show overrepresentation is maximized at the local minima of the fragment length distribution from venous blood. (FIGS. 18C and 18D).


Without being bound by a particular theory, the overrepresentation of fragment lengths in capillary blood compared to venous blood is consistent with assuming a mixture of regular cfDNA and non-apoptotic genomic DNA. The source of the non-apoptotic genomic DNA is likely cell damage introduced by perforation of the skin with a lancet.


Example 20. Exemplary Method for Reduction in Contamination

To investigate the effect of different collection methods on the contribution of non-apoptotic genomic DNA we compared a standard finger prick blood collection protocol to one that we have optimized. The standard protocol includes thorough cleaning of the fingertip with ethanol, puncture the skin with a onetime use lancet and collect the blood into an EDTA container (hereafter referred to as the “non-wiped” condition). In the optimized protocol an additional step is performed before the blood is collected. After the skin is punctured with the one time use lancet the first drop of blood is wiped away with gauze pad (hereafter referred to as the “wiped” condition). Only the blood following this first drop is collected in the EDTA container.


Method


The collected blood was processed into plasma and DNA extracted within 2 hours of collection. DNA quantity was assessed using real time PCR. Fragment length distributions were established by paired end sequencing on a ILMN Next-Seq. Venous blood was collected as a reference using a standard method.


DNA Quantity


The DNA quantity for samples collected with the non-wiped condition is approximately 50% higher compared to the wiped collection protocol. Higher DNA yields are generally regarded as favorable for NIPT analysis. However, the analysis of fragment length distributions revealed a stronger overrepresentation of fragments lengths indicative for cell damage in the non-wiped condition (FIG. 19).


Without being bound by any particular theory, wiping away the first drop of blood reduces the contribution of DNA derived from cell damage. Alternatively or in addition, solutions to the issue of DNA originating from damage and contamination, including: (1) capture methods that select against longer DNA fragments, (2) electrophoretic methods, (3) selection of library products by size, and (4) bioinformatics methods to account/remove or differentially analyze based on size information.


Example 21. Detection of Fetal Chromosomal Aneuploidy Using Low Coverage Whole Genome Sequencing-by-Synthesis with Ultra-Low Input Amounts of Circulating cfDNA (10 Genome Equivalents)

Whole blood is collected by finger-tip capillary bed puncture using a contact-activated lancet (BD Microtainer) and blood collection into a SAFE-T_FILL capillary collection device (KABE Labortechnik, GMBH). Capillary blood is processed to plasma by double-spin centrifugation as follows:

    • 1. Spin 1-1330 rpm for 20 minutes
    • 2. Spin 2-3300 rpm for 10 minutes


Plasma was processed fresh, stored at 4° C. or −80° C. until use.


Circulating cell-free DNA was then extracted from 10 ul of plasma using a paramagnetic beads protocol. Isolation consisted of the following steps:

    • 1. Incubation of plasma with Lysis Buffer and beads at room temperature for 10 minutes while being agitated for lysis/binding.
    • 2. Washing of the bead/ccfDNA complex with wash solution 1.
    • 3. Washing of the bead/ccfDNA complex with wash solution 2.
    • 4. Elution of ccfDNA from beads (20 μl) with incubation at room temperature for 5 minutes.


The extracted DNA was used as input for library generation. DNA libraries were prepared using the NEBNext Ultra II DNA Library Prep Kit with the NEBNext Multiplex Oligos for Illumina (Index Set Primers 1) (New England Biolabs). Library preparation consisted of:

    • 1. End-repair, 5-phophphorylation and A-tailing with incubation at 20° C. for 30 minutes followed by 65° C. for 30 minutes.
    • 2. Adaptor ligation with incubation at 20° C. for 15 minutes followed by cleavage of the ligated adaptor loop with incubation at 37° C. for 15 minutes. Adaptors were diluted 1:25 to a 0.6 uM working concentration. The cleaved, adaptor-ligated library was then subjected to bead-based purification using SPRISelect beads. The volume of beads was increased to 116 ul to further enhance binding of highly-fragmented, low concentration ccfDNA following adaptor ligation.
    • 3. Library amplification/indexing with initial denaturation at 98° C. for 1 minute followed by 15 cycles of 98° C. denaturation for 10 seconds and annealing/extension at 65° C. for 75 seconds with final extension at 65° C. for 5 minutes. Amplified library was then purified using SPRISelect beads (45 ul).


All libraries were sized and characterized using Agilent Bioanalyzer 2100 with a High-Sensitivity DNA Chip (Agilent Technologies). Concentrations were determined using Qubit v3.0 (Life Technologies) for library dilutions prior to sequencing. Each library was normalized to a concentration of 2 nM and pooled for denaturation and dilution prior to sequencing. Sequencing-by-synthesis was conducted using an Illumina NextSeq 550 at a loading concentration of 1.5 pM. Seventy-five cycle paired-end sequencing (2×75) was conducted for each index/sample. The median sequencing count for this sample set was 6.4M paired sequence reads passed-filter. All sequencing data (fasta.gz files) was aligned against the human reference genome build hg38 using Bowtie with alignment parameters “−k 1-n 0”. For further analysis, the human genome was divided into consecutive 50,000 basepair regions, also called 50 kb bins, and the fraction of the base “G” and “C” was calculated for each bin with an accuracy up to 3 decimals. For each bin the aligned sequence reads that start in a bin were counted. For further analysis the data was reduced by filtering out bins not on chromosomes 1 to 22 (e.g. chromosomes X and Y were excluded). After this filtering, a Loess regression between GC contend and read count per bin was performed and the median bin count was calculated. The Loess regression provided an expected bin count for each GC content value, also called the expected value. This expected value was divided by the median bin count to get a correction factor. The measured bin count was then divided by the correction factor resulting in a GC corrected bin count and the median of the GC corrected bin count was calculated. All 50 kb bins were divided by the median GC corrected bin count to yield GC normalized bin counts and for each bin a median and median absolute deviation (MAD) was calculated. Bins with a low MAD and a median around the expected value of 1 were selected (bins with MAD>=95th percentile of all MAD bin values were filtered out; bins with Median <=5th percentile or Median >=95th percentile of all Median bin values were filtered out).


From the reduced and normalized data, all sequence bins originating on chromosome 21 were identified. The percentage representation of sequence reads originating from chromosome 21 was calculated by summing up all GC normalized values for bins originating on chromosome 21 and dividing the sum by the sum of all GC normalized values excluding GC normalized values of bins originating from chromosome 21 and 19 (as well as other chromosomes already excluded in earlier steps, e.g. X and Y, chromosomes other than 1-22). The median and MAD of the chromosomes 21 representation were then calculated from a set of known euploid samples (reference samples). For each sample the median chromosome 21 representation (obtained from the reference set) was subtracted from the sample specific chromosome 21 representation resulting in a sample specific difference. This sample specific difference was divided by the chromosome 21 representation MAD (obtained from the reference set), providing a value referred to as the Z-score. Test samples were then classify based on their Z-score, where samples with a Z-score of 3.5 and higher were classified as trisomic and samples with a Z-score of less than 3.5 were classified as euploid. See FIG. 20. 16 samples were obtained from individuals assumed to carry a euploid pregnancy and 8 samples were obtained from women confirmed to carry a fetus with trisomy 21. All 16 samples assumed to be euploid were classified as euploid and all 8 samples with confirmed trisomy 21 were classified as trisomy 21. This resulted in a sensitivity of 100% and a specificity of 100% for this dataset.


While preferred embodiments of the devices, systems and kits disclosed herein have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the devices, systems and kits disclosed herein. It should be understood that various alternatives to the embodiments of the devices, systems and kits disclosed herein may be employed in practicing the inventive concepts. It is intended that the following claims define the scope of the devices, systems and kits and that methods and structures within the scope of these claims and their equivalents be covered thereby.

Claims
  • 1. A method comprising: a) obtaining a biological sample from a subject, wherein the biological sample comprises a cell-free deoxyribonucleic acid (cfDNA), and wherein the biological sample has a volume of at most 120 microliters when it is obtained from the subject;b) tagging at least a portion of the cfDNA to produce tagged cfDNA by: a. generating ligation-competent cfDNA by one or more steps comprising: i. generating a blunt end of the cfDNA, wherein a 5′ overhang or a 3′ recessed end is removed using one or more polymerases and one or more exonucleases;ii. dephosphorylating the blunt end of the cfDNA;iii. contacting the cfDNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cfDNA; oriv. repairing or remove DNA damage in the cfDNA using a ligase; andb. ligating the ligation competent cfDNA to adaptor oligonucleotides by contacting the ligation competent cfDNA to the adaptor oligonucleotides in the presence of the ligase and one or more of a crowding reagent and a small molecule enhancer; andc) optionally, amplifying the tagged cfDNA; andd) sequencing at least a portion of the tagged cfDNA.
  • 2. The method of claim 1, wherein the volume is at most 100 microliters, when it is obtained from the subject.
  • 3. The method of claim 2, wherein the volume is at most 40 microliters when it is obtained from the subject.
  • 4. The method of claim 1, wherein the biological sample obtained from the subject is capillary blood.
  • 5. The method of claim 4, wherein the volume is at most 40 microliters when it is obtained from the subject.
  • 6. The method of claim 4, wherein the biological sample was obtained from the subject by a process of: a) inducing a first transdermal puncture to produce a first fraction of a biological sample;b) discarding the first fraction of the biological sample; andc) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis.
  • 7. The method of claim 1, further comprising detecting a normal representation, an overrepresentation or an underrepresentation of at least one target sequence in the at least a portion of the tagged cfDNA.
  • 8. The method of claim 1, wherein the subject is pregnant with a fetus.
  • 9. The method of claim 8, wherein a component of the cfDNA is a fetal cfDNA component from the fetus.
  • 10. The method of claim 9, wherein the cfDNA in the biological sample is about 10 genome equivalents.
  • 11. The method of claim 9, further comprising analyzing genotype information from an individual and the fetal cfDNA component to determine whether the individual paternally contributed to the fetus by identifying a genotypic match between the fetal cfDNA component and the genotype information.
  • 12. The method of claim 1, comprising amplifying in (c), and wherein generating the ligation-competent cfDNA comprises: d) generating the blunt end of the cfDNA, wherein a 5′ overhang or a 3′ recessed end is removed using one or more polymerases and one or more exonucleases;e) dephosphorylating the blunt end of the cfDNA;f) contacting the cfDNA with a crowding reagent thereby enhancing a reaction between the one or more polymerases, one or more exonucleases, and the cfDNA; andg) repairing or remove DNA damage in the cfDNA using the ligase.
  • 13. The method of claim 1, wherein the cfDNA is selected from a tumor, transplanted tissue or organ, or one or more pathogens, in the subject.
  • 14. The method of claim 13, wherein the one or more pathogens comprises a bacterium or component thereof.
  • 15. The method of claim 13, wherein the one or more pathogens comprises a virus or a component thereof.
  • 16. The method of claim 13, wherein the one or more pathogens comprises a fungus or a component thereof.
  • 17. The method of claim 1, comprising amplifying in (c) by massively multiplexed amplification.
  • 18. The method of claim 17, wherein the massively multiplex amplification assay is isothermal amplification.
  • 19. The method of claim 17, wherein the massively multiplex amplification assay is massively multiplexed polymerase chain reaction (mmPCR).
  • 20. The method of claim 1, further comprising pooling two or more biological samples, each sample obtained from a different subject.
  • 21. The method of claim 1, further comprising contacting the biological sample with a white blood cell stabilizer after obtaining the biological sample from the subject.
  • 22. The method of claim 1, wherein the biological sample obtained from the subject was collected using a device configured to lyse intercellular junctions of an epidermis of the subject.
  • 23. The method of claim 1, wherein the tagging in (b) produces a library of tagged cfDNA with an efficiency of at least 0.5, when the library is prepared by: h) performing end-repair, 5′ phosphorylation and A-tailing with incubation at 20 degrees Celsius for 30 minutes followed by 65 degrees Celsius for 30 minutes;i) ligating the cfDNA to adaptor oligonucleotides with incubation at 20 degrees Celsius for 15 minutes;j) cleaving a ligated adaptor loop from the adaptor oligonucleotides with incubation at 37 degrees Celsius for 15 minutes, to produce ligation-competent cfDNA;k) amplifying the ligation-competent cfDNA by: i. denaturing the ligation-competent cfDNA at 98 degrees Celsius for 1 minute, followed by 13 cycles at 98 degrees Celsius for 10 seconds;ii. annealing the denatured ligation-competent cfDNA to one or more complementary primers from (i) at 65 degrees Celsius for 75 seconds; andiii. extending the ligation-competent cfDNA at 65 degrees Celsius for 5 minutes, to produce an amplified library of ligation-competent cfDNA; andl) purifying the amplified library of ligation-competent cfDNA using SPRI beads.
  • 24. A method comprising: a) obtaining a biological sample from a pregnant subject with a fetus, wherein the biological sample comprises a cell-free deoxyribonucleic acid (cfDNA), and wherein the biological sample has a volume that is not greater than about 120 microliters when obtained from the subject;b) contacting at least one cfDNA in the biological sample with an amplification reagent and a polynucleotide primer that anneals to a sequence corresponding to a sequence of interest to produce an amplification product; andc) detecting a presence or an absence of the amplification product.
  • 25. The method of claim 24, further comprising annealing a oligonucleotide probe with a detectable label to the at least one cfDNA.
  • 26. The method of a claim 25, further comprising detecting epigenetic modification of the cfDNA.
  • 27. The method of claim 26, wherein the epigenetic modification comprises methylation at a genetic locus of the cfDNA.
  • 28. The method of claim 24, wherein detecting a presence of the amplification product indicates a gender of the fetus.
  • 29. The method of claim 28, wherein a component of the cfDNA is from the fetus.
  • 30. The method of claim 24, further comprising contacting the biological sample with a white blood cell stabilizer following obtaining the biological sample from the subject.
  • 31. The method of claim 24, wherein the volume of the biological sample is not greater than 50 microliters.
  • 32. The method of claim 31, wherein the volume of the biological sample is between about 10 microliters and about 40 microliters.
  • 33. The method of claim 24, wherein the biological sample was collected by a process of: m) inducing a first transdermal puncture to produce a first fraction of a biological sample;n) discarding the first fraction of the biological sample; ando) collecting a second fraction of the biological sample, thereby reducing or eliminating contamination of the biological sample due to white blood cell lysis.
  • 34. A method of increasing a relative amount of a target nucleic acid in a biological sample obtained from a subject comprising: p) inducing a transdermal puncture at a site of the subject to produce a first fraction and a second fraction of a biological sample;q) discarding the first fraction of the biological sample; andr) collecting the second fraction of the biological sample, thereby reducing or eliminating contamination or nucleic acid damage of the biological sample, wherein the first fraction comprises a lower fraction of a target nucleic acid, as compared to a fraction of the target nucleic acid in the second fraction.
  • 35. The method of claim 34, further comprising cleaning the site before inducing the transdermal puncture, thereby removing or reducing unwanted contaminant.
  • 36. The method of claim 35, wherein the unwanted contaminant comprises DNA from the transdermal puncture site.
  • 37. The method of claim 34, wherein the nucleic acid damage comprises damage to non-apoptotic DNA in the biological sample.
  • 38. The method of claim 34, wherein the biological sample is capillary blood.
  • 39. The method of claim 34, further comprising detecting the target nucleic acid in the second fraction of the biological sample using an assay selected from massively multiplexed polymerase chain reaction (mmPCR) or nucleic acid sequencing.
  • 40. A device comprising: a) a sample collector for obtaining from a subject a biological sample comprising a volume of at most 120 microliters, wherein the biological sample comprises a target cell-free DNA (cfDNA);b) a sample purifier for removing a cell from the biological sample to produce a cell-depleted sample; andc) a nucleic acid detector configured to detect the target cfDNA in the cell-depleted sample.
  • 41. The device of claim 40, further comprising a nucleic acid ligator comprising: a) A ligation formulation for producing ligation-competent target cfDNA, the ligation formulation comprising one or more of: 1) one or more exonucleases adapted to generate a blunt end of the target cfDNA and remove a 5′ overhang or a 3′ recessed end of the blunt end of the target cfDNA;2) a blunt end cfDNA dephosphorylating agent;3) a crowding reagent;4) a DNA damage repair agent; or5) a DNA ligase; andb) one or more adaptor oligonucleotides ligated to the ligation-competent target cfDNA.
  • 42. The device of claim 40, further comprising a white blood cell stabilizer.
  • 43. The device of claim 40, wherein the nucleic acid detector is a massively multiplexed PCR device (mmPCR).
  • 44. The device of claim 40, wherein the ligation formulation comprises: s) the one or more exonucleases adapted to generate a blunt end of the target cfDNA and remove a 5′ overhang or a 3′ recessed end of the blunt end of the target cfDNA;t) the blunt end cfDNA dephosphorylating agent;u) the DNA damage repair agent; andv) the DNA ligase.
  • 45. The device of claim 40, wherein the nucleic acid detector comprises a nucleic acid sequencer or lateral flow strip.
  • 46. The device of claim 45, wherein the nucleic acid sequencer comprises a signal detector.
  • 47. The device of claim 40, wherein the sample purifier comprises a filter, and wherein the filter has a pore size of about 0.05 microns to about 2 microns.
  • 48. The device of claim 47, wherein the filter is a vertical filter.
  • 49. The device of claim 40, wherein the sample purifier comprises a binding moiety selected from an antibody, antigen binding antibody fragment, a ligand, a receptor, a peptide, a small molecule, and a combination thereof.
  • 50. The device of claim 40, wherein the sample collector is configured to lyse intercellular junctions of an epidermis of the subject to obtain the biological sample.
CROSS-REFERENCE

This international application claims the benefit of U.S. Provisional Application Ser. No. 62/824,757, filed Mar. 27, 2019, which is incorporated herein by reference in its entirety.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2020/024638 3/25/2020 WO 00
Provisional Applications (1)
Number Date Country
62824757 Mar 2019 US