The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jul. 8, 2021, is named 13617-004-999_SL.txt and is 3,618 bytes in size.
This application relates to devices and methods for diagnostics based on analysis of nucleic acids.
Physicians order diagnostic tests, devices, and procedures to identify the cause of their patients' symptoms. For any particular symptom or like indication of a disease or abnormality, a patient can undergo several different tests, ranging from a simple physical exam to extensive or invasive assays. The use of multiple tests can be time-consuming. Many tests require technical equipment, extensive training, and specialists to perform and interpret each test. As a result, the current state of medical testing is relatively expensive, complicated and inaccessible to millions of patients. Paradoxically, this practice potentially can lead to delayed diagnosis and care. For example, physicians who are pressed to ration tests, may forego ordering a test and miss a diagnosis. New approaches are needed to streamline diagnostic testing and improve accessibility.
An exemplary justification for ordering multiple tests is that some tests evaluate only one process at a time, and as a result, additional tests can be needed to evaluate several possible diagnoses (
As one example, for the symptom “chest pain” illustrated in
There is great interest in using nucleic acids as analytes in medical testing. Nucleic acids, e.g. deoxyribonucleic (DNA) and ribonucleic acids (RNA), are present in every form of life and can be used to distinguish different organisms. DNA and RNA are composed of long polymers of four molecules called nucleotides. These nucleotides differ by nitrogenous bases called cytidine (C), guanine (G), thymine/uracil (T or U), and adenine (A). DNA and RNA vary in nucleotide number and order. For instance, DNA polymers can be relatively short, e.g., can be 5 or fewer nucleotides long, or can be hundreds of millions of nucleotides long, or anywhere in between. The order of nucleotides differs in every organism and can be used to identify human vs. non-human DNA. Specific DNA sequence testing for pathogens is often highly diagnostic and potentially can overcome the difficulty of isolating slow-growing organisms such as fungi and atypical mycobacteria.
Nucleic acid sequences can also be used to identify the anatomical location or cell type of origin. Specific sequences of DNA, called genes, produce RNA, which are used as a template for the cell to produce new proteins and enzymes necessary for the cell's function. For example,
Additionally, because the cells in the human body perform their respective, different functions, the cells' proteins and corresponding RNAs can be used to identify different cell types. For example, RNAs made in the heart can be used to distinguish heart tissue from lung tissue. The production of RNA is a highly regulated process. During this process, specific areas of the genome, e.g., genes, can gather large molecules to produce RNA (e.g. RNA polymerase, transcription factors, elongation, splicing factors), control RNA splicing, modify DNA or RNA directly, and alter DNA accessibility, the latter of which can modify DNA packaging proteins called histones. The production of RNA from or the association of transcriptional machinery with DNA from these sites can be used as evidence for an active gene. Changes in gene activity can be associated with different cell types and cell responses to a number of conditions such as disease, cell damage, ischemia, nutritional changes, chemical or drug exposure, and the like. Thus, active genes, specific cell types, and different organisms potentially can be ascertained through the detection of specific DNA and RNA sequences and specific chemical modifications such as methylation. See, for example, Rando et al., “Genome-wide views of chromatin structure,” Annu. Rev. Biochem. 78: 245-271 (2009), the entire contents of which are incorporated by reference herein.
For example,
Methods for nucleic acid analysis such as high-throughput sequencing have improved immensely and are capable of detecting millions of DNA or RNA molecules in one assay and potentially identifying some or all known pathogens and genes. Success of these approaches are often measured by the torrent of data that has been obtained from nucleic acid sequencing and by the estimates that petabytes of new data will be generated annually through this methodology. Despite this enormous potential, nucleic acid-based testing remains highly specialized and restricted to narrow uses. Some obstacles include technical complexity or technical difficulties. For example, high-throughput sequencing can require a series of highly specialized personnel with non-interchangeable skills who are responsible for sample collection, nucleic acid extraction and preparation, sequencing, data transfer, sequence data conversion, and reporting. In addition, the amount of data produced from sequencing billions of molecules can be memory- and processor-intensive, making data transfer and analyses extremely challenging. Furthermore, the interpretation of sequence data can be based on artificial intelligence, supercomputer-based machine learning, and consortium-based discovery to organize and understand the sequence output. These complexities can create barriers for use of nucleic acid sequencing in the clinic, where the expertise and time necessary to generate or interpret vast amounts of data are unavailable to most providers and hospitals.
Embodiments of the present invention provide devices and methods for diagnostics based on analysis of nucleic acids.
Under one aspect, a method is provided for use in diagnosing a condition based on a symptom experienced by a subject and based on a first biological sample obtained from the subject, the first biological sample including nucleic acids, the method being executed by a device. The method can include, based on the symptom, preselecting a first set of the nucleic acids for analysis. The method also can include capturing by the device a first plurality of the nucleic acids of the first set that are present in the first biological sample. The method also can include, for each of the captured nucleic acids of the first plurality: quantifying by the device an amount of that captured nucleic acid that is present in the first biological sample; sequencing by the device that captured nucleic acid; and based on the sequence of that captured nucleic acid, identifying by the device an origin of that captured nucleic acid. The method also can include outputting by the device an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
Optionally, preselecting the first set of the nucleic acids for analysis includes receiving by the device a first symptom-specific cartridge including a first set of complementary nucleic acids configured to capture the first set of the nucleic acids for analysis. Optionally, the method further includes, after the outputting step, removing the first symptom-specific cartridge from the device and receiving by the device a second symptom-specific cartridge including a second set of complementary nucleic acids. Optionally, the first set of complementary nucleic acids is different than the second set of complementary nucleic acids.
Additionally, or alternatively, the method optionally can include outputting by the device an indication of the quantified amount of each of the captured nucleic acids of the first plurality.
Additionally, or alternatively, the capturing can include separating extracellular nucleic acids in the first biological sample from intracellular nucleic acids in the first biological sample; and the quantifying and sequencing steps can be performed separately on the separated extracellular nucleic acids and on the intracellular nucleic acids. Optionally, the method includes outputting by the device an indication of the quantified amount of at least one of the extracellular nucleic acids and an indication of the quantified amount of at least one of the intracellular nucleic acids.
Additionally, or alternatively, the identifying by the device the origin of the captured nucleic acid can include comparing the sequence of that nucleic acid to sequences stored in a library stored in a computer-readable medium of the device. Optionally, the library stores nucleic acid sequences for a human and for a plurality of pathogens. Optionally, the output indicates the relative number of a pathogen per human cell.
Additionally, or alternatively, the method optionally includes receiving by the device a second biological sample obtained from the subject, the second biological sample being different from the first biological sample; and capturing by the device a second plurality of the nucleic acids of the first set that are present in the second biological sample. Optionally, for each of the captured nucleic acids of the second plurality, the method also can include quantifying by the device an amount of that captured nucleic acid that is present in the second biological sample; sequencing by the device that captured nucleic acid; and based on the sequence of that captured nucleic acid, identifying by the device an origin of that captured nucleic acid. Optionally, the outputting by the device further includes an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the second biological sample.
Additionally, or alternatively, the method optionally further can include outputting by the device an indication of at least one potential diagnosis for the subject and an indication of the likelihood of the at least one potential diagnosis based on the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
Under another aspect, a device is provided for use in diagnosing a condition based on a symptom experienced by a subject and based on a first biological sample obtained from the subject, the first biological sample including nucleic acids. The device can include a first set of complementary nucleic acids configured to capture a first set of the nucleic acids, the first set of the nucleic acids being selected based on the symptom, the first set of complementary nucleic acids capturing a first plurality of the nucleic acids of the first set that are present in the first biological sample. The device also can include a nucleic acid quantifier configured to quantify an amount of each of the captured nucleic acids that is present in the first biological sample. The device also can include a nucleic acid sequencer configured to sequence each captured nucleic acid that is present in the first biological sample. The device also can include a processor coupled to the quantifier and to the sequencer and being suitably programmed to identify an origin of each captured nucleic acid based on the sequence of that captured nucleic acid. The device also can include an output module coupled to the processor, the processor further being suitably programmed to cause the output module to output an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
Optionally, the device includes a receptacle configured to receive the first set of complementary nucleic acids within a first symptom-specific cartridge. Optionally, the first symptom-specific cartridge is removable from the receptacle and replaceable with a second symptom-specific cartridge including a second set of complementary nucleic acids. Optionally, the first set of complementary nucleic acids is different than the second set of complementary nucleic acids.
Additionally, or alternatively, the processor further can be suitably programmed to cause the output module to output an indication of the quantified amount of each of the captured nucleic acids of the first plurality.
Additionally, or alternatively, the device further can include a separator configured to separate extracellular nucleic acids in the first biological sample from intracellular nucleic acids in the first biological sample. Optionally, the nucleic acid quantifier and nucleic acid sequencer separately operate on the separated extracellular nucleic acids and on the intracellular nucleic acids. Optionally, the processor further is suitably programmed to cause the output module to output an indication of the quantified amount of at least one of the extracellular nucleic acids and an indication of the quantified amount of at least one of the intracellular nucleic acids.
Additionally, or alternatively, the device optionally further can include a computer-readable medium coupled to the processor. The processor optionally further can be suitably programmed to identify the origin of the captured nucleic acid based on comparing the sequence of that nucleic acid to sequences stored in a library stored in the computer-readable medium. Optionally, the library stores nucleic acid sequences for a human and for a plurality of pathogens. Optionally, the output indicates the relative number of a pathogen per human cell.
Additionally, or alternatively, the first set of complementary nucleic acids optionally can be configured to capture a second plurality of the nucleic acids of the first set that are present in a second biological sample obtained from the subject, the second biological sample being different from the first biological sample. Optionally, the nucleic acid quantifier further can be configured to quantify an amount of each of the captured nucleic acids that is present in the second biological sample. Optionally, the nucleic acid sequencer further can be configured to sequence each of the captured nucleic acids that is present in the second biological sample. Optionally, the processor further can be suitably programmed to identify an origin of each captured nucleic acid based on the sequence of the captured nucleic acid that is present in the second biological sample. Optionally, the processor further can be suitably programmed to cause the output module to output an indication of quantified amount and the identified origin of at least one captured nucleic acid that is present in the second biological sample.
Additionally, or alternatively, the processor optionally further can be suitably programmed to cause the output module to output an indication of at least one potential diagnosis for the subject and an indication of the likelihood of the at least one diagnosis based on the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
Under yet another aspect, a database can be stored in a computer-readable medium. The database can store at least a plurality of symptoms, a nucleic acid sequence associated with each of the symptoms, a potential diagnosis associated with each of the symptoms, a laboratory test or a procedure for each of the symptoms, and an inferred value for each of the symptoms, the inferred value including a clinical inference based on a result of said laboratory test for the respective symptom.
Under another aspect, a method is provided of generating a database stored in a computer-readable medium. The method can include receiving, by a device, a plurality of medical documents, each document describing at least one symptom experienced by a respective patient, a laboratory test or a procedure performed on that patient, and a diagnosis associated with the at least one symptom experienced by that patient, the diagnosis being based on a result of the laboratory test performed on that patient. The method also can include, by the device, inferring values based on the symptoms, the laboratory tests, and the diagnoses described in the plurality of medical documents, each inferred value including a clinical inference based on a result of at least one of the laboratory tests for the respective symptom. The method also can include, by the device, identifying a nucleic acid test value associated with each of the inferred values. The method also can include, by the device, generating and storing in the computer-readable medium a plurality of database entries, each database entry of the plurality including a symptom, a laboratory test or a procedure performed on a patient having that symptom, at least one possible diagnosis associated with that symptom, an inferred value for that diagnosis, and a nucleic acid test value for that inferred value.
Optionally, the nucleic acid test value includes an RNA sequence or a DNA sequence. Additionally, or alternatively, the nucleic acid test values optionally include one or more specific nucleic acid sequences, one or more groups of nucleic acid sequences, one or more quantities of nucleic acid sequences, one or more patterns of nucleic acid sequences, or one or more contexts of nucleic acid sequences. Optionally, the one or more contexts of nucleic acid sequences include one or more associations of nucleic acid sequences with chemical modifications, proteins, other intramolecular or extramolecular nucleic acids, or intracellular or extracellular subcompartments.
Additionally, or alternatively, the plurality of medical documents optionally include standard medical codes describing at least some of the symptoms, laboratory tests or procedures, and diagnoses. Additionally, or alternatively, the plurality of medical documents further include physical findings, medications, or environmental exposures.
Under still another aspect, a method is provided for performing one or more nucleic acid tests based on one or more symptoms experienced by a patient. The method can include receiving by a device respective identifiers of the one or more symptoms experienced by the patient. The method also can include, by the device, submitting to a database a query based on the respective identifiers of each of the one or more symptoms. The database can include a computer-readable medium storing at least a plurality of symptoms, a nucleic acid sequence associated with each of the symptoms, a potential diagnosis associated with each of the symptoms, a laboratory test or a procedure for each of the symptoms, and inferred data for each of the symptoms, the inferred value including a clinical inference based on a result of said laboratory test for the respective symptom. The method also can include, by the device, receiving from the database a response to the query, the response including one or more nucleic acid tests based on the nucleic acid sequences respectively associated with the one or more symptoms identified in the query. The method also can include, by the device, outputting respective representations of the one or more nucleic acid tests. The method also can include receiving, by a receptacle of the device, a cartridge configured to perform at least one of the one or more nucleic acid tests.
Optionally, the method further includes, by the device, outputting a result of the at least one of the one or more nucleic acid tests. The result can include a count of RNA or DNA of the subject or of a pathogen in the subject, the RNA or DNA having the nucleic acid sequence associated with at least one of the one or more symptoms identified in the query.
Additionally, or alternatively, the response to the query can include a representation of a plurality of nucleic acid tests based on a plurality of nucleic acid sequences respectively associated with the one or more symptoms identified in the query. The cartridge optionally can be configured to perform each nucleic acid test of the plurality.
The method optionally can include receiving, by a receptacle of the device, at least one additional cartridge, the at least one additional cartridge being configured to perform at least one other of the nucleic acid tests.
Additionally, or alternatively, the method optionally can include performing by the device the at least one of the one or more nucleic acid tests. Optionally, the performing can include: quantifying by the device an amount of a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having a first origin; quantifying by the device an amount of a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having a second origin; and determining by the device at least one possible diagnosis based on the amount of the first subset of the nucleic acids and based on the amount of the second subset of the nucleic acids. The method optionally can include outputting by the device an indication of the at least one possible diagnosis. The method optionally, can include, by the device, receiving an indication of at least one of: a diagnosis made by the caregiver, a result of a laboratory test or a procedure performed on the subject, a symptomatic code, a site of injury, a cellular response, a host-immune response, a contribution of a non-human organism, or an origin of cells or symptoms. The method optionally can include transmitting by the device to the database the received indication for use in updating the database.
Optionally, the method further can include receiving by the device or by a second device respective identifiers of one or more symptoms experienced by a second patient. The symptoms experienced by the second patient can be the same as the symptoms experienced by the first patient. Optionally, the method further can include, by the device or by the second device, submitting to the updated database a second query based on the respective identifiers of each of the one or more symptoms. Optionally, the method further can include, by the device or by the second device, receiving from the updated database a response to the second query, the response including one or more updated nucleic acid tests based on the nucleic acid sequences respectively associated with the one or more symptoms identified in the second query. At least one of the one or more updated nucleic acid tests can be different than at least one of the one or more nucleic acid tests. Optionally, the method further can include, by the device or by the second device, outputting respective representations of the updated one or more nucleic acid tests. Optionally, the method further can include receiving, by the receptacle of the device or by a receptacle of the second device, a second cartridge configured to perform at least one of the updated one or more nucleic acid tests.
Under yet another aspect, a device is provided for performing one or more nucleic acid tests based on one or more symptoms experienced by a patient. The device can include an input module configured to receive respective identifiers of the one or more symptoms experienced by the patient. The device also can include a query module configured to submit to a database a query including the respective identifiers of each of the one or more symptoms. The database can include a computer-readable medium storing at least a plurality of symptoms, a nucleic acid sequence associated with each of the symptoms, a potential diagnosis associated with each of the symptoms, a laboratory test or a procedure for each of the symptoms, and inferred data for each of the symptoms, the inferred value including a clinical inference based on a result of said laboratory test for the respective symptom. The query module further can be configured to receive from the database a response to the query, the response including one or more nucleic acid tests based on the nucleic acid sequences respectively associated with the one or more symptoms identified in the query. The device further can include an output module configured to output respective representations of the one or more nucleic acid tests. The device further can include a receptacle configured to receive a cartridge configured to perform at least one of the one or more nucleic acid tests.
Optionally, the output module further can be configured to output a result of the at least one of the one or more nucleic acid tests, the result including a count of RNA or DNA of the subject or of a pathogen in the subject, the RNA or DNA having the nucleic acid sequence associated with at least one of the one or more symptoms identified in the query.
Additionally, or alternatively, the response to the query optionally can include a representation of plurality of nucleic acid tests based on a plurality of nucleic acid sequences respectively associated with the one or more symptoms identified in the query, the cartridge being configured to perform each nucleic acid test of the plurality.
Optionally, the receptacle of the device can be configured to receive least one additional cartridge, the at least one additional cartridge being configured to perform at least one other of the nucleic acid tests.
Additionally, or alternatively, the cartridge optionally can include a first nucleic acid capture module configured to capture a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having a first origin. The cartridge optionally further can include a second nucleic acid capture module configured to capture a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having a second origin. Optionally, the device further can include a nucleic acid quantifier configured to quantify a respective amount of each of the first and second subsets of captured nucleic acids. The device optionally further can include a diagnosis module configured to determine at least one possible diagnosis based on the amount of the first subset of the nucleic acids and based on the amount of the second subset of the nucleic acids. Optionally, the output module can be configured to output an indication of the at least one possible diagnosis. Optionally, the input module further can be configured to receive an indication of at least one of: a diagnosis, a result of a laboratory test or a procedure performed on the subject, a symptomatic code, a site of injury, a cellular response, a host-immune response, a contribution of a non-human organism, or an origin of cells or symptoms. Optionally, the query module further can be configured to transmit by the device to the database the received indication for use in updating the database.
Optionally, the input module further can be configured to receive respective identifiers of one or more symptoms experienced by a second patient. The symptoms experienced by the second patient can be the same as the symptoms experienced by the first patient. The query module optionally further can be configured to submit to the updated database a second query based on the respective identifiers of each of the one or more symptoms. The query module optionally further can be configured to receive from the updated database a response to the second query, the response including one or more updated nucleic acid tests based on the nucleic acid sequences respectively associated with the one or more symptoms identified in the second query. At least one of the one or more updated nucleic acid tests can be different than at least one of the one or more nucleic acid tests. The output module optionally further can be configured to output respective representations of the updated one or more nucleic acid tests. Optionally, the receptacle of the device further can be configured to receive a second cartridge configured to perform at least one of the updated one or more nucleic acid tests.
Under still another aspect, a method is provided for use in diagnosing a condition based on a symptom experienced by a subject and based on a biological sample obtained from the subject, the biological sample including nucleic acids, the method being executed by a device. The method can include, over a first period of time, quantifying by the device an amount of a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having a first origin. The method also can include, over the first period of time, quantifying by the device an amount of a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having a second origin that is different than the first origin. The method also can include outputting by the device an indication of the amount of the first subset of the nucleic acids quantified over the first period of time. The method also can include outputting by the device an indication of the amount of the second subset of the nucleic acids quantified over the first period of time.
Optionally, the method further includes, based on the amount of the first subset of the nucleic acids quantified over the first period of time, estimating by the device a first likelihood that the subject is suffering from a first condition. The method optionally further can include, based on the amount of the second subset of the nucleic acids quantified over the second period of time, estimating by the device a second likelihood that the subject is suffering from a second condition that is different than the first condition. The method optionally further can include outputting by the device an indication of the first likelihood and an indication of the second likelihood. Optionally, the method further includes, based on the amount of the first subset of the nucleic acids quantified over the first period of time, estimating by the device a first trajectory of an amount of the first subset of the nucleic acids over a second period of time. Optionally, the method further includes, based on the amount of the second subset of the nucleic acids quantified over the first period of time, estimating by the device a second trajectory of an amount of the second subset of the nucleic acids over the second period of time. Optionally, the method further includes outputting by the device an indication of the first trajectory and an indication of the second trajectory. Optionally, the method further includes, based on the first and second trajectories, estimating by the device a second time at which the first or second condition is sufficiently likely as to make a diagnosis that the patient is suffering from that condition; and outputting by the device an indication of the second time.
Additionally, or alternatively, the method further can include receiving by the device additional clinical information regarding the patient. The first and second likelihoods optionally can be further based on the received additional clinical information.
Additionally, or alternatively, the method optionally further can include, over a second period of time subsequent to the first period of time, quantifying by the device an amount of the first subset of the nucleic acids that are present in the biological sample. The method optionally further can include, over the second period of time, quantifying by the device an amount of the second subset of the nucleic acids that are present in the biological sample. The method optionally further can include outputting by the device an indication of the amount of the first subset of the nucleic acids quantified over the second period of time. The method optionally further can include outputting by the device an indication of the amount of the second subset of the nucleic acids quantified over the second period of time.
Additionally, or alternatively, the indications of the amounts of the first and second subsets of nucleic acids quantified over the first period of time optionally can include a histogram.
Additionally, or alternatively, the indication of the amount of the first subset of the nucleic acids over the first period of time optionally can include a number of first cell equivalents. The indication of the amount of the second subset of the nucleic acids over the first time can include a number of second cell equivalents. Optionally, the first origin can include a pathogen, and the number of first cell equivalents can represent a severity of infection of the subject by the pathogen. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents optionally can represent a severity of a condition from which the subject is suffering or clinical significance. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents optionally can represent a response to a treatment.
Additionally, or alternatively, the method optionally can include, based on the amount of the first subset of the nucleic acids quantified over the first period of time, ceasing quantifying by the device an amount of the first subset of the nucleic acids over a second period of time that is subsequent to the first period of time. The method further optionally can include, based on the ceasing, over the second period of time, quantifying by the device an amount of a third subset of the nucleic acids that are present in the biological sample, the third subset of the nucleic acids having a third origin that is different than the first origin and that is different than the second origin. The method further optionally can include outputting by the device an indication of the amount of the third subset of the nucleic acids quantified over the second period of time.
Optionally, the device includes a sequencer that quantifies the first subset of the nucleic acids over the first period of time and that is reassigned so as to quantify the third subset of the nucleic acids over the second period of time. Additionally, or alternatively, the ceasing optionally can be based on an estimation by the device of a first likelihood that the subject is suffering from a first condition, the estimation being based on the amount of the first subset of the nucleic acids quantified over the first period of time. Optionally, the ceasing further can be based on a comparison by the device of the estimation to a threshold.
Under yet another aspect, a device is provided for use in diagnosing a condition based on a symptom experienced by a subject and based on a biological sample obtained from the subject, the biological sample including nucleic acids. The device can include a first quantification module configured to quantify, over a first period of time, an amount of a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having a first origin. The device also can include a second quantification module configured to quantify, over the first period of time, an amount of a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having a second origin that is different than the first origin. The device also can include an output module configured to: output an indication of the amount of the first subset of the nucleic acids quantified over the first period of time, and to output an indication of the amount of the second subset of the nucleic acids quantified over the first period of time.
Optionally, the device further can include an estimation module configured to estimate, based on the amount of the first subset of the nucleic acids quantified over the first period of time, a first likelihood that the subject is suffering from a first condition. Optionally, the estimation module further can be configured to estimate, based on the amount of the second subset of the nucleic acids quantified over the second period of time, a second likelihood that the subject is suffering from a second condition that is different than the first condition. Optionally, the output module further can be configured to output an indication of the first likelihood and an indication of the second likelihood. Optionally, the estimation module further can be configured to estimate, based on the amount of the first subset of the nucleic acids quantified over the first period of time, a first trajectory of an amount of the first subset of the nucleic acids over a second period of time. Optionally, the estimation module further can be configured to estimate, based on the amount of the second subset of the nucleic acids quantified over the first period of time, a second trajectory of an amount of the second subset of the nucleic acids over the second period of time. Optionally, the output module further can be configured to output an indication of the first trajectory and an indication of the second trajectory. Optionally, the estimation module further is configured to estimate, based on the first and second trajectories, a second time at which the first or second condition is sufficiently likely as to make a diagnosis that the patient is suffering from that condition. Optionally, the output module further is configured to output an indication of the second time.
Additionally, or alternatively, the device optionally further can include an input interface configured to receive additional clinical information regarding the patient. The first and second likelihoods optionally further can be based on the received additional clinical information.
Additionally, or alternatively, the first quantification module optionally can be configured to quantify, over a second period of time subsequent to the first period of time, an amount of the first subset of the nucleic acids that are present in the biological sample. The second quantification module optionally can be configured to quantify, over the second period of time, an amount of a second subset of the nucleic acids that are present in the biological sample. Optionally, the output module can be configured to output an indication of the amount of the first subset of the nucleic acids quantified over the second period of time. Optionally, the output module can be configured to output an indication of the amount of the second subset of the nucleic acids quantified over the second period of time.
Additionally, or alternatively, the indications of the amounts of the first and second subsets of nucleic acids quantified over the first period of time optionally can include a histogram.
Additionally, or alternatively, the indication of the amount of the first subset of the nucleic acids over the first period of time optionally can include a number of first cell equivalents, and the indication of the amount of the second subset of the nucleic acids over the first time optionally can include a number of second cell equivalents. Optionally, the first origin includes a pathogen, and the number of first cell equivalents represents a severity of infection of the subject by the pathogen. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents optionally represents a severity of a condition from which the subject is suffering or clinical significance. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents optionally represents a response to a treatment.
Additionally, or alternatively, the first quantification module optionally can be configured to cease, based on the amount of the first subset of the nucleic acids quantified over the first period of time, quantifying an amount of the first subset of the nucleic acids over a second period of time that is subsequent to the first period of time. Optionally, the first quantification module can be configured to quantify, based on the ceasing, over the second period of time, an amount of a third subset of the nucleic acids that are present in the biological sample, the third subset of the nucleic acids having a third origin that is different than the first origin and that is different than the second origin. Optionally, the output module further can be configured to output an indication of the amount of the third subset of the nucleic acids quantified over the second period of time.
Additionally, or alternatively, the first quantification module optionally includes a sequencer that quantifies the first subset of the nucleic acids over the first period of time and that is reassigned so as to quantify the third subset of the nucleic acids over the second period of time. Additionally, or alternatively, the ceasing optionally can be based on an estimation by the device of a first likelihood that the subject is suffering from a first condition, the estimation being based on the amount of the first subset of the nucleic acids quantified over the first period of time. Optionally, the ceasing further can be based on a comparison by the device of the estimation to a threshold.
Under yet another aspect, a method is provided for use in assessing the quality of a biological sample obtained from a subject, the biological sample including nucleic acids, the method being executed by a device. The method can include quantifying by the device an amount of a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having an intracellular origin. The method further can include quantifying by the device an amount of a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having an extracellular origin. The method further can include outputting by the device an indication of the amount of the first subset of the nucleic acids. The method further can include outputting by the device an indication of the amount of the second subset of the nucleic acids. The relative amounts of the first and second subsets of the nucleic acids can indicate the quality of the biological sample.
Optionally, the method further can include outputting by the device an indication of an expected amount of the first subset of the nucleic acids in a normal biological sample and an indication of an expected amount of the second subset of the nucleic acids in a normal biological sample.
Under still another aspect, a device is provided for use in assessing the quality of a biological sample obtained from a subject, the biological sample including nucleic acids. The device can include a first quantification module configured to quantify an amount of a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having an intracellular origin. The device further can include a second quantification module configured to quantify an amount of a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having an extracellular origin. The device further can include an output module configured to output an indication of the amount of the first subset of the nucleic acids and to output an indication of the amount of the second subset of the nucleic acids. The relative amounts of the first and second subsets of the nucleic acids can indicate the quality of the biological sample.
Optionally, the output module further is configured to output an indication of an expected amount of the first subset of the nucleic acids in a normal biological sample and an indication of an expected amount of the second subset of the nucleic acids in a normal biological sample.
Embodiments of the present invention provide devices and methods for diagnostics based on analysis of nucleic acids. For example, provided herein is a diagnostic device that can simplify and automate nucleic acid testing in the clinic. In certain embodiments, this device can be or include a portable or stationary unit that can facilitate a physician's diagnosis of a cause of patient's symptom, such as an origin or type of tumor or a type of infection, and can provide other competencies, regardless of whether the patient is seen in a community clinic, an emergency room, inpatient hospital, or academic center. Physicians can employ such a device directly to test any suitable number of diagnoses, e.g., tens to hundreds of diagnoses, essentially simultaneously in their patients, without necessarily needing to transmit biological samples offsite. In some embodiments, the device can be configured so as to suitably interact with the physician and to assist in creating results reports that can support or exclude diagnoses, using information generated by the device. For example,
Overview: How Symptom-Based Testing Reduces Complexity of Nucleic Acid Analysis
Current sequencing devices are not practical for an outpatient setting due to technical and informational complexity. To circumvent these difficulties, certain embodiments of the present devices and methods can provide targeted detection of a limited number of nucleic acid sequences can improve sensitivity, accelerate identification, and interpretation.
It may not necessarily be known a priori which nucleic acid sequences are clinically informative for a diagnosing particular symptom. Certain embodiments of the present invention provide devices and methods to identify clinically informative sequences. For example, the present devices and methods can generate an external index, hereafter called inferred data. An index of inferred data can be initially created automatically (e.g., by a computer processor executing suitable software) or by humans (e.g., by one or more physician specialists), to define what types of clinical information can be inferred from existing laboratory tests and procedures. For example,
In addition to manual approaches to categorize current laboratory test and procedures, automated approaches can be employed using medical literature and internet. Electronic search for individual test and procedure names can identify the context of a test within a chapter or data in various text formats such as HyperText Markup Language (html). For example, in one nonlimiting example in the html format, “<B>Troponin</B>” and “Cardiac” may appear adjacent to one another or within quotes in a given document. In one approach, a frequency is generated by counting the number of times individual laboratory tests and procedures appear together with a list of clinically relevant terms representing anatomic site, pathogen, or host response. A non-limiting example of this approach is shown here:
A second automated approach performs text-based search among public and private medical records such as described above with reference to
As described in greater detail below with reference to
For example,
Illustratively, a database can be stored in a computer-readable medium, the database storing at least a plurality of symptoms, a nucleic acid sequence associated with each of the symptoms, a potential diagnosis associated with each of the symptoms, a laboratory test or a procedure for each of the symptoms, and an inferred value for each of the symptoms, the inferred value comprising a clinical inference based on a result of said laboratory test for the respective symptom, e.g., such as described herein with reference to
In some embodiments, a method of generating a database stored in a computer-readable medium can include receiving, by a device (e.g., by a suitably programmed processor), a plurality of medical documents. For example, the device can include a computer-readable medium (which can be the same as or different than the computer-readable medium in which the database is stored) in which the plurality of medical documents can be stored. Each document can describe at least one symptom experienced by a respective patient, a laboratory test or a procedure performed on that patient, and a diagnosis associated with the at least one symptom experienced by that patient, the diagnosis being based on a result of the laboratory test performed on that patient. The method also can include, by the device, inferring values based on the symptoms, the laboratory tests, and the diagnoses described in the plurality of medical documents, each inferred value comprising a clinical inference based on a result of at least one of the laboratory tests for the respective symptom, e.g., such as described herein with reference to
Optionally, the nucleic acid test value can include an RNA sequence or a DNA sequence. Additionally, or alternatively, the nucleic acid test values can include one or more specific nucleic acid sequences, one or more groups of nucleic acid sequences, one or more quantities of nucleic acid sequences, one or more patterns of nucleic acid sequences, or one or more contexts of nucleic acid sequences. Illustratively, the one or more contexts of nucleic acid sequences can include one or more associations of nucleic acid sequences with chemical modifications, proteins, other intramolecular or extramolecular nucleic acids, or intracellular or extracellular subcompartments. The plurality of medical documents can include standard medical codes describing at least some of the symptoms, laboratory tests or procedures, and diagnoses. Additionally, or alternatively, the plurality of medical documents further can include physical findings, medications, or environmental exposures.
In some embodiments, nucleic acid values (e.g., “AATGGGAACGGTAA” (SEQ ID NO: 4)) can be further evaluated for their uniqueness. For example, the nucleic acid value (“AATGGGAACGGTAA” (SEQ ID NO: 4)) can be found both in brain and heart tissue. A symptom-based approach can be used to determine if distinguishing between these two tissues may be necessary or useful in a given clinical scenario. For example, in the presence of the symptom ‘chest pain’, the detection of circulating “AATGGGAACGGTAA” (SEQ ID NO: 4) is more likely to be representative of damaged heart tissue than damaged brain tissue. As indicated above, specificity of nucleic acid sequences is likely to vary depending on the sample site, e.g., blood versus urine. Additional methods can be incorporated into the identification of specific patterns or quantities of nucleic acid sequences, use of quantitative differences, threshold cut-offs, rank orders, and other means. The output of this process is a set of nucleic acid sequences, which are non-overlapping with nucleic acid sequences present in competing diagnoses.
Illustratively, a method for performing one or more nucleic acid tests based on one or more symptoms experienced by a patient can include receiving by a device (e.g., by the instruments described herein with reference to
In some embodiments, the method further includes, by the device, outputting a result of the at least one of the one or more nucleic acid tests, the result comprising a count of RNA or DNA of the subject or of a pathogen in the subject, the RNA or DNA having the nucleic acid sequence associated with at least one of the one or more symptoms identified in the query. Additionally, or alternatively, the response to the query can include a representation of a plurality of nucleic acid tests based on a plurality of nucleic acid sequences respectively associated with the one or more symptoms identified in the query, the cartridge being configured to perform each nucleic acid test of the plurality. Additionally, or alternatively, the method further can include receiving, by a receptacle of the device, at least one additional cartridge, the at least one additional cartridge being configured to perform at least one other of the nucleic acid tests.
Under another aspect, a device (e.g., an instrument such as described herein with reference to
In some embodiments, the output module optionally further is configured to output a result of the at least one of the one or more nucleic acid tests, the result comprising a count of RNA or DNA of the subject or of a pathogen in the subject, the RNA or DNA having the nucleic acid sequence associated with at least one of the one or more symptoms identified in the query. Additionally, or alternatively, the response to the query can include a representation of plurality of nucleic acid tests based on a plurality of nucleic acid sequences respectively associated with the one or more symptoms identified in the query, the cartridge being configured to perform each nucleic acid test of the plurality. Additionally, or alternatively, the receptacle of the device further can be configured to receive least one additional cartridge, the at least one additional cartridge being configured to perform at least one other of the nucleic acid tests.
The above described exemplary approach highlights that the scope of nucleic acid testing and interpretation can be greatly narrowed based on a method of inferred data values and symptom-specific data structure. Furthermore, such a symptom-based methodology can address several major obstacles in nucleic acid testing and can facilitate the miniaturization and improvement of a nucleic acid diagnostic instrument.
Overview of Diagnostic Device and Exemplary Use Thereof
An overview of an exemplary diagnostic device as a portable or stationary unit to detect active genes and foreign DNA to test multiple diagnoses, according to some embodiments, is illustrated in
Component 5 can be electronically coupled to component 4 and can be configured so as to perform sequence identification and quantification, e.g., so as to perform symptom-specific DNA (component 5A), RNA (component 5B), and integrated analyses (component 5C). Illustratively, component 5 (also referred to herein as 5A/5B/5C) can include a processor and one or more computer-readable media storing instructions to cause the processor to perform one or more of the functions provided herein, and also storing information for use in performing nucleic acid analysis. For example, component 5 can include a nucleic acid quantifier configured to quantify an amount of each of the captured nucleic acids that is present in the first biological sample. For example, component 5 can include a processor coupled to the quantifier and to the sequencer and that is suitably programmed to identify an origin of each captured nucleic acid based on the sequence of that captured nucleic acid. In embodiments that include a separator, the processor can be suitably programmed to cause the display component 6 or other output module to output an indication of the quantified amount of at least one of the extracellular nucleic acids and an indication of the quantified amount of at least one of the intracellular nucleic acids. Note that the terms “component” and “module” can be used interchangeably herein.
Optionally, the device can include a computer-readable medium coupled to the processor, and the processor further can be suitably programmed to identify the origin of the captured nucleic acid based on comparing the sequence of that nucleic acid to sequences stored in a library stored in the computer-readable medium. Optionally, the library stores nucleic acid sequences for a human and for a plurality of pathogens. The output from the device can indicate the relative number of a pathogen per human cell.
Component 6 can be electronically coupled to components 5, 7, 8 and can be configured so as to interact with the user, e.g., can include display and input components. For example, component 6 can include a display that is coupled to the processor, and the processor (e.g., of component 5) can be suitably programmed to cause the display to output an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample. Optionally, the processor can be suitably programmed to cause the display to output an indication of the quantified amount of each of the captured nucleic acids of the first plurality. As another example, the processor (e.g., of component 5) can be suitably programmed to cause the display to output an indication of at least one potential diagnosis for the subject and an indication of the likelihood of the at least one diagnosis based on the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
Component 7 can be electronically coupled to components 5 and 8 and configured so as to provide connectivity to electronic medical records, a database such as described elsewhere herein, and external sequence analysis, e.g., can include a network module. Component 8 can be coupled to components 1-7 and 9 and configured to provide a power source, e.g., can include a rechargeable, solar, or other power source, or can be configured so as to connect to a standard AC power outlet. Component 9 includes portions of the device that can be exchanged, allowing for optimization, customization, or restoration. For example, component 9 can include exchangeable portions of the device for specific symptoms, and restoring reagents, such as some or all of components 1, 2, and 3. For example, the inset to
In some embodiments, the first set of complementary nucleic acids (e.g., of component 3) further captures a second plurality of nucleic acids of the first set that are present in a second biological sample obtained from the subject, the second biological sample being different from the first biological sample. The nucleic acid quantifier (e.g., of component 5) further can quantify an amount of each of the captured nucleic acids that is present in the second biological sample. The nucleic acid sequencer (e.g., of component 4) further can sequence each of the captured nucleic acids that is present in the second biological sample. The processor (e.g., of component 5) further can be suitably programmed so as to identify an origin of each captured nucleic acid based on the sequence of the captured nucleic acid that is present in the second biological sample. The processor further can be suitably programmed so as to cause the display (e.g., of component 7) to output the an indication of quantified amount and the identified origin of at least one captured nucleic acid that is present in the second biological sample.
Illustratively, to activate the device, the healthcare worker can input a physical identification or a touch code via Component 6 (
Further exemplary details of the components illustrated in
Component 1: Preparing Samples
In some embodiments, during use, the user can inject or deliver one or more biological samples, e.g., via a syringe, capillary tube, or pipette, into Component 1, which can include one or more sample ports of the device, such as illustrated in
Component 2: Separating Samples and Preparing Nucleic Acids
The process of receiving biological liquids, separation, lysis and nucleic acid processing can be achieved by adjoined components from existing methodologies and devices. For example,
In one nonlimiting example, sample reception, storage and subsequent separation (Component 2) can be achieved using known or customized microfluidic components, such as microfluidic ChipShop spiral or pillar particle and cell sorting chips (e.g., part numbers #18-1708-0382-01 or #19-1800-0261-01, commercially available from microfluidic ChipShop, Jena, Germany), micro-droplet separator, or other separation approaches. In some embodiments, compartments may be initially separated but can become open based on an electronic signal from a controller (e.g., Component 5) or natively in response to the presence of liquids. The separation of samples based on size, visual or other properties can allow the device to identify whether the analyte, e.g., DNA or RNA, originated from an intact cell or debris from a damaged cell. In some embodiments, following this stage, the analytes, e.g., DNA and RNA, can be present within liquid solution as a complex with cellular proteins or as free molecules. Further purification of protein-associated nucleic acids and enrichment of free nucleic acids can be performed, using well-known methods of immuno-purification of DNA such as commercially available Clontech EpiXplore ChIP assay kits (Clontech Laboratories, Inc., Mountain View, Calif.). Additionally, isolation of DNA and RNA can be performed using existing methods and reagents, such as Bioneer Accuprep (Bioneer Corporation, Daej eon, Korea), Qiagen AllPrep DNA/RNA FFPE Kit (Qiagen Inc., Valencia, Calif.), or Zymo Research ZR-Duet (Zymo Research Corporation, Irvine, Calif.) and electrical, heat or mechanical methods, such as ChipGenie (microfluidic ChipShop, Jena, Germany) to disrupt cells, release and isolate DNA and RNA.
Component 3 and 4: Selection of Clinically Relevant DNA and RNA Molecules to Assay
From the total population of DNA and RNA molecules, clinically informative sequences can be enriched using established methods, such as nucleotide capture (e.g. Agilent SureSelect (Agilent Technologies, Santa Clara, Calif.), Nextera Rapid Capture (Illumina, Inc., San Diego, Calif.), or the like) or targeted sequences using oligonucleotides. To select specific sequences, publicly available computational approaches such as the Wessim Whole Exome Sequencing SIMulator using in silico exome capture (a Python based simulator available for download from sak042.github.io/Wessim) can be used to predict and simulate capture oligonucleotides. Using such an approach, the device can target a select group of sequences that collectively facilitate distinguishing between multiple diagnostic possibilities for each clinical scenario. The types of sequences captured can depend on the clinical context and can be pre-defined by symptoms as discussed above with reference to
In some embodiments, component 4 can include known components to perform the sequencing of nucleic acids. Indeed, there are many ways to identify specific sequences. In one example, selected nucleic acids from Component 3 can be prepared for sequencing using methods well-known to molecular biologists, and can be sequenced using one or more known devices, such as Illumina Mi-Seq (Illumina, Inc., San Diego, Calif.), Life Technologies Ion Torrent (Life Technologies, Thermo Fisher Scientific Inc., Waltham, Mass.), or the like. In brief, such devices use DNA from the patient sample as a template to make new copies of DNA. This copy process can be monitored chemically or visually and is used to record the order in which individual nucleotides are added into the new copy of DNA. This order corresponds to the sequence of the DNA. Such sequence can be compared to sequences in a database, and the order of nucleotides in the sequence can reveal the identity and origin of the sequence, e.g., human, bacteria, fungus, virus, and subtypes of species, and individual specific differences, e.g., drug-resistance and states. Sequencing instruments (currently known as next generation sequencers) can allow for relatively large numbers of nucleic acids to be sequenced in parallel. This attribute can facilitate testing multiple molecules simultaneously and flexibility in their application.
In particular embodiments, the methods of the invention can be performed with next generation sequencing (NGS) using commercially available kits and instruments from companies such as the Life Technologies/Ion Torrent PGM or Proton (Life Technologies, Thermo Fisher Scientific Inc., Waltham, Mass.), the Illumina HiSEQ or MiSEQ (Illumina, Inc., San Diego, Calif.), and the Roche/454 next generation sequencing system (Roche Diagnostics Corporation, Basel, Switzerland). NGS technology is rapidly revolutionizing the fields of genomics molecular diagnostics, and personalized medicine through the increasingly efficient and economical generation of unprecedented volumes of data. See, e.g., the following references, the entire contents of each of which is incorporated by reference herein: Didelot et al., “Transforming clinical microbiology with bacterial genome sequencing,” Nature Rev. Genetics, 13: 601-612 (2012); Biesecker et al., “Next generation sequencing in the clinic: Are we ready?” Nature Rev. Genetics 13: 818-824 (2012); Martin et al., “Next-generation transcriptome assembly,” Nature Rev. Genetics 12: 671-682 (2011); Voelkerding et al., “Next-generation sequencing: From basic research to diagnostics,” Clin. Chem. 55: 641-658 (2009); Su et al., “Next-generation sequencing and its applications in molecular diagnostics,” Expert Rev. Mol. Diagn. 11: 333-343 (2011); Meyerson et al., “Advances in understanding cancer genomes through second-generation sequencing,” Nature Rev. Genetics 11: 685-696 (2010); and Zhang et al., “The impact of next-generation sequencing on genomics,” Journal of Genetics and Genomics=Yi chuan xue bao 38: 95-109 (2011).
Some commonly used NGS platforms are the 454 GS Junior (Roche Diagnostics Corporation, Basel, Switzerland), Ion Torrent (Life Technologies, Thermo Fisher Scientific Inc., Waltham, Mass.), and MiSeq (Illumina, Inc., San Diego, Calif.), which are “benchtop” sequencers designed for laboratory use. These platforms are capable of a wide range of sequencing applications due to their versatility in sample type, experiment scale, instrument protocol, and multiplexing options. See, for example, the following references, the entire contents of which are incorporated by reference herein: Liu et al., “Comparison of next-generation sequencing systems,” J. Biomedicine & Biotechnology, 2012: Article ID 251364, 11 pages (2012); Loman et al., “Performance comparison of benchtop high-throughput sequencing platforms,” Nature Biotechnol. 30: 434-439 (2012); Glenn, “Field guide to next-generation DNA sequencers,” Mol. Ecol. Resources 11: 759-769 (2011); and Quail et al., “A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers,” BMC Genomics 13: 341, 13 pages (2012). The 454 and Ion Torrent platforms use emulsion PCR to generate millions of DNA molecules with the same sequence from a single sample molecule attached to a polymer bead. The Illumina platforms use bridge PCR to amplify single surface-bound molecules to generate a cluster of molecules with the same sequence. Templates are then sequenced by a stepwise incorporation of nucleotides (e.g., Illumina Genome Analyzer, Roche Applied Science 454 Genome Sequencer) or short oligonucleotides (e.g., Applied Biosystems SOLID (Applied Biosystems, Thermo Fisher Scientific Inc., Waltham, Mass.)). Both the bridge PCR and emulsion PCR methods of parallel amplification require the ligation of adapter sequences to the ends of sample DNA molecules to create sequencing libraries that can bind to surface or bead-bound probes complementary to the adapters.
In addition, the analysis of nucleic acids can be performed using any technique known in the art including, without limitation, sequence analysis, and electrophoretic analysis. Non-limiting examples of sequence analysis include Maxam-Gilbert sequencing; Sanger sequencing; capillary array DNA sequencing; thermal cycle sequencing such as disclosed in Sears et al., “CircumVent thermal cycle sequencing and alternative manual and automated DNA sequencing protocols using the highly thermostable VentR (exo-) DNA polymerase,” Biotechniques, 13: 626-633 (1992), the entire contents of which are incorporated by reference herein; solid-phase sequencing such as disclosed in Zimmerman et al., “Fully automated Sanger sequencing protocol for double stranded DNA,” Methods Mol. Cell Biol., 3: 39-42 (1992), the entire contents of which are incorporated by reference herein; sequencing with mass spectrometry such as matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF/MS) such as disclosed in Fu et al., “Sequencing exons 5 to 8 of the p53 gene by MALDI-TOF mass spectrometry,” Nat. Biotechnol, 16: 381-384 (1998), the entire contents of which are incorporated by reference herein; and sequencing by hybridization, such as disclosed in the following references, the entire contents of each of which are incorporated by reference herein: Chee et al., “Accessing genetic information with high-density DNA arrays,” Science, 274: 610-614 (1996); Drmanac et al., “DNA sequence determination by hybridization: a strategy for efficient large-scale sequencing,” Science, 260: 1649-1652 (1993); and Drmanac et al., “Accurate sequencing by hybridization for DNA diagnostics and individual genomics,” Nat. Biotechnol., 16:54-58 (1998). Non-limiting examples of electrophoretic analysis include slab gel electrophoresis such as agarose or polyacrylamide gel electrophoresis, capillary electrophoresis, and denaturing gradient gel electrophoresis.
As for the identification of specific DNA sequences, detection of active genes also can be based on highly specific RNA sequences. A target mRNA can be amplified by reverse transcribing the mRNA into cDNA, and then performing PCR (reverse transcription-PCR or RT-PCR). The reverse transcription step is typically primed using specific primers, random hexamers, or oligo-dT primers, depending on the circumstances and the goal of expression profiling. For example, extracted RNA can be reverse-transcribed using a GeneAmp® RNA PCR kit (Applied Biosystems, Thermo Fisher Scientific Inc., Waltham, Mass.) according to the manufacturer's instructions. The derived cDNA can then be used as a template in a subsequent PCR reaction.
RNA-seq is an emerging technology for surveying gene expression and transcriptome content by directly sequencing the mRNA molecules in a sample. RNA-seq can provide gene expression measurements and is regarded as an attractive approach to analyze a transcriptome in an unbiased and comprehensive manner.
Various methods for determining expression of mRNA, protein, or gene amplification include, but are not limited to, gene expression profiling, polymerase chain reaction (PCR) including quantitative real time PCR (qRT-PCR), RNA-Seq, FISH, microarray analysis, serial analysis of gene expression (SAGE), MassARRAY, proteomics, and immunohistochemistry (IHC).
Histone modifications have been implicated in the regulation of gene expression and genome function. Chromatin Immunoprecipitation followed by hybridization (Chip-on-CHIP) and Chip-sequencing (ChIP-Seq) can be used to determine the localization of this modification at specific genomic locations and to determine which genes are targeted and turned on and off in a variety of diseases and disorders.
Gene expression profiles can be readily obtained by any number of methods known in the art, for example, microarray analysis, individual gene or RNA screening (e.g., by PCR or real time PCR), diagnostic panels, mini chips, NanoString chips (nanoString Technologies, Seattle, Wash.), RNA-seq chips, protein chips, or ELISA tests.
Methods of measuring a level of a polypeptide gene product are known in the art and include assays that utilize a capture agent. In some embodiments, the capture agent is an antibody, antibody fragment, nucleic acid-based protein binding reagent, small molecule or variant thereof. In additional embodiments, the assay is an enzyme immunoassay (EIA), enzyme-linked immunosorbent assay (ELISA), and radioimmunoassay (RIA). In some embodiments, detection and/or quantification of one or more biomarkers further comprises mass spectrometry (MS). In yet further embodiments, the mass spectrometry is co-immunoprecitipation-mass spectrometry (co-IP MS), where coimmunoprecipitation, a technique suitable for the isolation of whole protein complexes is followed by mass spectrometric analysis.
Component 5: Analysis of Sequence Data from Multiple Sources
In some embodiments, component 5 can include a suitably programmed processor, which, responsive to instructions on a computer-readable medium, takes as input data from Component 4 (the sequencing instrument) and compares such data to a pre-computed and stored library of sequences such as described in greater detail above with reference to
As discussed above, nucleic acid information is obtained from multiple sources such as different sample sites (e.g., blood, urine, CSF), RNA vs. DNA, intact cells, circulating cell debris, association with specific molecules (e.g. modified histones) and others. Component 5 is able to recognize these different sources of nucleic acid data. In
A non-limiting example of this synchronization of multiple samples across multiple nucleic acid testing devices is shown below:
One exemplary arrangement for accommodating asynchronous sequencing of a biological source across multiple sequencers is depicted in
Illustratively, a method for use in diagnosing a condition based on a symptom experienced by a subject and based on a biological sample obtained from the subject, the biological sample including nucleic acids, the method being executed by a device (such as an instrument described herein with reference to
Optionally, the method can include, based on the amount of the first subset of the nucleic acids quantified over the first period of time, ceasing quantifying by the device an amount of the first subset of the nucleic acids over a second period of time that is subsequent to the first period of time. The method also can include, based on the ceasing, over the second period of time, quantifying by the device an amount of a third subset of the nucleic acids that are present in the biological sample, the third subset of the nucleic acids having a third origin that is different than the first origin and that is different than the second origin. The method also can include outputting by the device an indication of the amount of the third subset of the nucleic acids quantified over the second period of time.
Optionally, the device can include a sequencer that quantifies the first subset of the nucleic acids over the first period of time and that is reassigned so as to quantify the third subset of the nucleic acids over the second period of time. Additionally, or alternatively, the ceasing is based on an estimation by the device of a first likelihood that the subject is suffering from a first condition, the estimation being based on the amount of the first subset of the nucleic acids quantified over the first period of time. Optionally, the ceasing further can be based on a comparison by the device of the estimation to a threshold.
Under another aspect, a device (e.g., an instrument such as described herein with reference to
Optionally, the first quantification module is configured to cease, based on the amount of the first subset of the nucleic acids quantified over the first period of time, quantifying an amount of the first subset of the nucleic acids over a second period of time that is subsequent to the first period of time. The first quantification module can be configured to quantify, based on the ceasing, over the second period of time, an amount of a third subset of the nucleic acids that are present in the biological sample, the third subset of the nucleic acids having a third origin that is different than the first origin and that is different than the second origin. The output module further can be configured to output an indication of the amount of the third subset of the nucleic acids quantified over the second period of time. Additionally, or alternatively, the first quantification module can include a sequencer that quantifies the first subset of the nucleic acids over the first period of time and that is reassigned so as to quantify the third subset of the nucleic acids over the second period of time. Additionally, or alternatively, the ceasing can be based on an estimation by the device of a first likelihood that the subject is suffering from a first condition, the estimation being based on the amount of the first subset of the nucleic acids quantified over the first period of time. Additionally, or alternatively, the ceasing further can be based on a comparison by the device of the estimation to a threshold.
Component 5A: Sequence Analysis of DNA in Detecting Pathogens and Humans
In particular embodiments, the present devices and methods can be employed to detect a viral infection, a Gram positive bacterial infection or a Gram negative bacterial infection, a parasite or a fungus.
Component 5 uses data such as described above for several types of analyses, including species identification, estimation of cell and microbe number, disease predisposition, personalization of safe treatment options, detection of tissue and cell type damage, determine host cellular responses, identify pathogen response, determine which treatments the pathogens are sensitive to, and others. Below is a description of an exemplary manner in which the device analyzes and utilizes information from different nucleic acid analytes.
For DNA matches, any of several analyses can be performed to output specific types of clinical data to the physician. This DNA analysis process are described as component 5A, which can include a suitable processor, associated memory and database (which may be the same as component 5), which, responsive to instructions on a computer-readable medium, takes as input electronic data from the sequencer (Component 4) to analyze and output to the physician which genes or biomarkers are detected via Component 6.
Hypothetical “pneumonia” (species detection and cell number) Scenario 2 illustrated in
DNA analysis can also reveal patient genetic markers. Genetic markers are a type of variation in a patient's DNA sequence that is correlated with disease risk, an indication for a medication, or adverse drug event risk. For example,
Component 5B: Detects Affected Cells and Host Responses by Gene Detection
In some embodiments, component 5B can include a suitably programmed processor (which may be the same as the processor of component 5), which, responsive to instructions on a computer-readable medium, takes as input electronic data from the sequencer (Component 4) to analyze and output to the physician which genes or biomarkers are detected via Component 6. For example, component 5B can use electronic data to detect products of activated genes either from intact cells or from circulating cell debris, and uses this information to derive two major functions: which cells are present and what types of host responses are occurring. In some embodiments, gene detection, e.g., RNA or chromatin immunoprecipitation of DNA, can be used to determine which tissues are damaged and what the cellular responses are present. As described above, e.g., with reference to
Additionally, existing approaches align data to an encyclopedia of sequences representing one or more genomes. The location of where this alignment occurs within the encyclopedia can be used to obtain additional information. Although common, such a practice is in some ways analogous to scanning through 3 billion pages for a match and is computationally complex. In some embodiments, the present methods and devices can use a more targeted approach, such as so called targeted sequencing and alignment-free method, so as to as to scan through significantly fewer total number of possible sequences, potentially saving significant time and improving sensitivity. For further details regarding alignment free methods, see, e.g., Vinga et al., “Alignment-free sequence comparison-a review,” Bioinformatics, 19: 513-523 (2003), the entire contents of which are incorporated by reference herein.
Hypothetical “myocardial infarction” Scenario 2 illustrated in the left panel of
Detecting Host Response
In some embodiments, in addition to identifying which cells are present, the present device can be configured so as to report how cells respond to disease, infection, and changes in the environment. Physicians frequently order microscopic exam of blood, urine, and other biological fluids and other tests to identify the presence or absence of specific populations of immune cells. For example, in bacterial infections, acute inflammatory cells, e.g. neutrophils, when detected in the blood, urine or wound site are a sensitive indicator for a possible infection. In parasitic and drug hypersensitivity responses, “allergic”-type responses can be seen, e.g., increased eosinophils (eosinophilia). In viral responses, increased numbers of lymphocytes and different immune states are often noted. In other examples, the presence of different RBC stages, e.g., normoblasts or reticulocytes, is used as evidence of high turnover and hematopoietic disturbances. Thus, use of RNA counts allows the physician to the detection or shifts in the abundance of immune cells.
Other tests of host responses do not involve a shift in cell number but can be detected by RNA changes. For example, ferritin and transferrin receptor levels are sensitive assays to diagnose systemic low iron conditions. Response to low iron arises from activation of genes, whose products are used in iron transport and absorption. Other types of cell states can represent chronic injury, hypoxia, and hyperactivation. Thus, potentially important clinical information can be found in addition to the identification and quantity of a cell.
Component 5C: Integration of RNA and DNA Data
Component 5C can include a suitably programmed processor (which may be the same as the processor of component 5), which, responsive to instructions on a computer-readable medium, takes as input electronic data from DNA and RNA analysis in Component 5A and 5B to analyze and output to the physician via Component 6. In
Real-time results of RNA data from Component 5B are then coupled to the current DNA data from Component 5A. In the embodiment illustrated in the upper panel of
For example,
Embodiments of the invention can encompass both qualitative and quantitative detection of a nucleic acid in a biological sample. In this regard, qualitative detection can be useful, for example, for recognizing an infection of an individual. Thereby, one aspect is that false-negative or false-positive results be avoided. In addition to mere detection of the presence or absence of a nucleic acid in a sample, it-can be useful to determine the quantity of said nucleic acid. As an example, stage and severity of a viral disease may be assessed on the basis of the viral load. Further, monitoring of any therapy can use information on the quantity of a pathogen present in an individual in order to evaluate the therapy's success. For a quantitative assay, a quantitative standard nucleic acid can serve as a reference for determining the absolute quantity of a nucleic acid.
Component 6: Result Reporting and Real-Time Requirements
Component 6 includes, or operates as, an input-output interface such as a digital touch-screen and computer. Component 6 can be configured to output visual and interactive representations of data from Components 5A, 5B, and 5C. The description that follows provide exemplary types of visual outputs and interactions with the physician supported by the device. In addition to the real-time analytical reports described with reference to
In some embodiments, real-time reporting of read counts can be used to display which cell types, responses and non-human pathogens have been detected thus far. These types of RNA and DNA data can, in some respects, parallel traditional laboratory-based tests where analogous inferences can be produced. Because different biological samples can provide different information, different panels can be used to represent different biological sample sites and display current total, tissue-specific and pathogen-specific read counts. For example,
A similar metric used by laboratory assays is the proportion of cells within a biological sample. In the present devices and methods, an analogous metric can be exemplified by the differential complete blood count. In some embodiments, based on this assay, the percentages of several cell types found in blood can be reported to the physician. For example,
In some embodiments, an additional type of RNA and DNA data can be based on cell separating processes such as described further above with reference to
Note that some embodiments include displaying reports or results such as illustrated in
Illustratively, a method for use in assessing the quality of a biological sample obtained from a subject, the biological sample including nucleic acids, the method being executed by a device (e.g., an instrument such as described herein with reference to
Under another aspect, a device (e.g., an instrument such as described herein with reference to
In some embodiments, upon activation of the device and the entry of chief symptoms and site via input from the physician, an automated search of patient electronic records can begin to identify, and optionally to self-complete, key clinical determinants. These determinants can include, but are not limited to, one or more of the following: history of immunocompromised states, recent infections, recent procedures, or other existing conditions. In some embodiments, any suitable combination of such elements can be processed so as to generate a list of possible diagnoses (with modifiers) and a matrix containing expected results from one or more types of biological samples and past specificity and sensitivity for each diagnosis. In some embodiments, for various possible diagnoses, the physician can select test values and results from the instrument that potentially may support or exclude each such diagnosis. The product of this interaction is a documented logic tree, which the physician creates as a result report, nonlimiting examples of which are illustrated in
In the nonlimiting examples illustrated in
In various embodiments, the use of the result output also serves to provide several functions. One exemplary function is to allow the physician to use RNA and DNA results to infer the same types of diagnostic knowledge as traditional laboratory tests. This structured output of data along side with patient records are readily adopted into diagnostic algorithms already familiar to physicians and similarly trained health professionals. Another exemplary function is to allow the physician to highlight which views and results are most informative to the physician. In some embodiments, such a function can be generated through scoring choices and interactions used by the physician or alternatively, the physician can choose to store or flag views which document their diagnostic conclusions.
Another exemplary function displayed in
During the operation of the device, real-time display of RNA and DNA results are displayed for several clinical uses, including but not limited to determining the diagnosis, excluding diagnosis, viewing the status of the test, viewing the progress of the test, and creating reports. For example,
To portray results in a manner easily understood by the user, the number of counts is displayed in relation to a direct or indirect number of cells analyzed. The number of counts per cell is pre-established based on laboratory observations from known inputs of titrated cell numbers. For example, 1000 counts might be equivalent to 5 human lab reference cells. As illustrated in
In another example of representing real-time nucleic acid results (
An exemplary useful feature of displaying a trajectory of diagnostic likelihood based on a progression of nucleic acid results is that the remaining time necessary for diagnosis can be estimated. Given the rate of increasing likelihood vs. the number of nucleic acid counts, the amount of time can be calculated to reach a threshold of diagnostic completion, for example as illustrated in
In some embodiments, the device can present a template for the physician to view or document which diagnoses are the most likely and which diagnoses are highly unlikely based on what the current status of findings. In the nonlimiting example of chest pain, the instrument can present a list of diagnoses, of which the physician can mark or view their respective likelihood, inability to exclude, and the like. Illustratively, each diagnostic choice can trigger a set of questions, such as “was there evidence of cardiac damage?”, “sign of infection?”, and others. In response to the questions, the physician can draw upon “flagged” views to support his or her report created from outputs shown in
One non-limiting example of “flags” on reporting views is that the physician can create reports which to document and support their diagnostic claims. For example,
In some embodiments, some aspects of the diagnostic report can draw upon data or request additional data that is not generated by the device. For example, the current medical record can be automatically included as supporting or excluding evidence in the physician report. The physician can use a RNA or DNA result from the device as an alternative to a named laboratory test and vice versa. Thus, in certain embodiments, the generated report from the device incorporates both observations from the medical record and the device itself. In other embodiments, the device may accept and incorporate visual or electronic results such an exemplary chemical strip test or other complementary assays. These interactions can further support the use of specific RNA and DNA data types in replacement or in parallel with currently used laboratory tests.
Illustratively, a method for use in diagnosing a condition based on a symptom experienced by a subject and based on a biological sample obtained from the subject, the biological sample including nucleic acids, can be executed by a device (such as the instruments described herein with reference to
In some embodiments, the method optionally, can include, based on the amount of the first subset of the nucleic acids quantified over the first period of time, estimating by the device a first likelihood that the subject is suffering from a first condition. The method optionally can include, based on the amount of the second subset of the nucleic acids quantified over the second period of time, estimating by the device a second likelihood that the subject is suffering from a second condition that is different than the first condition. The method optionally can include outputting by the device an indication of the first likelihood and an indication of the second likelihood.
Additionally, or alternatively, the method optionally can include, based on the amount of the first subset of the nucleic acids quantified over the first period of time, estimating by the device a first trajectory of an amount of the first subset of the nucleic acids over a second period of time. The method optionally can include, based on the amount of the second subset of the nucleic acids quantified over the first period of time, estimating by the device a second trajectory of an amount of the second subset of the nucleic acids over the second period of time. The method optionally can include outputting by the device an indication of the first trajectory and an indication of the second trajectory. Optionally, the method can include, based on the first and second trajectories, estimating by the device a second time at which the first or second condition is sufficiently likely as to make a diagnosis that the patient is suffering from that condition; and outputting by the device an indication of the second time.
The method also, or alternatively, can include receiving by the device additional clinical information regarding the patient, wherein the first and second likelihoods further are based on the received additional clinical information.
Additionally, or alternatively, the method optionally can include, over a second period of time subsequent to the first period of time, quantifying by the device an amount of the first subset of the nucleic acids that are present in the biological sample. The method optionally can include, over the second period of time, quantifying by the device an amount of the second subset of the nucleic acids that are present in the biological sample. The method optionally can include outputting by the device an indication of the amount of the first subset of the nucleic acids quantified over the second period of time; and outputting by the device an indication of the amount of the second subset of the nucleic acids quantified over the second period of time.
The indications of the amounts of the first and second subsets of nucleic acids quantified over the first period of time optionally can include a histogram, e.g., such as described herein with reference to
In some embodiments, the indication of the amount of the first subset of the nucleic acids over the first period of time includes a number of first cell equivalents, and the indication of the amount of the second subset of the nucleic acids over the first time includes a number of second cell equivalents. Optionally, the first origin can include a pathogen, and the number of first cell equivalents can represent a severity of infection of the subject by the pathogen. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents represents a severity of a condition from which the subject is suffering or clinical significance. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents represents a response to a treatment.
Under another aspect, a device (e.g., an instrument such as described herein with reference to
Optionally, the device also can include an estimation module configured to estimate, based on the amount of the first subset of the nucleic acids quantified over the first period of time, a first likelihood that the subject is suffering from a first condition, e.g., can include components 5A and 5B. The estimation module further can be configured to estimate, based on the amount of the second subset of the nucleic acids quantified over the second period of time, a second likelihood that the subject is suffering from a second condition that is different than the first condition. The output module further can be configured to output an indication of the first likelihood and an indication of the second likelihood. Additionally, or alternatively, the estimation module optionally further can be configured to estimate, based on the amount of the first subset of the nucleic acids quantified over the first period of time, a first trajectory of an amount of the first subset of the nucleic acids over a second period of time. The estimation module optionally further can be configured to estimate, based on the amount of the second subset of the nucleic acids quantified over the first period of time, a second trajectory of an amount of the second subset of the nucleic acids over the second period of time. The output module further optionally can be configured to output an indication of the first trajectory and an indication of the second trajectory. Optionally, the estimation module further can be configured to estimate, based on the first and second trajectories, a second time at which the first or second condition is sufficiently likely as to make a diagnosis that the patient is suffering from that condition; and the output module further can be configured to output an indication of the second time.
Additionally, or alternatively, the device further can include an input interface configured to receive additional clinical information regarding the patient, wherein the first and second likelihoods further are based on the received additional clinical information.
Additionally, or alternatively, the first quantification module optionally can be configured to quantify, over a second period of time subsequent to the first period of time, an amount of the first subset of the nucleic acids that are present in the biological sample. The second quantification module optionally can be configured to quantify, over the second period of time, an amount of a second subset of the nucleic acids that are present in the biological sample. The output module optionally can be configured to output an indication of the amount of the first subset of the nucleic acids quantified over the second period of time; and can be configured to output an indication of the amount of the second subset of the nucleic acids quantified over the second period of time.
Additionally, or alternatively, the indications of the amounts of the first and second subsets of nucleic acids quantified over the first period of time optionally include a histogram, e.g., such as described herein with reference to
Optionally, in some embodiments, the indication of the amount of the first subset of the nucleic acids over the first period of time includes a number of first cell equivalents, and the indication of the amount of the second subset of the nucleic acids over the first time includes a number of second cell equivalents. Additionally, or alternatively, the first origin can include a pathogen, and the number of first cell equivalents can represent a severity of infection of the subject by the pathogen. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents represents a severity of a condition from which the subject is suffering or clinical significance. Additionally, or alternatively, the number of first cell equivalents or the number of second cell equivalents represents a response to a treatment.
An exemplary parameter in evaluating the significance of a result is understanding the evidence supporting a particular diagnostic solution. For example, the evidence can include recommendations from an established committee and publications, based on large controlled studies. These types of supporting evidence can be accessible through the device interface, e.g., via network module (Component 7). Another type of evidence that can grow over time is the increasing numbers of samples and results obtained by the aggregate of users of the device. This type of data can be displayed to the physician and incorporated with the generated report, and can include, for example, aggregate results such as the frequency of specific findings in other cases with confirmed diagnosis, e.g., discharge continuity of care or equivalent documents, improvement of condition in response to treatment, and the like.
In certain aspects, the generated report from the device can further comprise a risk score based on the expression information. In particular aspects, the risk score may be defined as a weighted sum of expression levels of biomarkers. For example, the risk score may be calculated based on a summation of the expression level of the selected biomarkers multiplied with a corresponding regression coefficient. The regression coefficient may be calculated according to a regression analysis of the correlation between the expression level of the biomarker genes and survival of a control group. To improve data processing efficiency, the risk score can be generated on a computer.
Longitudinal studies can be performed and yield consecutive reports as biomarkers can be repeatedly taken from patients at multiple points in time. In longitudinal studies, a small set of biomarkers is correlated to the disease progression and that biomarkers expressed at different stages can be of prognostic value with regard to therapy resistance.
Post-Diagnostic, Self-Learning of a Symptom-Based Diagnostic Device
Data from the device is another source of new knowledge. Also, as noted in earlier sections, data from the continued use of the device on multiple patients and by multiple physicians can be informative. For example, new data from users can be aggregated to quantify their concordance or discordance with specific interpretations of RNA and DNA data. Illustratively, these interpretations can occur at the inference level of which tissue was affected, what host response was present or what pathogens were present or at the diagnostic level. In some embodiments, at the inference value level, sequences that are identified as poorly concordant can be discarded from future devices or edited to improve sensitivity or specificity. In some embodiments, at the diagnostic level, the interpretation of discordant sequences can be altered to reflect supporting data. In both cases, the level of concordance can be reported with results to aid the physician in understanding the strength or weakness of each data point.
Some pre-computed sequences can be difficult to detect in practice for technical reasons. Other scenarios, e.g., different locations, hospital vs. clinic or in the United States vs. another country, potentially can have different prevalence or unique exposure to diseases, which are not initially pre-computed. For example, some types of infections can differ in frequency or in pathogen in different areas of the world. In a common type of wound, e.g., genital ulcers, the causes can differ in likelihood based upon factors such as geographic location, and diagnoses can range, e.g., from herpes simplex virus to syphilis or other pathogens.
Thus, in some embodiments, the geographic and context for each result can be taken into account, such as via global positioning systems, internet protocol address (IP), or other identifiers of location. Storage of data with biological sample site, geolocation, symptoms, and medical criteria metadata can facilitate self-learning.
In some embodiments, upon completion of diagnostic run of device, the device can be returned for re-charging and re-use. During this process, one or more types of data can be collected for post-diagnosis improvements, including, but not limited to, one or more of: electronic medical records from discharge diagnoses, physician interaction or contributed data, or raw biological material remaining in the device.
As another example, the remaining biological material can be used for further analysis. For example, non-targeted and targeted sequencing can be performed to identify new sequences, which can be more diagnostic, as described in greater detail below with reference to
So as to improve performance and accuracy of nucleic acid based testing, an output for modifications can be used. For example,
Based on the evaluation of device accuracy and sensitivity, a self-learning model such as illustrated in
For example,
Illustratively, a method for performing one or more nucleic acid tests based on one or more symptoms experienced by a patient includes receiving by a device (e.g., an instrument such as described herein with reference to
Optionally, the method further can include performing by the device the at least one of the one or more nucleic acid tests. The performing can include quantifying by the device an amount of a first subset of the nucleic acids that are present in the biological sample, the first subset of the nucleic acids having a first origin. The performing also can include quantifying by the device an amount of a second subset of the nucleic acids that are present in the biological sample, the second subset of the nucleic acids having a second origin. The method also can include determining by the device at least one possible diagnosis based on the amount of the first subset of the nucleic acids and based on the amount of the second subset of the nucleic acids. The method also can include outputting by the device an indication of the at least one possible diagnosis. The method also can include, by the device, receiving an indication of at least one of: a diagnosis made by the caregiver, a result of a laboratory test or a procedure performed on the subject, a symptomatic code, a site of injury, a cellular response, a host-immune response, a contribution of a non-human organism, or an origin of cells or symptoms. The method also can include transmitting by the device to the database the received indication for use in updating the database.
Optionally, the method further can include receiving by the device or by a second device respective identifiers of one or more symptoms experienced by a second patient, wherein the symptoms experienced by the second patient are the same as the symptoms experienced by the first patient. For example, although the database was updated using information provided by the previously mentioned device, the updated database subsequently can be accessed by the same device, or by a different device, in association with performing nucleic acid tests based on symptoms and a biological sample from another patient. The method further can include by the device or by the second device, submitting to the updated database a second query based on the respective identifiers of each of the one or more symptoms. The method further can include, by the device or by the second device, receiving from the updated database a response to the second query, the response comprising one or more updated nucleic acid tests based on the nucleic acid sequences respectively associated with the one or more symptoms identified in the second query, wherein at least one of the one or more updated nucleic acid tests is different than at least one of the one or more nucleic acid tests. The method also can include, by the device or by the second device, outputting respective representations of the updated one or more nucleic acid tests. The method also can include receiving, by the receptacle of the device or by a receptacle of the second device, a second cartridge configured to perform at least one of the updated one or more nucleic acid tests.
Under another aspect, a device (e.g., an instrument such as described herein with reference to
Optionally, the cartridge can include a first nucleic acid capture module configured to capture a first subset of the nucleic acids that are present in the biological sample (e.g., component 3), the first subset of the nucleic acids having a first origin. The cartridge further can include a second nucleic acid capture module configured to capture a second subset of the nucleic acids that are present in the biological sample (e.g., component 3), the second subset of the nucleic acids having a second origin. The device further can include a nucleic acid quantifier configured to quantify a respective amount of each of the first and second subsets of captured nucleic acids (e.g., components 5A-5B). The device further can include a diagnosis module (e.g., components 5A-5B) configured to determine at least one possible diagnosis based on the amount of the first subset of the nucleic acids and based on the amount of the second subset of the nucleic acids. The output module can be configured to output an indication of the at least one possible diagnosis. The input module further can be configured to receive an indication of at least one of: a diagnosis, a result of a laboratory test or a procedure performed on the subject, a symptomatic code, a site of injury, a cellular response, a host-immune response, a contribution of a non-human organism, or an origin of cells or symptoms. The query module further can be configured to transmit by the device to the database the received indication for use in updating the database.
Optionally, the input module further can be configured to receive respective identifiers of one or more symptoms experienced by a second patient, wherein the symptoms experienced by the second patient are the same as the symptoms experienced by the first patient. The query module further can be configured to submit to the updated database a second query based on the respective identifiers of each of the one or more symptoms. The query module further can be configured to receive from the updated database a response to the second query, the response comprising one or more updated nucleic acid tests based on the nucleic acid sequences respectively associated with the one or more symptoms identified in the second query, wherein at least one of the one or more updated nucleic acid tests is different than at least one of the one or more nucleic acid tests. The output module further can be configured to output respective representations of the updated one or more nucleic acid tests. The receptacle of the device further can be configured to receive a second cartridge configured to perform at least one of the updated one or more nucleic acid tests, e.g., a second component 9.
Method 20 further can include capturing by the device a first plurality of nucleic acids of the first set that are present in the first biological sample (step 22). For example, in some embodiments, the first set of complementary nucleic acids can capture a first plurality of nucleic acids of the first set that are present in the first biological sample.
Method 20 further can include, for each of the captured nucleic acids of the first plurality, quantifying by the device an amount of the captured nucleic acid that is present in the first biological sample; sequencing that captured nucleic acid; and, based on the sequence of that captured nucleic acid, identifying by the device an origin of that captured nucleic acid (step 23). For example, in some embodiments, the device can include a nucleic acid quantifier configured to quantify an amount of each of the captured nucleic acids that is present in the first biological sample. In some embodiments, the device also can include a nucleic acid sequencer that is configured to sequence each captured nucleic acid that is present in the first biological sample. In some embodiments, the device also can include a processor coupled to the quantifier and to the sequencer and being suitably programmed to identify an origin of each captured nucleic acid based on the sequence of that captured nucleic acid.
Method 20 further can include outputting by the device an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample (step 24). For example, the device further can include a display coupled to the processor, the processor further being suitably programmed to cause the display to output an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
In some embodiments, method 20 optionally can include preselecting the first set of nucleic acids for analysis comprises receiving by the device a first symptom-specific cartridge comprising a first set of complementary nucleic acids configured to capture the first set of nucleic acids for analysis. Optionally, method 20 further comprises, after the outputting step, removing the first symptom-specific cartridge from the device and receiving by the device a second symptom-specific cartridge comprising a second set of complementary nucleic acids. For example, in some embodiments, the device is configured to receive the first set of complementary nucleic acids within a first symptom-specific cartridge. Optionally, the first symptom-specific cartridge is removable and replaceable with a second symptom-specific cartridge comprising a second set of complementary nucleic acids. Optionally, the first set of complementary nucleic acids is different than the second set of complementary nucleic acids.
In some embodiments, method 20 optionally includes outputting by the device an indication of the quantified amount of each of the captured nucleic acids of the first plurality. For example, in some embodiments, the processor further is suitably programmed to cause the display to output an indication of the quantified amount of each of the captured nucleic acids of the first plurality.
In some embodiments, the capturing step (step 22) of method 20 comprises separating extracellular nucleic acids in the first biological sample from intracellular nucleic acids in the first biological sample; and the quantifying and sequencing steps are performed separately on the separated extracellular nucleic acids and on the intracellular nucleic acids. Optionally, the method further includes outputting by the device an indication of the quantified amount of at least one of the extracellular nucleic acids and an indication of the quantified amount of at least one of the intracellular nucleic acids. For example, in some embodiments, the device further includes a separator configured to separate extracellular nucleic acids in the first biological sample from intracellular nucleic acids in the first biological sample. Optionally, the nucleic acid quantifier and nucleic acid sequencer separately operate on the separated extracellular nucleic acids and on the intracellular nucleic acids. Optionally, the processor further is suitably programmed to cause the display to output an indication of the quantified amount of at least one of the extracellular nucleic acids and an indication of the quantified amount of at least one of the intracellular nucleic acids.
In some embodiments, the identifying by the device the origin of the captured nucleic acid (step 23 of method 20) comprises comparing the sequence of that nucleic acid to sequences stored in a library stored in a computer-readable medium of the device. For example, in some embodiments, the device further includes a computer-readable medium coupled to the processor. The processor further can be suitably programmed to identify the origin of the captured nucleic acid based on comparing the sequence of that nucleic acid to sequences stored in a library stored in the computer-readable medium. Optionally, the library stores nucleic acid sequences for a human and for a plurality of pathogens. Optionally, the output indicates the relative number of a pathogen per human cell, where the number of human cells can be defined for example by the number of cells detected by light, electromagnetic, thermal, mass, volume displacement or inferred by nucleic acid, protein, lipid or other chemical component.
In some embodiments, method 20 further includes receiving by the device a second biological sample obtained from the subject, the second biological sample being different from the first biological sample. The method further can include capturing by the device a second plurality of nucleic acids of the first set that are present in the second biological sample. The method further can include, for each of the captured nucleic acids of the second plurality, quantifying by the device an amount of that captured nucleic acid; sequencing by the device that captured nucleic acid; and, based on the sequence of that captured nucleic acid, identifying by the device an origin of that captured nucleic acid. The outputting by the device further can include an indication of the quantified amount and the identified origin of at least one captured nucleic acid that is present in the second biological sample. For example, in some embodiments, the first set of complementary nucleic acids further captures a second plurality of nucleic acids of the first set that are present in a second biological sample obtained from the subject, the second biological sample being different from the first biological sample. The nucleic acid quantifier further can quantify an amount of each of the captured nucleic acids that is present in the second biological sample. The nucleic acid sequencer further can sequence each of the captured nucleic acids that is present in the second biological sample. The processor further can be suitably programmed so as to identify an origin of each captured nucleic acid based on the sequence of the captured nucleic acid that is present in the second biological sample. The processor further can be suitably programmed so as to cause the display to output the an indication of quantified amount and the identified origin of at least one captured nucleic acid that is present in the second biological sample.
In some embodiments, method 20 further includes outputting by the device an indication of at least one potential diagnosis for the subject and an indication of the likelihood of the at least one based on the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample. For example, in some embodiments, the processor further is suitably programmed to cause the display to output an indication of at least one potential diagnosis for the subject and an indication of the likelihood of the at least one diagnosis based on the quantified amount and the identified origin of at least one captured nucleic acid that is present in the first biological sample.
It is further noted that suitable aspects of the present devices and methods can be implemented using various types of data processor environments (e.g., using one or more data processors) which execute instructions (e.g., software instructions) to perform operations disclosed herein. Non-limiting examples include implementation on a single general purpose computer or workstation, or on a networked system, or in a client-server configuration, or in an application service provider configuration. For example, suitable aspects of the methods and devices described herein may be implemented using many different types of processing devices by program code comprising program instructions that are executable by the device processing subsystem. The software program instructions may include source code, object code, machine code, or any other stored data that is operable to cause a processing system to perform the methods and operations described herein. Other implementations may also be used, however, such as firmware or even appropriately designed hardware configured to carry out the methods and devices described herein. For example, a computer can be programmed with instructions to perform suitable steps of the flowcharts or exemplary analyses shown in
It is further noted that the devices and methods may include data signals conveyed via networks (e.g., local area network, wide area network, internet, combinations thereof, etc.), fiber optic medium, carrier waves, wireless networks, etc. for communication with one or more data processing devices. The data signals can carry any or all of the data disclosed herein that is provided to or from a device.
The devices' and methods' data (e.g., associations, mappings, data input, data output, intermediate data results, final data results, etc.) may be stored and implemented in one or more different types of computer-implemented data stores, such as different types of storage devices and programming constructs (e.g., RAM, ROM, Flash memory, flat files, databases, programming data structures, programming variables, IF-THEN (or similar type) statement constructs, etc.). It is noted that data structures describe formats for use in organizing and storing data in databases, programs, memory, or other computer-readable media for use by a computer program.
The devices and methods may be provided on many different types of computer-readable storage media including computer storage mechanisms (e.g., non-transitory media, such as CD-ROM, diskette, RAM, flash memory, computer's hard drive, etc.) that contain instructions (e.g., software) for use in execution by a processor to perform the methods' operations and implement the devices described herein.
Additionally, the computer components, analysis modules, software modules, functions, data stores and data structures (e.g., databases) described herein may be connected directly or indirectly to each other in order to allow the flow of data needed for their operations. It is also noted that a module or processor includes but is not limited to a unit of code that performs a software operation, and can be implemented for example as a subroutine unit of code, or as a software function unit of code, or as an object (as in an object-oriented paradigm), or as an applet, or in a computer script language, or as another type of computer code. The software components and/or functionality may be located on a single computer or distributed across multiple computers depending upon the situation at hand.
It should be understood that as used in the description herein and throughout the claims that follow, the meaning of “a,” “an,” and “the” includes plural reference unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise. Finally, as used in the description herein and throughout the claims that follow, the meanings of “and” and “or” include both the conjunctive and disjunctive and may be used interchangeably unless the context expressly dictates otherwise; the phrase “exclusive or” may be used to indicate situation where only the disjunctive meaning may apply.
Although the disclosure has been described with reference to the disclosed embodiments, those skilled in the art will readily appreciate that the specific examples detailed above are only illustrative of the disclosure. It should be understood that various modifications can be made without departing from the spirit of the disclosure. The appended claims are intended to cover all such changes and modifications that fall within the true spirit and scope of the invention.
This application is a continuation application of U.S. application Ser. No. 15/011,036, filed Jan. 29, 2016, which claims the benefit of U.S. Provisional Patent Application No. 62/110,175, filed Jan. 30, 2015 and entitled “Devices and Methods for Diagnostics Based on Analysis of Nucleic Acids,” each of which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
62110175 | Jan 2015 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15011036 | Jan 2016 | US |
Child | 17372199 | US |