SYSTEMS AND METHODS FOR DETECTING PARTICLES OF INTEREST USING SPECTRAL ANALYSIS

Information

  • Patent Application
  • 20240027266
  • Publication Number
    20240027266
  • Date Filed
    July 25, 2023
    a year ago
  • Date Published
    January 25, 2024
    9 months ago
Abstract
An example method includes receiving a first set of data that includes spectral metrics provided by a spectral acquisition apparatus that obtains the spectral metrics based on interactions of electromagnetic radiation with a sample. The first set of data is processed to obtain a second set of data that includes the spectral metrics. One or more trained models are applied to the spectral metrics or a set of values based on the spectral metrics to obtain a result. Based on the result, either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample is determined. A particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample may be generated and provided.
Description
FIELD OF THE INVENTION(S)

Embodiments of the present invention(s) are generally related to detecting particle of interests by analyzing spectral metrics, and in particular to detecting particle of interests such as foodborne pathogens, infectious pathogens of humans, and environmental pathogens.


BACKGROUND

Foodborne illnesses may be caused by consuming food or beverages that are contaminated by pathogens, allergens, foreign particles, or fraudulent ingredients, such as bacteria, toxins produced by bacteria, viruses, parasites, chemicals, foreign material (e.g., metal shavings), and/or the like. The United States Food and Drug Administration (U.S. FDA) estimates that there are approximately 48 million cases of foodborne illness each year in the United States. The U.S. FDA further estimates that 1 in 6 Americans are affected by foodborne illnesses, resulting in 128,000 hospitalizations and 3,000 deaths per year.


Food or beverages (collectively, food) may be contaminated during any stage in the supply chain (e.g., in the field, while undergoing processing at food production or processing facilities (collectively, food processing facilities), or during shipping or transport), distribution, retail, restaurant, or improper storage/handling/expiration at a home. However, the contamination may not be discovered until after people are sickened from consuming the food, which may be due to the fact that food processors cannot wait for test results as they need to ship food to meet shelf-life expectations. Unfortunately, government agencies, such as the U.S. FDA, often declare an outbreak of a foodborne illness and issue recalls of the food suspected of causing the outbreak only after a number of people are sickened.


In addition to the deleterious effects on individual health, there are economic costs to recalls. For example, a food producer or processor (collectively, a food processor) may voluntarily or be required to recall numerous lots of food or entire production runs. Such recalls may sicken many and may tarnish the brand of the food processor, leading to consumer distrust, reduced sales, and large costs for product recalls, legal defense, damage control, and insurance premiums.


Furthermore, during a pandemic and the aftermath, it may be vital to identify infected people. Multiple testing methods have been developed to diagnose viral infections, including polymerase chain reaction (PCR), enzyme-linked immunosorbent assay, immunofluorescent assay, and others. However, these methods are impractical when it comes to wide-scale screening because of lack of speed, lack of accuracy, lack of resources, dependency on foreign supply chains for reagents, and cost. As seen with the COVID-19 pandemic, when attempting to screen large populations, reagent supplies may become depleted, and current testing methodologies may take days to return a result back to a patient. Due to the limited supply of test equipment, testing may be performed on people who actively present symptoms and self-identify. The testing is primarily used to verify the diagnosis.


Relying on a person to present symptoms is a significant challenge for containment because of the reliance on a person's immune system's response to the virus (such as running a fever or developing a persistent dry cough). In the case of COVID-19, infected people may be contagious but asymptomatic during the virus' long incubation period (e.g., 2-14 days). The long incubation period has made the virus nearly impossible to contain (which may be due in part to long-lead times for diagnosis), and has led governments to take strong action to reduce spread of the virus. These strong actions include orders for long-term shelter-in-place and social distancing.


These problems can be common for many different pathogens, allergens, or harmful agents. There are many bacteria and viruses, for example, which may be asymptomatic for a period of time but may have serious health consequences. Further, many bacteria and viruses may be highly infectious either before or after symptoms appear. Testing for any number of pathogens can be invasive, uncomfortable, and/or painful. In addition, many tests for a specific pathogen may be inaccurate, slow, expensive, or unavailable to the mass population. Moreover, the potency or effectiveness of many compounds (e.g., reagents) used to test pathogens may change due to age, exposure to environmental conditions, and/or improper handling.


Furthermore, environmental hazards such as methane, ethylene, methylene, and other volatile organic compounds (VOCs) may pose risks to human health and environmental health. The detection and mitigation of such environmental hazards may be required by various governmental agencies and/or corporate policies.


Furthermore, a substance or item may become contaminated with or come in contact with another, undesirable, substance. For example, air may become polluted by carbon monoxide. As another example, water may become contaminated with chemicals. Such contaminants may be difficult or impossible for humans to detect with only their innate senses.


SUMMARY

In some aspects, the techniques described herein relate to a non-transitory computer-readable medium including executable instructions, the executable instructions being executable by one or more processors to perform a method, the method including: receiving a first set of data in a first format, the first set of data including a set of spectral metrics, the first set of data provided by an apparatus that obtains the set of spectral metrics based on interactions of electromagnetic radiation with a sample; processing the first set of data to obtain a second set of data in a second format different from the first format, the second set of data including the set of spectral metrics; applying one or more trained models to at least one of the set of spectral metrics and a set of values based on the set of spectral metrics to obtain a result, the one or more trained models trained on a set of training samples for a particle of interest; based on the result, determining either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample; generating a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample; and providing the particle of interest detection notification.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium, wherein the method further includes: receiving metadata associated with at least one of the apparatus, the sample, a date and time at which the apparatus obtains the set of spectral metrics; and storing the metadata in association with the result.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium, wherein the sample is at least one of a sample of a food processing byproduct, a sample from a person, and an environmental sample, and the particle of interest is at least one of a foodborne pathogen, an infectious pathogen of humans, and an environmental particle of interest.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium, wherein the set of spectral metrics is a first set of spectral metrics, the apparatus is a first apparatus from a first manufacturer sited at a first location, the sample is a first sample, the set of values is a first set of values, the result is a first result, the positive particle of interest detection is a first positive particle of interest detection, the negative particle of interest detection is a first negative particle of interest detection, the particle of interest detection notification is a first particle of interest detection notification, and the method further includes: receiving a third set of data in a third format, the third set of data including a second set of spectral metrics, the second set of spectral metrics provided by a second apparatus that obtains the second set of spectral metrics based on interactions of electromagnetic radiation with a second sample, and the second apparatus is sited at a second location different from the first location; processing the third set of data to obtain a fourth set of data in the second format, the fourth set of data including the second set of spectral metrics; applying the one or more trained models to at least one of the second set of spectral metrics and a second set of values based on the second set of spectral metrics to obtain a second result; based on the second result, determining either a second positive particle of interest detection or a second negative particle of interest detection for the particle of interest in the second sample; generating a second particle of interest detection notification that indicates either the second positive particle of interest detection or the second negative particle of interest detection for the particle of interest in the second sample; and providing the second particle of interest detection notification.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium, wherein the method further includes normalizing each spectral metric in the set of spectral metrics to be between zero, inclusive, and one, inclusive, to obtain the set of values, and wherein applying the one or more trained models to at least one of the set of spectral metrics and the set of values based on the set of spectral metrics to obtain the result includes applying the one or more trained models to the set of values.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium, wherein the method further includes training one or more models on the set of training samples for the particle of interest to obtain the one or more trained models.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium wherein spectral metrics in the set of spectral metrics are one of absorbance metrics, transmittance metrics, reflectance metrics, and scattering metrics.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium wherein the result indicates the positive particle of interest detection if the result meets or exceeds a threshold.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium wherein the electromagnetic radiation includes at least one of ultraviolet light, visible light, and infrared light.


In some aspects, the techniques described herein relate to a non-transitory computer-readable medium wherein the one or more trained models include a set of trained decision trees.


In some aspects, the techniques described herein relate to a method including: receiving a first set of data in a first format, the first set of data including a set of spectral metrics, the first set of data provided by an apparatus that obtains the set of spectral metrics based on interactions of electromagnetic radiation with a sample; processing the first set of data to obtain a second set of data in a second format different from the first format, the second set of data including the set of spectral metrics; applying one or more trained models to at least one of the set of spectral metrics and a set of values based on the set of spectral metrics to obtain a result, the one or more trained models trained on a set of training samples for a particle of interest; based on the result, determining either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample; generating a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample; and providing the particle of interest detection notification.


In some aspects, the techniques described herein relate to a method, further including: receiving metadata associated with at least one of the apparatus, the sample, a date and time at which the apparatus obtains the set of spectral metrics; and storing the metadata in association with the result.


In some aspects, the techniques described herein relate to a method, wherein the sample is at least one of a sample of a food processing byproduct, a sample from a person, and an environmental sample, and the particle of interest is at least one of a foodborne pathogen, an infectious pathogen of humans, and an environmental particle of interest.


In some aspects, the techniques described herein relate to a method, wherein the set of spectral metrics is a first set of spectral metrics, the apparatus is a first apparatus from a first manufacturer sited at a first location, the sample is a first sample, the set of values is a first set of values, the result is a first result, the positive particle of interest detection is a first positive particle of interest detection, the negative particle of interest detection is a first negative particle of interest detection, the particle of interest detection notification is a first particle of interest detection notification, and the method further includes: receiving a third set of data in a third format, the third set of data including a second set of spectral metrics, the second set of spectral metrics provided by a second apparatus that obtains the second set of spectral metrics based on interactions of electromagnetic radiation with a second sample, and the second apparatus is sited at a second location different from the first location; processing the third set of data to obtain a fourth set of data in the second format, the fourth set of data including the second set of spectral metrics; applying the one or more trained models to at least one of the second set of spectral metrics and a second set of values based on the second set of spectral metrics to obtain a second result; based on the second result, determining either a second positive particle of interest detection or a second negative particle of interest detection for the particle of interest in the second sample; generating a second particle of interest detection notification that indicates either the second positive particle of interest detection or the second negative particle of interest detection for the particle of interest in the second sample; and providing the second particle of interest detection notification.


In some aspects, the techniques described herein relate to a method, further including normalizing each spectral metric in the set of spectral metrics to be between zero, inclusive, and one, inclusive, to obtain the set of values, and wherein applying the one or more trained models to at least one of the set of spectral metrics and the set of values based on the set of spectral metrics to obtain the result includes applying the one or more trained models to the set of values.


In some aspects, the techniques described herein relate to a method, further including training one or more models on the set of training samples for the particle of interest to obtain the one or more trained models.


In some aspects, the techniques described herein relate to a method wherein spectral metrics in the set of spectral metrics are one of absorbance metrics, transmittance metrics, reflectance metrics, and scattering metrics.


In some aspects, the techniques described herein relate to a method wherein the result indicates the positive particle of interest detection if the result meets or exceeds a threshold.


In some aspects, the techniques described herein relate to a method wherein the electromagnetic radiation includes at least one of ultraviolet light, visible light, and infrared light.


In some aspects, the techniques described herein relate to a method wherein the one or more trained models include a set of trained decision trees.


In some aspects, the techniques described herein relate to a system including: a first computing device configured to: receive a first set of data in a first format, the first set of data including a set of spectral metrics, the first set of data provided by an apparatus configured to obtain the set of spectral metrics based on interactions of electromagnetic radiation with a sample; process the first set of data to obtain a second set of data in a second format different from the first format, the second set of data including the set of spectral metrics; and transmit the second set of data; and a second computing device configured to: receive the second set of data; apply one or more trained models to at least one of the set of spectral metrics and a set of values based on the set of spectral metrics to obtain a result, the one or more trained models trained on a set of training samples for a particle of interest; based on the result, determine either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample; and transmit to the first computing device either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample, wherein the first computing device is further configured to: generate a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample; and provide the particle of interest detection notification.


In some aspects, the techniques described herein relate to a system, further including the apparatus, wherein the apparatus is configured to: obtain the set of spectral metrics based on interactions of electromagnetic radiation with a sample; and provide the first set of data including the set of spectral metrics to the first computing device.


In some aspects, the techniques described herein relate to a system wherein the first computing device is further configured to transmit metadata associated with at least one of the apparatus, the sample, a date and time at which the apparatus obtains the set of spectral metrics to the second computing device, and the second computing device is further configured to: receive the metadata; and store the metadata in association with the result.


In some aspects, the techniques described herein relate to a system wherein spectral metrics in the set of spectral metrics are one of absorbance metrics, transmittance metrics, reflectance metrics, and scattering metrics.


In some aspects, the techniques described herein relate to a system wherein the electromagnetic radiation includes at least one of ultraviolet light, visible light, and infrared light.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1A depicts an example particle of interest detection environment in some embodiments.



FIG. 1B depicts another example particle of interest detection environment according to some embodiments.



FIG. 2A depicts an example food processing apparatus, an example computing device, and an example spectral acquisition apparatus in some embodiments.



FIG. 2B depicts another example food processing apparatus, an example computing device, and an example of spectral acquisition apparatus in some embodiments.



FIG. 3A is a block diagram of components of an example computing device in some embodiments.



FIG. 3B is a block diagram of components of an example particle of interest detection system in some embodiments.



FIG. 4 is a flowchart showing a method for detecting particles of interest in some embodiments.



FIG. 5 depicts in further detail an initialization phase and a data retrieval phase of a method for detecting particle of interests in some embodiments.



FIG. 6 depicts in further detail a data pre-processing phase of a method for detecting particle of interests in some embodiments.



FIG. 7 depicts in further detail a data processing phase of a method for detecting particle of interests in some embodiments.



FIG. 8 depicts in further detail a notification phase of a method for detecting particle of interests in some embodiments.



FIG. 9 is a flowchart showing a method for training sets of decision trees for detecting particle of interests in some embodiments.



FIG. 10 depicts an example user interface for providing authentication information according to some embodiments.



FIG. 11 depicts an example user interface for providing information regarding a particular sample according to some embodiments.



FIG. 12 depicts an example user interface for displaying particle of interest detection results according to some embodiments.



FIG. 13 depicts a graph of absorption for a set of wavelengths for multiple instances of light that passed through multiple samples containing different concentrations of E. coli and multiple samples that do not contain E. coli in some embodiments.



FIG. 14A depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through multiple testing samples containing E. coli at seven different concentrations and multiple testing samples that do not contain E. coli in some embodiments.



FIG. 14B depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through multiple testing samples containing E. coli at four different concentrations and multiple testing samples that do not contain E. coli in some embodiments.



FIG. 15 depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through multiple testing samples containing E. coli at seven different concentrations and multiple testing samples that do not contain E. coli in some embodiments.



FIG. 16 depicts a graph of absorption for a set of wavelengths for multiple instances of light that passed through multiple samples containing different concentrations of microspheres in some embodiments.



FIGS. 17A and 17B depict confusion matrices for results of applications of two sets of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through samples containing 1000 nanometer microspheres or samples that do not contain microspheres in some embodiments.



FIG. 18 depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through samples containing 500 nanometer microspheres or samples that do not contain microspheres in some embodiments.



FIGS. 19A and 19B depict confusion matrices for results of applications of two sets of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through samples containing 50 nanometer microspheres or samples that do not contain microspheres in some embodiments.



FIG. 20 depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through samples containing green microspheres at ten different concentrations or samples that do not contain microspheres in some embodiments.



FIG. 21A depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through samples containing green microspheres, samples containing red microspheres, or samples that do not contain microspheres in some embodiments.



FIG. 21B depicts a confusion matrix for the results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through samples containing green microspheres, samples containing red microspheres, samples containing a mixture of red microspheres and green microspheres, or samples that do not contain microspheres in some embodiments.



FIG. 22 depicts a comparison of current food safety testing procedures and the particle of interest detection system according to some embodiments.



FIG. 23 depicts a block diagram of an example digital device in some embodiments.





Throughout the drawings, like reference numerals will be understood to refer to like parts, components, and structures.


DETAILED DESCRIPTION

A government agency such as the U.S. FDA may not declare a foodborne illness outbreak until after a large number of persons have been sickened and/or if a producer receives a positive result. Before declaring the outbreak, the government agency may have to perform an investigation to determine the food that is causing the outbreak, which may be difficult to do and/or take significant time. If the government agency is able to determine the food, testing for foodborne pathogens has to be performed to identify the particular foodborne pathogen responsible for the foodborne illnesses. The investigation and testing may take a large amount of time, during which more persons may be affected by the contaminated food. One reason for the large amount of time is that it may take approximately 42 hours to approximately 132 hours to obtain test results confirming a foodborne pathogen.


In various embodiments, systems and methods discussed herein may enable early detection of foodborne pathogens during food farming, harvest, production or processing (collectively, food processing) across the entire supply chain (farm to fork). The systems may utilize spectral acquisition apparatuses, which may be or include device that obtains and/or generates spectral metrics, such as spectrometers, spectrophotometers, or thermal emissions spectrometers, or other electromagnetic sources and detectors. The spectral acquisition apparatuses may obtain and/or generate spectral metrics acquired by causing and/or detecting electromagnetic radiation interacting with water, environmental surfaces/spaces, preparation surfaces, drains, food, or any other zones as defined by government agencies (e.g., the U.S. FDA) used or produced by food processing apparatuses. The spectral acquisition apparatuses may transmit the spectral metrics to a particle of interest detection system that utilizes one or more computing systems containing machine learning (ML) and/or artificial intelligence (AI) models to detect evidence of foodborne pathogens from the spectral metrics. The particle of interest detection system may provide results to personnel working in food processing facilities, farmers, distributors, retail, or the household. In the event of a positive detection of a foodborne pathogen, the food may be immediately quarantined and start remedial measures, such as cleaning food processing equipment, discarding contaminated food, and/or performing additional testing or detection to determine root cause and corrective action.


Such early detection of foodborne pathogens allows food processors to identify contaminated food prior to shipping the food out to wholesalers, distributors, retailers, and/or consumers. This early detection may save food processors the costs of recalling food, which may be significant. In addition, early detection may prevent or reduce the occurrence of foodborne illness outbreaks, which may prevent or reduce illnesses, hospitalizations, and deaths.


In various embodiments, the systems and methods described herein are applicable to detect a wide variety of foodborne pathogens that cause foodborne illnesses. Such foodborne pathogens include norovirus, Salmonella (non-typhoidal), Clostridium perfringens, Campylobacter, Staphylococcus aureus, Toxoplasma gondii, Escherichia coli (E. coli), Clostridium botulinum, Cryptosporidium, Cyclospora, hepatitis A virus, Shigella, Yersinia, and Listeria monocytogenes (Listeria), among many others. The particle of interest detection systems may train one or more ML and/or AI models for each foodborne pathogen. Upon receiving spectral metrics from spectral acquisition apparatuses, the particle of interest detection systems may apply the trained machine learning and/or AI models to the spectral metrics. In this way, the particle of interest detection systems may be able to detect multiple particle of interests from spectral metrics of a single sample of a food processing byproduct, rather than using multiple assays and reagents to test particles of interest. One advantage of some embodiments of the systems and methods described herein is that they may decrease the Limit of Detection (LOD) from the Classical Limit of Detection (cLOD) of the spectral analysis equipment, which is limited by physics, to the machine learning limit of detection (mlLOD) that may be one to two orders of magnitude lower than the cLOD. Current detection technology requires the sample to be enriched and incubated in order to stimulate growth/replication in order to detect. The proposed technology reduces the enrichment and incubation time and cost due to the lower LOD (mlLOD) that are capable of detecting very low concentrations of organisms without the need for enrichment/incubation.


In various embodiments, the spectral acquisition apparatuses may be or include spectrometers or other spectral analysis technology, such as commercially available spectrometers or customized UV/VIS/NIR/MWIR/LWIR sensors that are capable of communicating with the particles of interest detection system or are couplable to digital devices capable of communicating with the particles of interest detection system. Food processors may widely deploy the spectral acquisition apparatuses at food processing facilities to detect foodborne pathogens in their food processing. The particles of interest detection systems and associated methods described herein, because they provide more accurate results more quickly and economically than other systems and methods, are broadly applicable to any location where food is processed, such as farms, food processing facilities, packaging facilities, distributors, restaurants, grocery stores, homes, and other locations. Accordingly, the particles of interest detection systems and associated methods described herein may provide significant benefits to farmers, food processors, distributors, restaurant operators, grocery store operators, households, consumers, and others (e.g., any entity in the farm-to-fork supply chain).


The particles of interest detection systems and associated methods, due to the ability to perform rapid and continuous testing of foods, also allow for food processors to quarantine food that may be contaminated by particles of interests prior to shipping out such food. For example, a food processor, upon detection of a foodborne pathogen during a particular food processing run, may be able to quarantine food processed during that run or food processed after the last “clean” test prior to shipping out that food. The food processor may then test the food (e.g., using laboratory tests) to confirm the presence of foodborne pathogens. The food processor may also be able to clean food processing equipment and/or parts of the food processing facility to prevent or reduce contamination of further food. The food processor may then retest food processing byproducts and/or equipment for contamination. As a result, the food processor may confirm that the machinery and/or byproducts are “clean” (e.g., without detected foodborne pathogens) before returning to food processing.


Accordingly, food processors may be able to reduce economic costs associated with foodborne illness outbreaks (and lower food recall insurance premiums due to significantly lower risk). Furthermore, effects on individual health and/or public health may be avoided or reduced by the deployment of the particle of interest detection systems and associated methods described herein.


The particle of interest detection systems and associated methods described herein may also aid food processors (and all other entities in the food supply chain from “farm to fork” in complying with food safety laws and regulations, such as those promulgated by government agencies such as the U.S. FDA.


The particle of interest detection systems and associated methods may also be utilized to detect other particles, allergens, contaminants, or pollutants that can harm human health, human safety, and/or the environment. Accordingly, the particle of interest detection systems and associated methods described herein may also aid community water systems and/or other water suppliers with complying with water quality standards, such as those promulgated by government agencies such as the U.S. Environmental Protection Agency, FDA (for pharmaceuticals), or semiconductor industry associations specifying water quality.


In various embodiments, the particle of interest detection systems and associated methods discussed herein may enable early detection of persons infected with SARS-CoV-2 (or other diseases) prior to those persons presenting symptoms. In one example, persons may provide samples, obtained from saliva, nasal, skin, or other swabs, and/or breath. A spectral acquisition apparatus may perform spectrometer scans of the samples and transmit such scans to particle of interest detection systems for processing. The particle of interest detection systems may apply machine learning algorithms to detect pathogens (e.g., SARS-CoV-2 virions) or other harmful diseases or contamination in the samples. The particle of interest detection systems may then transmit results to the spectral acquisition apparatuses and/or to personal devices of the persons who gave samples. The entire process may return results quickly (e.g., within seconds or minutes), which is a significant improvement over the time it takes to obtain results from PCR, antigen, or other biological tests. Because results can be obtained quickly, infected individuals can be quickly identified, and such individuals may take appropriate measures to prevent the spread to others and seek timely medical care.


The spectral acquisition apparatuses may be or include spectrometers and/or spectrophotometers. The spectrometers may be commercially available optical spectrometers that are capable of communicating with the particle of interest detection systems or are couplable to digital devices capable of communicating with the particle of interest detection systems. As such, the spectral acquisition apparatuses may be widely distributed across geographies and deployed to locations where screening and/or detection of persons infected with SARS-CoV-2 (and/or other communicable diseases) is important. Such locations may include popular entertainment venues (e.g., concert halls), medical facilities (e.g., hospitals), business sites of common carriers (e.g., airports and train stations), and the like. As the particle of interest detection systems may provide results quickly, the screening and/or detection of concert attendees, patients, medical personnel and visitors, travelers and others may be performed rapidly without unduly interfering with the movement of persons into and out of such locations.


For example, individuals in concert sound and stage crews may wish to know of a SARS-CoV-2 infection as soon as possible. This is because such individuals may interact with musicians who must perform or risk significant financial losses. The particle of interest detection systems as described herein, because they can provide quick results, and can provide them accurately and repeatedly, may allow for rapid identification of infected individuals. Accordingly, the particle of interest detection systems can reduce disruptions to concerts and other musical events.


More generally, the particle of interest detection systems and associated methods described herein, because they provide more accurate results more quickly and potentially more economically than PCR, enzyme-linked immunosorbent assay, immunofluorescent assay, and other biological testing methods, may be broadly applicable to any location, event, or circumstances where it is desirable to rapidly identify infected individuals and potentially quarantine them. Accordingly, the particle of interest detection systems and associated methods described herein may reduce social and/or economic disruptions, which may provide significant health and economic benefits to individuals, organizations, and governments. Rapid, low cost, reagent free, and digital spectral test results has the ability to quickly contain the spread of new diseases and/or biological agents that may cause harm to humans.


While detection of SARS-CoV-2 virions is discussed herein, it will be appreciated that the particle of interest detection systems may detect other pathogens, such as, but not limited to, human immunodeficiency virus (HIV) virions, the various strains of influenza virus virions, and/or the like in addition to or as an alternative. Furthermore, the particle of interest detection systems may detect chemical or protein composition from a number of sources. As such, some of the methods and systems described herein may be applied to food composition analysis, chemical composition analysis, water purity, and the like.


While systems described herein may refer to detection of the pathogens, it will be appreciated that the systems described herein may detect specific pathogens by identifying or detecting a body's reaction to such pathogens in addition to or instead of the pathogen itself. Similarly or alternatively, the systems described herein may determine possible or likely infection by detecting chemical or biological materials (e.g., proteins) that may be related to infection by one or more pathogens.


The systems and methods described herein are also applicable to detection of environmental hazards such as ethylene, methylene, other volatile organic compounds (VOCs), and biological agents intended to cause harm to humans. Accordingly, the systems and methods allow for early detection and appropriate mitigation of such environmental hazards. In general, the systems and methods described herein may be utilized to detect any particle of interest that may pose a threat to human and/or animal health and/or harm the environment.



FIG. 1A depicts an example particle of interest detection environment 100 in some embodiments. The particle of interest detection environment 100 includes a food processing apparatus 106, a spectral acquisition apparatus 102, a computing device 110, a communication network 108, and a particle of interest detection system 104. Although a single spectral acquisition apparatus 102 is depicted in FIG. 1A, the particle of interest detection environment 100 may include any number of spectral acquisition apparatuses 102. Although a single food processing apparatus 106 is depicted in FIG. 1A, the particle of interest detection environment 100 may include any number of food processing apparatuses 106. Although a single particle of interest detection system 104 is depicted in FIG. 1A, the particle of interest detection environment 100 may include any number of particle of interest detection systems 104. The particle of interest detection environment 100 may also include other systems, apparatuses, devices, machines, and/or components not illustrated in FIG. 1A, such as cleaning systems, water supply and water drain systems, and/or electrical and communication systems.


The food processing apparatus 106 may be or include any device, machine, and/or apparatus that processes food or facilitates processing food for human or animal consumption. For example, the food processing apparatus 106 may be a washing machine that washes fruits and vegetables such as leafy greens, apples, carrots, and the like using water. As another example, the food processing apparatus 106 may be a commercial spinner that dries washed lettuce and other vegetables, which produces water to be drained away. Other examples of food processing apparatuses 106 are within the scope of this disclosure. The food processing apparatus 106 may be or include any number of digital devices. Digital devices are discussed, for example, with reference to FIG. 23. The food processing apparatus 106 may be connected to the communication network 108.


The spectral acquisition apparatus 102 may be utilized as a part of a method to detect particles of interest. Particles of interest may be pathogens that affect the environment and/or human health and/or safety. For example, the particles of interest may include metals such as lead, arsenic, or mercury, toxins produced by bacteria such as coliform bacteria that may be found in water, air, or other bodily fluid such as blood or urine. The spectral acquisition apparatus 102 may be or include any number of digital devices. The spectral acquisition apparatus 102 may be or include any spectral acquisition device, but not limited to spectrometer, spectrophotometers, thermal emissions spectrometer, or other electromagnetic sources and detectors. A spectral acquisition device may be any device capable of obtaining spectral metrics and/or spectral data from the electromagnetic spectrum to find particles of interest, which may be invisible to unaided humans. In one example, the spectral acquisition apparatus 102 may be a Hach DR3900 spectrophotometer of the Hach Company of Loveland, Colorado, United States of America. In some embodiments, the spectral acquisition apparatus 102 may be a spectrometer of the INSION GmbH company of Germany. In another example, the spectral acquisition apparatus 102 may each be or include a different spectrophotometer, spectrometer, sensor, or detector capable of network communication. The spectral acquisition apparatus 102 may perform the functions of a spectrophotometer or a spectrometer. The spectral acquisition apparatus 102 may receive samples of food processing byproducts, detect light that has passed through the samples, and measure intensities of a set of wavelengths of the electromagnetic spectrum or light that has passed through the samples. The spectral acquisition apparatus 102 may then transmit a set of spectral metrics based on the measured intensities for the set of wavelengths of the electromagnetic spectrum or electromagnetic energy to the computing device 110. The set of spectral metrics may be the measured intensities, or they may be other values based on the measured intensities, such as absorbance, transmittance, reflectance, and/or scattering values.


The computing device 110 may be or include any number of digital devices. A software application 112 may be executed by the computing device 110. The software application 112 may receive the set of spectral metrics which the spectral acquisition apparatus 102 obtains based on interactions of electromagnetic radiation with the samples of food processing byproducts. In some embodiments, the software application 112 normalizes multiple sets of spectral metrics received from the spectral acquisition apparatus 102 as the software application 112 receives it. In one example, the software application 112 normalizes multiple sets of spectral metrics after the software application 112 receives all of them.


In one example, the computing device 110 may be a laptop and may be connected to the spectral acquisition apparatus 102 via a physical cable, such as a Universal Serial Bus (USB) cable. In some embodiments, the spectral acquisition apparatus 102 and the computing device 110 are not directly connected via a physical cable but are indirectly connected through a network, such as an IP-based Local Area Network (LAN), which may be part of or connected to the communication network 108. The software application 112 may perform pre-processing steps before transmitting the processed data to the particle of interest detection system 104.


The particle of interest detection system 104 may be or include any number of digital devices. The particle of interest detection system 104 may receive the set of spectral metrics, process the set of spectral metrics as described herein, generate a particle of interest detection notification, and provide the particle of interest detection notification. In one example, the particle of interest detection system 104 utilizes ML and/or AI models to detect evidence of particles of interest based on the set of values. The AI technology enables significantly reduced limits of detection (LOD), well beyond the LOD of the spectrophotometer (or other spectral acquisition device) alone. LOD is classically limited by the capabilities of the hardware and optical components. AI reaches beyond hardware and optical limitations.


In various embodiments, the particle of interest detection system 104 may be a cloud-based application. In one example, the particle of interest detection system 104 may include serverless computing in which a cloud provider assigns computing resources on-demand. In some embodiments, the particle of interest detection system 104 provides the particle of interest detection notification to the computing device 110.


The spectral acquisition apparatus 102, the computing device 110, and/or the particle of interest detection system 104 may, in the event of a positive particle of interest detection notification, notify third-party systems such as those operated by food processors, those operated by government agencies such as the U.S. FDA, and/or those operated by third parties approved by such government agencies. In such an event, the spectral acquisition apparatus 102, the computing device 110, and/or the particle of interest detection system 104 may also recommend further diagnostic analysis by government agencies or other third parties approved by the government agencies.


In some embodiments, communication network 108 represents one or more computer networks (for example, LANs, WANs, and/or the like). The communication network 108 may provide communication between any of the food processing apparatuses 106, any of the spectral acquisition apparatuses 102, any of the computing devices 110, and the particle of interest detection system 104. In some implementations, the communication network 108 comprises computer devices, routers, cables, and/or other network topologies. In some embodiments, the communication network 108 may be wired and/or wireless. In various embodiments, the communication network 108 may comprise the Internet, one or more networks that may be public, private, IP-based, non-IP based, and so forth.


Some embodiments described herein discuss performing spectral analysis on water samples (e.g., obtained from wash water), such as those obtained directly or indirectly from food processing apparatuses 106. It will be appreciated that the spectral acquisition apparatus 102, the computing device 110, and/or the particle of interest detection system 104 may perform spectral analysis on any food processing byproduct. Examples of food processing byproducts include, but are not limited to, water, wash water, oils, greases, animal blood, meat, and feces from animals such as cows, pigs, and chickens. Furthermore, samples may be obtained by swabbing or otherwise sampling food processing equipment, surfaces, residues, or anything that comes into contact with food. It will be understood that food processing byproducts are not limited to the examples described herein.



FIG. 1B depicts another example of particle of interest detection environment 150 according to some embodiments. In the illustrated embodiment of particle of interest detection environment 150, the spectral acquisition apparatus 102 is a spectrophotometer. A sample of food processing byproducts may be obtained and placed in a non-inert container such as a cuvette or test tube. The cuvette may include a machine-readable code such as a barcode or a quick-response (QR) code. The particle of interest detection environment 150 may include a barcode reader 160 or an image capture device coupled to a mobile computing device. The barcode reader may be coupled to the computing device 110 via a cable, such as a USB cable.


The computing device 110 may receive data from a plurality of spectral acquisition apparatuses. The output of each spectral acquisition apparatus may be in different file formats. For example, the output of one spectral acquisition apparatus may be in a comma-separated values (CSV) file format, while the output of another spectral acquisition apparatus may be in an extensible markup language (XML) file format. Furthermore, the format of the data may vary depending on a manufacturer and/or model of the spectral acquisition apparatus.



FIG. 2A depicts an example food processing environment 200 in some embodiments. The food processing environment 200 includes the food processing apparatus 106, the spectral acquisition apparatus 102, and a computing device 110 connected to the spectral acquisition apparatus 102 via a cable 202, which may be a Universal Serial Bus (USB) cable. The food processing apparatus 106 has pieces of produce 204, such as lettuce, on it to be washed. The spectral acquisition apparatus 102 and the computing device 110 may be positioned proximate to the food processing apparatus 106 and may be positioned on a support (e.g., a bench, a table, or the like, not illustrated in FIG. 2A).


The spectral acquisition apparatus 102 may perform scans of samples of food processing byproducts and obtain spectral metrics based on light that has passed through the samples. The samples may be placed in a cuvette 206 positioned in a receptacle of the spectral acquisition apparatus 102. For example, the spectral acquisition apparatus 102 may obtain a set of intensity measurements of a set of wavelengths of the light that has passed through the sample. The spectral acquisition apparatus 102 may convert the intensity measurements in the set of intensity measurements to other values, such as absorbance values, transmittance values, reflection values, scattered values or concentration values. The spectral acquisition apparatus 102 may transmit data including the spectral metrics in a first format to the computing device 110. In some embodiments, the spectral acquisition apparatus 102 transmits the set of intensity measurements to the computing device 110. In some embodiments, the spectral acquisition apparatus 102 measures the detected light in units other than intensity, such as in absorbance units or transmittance units, and transmits the measured other values to the computing device 110. In some embodiments, the spectral acquisition apparatus 102 transmits the spectral metrics to the particle of interest detection system 104. In some embodiments, the spectral acquisition apparatus 102 transmits the spectral metrics to both the computing device 110 and to the particle of interest detection system 104.



FIG. 2B depicts another example food processing environment 250 in some embodiments. In the food processing environment 250, the food processing apparatus 106 is a salad spinner that may be used to dry wet lettuce or other wet produce. The food processing apparatus 106 produces water as it spins, which may drain via one or more drain lines (not illustrated in FIG. 2B). Other like reference numerals in FIG. 2B refers to like elements in FIG. 2A and are not discussed with reference to FIG. 2B.



FIG. 3A is a block diagram of components of a computing device 110 in some embodiments. The computing device 110 includes a software application 112. The software application 112 includes a communication module 302, a device configuration module 304, a pre-processing module 306, a control module 308, a user interface module 310, a notification module 312, and a datastore 314.


The communication module 302 may send and/or receive requests and/or data between the computing device 110 and any of the food processing apparatuses 106, the particle of interest detection system 104, and the spectral acquisition apparatuses 102. The communication module 302 may receive requests and/or data from the food processing apparatuses 106, the particle of interest detection system 104, and/or the spectral acquisition apparatuses 102. The communication module 302 may also send requests and/or data to the food processing apparatuses 106, the particle of interest detection system 104, and/or the spectral acquisition apparatuses 102.


The device configuration module 304 may include an Application Programming Interface (API) or other interface or other interface to allow the one or more spectral acquisition apparatus 102 to communicate with the computing device 110. In some embodiments, the device configuration module 304 may identify a manufacturer, type, or model of the spectral acquisition apparatus 102. Once the device configuration module 304 makes the identification, the software application 112 may identify properties or parameters of spectrometer data that may be sent by spectral acquisition apparatus 102. In one example, the device configuration module 304 determines that the spectral acquisition apparatus 102 is a Hach DR3900 spectrophotometer. Once the device configuration module 304 identifies the manufacturer, type, or model of the spectral acquisition apparatus, the device configuration module 304 may identify a precision or accuracy of the light intensity measurement, available file formats of the output of the spectral acquisition apparatus 102, and other properties associated with the spectral acquisition apparatus. These properties may include whether or software application 112 is able to control the spectral acquisition apparatus 102 or only able to read data from the spectral acquisition apparatus 102, name of the spectral acquisition apparatus 102, data bit depth, step size, integration time, firmware version, and serial number of the spectral acquisition apparatus 102.


The pre-processing module 306 may receive data from the spectral acquisition apparatus 102 in a first format. The pre-processing module 306 may receive from the device configuration module 304 information regarding the file format and the format of the data. In one example, the output of one spectral acquisition apparatus may be in the CSV file format, while the output of another spectral acquisition apparatus may be in an XML file format. The pre-processing module 306 may normalize or arrange the set of values received from the different spectral acquisition apparatuses of the particle of interest detection environment 100 so that it may be inputted into the artificial intelligence and/or machine learning system of the particle of interest detection system 104. The output of the pre-processing module 306 may be a second format, which may be stored in the datastore 314. The second format may include the spectral metrics.


In various embodiments, the file format of the output of one spectral acquisition apparatus is a different file format from another spectral acquisition apparatus. In some embodiments, the pre-processing module 306 may format the data according to the requirements of the artificial intelligence and/or machine learning system of the particle of interest detection system 104.


Certain models of spectral acquisition apparatuses 102 may allow the spectral acquisition apparatus 102 to be remotely controlled by a software application or a computing device. For example, certain models of spectral acquisition apparatuses 102 may include an Application Programming Interface (API) that allows a software application or a computing device to control the spectral acquisition apparatus 102 via the API. In one example, the API of the Hach DR3900 spectrophotometer may include instructions that allow the user of the computing device 110 to identify the number of spectral acquisition apparatuses remotely controllable by the computing device 110, open communication or connect to one or more spectral acquisition apparatuses 102, close communication or disconnect one or more spectral acquisition apparatus 102, provide to the computing device 110 error messages associated with the spectral acquisition apparatus 102, and read one or more sets of values from the spectral acquisition apparatuses.


Other models of spectral acquisition apparatus may allow a software application or a computing device to only receive data from the spectral acquisition apparatus 102. In cases where the spectral acquisition apparatus 102 may be remotely controlled by the software application 112, the control module 308 of the software application 112 may control the spectral acquisition apparatus 102. In one example, the control module 308 may power up or start up the spectral acquisition apparatus 102. In some embodiments, the control module 308 may send a request to the spectral acquisition apparatus 102 to identify and scan a cuvette or test tube placed in the spectral acquisition apparatus 102.


In some embodiments, the control module 308 may send a request to the AI/ML of the particle of interest detection system 104 to detect evidence of particle of interest in the spectrometer scans. The AI/ML determines if the result of the detection indicates a positive particle of interest detection or a negative particle of interest detection. If the result exceeds a threshold, then the AI/ML may output a positive particle of interest detection, and if the result does not exceed a threshold, then the AI/ML may output a negative particle of interest detection. The particle of interest detection system 104 may send the result to the user interface module 310 of the software application 112.


The user interface module 310 provides user interfaces to the graphics display unit 2310 of the digital device 2300 of FIG. 23. In some embodiments, the user interface module 310 provides a user interface that allows users to log in to securely access the software application 112. In one example, the user interface module 310 provides a notification to a user of the computing device 110 of the positive or negative particle of interest detection provided by the AI/ML of the particle of interest detection system 104 in the form of a pop-up window of the user interface.


In some embodiments, the user interface module 310 may provide multiple particle of interest test results to the user interface. An example of this can be found in an example user interface 1200 of FIG. 12. The user interface 1200 includes a table 1210 which provides information regarding multiple particle of interest test results. For example, table 1210 includes a row 1220, which depicts information regarding one particular test result. Each row of table 1210, such as the row 1220, includes information such as the location ID, specimen ID, result, timestamp, data file name, and submission UUID.


In some embodiments, the type of notification provided to the user by the notification module 312 may depend on the type of pathogen being tested or concentration of the pathogen. The notification module 312 may provide a pop-up window to the user interface provided by the user interface module 310. In some embodiments, the notification module 312 may send an email to the user of the software application 112, who may be a lab technician who prepared and tested the sample of water, a text message, a short message/messaging service (SMS), or a telephone call.


The notification module 312 may provide reports, alerts, and/or dashboards that include results, confidence scores, and/or other information.


For example, the computing device 110 may receive the results of particle of interest detections on particular food processing equipment as well as what food was processed on the food processing equipment. As another example, the computing device 110 may receive the results of particle of interest detections in certain parts of a food processing facility as well as what food was processed in those certain parts. The computing device 110 may thus be able to identify food (e.g., particular lots or production runs) and recommend, via the notification module 312, that remedial action, such as quarantining food, recalling food, or other action, should be taken. The notification module 312 may optionally notify appropriate third parties (e.g., government agencies such as the U.S. FDA) of the detection of foodborne pathogens. The notification module 312 may, in some embodiments, prepare reports to aid in compliance with food safety laws and regulations.


The datastore 314 may include data stored, accessed, and/or modified by any of the modules of the computing device 110. The datastore 314 may include any number of data storage structures such as tables, databases, lists, and/or the like.


Although the software application 112 is depicted as described as including modules 302 through 312 and notification module 312, the modules and/or datastore may be included in the computing device 110. One or more modules and/or datastore of the software application 112 as seen in FIG. 3A may be included in the computing device 110.



FIG. 3B depicts components of a block diagram of the particle of interest detection system 104 in some embodiments. The particle of interest detection system 104 includes a communication module 352, a processing module 354, a training and curation module 356, a particle of interest prediction module 358, a notification module 360, a spectral metrics datastore 362, an AI/ML datastore 364, a reference datastore 366, and a system datastore 368.


The communication module 352 may send and/or receive requests and/or data between the particle of interest detection system 104 and any of the food processing apparatuses 106, the computing devices 110, and the spectral acquisition apparatuses 102. The communication module 352 may receive requests and/or data from the food processing apparatuses 106, the computing devices 110, and/or the spectral acquisition apparatuses 102. The communication module 352 may also send requests and/or data to the food processing apparatuses 106, the computing devices 110, and/or the spectral acquisition apparatuses 102.


The processing module 354 may receive a request from the software application 112 that data from the spectral acquisition apparatus 102 will be transmitted to the processing module 354. The processing module 354 may receive this request and transmit a request to the particle of interest prediction module 358 that data will be transmitted to the particle of interest prediction module 358. In some embodiments, the data transmitted to the processing module 354 may be a first set of data in the first format. In one example, the data transmitted to the processing module 354 may be a second set of data in the second format. The first set of data in the first format is processed to obtain the second set of data in the second format.


The training and curation module 356 may train an artificial intelligence and/or machine learning system (e.g., such as a set of decision trees) to be applied to the processed data from the spectral acquisition apparatus 102.


The particle of interest prediction module 358 may apply the trained artificial intelligence and/or machine learning system (e.g., such as the set of trained decision trees) to the second set of values to obtain a result. The artificial intelligence and/or machine learning system may have been previously trained by the training and curation module 356. In some embodiments, the interest prediction module 358 may be previously trained to detect particles of interest. Particles of interest may be pathogens that affect the environment or human health and safety. In one example, a result may indicate either a positive (a positive particular of interest detection) or a negative (a negative particular of interest detection) for a foodborne pathogen for the sample of the food processing byproduct.


The artificial intelligence technology enables significantly reduced limits of detection (LOD), well beyond the LOD of the spectrophotometer (or other spectral acquisition device) alone. LOD is classically limited by the capabilities of the hardware and optical components. AI reaches beyond hardware and optical limitations.


The notification module 360 may generate and provide notifications that include results of foodborne pathogen detections of the sample as well as other information, such as a confidence score. The notification module 360 may provide reports, alerts, and/or dashboards that include results, confidence scores, and/or other information. For example, the particle of interest detection system 104 may track foodborne pathogen detections on particular food processing equipment as well as what food was processed on the food processing equipment. As another example, the particle of interest detection system 104 may track foodborne pathogen detections in certain parts of a food processing facility as well as what food was processed in those certain parts. The particle of interest detection system 104 may thus be able to identify food (e.g., particular lots or production runs) and recommend, via the notification module 360, that remedial action, such as quarantining food, recalling food, or other action, should be taken. The notification module 360 may optionally notify appropriate third parties (e.g., government agencies such as the U.S. FDA) of the detection of foodborne pathogens. The notification module 360 may, in some embodiments, prepare reports to aid in compliance with food safety laws and regulations.


The spectral metrics datastore 362 stores raw data received from the spectral acquisition apparatus 102. A data store is any data structure (e.g., one or more tables, databases, and/or the like) for storing information. The raw data may be stored for auditing purposes. In some embodiments, the spectral metrics datastore 362 receives pre-processed data from the pre-processing module 306 of FIG. 3A. The spectral metrics datastore 362 may include any number of data storage structures such as tables, databases, lists, and/or the like.


The AI/ML datastore 364 stores artificial intelligence/machine learning models (e.g., such as a set of decision trees) used to detect evidence of foodborne pathogens in the spectrometer scans. In some embodiments, the AI/ML datastore 364 stores the results of the artificial intelligence/machine learning models. In some embodiments, the artificial intelligence/machine learning models are stored on the particle of interest detection system 104, which may be a cloud-based application. In various embodiments, the artificial intelligence/machine learning models are stored locally on the same network as the food processing apparatus 106. In one example, the artificial intelligence/machine learning models are stored on edge devices. An edge device is a device that provides an opening or entry point into an enterprise network. For security reasons, some corporations may require the artificial intelligence/machine learning models to be stored locally instead of storing the artificial intelligence/machine learning model in a cloud-based application that may be external to the corporation's enterprise network.


The reference datastore 366 stores reference data and metadata associated with the spectrometer data, spectrometer, and properties associated with the data.


For example, reference data may include the name of the manufacturer, model, and the serial number of the spectral acquisition apparatus, operator identifier (ID), and spectral acquisition apparatus firmware version.


In one example, metadata may include a scan universal unique identifier, an external reference identifier, a specimen scanned data timestamp, which may be the date and time at which the spectral acquisition apparatus obtains the set of spectral metrics, spectral data file name, target particle or pathogen, inference request type name, operator ID, scan mode type code or name, medium name, modality name, device model, scan code or version code, device ID, location code name, customer ID, software version number code, and device firmware version identifier.


The scan universal unique identifier may be an identifier or identification number associated with a particular sample or specimen.


The external reference identifier may be a unique identifier for a particular sample assigned by a customer. The external reference identifier may link to a particular data file and metadata associated with the particular sample. The external reference identifier may enable a chain of custody which may be used for tracking purposes.


The specimen scanned data timestamp may include a date and timestamp of when the sample or specimen was scanned by the computing device 110.


The spectral data file name may be the name of the spectral data file.


The target particle or pathogen may be a name of the pathogen or particle that the particle of interest detection system 104 is trying to detect. In some embodiments, the target particle or pathogen may be E. Coli, Salmonella, or Listeria, or an infectious pathogen of humans such as Respiratory Syncytial Virus (RSV), or Coronavirus (COVID-19).


The inference request type name specifies whether the result or inference needs to be sent to the software application 112 or not. The metadata associated with the inference request type name may be synchronous or asynchronous. A synchronous value may represent that the result of the data acquired and processed by the particle of interest detection system 104 may be communicated or sent to the software application 112. An asynchronous value may represent that the result of the data acquired and processed by the particle of interest detection system 104 may not be communicated or sent to the software application 112.


In one example, the operator ID may be an internal ID number or name associated with a particular user of the 110 or laboratory technician.


Metadata associated with the scan mode type code or name may specify whether the sample or specimen was scanned in transmission mode or absorption mode. The results of scans in transmission mode may be expressed as a percentage or ratio (% T). The results of scans in absorption mode may be expressed in Absorption Unit (AU). In one example, for Hach spectrophotometers, to convert from Absorption Unit to transmission ratio, the following equation may be utilized:





% T=antilog(2−AU)


In various embodiments, the medium name identifies the medium in which the target particle is prepared for a scan. Examples of medium names may include DI Water, phosphate buffer solution (PBS), saline water, CITOSWAB®.


In another example, the modality name may refer to where the target particle was collected from. Examples of modality names include low nasal swabs, upper nasal swabs, oral swabs, oral rinse, urine, and blood sample.


The device mode may refer to whether the spectral metrics is processed by the artificial intelligence and/or machine learning system right away or if the spectral metrics is not processed right away.


In some embodiments, the scan code or version code may be a code that is generated by the particle of interest detection system 104 when the particle of interest detection system 104 is in operation mode (e.g., non-training mode). The scan code or version code may be used to validate user input.


The device ID may be an identifier assigned by the software application.


In one example, the location code name may represent an identifier of where a particular piece of equipment or device is deployed for sample/specimen collection.


In some embodiments, the customer ID is an identifier for each customer of the particle of interest detection system 104.


The software version number code and device firmware version identifier may represent or identify the version of the software being used.


The system datastore 368 stores the results of foodborne pathogen detections of the sample as well as other information, such as a confidence score. Reports and/or dashboards that include results, confidence scores, and/or other information may be stored in the system datastore 368.


A module of the computing device 110 or the particle of interest detection system 104 may be hardware, software, firmware, or any combination. For example, each module may include functions performed by dedicated hardware (e.g., an Application-Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), or the like), software, instructions maintained in ROM, and/or any combination. Software may be executed by one or more processors. Although a limited number of modules are depicted in FIGS. 3A and 3B, there may be any number of modules. Further, individual modules may perform any number of functions, including functions of multiple modules, as shown herein. Further, modules depicted as being included in the computing device 110 may be additionally or alternatively included in the particle of interest detection system 104, and modules included in the particle of interest detection system 104 may be additionally or alternatively included in the computing device 110.



FIG. 4 is a flowchart showing a method 400 for detecting particles of interest in some embodiments. Various modules of both the computing device 110 and the particle of interest detection system 104 may perform the method 400. In some embodiments, various modules of only the computing device 110 perform the method 400. In some embodiments, various modules of only the particle of interest detection system 104 perform the method 400.


The method 400 includes an initialization phase 402, a data retrieval phase 404, a data pre-processing phase 406, a data processing phase 408, and a notification phase 410. More details regarding each of these steps will be discussed in FIG. 5 through FIG. 8.



FIG. 5 depicts in further detail of the initialization phase 402 and the data retrieval phase 404 in some embodiments. The initialization phase 402 includes steps 502 through 506. The data retrieval phase 404 includes step 508.


In step 502, a user of the software application 112 may interact with an input device of the computing device 110 to authenticate the user. The communication module 302 may receive data to authenticate a user. The data may include a username and password, which the user may input by interacting with the fields 1010 and 1012 of FIG. 10. The software application 112 or the particle of interest detection system 104 may authenticate the user. In some embodiments, the user may provide biometric data (such as by a fingerprint sensor integrated into the computing device 110, by facial or voice recognition, and/or by other ID verification means—e.g., a radio frequency identification (RFID) card or the like).


In step 504, the device configuration module 304 may receive set-up data associated with the spectral acquisition apparatus 102. The setup data may include data such as a manufacturer, type, or model of the spectral acquisition apparatus 102. The user of the computing device 110 may input this information to the software application 112. In some embodiments, each piece of equipment or asset may be tagged with barcodes, RFID, or QR codes. Data associated with the tags may include information such as a date a particular piece of equipment was purchased, firmware version, last time the equipment was service, and the like. The device configuration module 304 may further receive information regarding the spectral acquisition apparatus 102, such as whether or not the software application 112 is able to control the spectral acquisition apparatus 102 or only able to read data from the spectral acquisition apparatus 102.


In step 506, the device configuration module 304 may receive set up data associated with another apparatus, such as the food processing apparatus 106. The set up data may include data such as the different types of food items which may be processed by the food processing apparatus 106 or the type of testing medium. In some embodiments, the device configuration module 304 may receive additional information such as the frequency with which the food processing byproduct is sampled.


As discussed herein, the particle of interest detection system 104 may be employed to detect foodborne pathogens. The following describes detection of foodborne pathogens, but it will be appreciated that the particle of interest detection system 104 may be utilized to detect any particle of interest.


A person working in a food processing facility in which the food processing apparatus 106 is located may fills the cuvette 206 or other suitable container with a sample of food processing byproduct and place the cuvette 206 in an appropriate receptacle of the spectral acquisition apparatus 102. The person may fill the cuvette 206 periodically, as needed, for each lot or shipment of food to be processed or on a predetermined schedule. It will be understood that samples of food processing byproducts may be tested at various times. In some embodiments, the cuvette 206 may be filled by an automated device or system without intervention by a person. In some embodiments, the sample of food processing byproduct may be mixed with a reagent and/or cultivated in a solution, such as a tryptic soy broth (TSB) solution. In some embodiments, the sample of food processing byproduct may be mixed with a neutral or inert substance.


After filling the cuvette 206, the person may then start a scan of the sample of the food processing byproduct using an interface of the spectral acquisition apparatus 102. Additionally, or alternatively, the person may start the scan using the computing device 110, which may control the spectral acquisition apparatus 102. The spectral acquisition apparatus 102 may generate electromagnetic radiation that interacts with the sample. For example, the spectral acquisition apparatus 102 may generate light that passes through at least a portion of the sample of the food processing byproduct in the cuvette 206 and detect the light that has passed through at least the portion of the sample of the food processing byproduct in the cuvette 206. The spectral acquisition apparatus 102 may measure the intensities of the detected light for a set of wavelengths of the light and obtains a set of intensity measurements for the set of wavelengths of the light. In some embodiments, the set of wavelengths of light includes wavelengths of light in the ultraviolet, visible, and infrared spectrums. In some embodiments, the set of wavelengths of light includes wavelengths of light ranging from approximately 300 nanometers (nm) (for example, approximately 320 nm) to approximately 1100 nm (for example, approximately 1100 nm). In some embodiments, the spectral acquisition apparatus 102 has a resolution of 1 nm and obtains a set of 781 intensity measurements for a set of 781 wavelengths of light. The spectral acquisition apparatus 102 obtains a set of spectral metrics based on interactions of the electromagnetic radiation with the sample.


Additionally or alternatively, the spectral acquisition apparatus 102 may measure absorbance, transmittance, reflectance and/or scattering of electromagnetic radiation to obtain the set of spectral metrics. The electromagnetic radiation generated by the spectral acquisition apparatus 102 may include ultraviolet light, visible light, and/or infrared light. The spectral acquisition apparatus 102 may also utilize electromagnetic radiation from portions of the electromagnetic spectrum other than ultraviolet, visible, and infrared portions.


In step 508, the communication module 302 receives a first set of data in a first format from the spectral acquisition apparatus 102. The first set of data includes the set of spectral metrics. The set of spectral metrics may be or be based on a set of intensity measurements for a set of wavelengths of light that the spectral acquisition apparatus 102 obtained. Additionally or alternatively, the set of spectral metrics may be or be based on absorbance, transmittance, reflectance and/or scattering measurements that the spectral acquisition apparatus 102 obtains. When the communication module 302 receives the set of spectral metrics, the user interface module 310 may provide the set of spectral metrics to a user interface regarding the particular specimen. For example, the example user interface 1100 of FIG. 11 may be provided by the user interface module 310 to a graphics display of the computing device 110. The example user interface 1100 may include information such as the specimen container ID, external reference ID, target particle, and the like. For example, a field 1110 of the example user interface 1100 may provide metadata such as a request type. A field 1120 of the example user interface may provide metadata such as the target particle. At the same time, a field 1130 of the example user interface may provide metadata such as the location code.


In some embodiments, the set of spectral metrics from the spectral acquisition apparatus 102 may be stored in the light intensity measuring datastore 362 of the particle of interest detection system 104. A copy of data from the spectral acquisition apparatus 102 may be stored in the particle of interest detection system 104 for various purposes, such as for auditing purposes.



FIG. 6 depicts in further detail the data pre-processing phase 406 in some embodiments.


In step 602, the pre-processing module 306 may receive file format data from the spectral acquisition apparatus 102. In one example, the output of one spectral acquisition apparatus may be in the CSV file format, while the output of another spectral acquisition apparatus may be in an XML file format. In step 604, the pre-processing module 306 may receive data regarding the format of data from the spectral acquisition apparatus 102.


In step 606, the pre-processing module 306 processes or normalizes the first set of data including the set of spectral metrics received from the spectral acquisition apparatus 102 so that it may be inputted to the artificial intelligence and/or machine learning system of the particle of interest detection system 104. The output of the pre-processing module 306 may be a second set of data, which includes the set of spectral metrics, which may be stored in the datastore 314.



FIG. 7 depicts in further detail the data processing phase 408 in some embodiments. In some embodiments, steps of the data processing phase 408 are executed by the particle of interest detection system 104.


At step 702, the particle of interest prediction module 358 applies one or more machine learning or artificial intelligence models to the set of spectral metrics or a set of values based on the set of spectral metrics received from the pre-processing module 306. In some embodiments, the particle of interest prediction module 358 applies a set of trained decision trees to the set of spectral metrics or a set of values based on the set of spectral metrics received from the pre-processing module 306. The training and curation module 356 may have trained a set of decision trees on a set of training samples. A first subset of training samples of the set of training samples contain a particle of interest at a first concentration, and a second subset of training samples of the set of training samples contain the particle of interest at a second concentration different from the first concentration. In some embodiments, the particle of interest prediction module 358 utilizes the following Python code to apply the set of trained decision trees to the set of values to obtain a result:






y_pred=model.predict(x_test)


In this code, y_pred is the result and x_test is testing data. In some embodiments, the set of trained decision trees may operate in a binary mode. In such embodiments, the result may be a float that has a value between zero, inclusive, and one, inclusive. In some embodiments, the set of trained decision trees may operate in a multiclass mode. In such embodiments, the result may be an integer that has value of either zero, one, or another integer value greater than one.


At step 704, the particle of interest prediction module 358, based on the result, determines either a positive particle of interest detection or a negative particle of interest detection for the particle of interest in the sample of the food processing byproduct. In some embodiments, the particle of interest prediction module 358 determines that the result indicates a positive particle of interest detection if the result meets or exceeds a threshold, and that the result indicates a negative particle of interest detection if the result does not meet or exceed the threshold. In embodiments where the result is a float value between zero, inclusive, and one, inclusive, the threshold may be 0.5. In embodiments where the result is an integer with a value of either zero, one, or another integer value greater than one, zero indicates a negative result, and one or another integer value greater than one indicates a positive result. As discussed in more detail herein, in such embodiments, the result may indicate both a positive particle of interest detection as well as a concentration of the particle of interest in the sample of the food processing byproduct.


At step 706, the particle of interest prediction module 358, based on the result, determines an approximate concentration or an approximate range of concentrations for the particle of interest in the sample of the food processing byproduct.


At step 708, the particle of interest prediction module 358 obtains a confidence value for the particle of interest in the sample of the food processing byproduct. In some embodiments, the particle of interest prediction module 358 utilizes the following Python code to obtain the confidence score:






y_score=model.predict_proba(x_test)


In this code, y_score is the confidence value, which may be a float that ranges between zero, inclusive, and one, inclusive. The closer the value is to zero the higher the degree of confidence that the result is negative, and the closer the value is to one the higher the degree of confidence that the result is positive. In some embodiments, the confidence value may be expressed as a percentage between 0% and 100%, inclusive.



FIG. 8 depicts in further detail of the notification phase 410 in some embodiments.


At step 802, the notification module 360 generates a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest in the sample of the food processing byproduct. In some embodiments, the particle of interest detection notification further indicates the approximate concentration or the approximate range of concentrations for the particle of interest in the sample of the food processing byproduct. In some embodiments, the particle of interest detection notification further indicates the confidence value for the particle of interest in the sample of the food processing byproduct. In some embodiments, if the confidence value is within a certain range or above or below a certain threshold, the particle of interest detection notification may include a flag indicating such. For example, if the confidence value is below a certain threshold, the particle of interest detection notification may flag that there is low confidence in the result. As another example, if the confidence value is above a certain threshold, the particle of interest detection notification may flag that there is high confidence in the result.


At step 804, the notification module 360 provides the particle of interest detection notification. In some embodiments, the notification module 360 provides the particle of interest detection notification to the computing device 110 for display by the user interface module 310. The notification module 360 may provide the particle of interest detection notification for display by other digital devices. At step 806, the notification module 360 generates and provides reports, such as dashboards, spreadsheets, or the like, which include results, confidence scores, and/or other information.


In some embodiments, the set of trained decision trees is a first set of trained decision trees, the set of training samples is a first set of training samples, the particle of interest is a first particle of interest, the result is a first result, the positive particle of interest detection is a first positive particle of interest detection, the negative particle of interest detection is a first negative particle of interest detection, and the particle of interest detection notification is a first particle of interest detection notification. In such embodiments, the particle of interest prediction module 358 may apply a second set of trained decision trees to the second set of values to obtain a second result. The second set of trained decision trees may be trained on a second set of training samples, where a third subset of training samples of the second set of training samples contain a second particle of interest at a third concentration and a fourth subset of training samples of the second set of training samples contain the second particle of interest at a fourth concentration different from the third concentration. The second particle of interest is different from the first particle of interest. For example, the first particle of interest may be a foodborne pathogen such as E. coli, and the second particle of interest may be a foodborne pathogen such as salmonella.


Further, in such embodiments, the particle of interest prediction module 358 may, based on the second result, determine either a second positive particle of interest detection or a second negative particle of interest detection for the second particle of interest in the sample of the food processing byproduct. The particle of interest prediction module 358 may also generate a second particle of interest detection notification that indicates either the second positive particle of interest detection or the second negative particle of interest detection for the second particle of interest in the sample of the food processing byproduct. The notification module 360 may also provide the second particle of interest detection notification.


In some embodiments, the set of spectral metrics is a first set of spectral metrics, and the result is a first result. In such embodiments, the communication module 302 may receive at least one second set of spectral metrics from the computing device 110. The at least one second set of spectral metrics are or are based on at least one second interactions of electromagnetic radiation obtained by the spectral acquisition apparatus 102. For example, the spectral acquisition apparatus 102 may perform multiple scans of the sample of the food processing byproduct and provide the multiple sets of spectral metrics to the particle of interest detection system 104 via the computing device 110. The particle of interest detection system 104 may perform initialization phase 402 to notification phase 410 of the method 400 and thus obtain multiple results.


Further in such embodiments, the particle of interest detection system 104 may then determine either the positive particle of interest detection or the negative particle of interest detection for the particle of interest in the sample of the food processing byproduct based on the multiple results. For example, if the particle of interest detection system 104 obtains three results, the particle of interest detection system 104 may determine either the positive particle of interest detection or the negative particle of interest detection for the particle of interest in the sample of the food processing byproduct based on the two results that have the highest confidence score. As another example, if the particle of interest detection system 104 obtains three results, the particle of interest detection system 104 may determine either the positive particle of interest detection or the negative particle of interest detection for the particle of interest in the sample of the food processing byproduct based on the best two results of the three results. It will be understood that the particle of interest detection system 104 may determine either the positive particle of interest detection or the negative particle of interest detection for the particle of interest in the sample of the food processing byproduct in various ways.



FIG. 9 is a flowchart showing a method 900 for training sets of decision trees for detecting foodborne pathogens in some embodiments. Various modules of the particle of interest detection system 104 perform the method 900. In some embodiments, various modules of the computing device 110 performs the method 900. In some embodiments, various modules of both the computing device 110 and the particle of interest detection system 104 perform the method 900. Although FIG. 9 illustrates and the accompany description describe training sets of decision trees for detecting foodborne pathogens, it will be appreciated that sets of decision trees may be trained to detect any particle of interest. Further, it will be appreciated that other machine learning and/or artificial intelligence models, such as convolutional neural networks, may be trained to detect particles of interest, such as foodborne pathogens.


In some embodiments, a person, such as a laboratory technician, prepares a set of training samples. A foodborne pathogen, such as E. coli, may be cultivated in a solution, such as a tryptic soy broth (TSB) solution. The initial concentration of the foodborne pathogen in the solution may be approximately 1e8 colony-forming units/milliliter (cfu/mL). In some embodiments, the initial concentration of the foodborne pathogen in the solution may range from approximately 1e6 to approximately 1e8 cfu/mL. In some embodiments, the initial concentration of the foodborne pathogen in the solution may be greater than approximately 1e8 cfu/mL. In some embodiments, the initial concentration of the foodborne pathogen in the solution may be lower than approximately 1e8 cfu/mL.


A first subset of training samples at the initial concentration of approximately 1e8 cfu/mL may be prepared. A second subset of training samples may be prepared that have been diluted 10:1 from the initial concentration using the solution, so as to have a second concentration of approximately 1e7 cfu/mL. A third subset of training samples may be prepared that have been diluted 100:1 from the initial concentration using the solution, so as to have a third concentration of approximately 1e6 cfu/mL. A fourth subset of training samples may be prepared that have been diluted 1000:1 from the initial concentration using the solution, so as to have a fourth concentration of approximately 1e5 cfu/mL. A fifth subset of training samples may be prepared that have been diluted 10,000:1 from the initial concentration using the solution, so as to have a fifth concentration of approximately 1e4 cfu/mL. A sixth subset of training samples may be prepared that have been diluted 100,000:1 from the initial concentration using the solution, so as to have a sixth concentration of approximately 1e3 cfu/mL. A seventh subset of training samples may be prepared that have been diluted 1,000,000:1 from the initial concentration using the solution, so as to have a seventh concentration of approximately 1e2 cfu/mL. An eighth subset of training samples may be prepared that have been diluted from the initial concentration using the solution, so as to have an eighth concentration of approximately 1 cfu/mL. A ninth subset of training samples may be prepared that contain only the solution, for example, the TSB solution.


In some embodiments, there may be fewer than or more than eight subsets of training samples at different concentrations. In some embodiments, the different subsets of training samples may be diluted using different dilution ratios to obtain different concentrations than those described herein. In some embodiments, the set of training samples contains approximately 2000 training samples that include a foodborne pathogen, which may be referred to herein as positive training samples. In some embodiments, the set of training samples contains fewer than 2000 positive training samples. In some embodiments, the set of training samples contains more than 2000 positive training samples. In some embodiments, the set of training samples contains approximately the same number of training samples that do not include a foodborne pathogen, which may be referred to herein as negative training samples, as the number of positive training samples. In some embodiments, the number of negative training samples is less than the number of positive training samples. In some embodiments, the number of negative training samples is more than the number of positive training samples.


In some embodiments, the set of training samples is prepared prior to the spectral acquisition apparatus 102 scanning the training samples. In some embodiments, the first subset of training samples at the initial concentration of approximately 1e8 cfu/mL are prepared and scanned by the spectral acquisition apparatus 102. Then, the first subset of training samples is diluted 10:1 from the initial concentration to obtain the second subset of training samples at the second concentration of approximately 1e7 cfu/mL, and then the second subset of training samples is scanned by the spectral acquisition apparatus 102. This dilution and scanning may be repeated several times to obtain, and then scan, the third through eighth subsets of training samples.


To scan a training sample, the cuvette 206 or other suitable container may be filled with a training sample and placed in an appropriate receptacle of the spectral acquisition apparatus 102. The person may then start a scan of the sample of the food processing byproduct using an interface of the spectral acquisition apparatus 102. Additionally or alternatively, the person may start the scan using a computing device 110, which may control the spectral acquisition apparatus 102. The spectral acquisition apparatus 102 generates light that passes through at least a portion of the training sample in the cuvette 206 and detects the light that has passed through at least the portion of the training sample in the cuvette 206. The spectral acquisition apparatus 102 measures intensities of the light for a set of wavelengths of the light and obtains a set of intensity measurements for the set of wavelengths of the light. In some embodiments, the set of wavelengths of light includes wavelengths of light in the ultraviolet, visible, and infrared spectrums. In some embodiments, the set of wavelengths of light includes wavelengths of light ranging from approximately 300 nanometers (nm) (for example, approximately 320 nm) to approximately 1100 nm (for example, approximately 1100 nm). In some embodiments, the spectral acquisition apparatus 102 has a resolution of 1 nm and obtains a set of 781 intensity measurements for a set of 781 wavelengths of the light.


The method 900 begins at step 902 where the communication module 352 receives multiple first sets of values from the spectral acquisition apparatus 102 via the computing device 110 based on multiple scans of the first subset of training samples containing the foodborne pathogen at the first concentration. The multiple first set of values are based on multiple sets of intensity measurements for a set of wavelengths of light that the spectral acquisition apparatus 102 obtained for scans of the first subset of training samples containing the foodborne pathogen at the first concentration.


Step 902 may be performed for each subset of training samples containing the foodborne pathogen at a different concentration. That is, the communication module 352 may perform step 902 for the first subset of training samples at the first concentration of approximately 1e8 cfu/mL, for the second subset of training samples at the second concentration of approximately 1e7 cfu/mL, up to and including for the eighth subset of training samples at the eighth concentration of approximately 1 cfu/mL.



FIG. 13 depicts a graph 1300 of absorption for a set of wavelengths for multiple instances of light that passed through multiple samples containing different concentrations of E. coli and multiple samples that do not contain E. coli in some embodiments. The wavelengths of light range from approximately 320 nm to approximately 1100 nm. The graph 1300 shows the absorption by wavelength for multiple training samples containing E. coli at a first concentration (Ecoli1), multiple training samples containing E. coli at a second concentration lower than the first concentration (Ecoli2), multiple training samples containing E. coli at a third concentration lower than the second concentration (Ecoli3), multiple training samples containing E. coli at a fourth concentration lower than the third concentration (Ecoli4), multiple training samples containing E. coli at a fifth concentration lower than the fourth concentration (Ecoli5), multiple training samples containing E. coli at a sixth concentration lower than the fifth concentration (Ecoli6), multiple training samples containing E. coli at a seventh concentration lower than the sixth concentration (Ecoli7), and multiple training samples that do not contain E. coli (TSB). The graph 1300 illustrates that the absorption across the range of wavelengths generally decreases as the concentration of E. coli in a training sample decreases, or conversely, that the absorption generally increases as the concentration of E. coli in a training sample increases.


Returning to FIG. 9, at step 904, the processing module 354 generates multiple second sets of values based on the multiple first sets of values. In some embodiments, the processing module 354 normalizes each value in the multiple second sets of values to be between zero, inclusive, and one, inclusive. The processing module 354 may further process the values in the multiple second sets of values. The processing module 354 may generate the multiple second sets of values using other techniques, such as applying a fitting function to each value in the multiple first sets of values to generate each value in the multiple second sets of values.


Step 904 may be performed for each subset of training samples containing the foodborne pathogen at a different concentration. That is, the processing module 354 may perform step 904 for the first subset of training samples at the first concentration of approximately 1e8 cfu/mL, for the second subset of training samples at the second concentration of approximately 1e7 cfu/mL, up to and including for the eighth subset of training samples at the eighth concentration of approximately 1 cfu/mL.


At step 906, the communication module 352 receives multiple third sets of values from the spectral acquisition apparatus 102 via the computing device 110 based on multiple scans of the subset of training samples that do not contain the foodborne pathogen. At step 508, the processing module 354 generates multiple fourth sets of values based on the multiple third sets of values. In some embodiments, the processing module 354 normalizes each value in the multiple fourth sets of values to be between zero, inclusive, and one, inclusive. The processing module 354 may further process the values in the multiple fourth sets of values. The processing module 354 may generate the multiple fourth sets of values using other techniques, such as applying a fitting function to each value in the multiple third sets of values to generate each value in the multiple fourth sets of values.


At step 910, the training and curation module 356 prepares training data based on the multiple second sets of values and the multiple fourth sets of values. The training and curation module 356 also prepares training labels for the training data. In embodiments where the set of trained decision trees operate in a binary mode, a training label may be either a zero (0) for a negative training sample and a one (1) for a positive training sample. In embodiments where the set of trained decision trees operate in a multiclass mode, a training label may be either a zero (0) for a negative training sample, a one (1) for a positive training sample having a foodborne pathogen concentration at a first concentration, a two (2) for a positive training sample having a foodborne pathogen concentration at a second concentration, a three (3) for a positive training sample having a foodborne pathogen concentration at a third concentration, a four (4) for a positive training sample having a foodborne pathogen concentration at a fourth concentration, a five (5) for a positive training sample having a foodborne pathogen concentration at a fifth concentration, a six (6) for a positive training sample having a foodborne pathogen concentration at a sixth concentration, a seven (7) for a positive training sample having a foodborne pathogen concentration at a seventh concentration, and an eight (8) for a positive training sample having a foodborne pathogen concentration at an eighth concentration. In some embodiments, there are fewer than eight different concentrations of the foodborne pathogen in the training samples and a corresponding lower number of different training labels. In some embodiments, there are more than eight different concentrations of the foodborne pathogen in the training samples and a corresponding higher number of different training labels.


At step 912, the training and curation module 356 trains a set of decision trees for the foodborne pathogen. In some embodiments, the training and curation module 356 utilizes an optimized distributed gradient boosting library, XGBoost. In some embodiments, the training and curation module 356 utilizes the following Python code to create each set of decision trees:

















 from xgboost import XGBClassifier



 params = {“booster”: “gbtree”,



  “objective”:“binary:logistic”,



  “max_delta_step”:20,



  “eval_metric”:“error”,



  “n_estimators”:10000,



  “verbosity”:0,



  “max_depth”:500,}



  self.config.params = params



model = XGBClassifier(**params)











XGBClassifier may be understood as a single model that is an ensemble of 10,000 decision trees (the “n_estimators”:10000 parameter). In some embodiments, the training and curation module 356 may utilize parameters other than or in addition to those listed herein. In some embodiments, the training and curation module 356 may utilize different values for model parameters than those listed herein.


In some embodiments, the training and curation module 356 utilizes the following Python code to train each set of the multiple sets of decision trees:

    • model.fit(x_train, y_train, eval_set=[(x_train, y_train), (x_test, y_test)], early_stopping_rounds=50)


      In this code, x_train is training data, y_train is training labels, x_test is testing data, and y_test is testing labels. Both x_train and x_test are ground truth data. Both x_train and x_test may include both positive training samples and negative training samples. In some embodiments, both the x_train and x_test data are balanced, meaning that they include equal or generally equal numbers of positive training samples and negative training samples. In some embodiments, the x_train and x_test data may be imbalanced toward negative training samples, meaning that they include more negative training samples than positive training samples. The training and curation module 356 may also use data sets that are imbalanced towards positive training samples, meaning that they include more positive training samples than negative training samples.


The set of trained decision trees may operate in a binary mode or a multiclass mode. The following Python code may be utilized to determine which mode the set of trained decision trees may operate in:

















# if binary, set binary, otherwise, set multiclass



if np.max(y_train) == 1:



 params[‘objective’] = “binary:logistic”



 params[‘eval_metric’] = ‘error’



elif np.max(y_train) > 1:



 params[‘objective’] = “multi:softmax”



 params[‘eval_metric’] = ‘merror’










At step 914, the training and curation module 356 validates the set of trained decision trees. In some embodiments, the training and curation module 356 utilizes both training data and testing data to validate the set of trained decision trees. In some embodiments, the training and curation module 356 utilizes only testing data to validate the set of trained decision trees.



FIG. 14A depicts a confusion matrix 1400 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through multiple testing samples containing E. coli at seven different concentrations and multiple testing samples that do not contain E. coli in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 1400 operated in a binary mode, and there were 108 total testing samples. In the set of testing samples, there were different testing samples containing seven different concentrations of E. coli and testing samples that did not contain E. coli. The upper left-hand quadrant of the confusion matrix 1400 indicates true negatives, of which there are 53. The upper right-hand quadrant of the confusion matrix 1400 indicates false negatives, of which there are 13. The lower left-hand quadrant of the confusion matrix 1400 indicates false positives, of which there are 11. The lower right-hand quadrant of the confusion matrix 1400 indicates true positives, of which there are 31. The confusion matrix indicates that the set of trained decision trees is 77% accurate, with 80% specificity and 74% sensitivity. This indicates that the set of trained decision trees would detect E. coli 74% of the time in samples containing E. coli at the lowest concentration (the seventh concentration, which may be approximately 1.5 cfu/mL) and higher concentrations (for example, concentrations at approximately 1.5e2 cfu/mL to approximately 1.5e7 cfu/mL).



FIG. 14B depicts a confusion matrix 1450 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through multiple testing samples containing E. coli at four different concentrations and multiple testing samples that do not contain E. coli in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 1450 operated in a binary mode, and there were 160 total testing samples. In the set of testing samples, there were different testing samples containing four different concentrations of E. coli and testing samples that did not contain E. coli. The upper left-hand quadrant of the confusion matrix 1450 indicates true negatives, of which there are 122. The upper right-hand quadrant of the confusion matrix 1450 indicates false negatives, of which there are 10. The lower left-hand quadrant of the confusion matrix 1450 indicates false positives, of which there are 3. The lower right-hand quadrant of the confusion matrix 1450 indicates true positives, of which there are 25. The confusion matrix 1450 indicates that the set of trained decision trees is 92% accurate, with 92% specificity and 89% sensitivity. This indicates that the set of trained decision trees would detect E. coli 89% of the time in samples containing E. coli at the lowest concentration (the fourth concentration, which may be approximately 1.5e4 cfu/mL) and higher concentrations (for example concentrations at approximately 1.5e5 cfu/mL to approximately 1.5e7 cfu/mL).



FIG. 15 depicts a confusion matrix 1500 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that have passed through multiple testing samples containing E. coli at seven different concentrations and multiple testing samples that do not contain E. coli in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 1400 operated in a multiclass mode, and there were 280 total testing samples. In the set of testing samples, there were different testing samples containing seven different concentrations of E. coli and testing samples that did not contain E. coli. The confusion matrix 1500 indicates that the set of trained decision trees operating in the multiclass mode accurately identifies the concentration of a testing sample up to the third concentration (for example, a concentration at approximately 1.5e4 cfu/mL). The confusion matrix 1500 further indicates that the set of trained decision trees operating in the multiclass mode accurately identifies an approximate range of concentrations for the fourth concentrations through the seventh concentrations.


In some embodiments, the training and curation module 356 performs the method 900 for each of multiple foodborne pathogens. That is, the training and curation module 356 trains a set of decision trees for each of multiple foodborne pathogens, such as E. coli, Salmonella, and Listeria. In some embodiments, the training and curation module 356 may train a set of decision trees for each of the following foodborne pathogens: norovirus, Salmonella (non-typhoidal), Clostridium perfringens, Campylobacter, Staphylococcus aureus, Toxoplasma gondii, Escherichia coli (E. coli), Clostridium botulinum, Cryptosporidium, Cyclospora, hepatitis A virus, Shigella, Yersinia, and Listeria monocytogenes (Listeria). The particle of interest prediction module 358 may apply one or more of the trained sets of decision trees to detect foodborne pathogens. Accordingly, the particle of interest detection system 104 may provide panel detection and notification for various foodborne pathogens. One advantage of the particle of interest detection system 104 is that it may provide results for such panel tests quickly (e.g., within seconds or minutes). Another advantage of the particle of interest detection system 104 is that it obviates the need for sending samples to laboratories for test, which may reduce logistical issues and/or complexity.


In various embodiments, a machine learning and/or AI architecture may be utilized (e.g., random forest, statistical approaches, and/or the like) in addition to or as an alternative to the sets of decision trees discussed herein. The machine learning and/or AI architecture may utilize the features discussed herein to generate predictive models and/or make predictions. In various embodiments, a 1d or 2d convolutional neural network (CNN) may be used as a discriminator to identify measurements indicating foodborne pathogen contamination and non-foodborne pathogen contamination. In various embodiments, a neural network may be trained using measurements or values from the spectral acquisition apparatuses 102 as discussed herein. The neural network may also be trained using laboratory test results to confirm those foods, equipment, and/or surfaces that are contaminated and those that are not contaminated. The neural network may receive or generate a set of features based on the output (i.e., measurement results or values based thereon) of the spectral acquisition apparatuses 102. The neural network may then be tested to confirm predictions against known foodborne pathogen contamination and non-foodborne pathogen contamination results. In various embodiments, the models may utilize time series data generated by the spectral acquisition apparatuses 102 to make determinations about foodborne pathogen contamination.


In various embodiments, the training and curation module 356 may receive new ground truth data for a particular foodborne pathogen (e.g., new data that includes both positive samples and negative samples for the particular foodborne pathogen) and update the training data and the testing data and retrain the set of decision trees corresponding to the particular foodborne pathogen. For example, the training and curation module 356 may receive new ground truth data for Salmonella. The training and curation module 356 may then update the training data and the testing data for Salmonella and retrain the set of decision trees for Salmonella. This may allow the particle of interest detection system 104 to better detect Salmonella in samples of food processing byproducts. The particle of interest detection system 104 may utilize similar processes for other foodborne pathogens such as E. coli and Listeria. As a result, the models and/or AI architecture may be updated, improved, and/or curated based on new positive samples and new negative samples in the new ground truth data.


In some embodiments, the training and curation module 356 may train sets of decision trees using training samples from a particular food processing facility, a region that includes multiple food processing facilities, or one or more classes of food processing facilities. This may allow the training and curation module 356 to create sets of trained decision trees that are customized for a particular food processing facility (for example, a single food processing facility), a particular location (for example, food processing facilities located in the Central Valley of California), or a particular type of food processing facility (for example, food processing facilities that slaughter chickens and process slaughtered chickens).



FIG. 16 depicts a graph 1600 of absorption for a set of wavelengths for multiple instances of light that passed through multiple samples containing different concentrations of microspheres in some embodiments. The wavelengths of light range from approximately 320 nm to approximately 1100 nm. The graph 1600 shows the absorption by wavelength for multiple samples containing—1-micron dragon green (480 nm-520 nm) microspheres at a first concentration of approximately 2.07e10 microspheres/mL (MS1D1), multiple training samples containing microspheres at a second concentration of approximately 2.07e9 microspheres/mL (MS1D2), multiple training samples containing microspheres at a third concentration of approximately 2.07e8 microspheres/mL (MS1D3), multiple training samples containing microspheres at a fourth concentration of approximately 2.07e7 microspheres/mL (MS1D4), multiple training samples containing microspheres at a fifth concentration of approximately 2.07e6 microspheres/mL (MS1D5), multiple training samples containing microspheres at a sixth concentration of approximately 2.07e5 microspheres/mL (MS1D6), multiple training samples containing microspheres at a seventh concentration of approximately 2.07e4 microspheres/mL (MS1D7), multiple training samples containing microspheres at an eighth concentration of approximately 2.07e3 microspheres/mL (MS1D8), multiple training samples containing microspheres at a ninth concentration of approximately 2.07e2 microspheres/mL (MS1D9), and multiple training samples containing microspheres at a tenth concentration of approximately 2.07 microspheres/mL (MS1D10). The graph 1600 illustrates that the absorption across the range of wavelengths generally decreases as the concentration of microspheres in a sample decreases, or conversely, that the absorption generally increases as the concentration of microspheres in a sample increases.



FIGS. 17A and 17B depict a confusion matrix 1700 and a confusion matrix 1750, respectively, for results of applications of two sets of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that has passed through samples containing 1000 nanometer microspheres or samples that do not contain microspheres in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 1700 was trained on samples containing 1000 nanometer green microspheres at ten different concentrations, operated in a binary mode, and there were 99 total samples, including samples containing microspheres in water (for example, deionized water) and samples containing only water (for example, deionized water). The set of trained decision trees that produced the results in the confusion matrix 1750 was trained on samples containing 1000 nanometer red microspheres at ten different concentrations, operated in a binary mode, and there were 76 total samples, including samples containing microspheres in water (for example, deionized water) and samples containing only water (for example, deionized water).


In both the confusion matrix 1700 and the confusion matrix 1750, the upper left hand quadrant indicates true negatives, of which there are 20 (FIG. 17A) or 22 (FIG. 17B). The upper right hand quadrant indicates false negatives, of which there are 1 (FIG. 17A) or 0 (FIG. 17B). The lower left hand quadrant indicates false positives, of which there are 0 (both FIG. 17A and FIG. 17B). The lower right hand quadrant indicates true positives, of which there are 78 (FIG. 17A) or 54 (FIG. 17B). The confusion matrix 1700 indicates that the set of trained decision trees is 99% accurate at detecting the presence of 1000 nanometer green microspheres in concentrations as low as 1 microsphere/mL and the confusion matrix 1750 indicates that the set of trained decision trees is 100% accurate at detecting the presence of 1000 nanometer red microspheres in concentrations as low as 1 microsphere/mL. 1000 nanometer microspheres are approximately the same size as common bacteria such as Salmonella and E. coli.



FIG. 18 depicts a confusion matrix 1800 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that has passed through samples containing 500 nanometer microspheres or samples that do not contain microspheres in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 1800 was trained on samples containing 500 nanometer red microspheres at ten different concentrations, operated in a binary mode, and there were 76 total testing samples, including testing samples containing microspheres in water (for example, deionized water) and testing samples containing only water (for example, deionized water). The upper left-hand quadrant of the confusion matrix 1800 indicates true negatives, of which there are 20. The upper right-hand quadrant of the confusion matrix 1800 indicates false negatives, of which there are 2. The lower left-hand quadrant of the confusion matrix 1800 indicates false positives, of which there are 0. The lower right-hand quadrant of the confusion matrix 1800 indicates true positives, of which there are 54. The confusion matrix 1800 indicates that the set of trained decision trees is 97% accurate at detecting the presence of 500 nanometer red microspheres in concentrations as low as 1e2 microspheres/mL.



FIGS. 19A and 19B depict a confusion matrix 1900 and a confusion matrix 1950, respectively, for results of applications of two sets of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that has passed through samples containing 50 nanometer microspheres or samples that do not contain microspheres in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 1900 was trained on samples containing 50 nanometer green microspheres at ten different concentrations, operated in a binary mode, and there were 58 total testing samples, including testing samples containing microspheres in water (for example, deionized water) and testing samples containing only water (for example, deionized water). The set of trained decision trees that produced the results in the confusion matrix 1950 was trained on samples containing 50 nanometer red microspheres at ten different concentrations, operated in a binary mode, and there were 76 total testing samples, including testing samples containing microspheres in water (for example, deionized water) and testing samples containing only water (for example, deionized water). In both the confusion matrix 1900 and the confusion matrix 1950, the upper left-hand quadrant indicates true negatives, of which there are 28 (FIG. 19A) or 22 (FIG. 17B). The upper right-hand quadrant indicates false negatives, of which there are 0 (both FIG. 19A and FIG. 2A). The lower left-hand quadrant indicates false positives, of which there are 0 (both FIG. 19A and FIG. 19B). The lower right-hand quadrant indicates true positives, of which there are 30 (FIG. 19A) or 54 (FIG. 19B). The confusion matrix 1900 indicates that the set of trained decision trees is 100% accurate at detecting the presence of 50 nanometer green microspheres in concentrations as low as 1e5 microspheres/mL and the confusion matrix 1950 indicates that the set of trained decision trees is 100% accurate at detecting the presence of 1000 nanometer red microspheres in concentrations as low as 1e5 microspheres/mL.



FIG. 20 depicts a confusion matrix 2000 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that has passed through samples containing green microspheres at ten different concentrations or samples that do not contain microspheres in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 2000 operated in a multiclass mode and there were 121 total testing samples. In the set of testing samples, there were different testing samples containing ten different concentrations of green microspheres and testing samples that did not contain microspheres. The confusion matrix 2000 indicates that the set of trained decision trees operating in the multiclass mode accurately identifies the concentration of a testing sample up to the fifth concentration (for example, a concentration at approximately 1e6 microspheres/mL). The confusion matrix 2000 further indicates that the set of trained decision trees operating in the multiclass mode accurately identifies an approximate range of concentrations for the sixth concentration through the tenth concentration.



FIG. 21A depicts a confusion matrix 2100 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that has passed through samples containing green microspheres, samples containing red microspheres, or samples that do not contain microspheres in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 2100 operated in a multiclass mode and there were 315 total testing samples, including testing samples containing microspheres in water (for example, deionized water) and testing samples containing only water (for example, deionized water). The testing samples contained red or green microspheres of different sizes and at different concentrations. The confusion matrix 2100 indicates that the set of trained decision trees is 99% accurate at detecting the presence and color of microspheres as small as 50 nm in concentrations as low as 1 microsphere/mL.



FIG. 21B depicts a confusion matrix 2150 for results of applications of a set of trained decision trees to multiple sets of values based on intensity measurements of multiple instances of light that has passed through samples containing green microspheres, samples containing red microspheres, samples containing a mixture of red microspheres and green microspheres, or samples that do not contain microspheres in some embodiments. The set of trained decision trees that produced the results in the confusion matrix 2150 operated in a multiclass mode and there were 423 total testing samples, including testing samples containing microspheres in water (for example, deionized water) and testing samples containing only water (for example, deionized water). The testing samples contained red or green microspheres of different sizes and at different concentrations. The confusion matrix 2100 indicates that the set of trained decision trees is 97% accurate at detecting the presence, color, and mixture of microspheres as small as in concentrations as low as 1 microsphere/mL.



FIG. 22 depicts a comparison of previous food safety testing procedures and procedures of the particle of interest detection system 104 of FIG. 1A according to some embodiments. Using previous food safety testing procedures, the time it takes from obtaining the sample to getting a result for foodborne pathogens such as Salmonella and Listeria can range from approximately 42 hours to approximately 132 hours. In contrast, utilizing procedures of the particle of interest detection system 104, the time can be shortened to less than approximately four hours.


Although FIG. 22 depicts previous food safety testing procedures, it will be appreciated that other previous testing procedures, such as testing for environmental contaminants and testing for infectious agents such as viruses may also have lengthy timelines that are measured in days or weeks. The systems and methods described herein provide for significant shortening of the amount of time required to detect particles of interest such as infectious agents and/or environmental contaminants and/or other particles of interest that may affect human and/or animal health and/or environmental health.


One advantage of the particle of interest detection systems and associated methods described herein is that such systems and methods may be utilized to detect any particles of interest that may affect human or animal health and/or the environment. For example, the particle of interest detection system may be utilized to detect the presence of infectious pathogens of humans such as SARS-CoV-2, COVID-19 or RSV. In some embodiments, the particle of interest detection system and associated methods may be operated to detect particles of interest or pathogens such as the Human Papillomavirus (HPV) infection or other sexually transmitted diseases (STDs) or sexually transmitted infections (STIs).


In another example, the particle of interest detection system and associated methods may be operated to detect pathogens and/or contaminants that may have an environmental effect on air quality. The particle of interest detection system may be utilized to detect the presence of ethylene, methylene, and other volatile organic compounds (VOCs). The particle of interest detection system and associated methods may also be utilized to detect pathogens and/or contaminants that may affect water quality. Accordingly, the particle of interest detection system and associated methods described herein may also aid community water systems and/or other water suppliers with complying with water quality standards, such as those promulgated by government agencies such as the U.S. Environmental Protection Agency.


Another advantage of the particle of interest detection systems and associated methods described herein is that such systems and methods may be utilized to detect multiple different particles of interest using a single set of spectral metrics obtained from a single set of interactions of electromagnetic radiation with a single sample. For example, under previous techniques, to detect COVID-19, influenza, Streptococcus, and Respiratory Syncytial Virus Infection (RSV), four different swabs and four different expensive and time-consuming tests to may be required. In contrast, the particle of interest detection systems and associated methods may utilize a single swab to obtain a single set of spectral metrics, to which multiple trained models (e.g., a trained model for detecting COVID-19, a trained model for detecting influenza, a trained model for detecting Streptococcus, and a trained model for detecting RSV) may be applied to detect all four pathogens, and may be able to provide either positive or negative results for each of the four pathogens in a shorter period of time, which may be as short as one or several minutes.


As another example, in the food safety context, under previous techniques, to detect E. coli, Salmonella, and Listeria requires three different swabs and enrichment and incubation in potentially different reagents, which may be expensive. Each test may also require a significant amount of time, such as between approximately 24 to approximately 96 hours, and may be expensive to perform. In contrast, the particle of interest detection systems and associated methods may utilize a single swab to obtain a single set of spectral metrics, to which multiple trained models (e.g., a trained model for detecting E. coli, a trained model for detecting influenza, a trained model for detecting Salmonella, and a trained model for detecting listeria) may be applied to detect all three foodborne pathogens, and may be able to provide either positive or negative results for each of the three foodborne pathogens in a shorter period of time, which may be as short as one or several minutes.


Accordingly, the particle of interest detection systems and associated methods described herein provide for the ability to perform a panel of tests and obtain results in a shorter period of time than previous techniques allow for.


Another advantage of the particle of interest detection systems and associated methods described herein is that the computing device 110 and the software application 112 may be able to obtain spectral metrics from a wide variety of spectral acquisition apparatuses, such as spectrometers, spectrophotometers, thermal emission spectrometers, cameras, and/or any apparatus that may provide spectral metrics. This may be due in part to the extensibility of the software application 112, which may be configured to interface with a wide variety of spectral acquisition apparatuses. Accordingly, the computing device 110 and the software application 112 may be considered hardware-independent, which may facilitate their placement and usage at different facilities, such as food processing facilities, health care facilities such as hospitals, industrial facilities such as factories, and water treatment facilities such as municipal water treatment facilities.


Yet another advantage of the particle of interest detection systems and associated methods described herein is that the particle of interest detection system 104 may store data, such as metadata, reference data, and results data, that may be analyzed and/or accessed so as to obtain insights into the prevalence of harmful particles of interest, such as foodborne pathogens, infectious pathogens, and/or environmental pollutants and/or contaminants. Such insights may be provided to customers and/or partners of entities utilizing the particle of interest detection systems and associated methods, as well as to governmental agencies occupied with human health and/or safety and/or environmental health, such as the U.S. FDA, the U.S. EPA, and/or the U.S. Department of Health and Human Services. Such insights may be utilized to help prevent and/or reduce threats to human health and/or environmental health. Accordingly, analysis of spectral analysis may provide actionable intelligence of significant value.


Additional details regarding techniques used to detect pathogens using spectrometer scans are described in U.S. patent application Ser. No. 18/173,050 filed on Feb. 22, 2023, and entitled “SYSTEMS AND METHODS FOR DETECTING PATHOGENS USING SPECTROMETER SCANS,” the entirety of which is incorporated herein by reference. Furthermore, additional details regarding techniques used to detect pathogens using spectral analysis are described in U.S. patent application Ser. No. 18/173,035 filed on filed on Feb. 22, 2023, and entitled “SYSTEMS AND METHODS FOR DETECTING FOODBORNE PATHOGENS USING SPECTRAL ANALYSIS,” the entirety of which is incorporated herein by reference.



FIG. 23 is a block diagram illustrating components of an example machine able to read instructions from a machine-readable medium and execute those instructions in a processor to perform the machine processing tasks discussed herein, such as the engine operations discussed above. In some embodiments, the digital device 2300 is coupled to a network, such as the communication network 108, via network interface device 2320. Still yet, the spectral acquisition apparatus 102, the particle of interest detection system 104, the food processing apparatus 106, and the computing device 110 can communicate with one or more networks such as a local area network (LAN), a general wide area network (WAN), and/or a public network (for example, the Internet).


Specifically, FIG. 23 shows a diagrammatic representation of a machine in the example form of a digital device 2300 within which instructions 2324 (e.g., software) for causing the machine to perform any one or more of the methodologies discussed herein may be executed. In alternative embodiments, the machine operates as a standalone device or may be connected (e.g., networked) to other machines, for instance, via the Internet. In a networked deployment, the machine may operate in the capacity of a server machine or a client machine in a server-client network environment, or as a peer machine in a peer-to-peer (or distributed) network environment. A system, such as a computing system, may be or include one or more of the digital device 2300.


The machine may be a server computer, a client computer, a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a cellular telephone, a smartphone, a web appliance, a network router, switch or bridge, or any machine capable of executing instructions 2324 (sequential or otherwise) that specify actions to be taken by that machine. Further, while only a single machine is illustrated, the term “machine” shall also be taken to include any collection of machines that individually or jointly execute instructions 2324 to perform any one or more of the methodologies discussed herein.


The example digital device 2300 includes a processor 2302 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), a digital signal processor (DSP), one or more application-specific integrated circuits (ASICs), one or more radio-frequency integrated circuits (RFICs), or any combination of these), a main memory 2304, and a static memory 2306, which are configured to communicate with each other via a bus 2308. The digital device 2300 may further include a graphics display unit 2310 (e.g., a plasma display panel (PDP), a liquid crystal display (LCD), a projector, or a cathode ray tube (CRT)). The digital device 2300 may also include alphanumeric input device 2312 (e.g., a keyboard), a cursor control device 2314 (e.g., a mouse, a trackball, a joystick, a motion sensor, or other pointing instrument), a data store 2316, a signal generation device 2318 (e.g., a speaker), an audio input device 2326 (e.g., a microphone) and a network interface device 2320, which also are configured to communicate via the bus 2308.


Bus 2308 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, and not limitation, such architectures include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus.


In some embodiments, alphanumeric input device 2312 is any device that inputs data (for example, mouse, keyboard, stylus, sensors, etc.) or outputs data (for example, speaker, display, virtual reality headset).


The data store 2316 includes a machine-readable medium 2322 on which is stored the instructions 2324 (e.g., software) embodying any one or more of the methodologies or functions described herein. The instructions 2324 (e.g., software) may also reside, completely or at least partially, within the main memory 2304 or within the processor 2302 (e.g., within a processor's cache memory) during execution thereof by the digital device 2300, the main memory 2304 and the processor 2302 also constituting machine-readable media. The instructions 2324 (e.g., software) may be transmitted or received over a network (not shown) via network interface device 2320.


While machine-readable medium 2322 is shown in an example embodiment to be a single medium, the term “machine-readable medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) able to store instructions (e.g., instructions 2324). The term “machine-readable medium” shall also be taken to include any medium that is capable of storing instructions (e.g., instructions 2324) for execution by the machine and that cause the machine to perform any one or more of the methodologies disclosed herein. The term “machine-readable medium” includes, but should not be limited to, data repositories in the form of solid-state memories, optical media, and magnetic media.


It should be understood that although not shown, other hardware and/or software components could be used in conjunction with the digital device 2300. Examples include, but are not limited to microcode, device drivers, redundant processing units, and external disk drive arrays, RAID systems, tape drives, and data archival storage systems, etc.


Exemplary embodiments are described herein in detail with reference to the accompanying drawings. However, the present disclosure can be implemented in various manners, and thus should not be construed to be limited to the embodiments disclosed herein. On the contrary, those embodiments are provided for the thorough and complete understanding of the present disclosure, and completely conveying the scope of the present disclosure.


It will be appreciated that aspects of one or more embodiments may be embodied as a system, method, or computer program product. Accordingly, aspects may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects may take the form of a computer program product embodied in one or more computer-readable medium(s) having computer-readable program code embodied thereon.


Any combination of one or more computer-readable medium(s) may be utilized. The computer-readable medium may be a computer-readable signal medium or a non-transitory computer-readable medium, such as a computer-readable storage medium. A computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a solid state drive (SSD), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer-readable storage medium may be any tangible medium that can contain or store a program or data for use by or in connection with an instruction execution system, apparatus, or device.


A transitory computer-readable signal medium may include a propagated data signal with computer-readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof.


Program code embodied on a computer-readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.


Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object-oriented programming language such as Java, Smalltalk, C++, Python, or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer program code may execute entirely on any of the systems described herein or on any combination of the systems described herein.


Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general-purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.


These computer program instructions may also be stored in a computer-readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer-readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.


The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.


While specific examples are described above for illustrative purposes, various equivalent modifications are possible. For example, while processes or blocks are presented in a given order, alternative implementations may perform routines having steps, or employ systems having blocks, in a different order, and some processes or blocks may be deleted, moved, added, subdivided, combined, and/or modified to provide alternative or sub-combinations. Each of these processes or blocks may be implemented in a variety of different ways. Also, while processes or blocks are at times shown as being performed in series, these processes or blocks may instead be performed or implemented concurrently or in parallel or may be performed at different times. Further any specific numbers noted herein are only examples: alternative implementations may employ differing values or ranges.


Throughout this specification, plural instances may implement components, operations, or structures described as a single instance. Structures and functionality presented as separate components in example configurations may be implemented as a combined structure or component. Similarly, structures and functionality presented as a single component may be implemented as separate components. These and other variations, modifications, additions, and improvements fall within the scope of the subject matter herein. Furthermore, any specific numbers noted herein are only examples: alternative implementations may employ differing values or ranges.


Components may be described or illustrated as contained within or connected with other components. Such descriptions or illustrations are examples only, and other configurations may achieve the same or similar functionality. Components may be described or illustrated as “coupled”, “couplable”, “operably coupled”, “communicably coupled” and the like to other components. Such description or illustration should be understood as indicating that such components may cooperate or interact with each other, and may be in direct or indirect physical, electrical, or communicative contact with each other.


Components may be described or illustrated as “configured to”, “adapted to”, “operative to”, “configurable to”, “adaptable to”, “operable to” and the like. Such description or illustration should be understood to encompass components both in an active state and in an inactive or standby state unless required otherwise by context.


The use of “or” in this disclosure is not intended to be understood as an exclusive “or.” Rather, “or” is to be understood as including “and/or.” For example, the phrase “providing products or services” is intended to be understood as having several meanings: “providing products,” “providing services”, and “providing products and services.”


It may be apparent that various modifications may be made, and other embodiments may be used without departing from the broader scope of the discussion herein. For example, while the particle of interest detection system 104 is described as providing reports via websites, the particle of interest detection system 104 may provide reports via applications executing on computing devices, such as apps executing on phones and/or mobile devices and/or native applications executing on laptop or desktop computers.


As another example, the particle of interest detection system 104 may detect pathogens and/or contaminants in water, such as water provided by community or municipal water systems, private wells, and/or bottled water producers. For example, the particle of interest detection system 104 may utilize the techniques described herein to test for microorganisms such as Cryptosporidium, Giardia lamblia, Legionella, and enteric viruses, inorganic chemicals such as cadmium, chromium, mercury, selenium, organic chemicals such as benzene, glyphosate, vinyl chloride, disinfectants, disinfection byproducts, and/or radionuclides. It will be understood that the particle of interest detection system 104 may utilize the techniques described herein to detect regulated pathogens and/or contaminants according to standards set by government agencies such as the U.S. Environmental Protection Agency as well as pathogens and/or contaminants unregulated by such government agencies.


Therefore, these and other variations upon the example embodiments are intended to be covered by the disclosure herein.

Claims
  • 1. A non-transitory computer-readable medium comprising executable instructions, the executable instructions being executable by one or more processors to perform a method, the method comprising: receiving a first set of data in a first format, the first set of data including a set of spectral metrics, the first set of data provided by an apparatus that obtains the set of spectral metrics based on interactions of electromagnetic radiation with a sample;processing the first set of data to obtain a second set of data in a second format different from the first format, the second set of data including the set of spectral metrics;applying one or more trained models to at least one of the set of spectral metrics and a set of values based on the set of spectral metrics to obtain a result, the one or more trained models trained on a set of training samples for a particle of interest;based on the result, determining either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample;generating a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample; andproviding the particle of interest detection notification.
  • 2. The non-transitory computer-readable medium of claim 1, wherein the method further comprises: receiving metadata associated with at least one of the apparatus, the sample, a date and time at which the apparatus obtains the set of spectral metrics; andstoring the metadata in association with the result.
  • 3. The non-transitory computer-readable medium of claim 1, wherein the sample is at least one of a sample of a food processing byproduct, a sample from a person, and an environmental sample, and the particle of interest is at least one of a foodborne pathogen, an infectious pathogen of humans, and an environmental particle of interest.
  • 4. The non-transitory computer-readable medium of claim 1, wherein the set of spectral metrics is a first set of spectral metrics, the apparatus is a first apparatus from a first manufacturer sited at a first location, the sample is a first sample, the set of values is a first set of values, the result is a first result, the positive particle of interest detection is a first positive particle of interest detection, the negative particle of interest detection is a first negative particle of interest detection, the particle of interest detection notification is a first particle of interest detection notification, and the method further comprises: receiving a third set of data in a third format, the third set of data including a second set of spectral metrics, the second set of spectral metrics provided by a second apparatus that obtains the second set of spectral metrics based on interactions of electromagnetic radiation with a second sample, and the second apparatus is sited at a second location different from the first location;processing the third set of data to obtain a fourth set of data in the second format, the fourth set of data including the second set of spectral metrics;applying the one or more trained models to at least one of the second set of spectral metrics and a second set of values based on the second set of spectral metrics to obtain a second result;based on the second result, determining either a second positive particle of interest detection or a second negative particle of interest detection for the particle of interest in the second sample;generating a second particle of interest detection notification that indicates either the second positive particle of interest detection or the second negative particle of interest detection for the particle of interest in the second sample; andproviding the second particle of interest detection notification.
  • 5. The non-transitory computer-readable medium of claim 1, wherein the method further comprises normalizing each spectral metric in the set of spectral metrics to be between zero, inclusive, and one, inclusive, to obtain the set of values, and wherein applying the one or more trained models to at least one of the set of spectral metrics and the set of values based on the set of spectral metrics to obtain the result includes applying the one or more trained models to the set of values.
  • 6. The non-transitory computer-readable medium of claim 1, wherein the method further comprises training one or more models on the set of training samples for the particle of interest to obtain the one or more trained models.
  • 7. The non-transitory computer-readable medium of claim 1 wherein spectral metrics in the set of spectral metrics are one of absorbance metrics, transmittance metrics, reflectance metrics, and scattering metrics.
  • 8. The non-transitory computer-readable medium of claim 1 wherein the result indicates the positive particle of interest detection if the result meets or exceeds a threshold.
  • 9. The non-transitory computer-readable medium of claim 1 wherein the electromagnetic radiation includes at least one of ultraviolet light, visible light, and infrared light.
  • 10. The non-transitory computer-readable medium of claim 1 wherein the one or more trained models include a set of trained decision trees.
  • 11. A method comprising: receiving a first set of data in a first format, the first set of data including a set of spectral metrics, the first set of data provided by an apparatus that obtains the set of spectral metrics based on interactions of electromagnetic radiation with a sample;processing the first set of data to obtain a second set of data in a second format different from the first format, the second set of data including the set of spectral metrics;applying one or more trained models to at least one of the set of spectral metrics and a set of values based on the set of spectral metrics to obtain a result, the one or more trained models trained on a set of training samples for a particle of interest;based on the result, determining either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample;generating a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample; andproviding the particle of interest detection notification.
  • 12. The method of claim 11, further comprising: receiving metadata associated with at least one of the apparatus, the sample, a date and time at which the apparatus obtains the set of spectral metrics; andstoring the metadata in association with the result.
  • 13. The method of claim 11, wherein the sample is at least one of a sample of a food processing byproduct, a sample from a person, and an environmental sample, and the particle of interest is at least one of a foodborne pathogen, an infectious pathogen of humans, and an environmental particle of interest.
  • 14. The method of claim 11, wherein the set of spectral metrics is a first set of spectral metrics, the apparatus is a first apparatus from a first manufacturer sited at a first location, the sample is a first sample, the set of values is a first set of values, the result is a first result, the positive particle of interest detection is a first positive particle of interest detection, the negative particle of interest detection is a first negative particle of interest detection, the particle of interest detection notification is a first particle of interest detection notification, and the method further comprises: receiving a third set of data in a third format, the third set of data including a second set of spectral metrics, the second set of spectral metrics provided by a second apparatus that obtains the second set of spectral metrics based on interactions of electromagnetic radiation with a second sample, and the second apparatus is sited at a second location different from the first location;processing the third set of data to obtain a fourth set of data in the second format, the fourth set of data including the second set of spectral metrics;applying the one or more trained models to at least one of the second set of spectral metrics and a second set of values based on the second set of spectral metrics to obtain a second result;based on the second result, determining either a second positive particle of interest detection or a second negative particle of interest detection for the particle of interest in the second sample;generating a second particle of interest detection notification that indicates either the second positive particle of interest detection or the second negative particle of interest detection for the particle of interest in the second sample; andproviding the second particle of interest detection notification.
  • 15. The method of claim 11, further comprising normalizing each spectral metric in the set of spectral metrics to be between zero, inclusive, and one, inclusive, to obtain the set of values, and wherein applying the one or more trained models to at least one of the set of spectral metrics and the set of values based on the set of spectral metrics to obtain the result includes applying the one or more trained models to the set of values.
  • 16. The method of claim 11, further comprising training one or more models on the set of training samples for the particle of interest to obtain the one or more trained models.
  • 17. The method of claim 11 wherein spectral metrics in the set of spectral metrics are one of absorbance metrics, transmittance metrics, reflectance metrics, and scattering metrics.
  • 18. The method of claim 11 wherein the result indicates the positive particle of interest detection if the result meets or exceeds a threshold.
  • 19. The method of claim 11 wherein the electromagnetic radiation includes at least one of ultraviolet light, visible light, and infrared light.
  • 20. The method of claim 11 wherein the one or more trained models include a set of trained decision trees.
  • 21. A system comprising: a first computing device configured to: receive a first set of data in a first format, the first set of data including a set of spectral metrics, the first set of data provided by an apparatus configured to obtain the set of spectral metrics based on interactions of electromagnetic radiation with a sample;process the first set of data to obtain a second set of data in a second format different from the first format, the second set of data including the set of spectral metrics; andtransmit the second set of data; anda second computing device configured to: receive the second set of data;apply one or more trained models to at least one of the set of spectral metrics and a set of values based on the set of spectral metrics to obtain a result, the one or more trained models trained on a set of training samples for a particle of interest;based on the result, determine either a positive particle of interest detection or a negative particle of interest detection for the particle of interest for the sample; andtransmit to the first computing device either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample,wherein the first computing device is further configured to: generate a particle of interest detection notification that indicates either the positive particle of interest detection or the negative particle of interest detection for the particle of interest for the sample; andprovide the particle of interest detection notification.
  • 22. The system of claim 21, further comprising the apparatus, wherein the apparatus is configured to: obtain the set of spectral metrics based on interactions of electromagnetic radiation with a sample; andprovide the first set of data including the set of spectral metrics to the first computing device.
  • 23. The system of claim 21 wherein the first computing device is further configured to transmit metadata associated with at least one of the apparatus, the sample, a date and time at which the apparatus obtains the set of spectral metrics to the second computing device, and the second computing device is further configured to: receive the metadata; andstore the metadata in association with the result.
  • 24. The system of claim 21 wherein spectral metrics in the set of spectral metrics are one of absorbance metrics, transmittance metrics, reflectance metrics, and scattering metrics.
  • 25. The system of claim 21 wherein the electromagnetic radiation includes at least one of ultraviolet light, visible light, and infrared light.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Patent Application No. 63/391,945, filed on Jul. 25, 2022, and entitled “ACQUIRING, PROCESSING AND HARVESTING HYPERSPECTRAL DATA FOR INVISIBLE PARTICLES,” and to U.S. Provisional Patent Application No. 63/391,954, filed on Jul. 25, 2022, and entitled “MOBILE DIGITAL DEVICE WITH SPECTRAL ANALYSIS,” each of which is incorporated in its entirety herein by reference.

Provisional Applications (2)
Number Date Country
63391945 Jul 2022 US
63391954 Jul 2022 US