The present application relates generally to cell cultures (e.g., in a bioreactor), and more specifically to the explicit or implicit determination of packed cell volume.
In the manufacture of certain biopharmaceutical products (e.g., biotherapeutic proteins), bioreactors are used to culture cells prior to harvesting a desired drug substance. Daily sampling from such bioreactors is typically required in process development laboratories, in order to ensure cell health and continuity between runs. This sampling process typically involves manually removing a sample of cell culture from the bioreactor, and then utilizing multiple analytical instruments, a centrifuge, and a manual process to visually measure the packed cell volume. A conventional implementation of this sampling process is shown as sampling process 200 in
On days that packed cell volume is measured, the sample is typically divided into two separate tubes to be centrifuged. The supernatant of the first tube is removed and taken to devices for analytical osmolality and metabolite measurements. The second tube, which is dedicated to packed cell volume measurement, is spun down in a centrifuge. An individual then estimates packed cell volume by visually approximating the percentage of solid cells compacted at the bottom of the spun centrifuge tube in relation to the total volume of cells and liquid in the tube. This daily sampling/analysis process can be very time consuming (e.g., roughly three hours for one person assessing samples from a row of eight bioreactors). Furthermore, the subjective visual estimation process can result in relatively low packed cell volume accuracy, substantial variance between the visual estimations made by different individuals, and possibly substantial variance between the visual estimations made by a single individual at different times.
Systems and methods described herein use a non-linear machine learning model (e.g., a neural network) to infer or predict a value or classification indicative of packed cell volume. For example, the model may infer from measured cell culture characteristics (e.g., experiment day, viable cell density, and viability) that the cell culture has a specific packed cell volume (e.g., expressed as a percentage), or may predict that a hypothetical cell culture with those characteristics would have a specific packed cell volume. Alternatively, the packed cell volume may be only implicitly determined or estimated. For example, the model may predict from measured cell culture characteristics that the cell culture would likely clog the filter during the harvesting process due to high packed cell volume, or that the cell culture would require modification to the timing, feed rate, and/or other parameters associated with a device (e.g., centrifuge or pump) used during the harvesting process. In some embodiments, the inferred or predicted value or classification is used to generate control data for one or more devices used in the harvesting process.
The techniques disclosed herein may obviate or lessen the need for manual assessments of packed cell volume, and may obviate or lessen the need for cell culture sampling (automated or otherwise) for purposes of packed cell volume assessments. Moreover, the disclosed techniques may more accurately estimate packed cell volumes, more reliably predict downstream issues (e.g., whether problems are likely to occur during harvesting, or whether/how centrifuge parameters should be modified to avoid such problems, etc.), and/or control downstream processes in a manner that achieves better performance (e.g., higher harvest yield).
The skilled artisan will understand that the figures described herein are included for purposes of illustration and are not limiting on the present disclosure. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the present disclosure. It is to be understood that, in some instances, various aspects of the described implementations may be shown exaggerated or enlarged to facilitate an understanding of the described implementations. In the drawings, like reference characters throughout the various drawings generally refer to functionally similar and/or structurally similar components.
The various concepts introduced above and discussed in greater detail below may be implemented in any of numerous ways, and the described concepts are not limited to any particular manner of implementation. Examples of implementations are provided for illustrative purposes.
In addition to the bioreactor 102, the system 100 includes one or more analytical instruments 104, a computer system 106, and a harvesting system 112. The bioreactor 102 may be any suitable vessel, device, or system that supports a cell culture, which may include living cells and/or substances derived therefrom within a media. The bioreactor 102 may contain recombinant proteins that are being expressed by the cell culture, e.g., such as for research purposes, clinical use, commercial sale, or other distribution. Depending on the biopharmaceutical process taking place, the media may include a particular fluid (e.g., a “broth”) and specific nutrients, and may have a target pH level or range, a target temperature or temperature range, and so on.
The analytical instrument(s) 104, which may or may not be communicatively coupled to the computer system 106 depending on the embodiment, can include any in-line, at-line, and/or off-line instrument (or instruments) configured to measure one or more attributes of the cell culture within the bioreactor 102. For example, the analytical instrument(s) 104 may measure viable cell density (VCD), viability, total cell count (TCC), cell diameter, and/or osmolality. Additionally or alternatively, the analytical instrument(s) 104 may measure one or more media component concentrations, such as metabolite levels (e.g., glucose, lactate, sodium, potassium, glutamine, ammonium, etc.), and/or one or more other cell culture attributes associated with the contents of the bioreactor 102 (e.g., temperature, pH, etc.). While in some embodiments the analytical instrument(s) 104 may use destructive analysis techniques, in other embodiments one, some, or all of the analytical instrument(s) 104 use non-destructive analysis (e.g., “soft sensing”) techniques.
The harvesting system 112 includes one or more devices that are used during the harvest process, e.g., to extract a desired drug substance from the cell culture of the bioreactor 102. As used herein, the term “device” may refer to a free-standing device, a component integrated within another device or system, or any combination thereof. For example, the harvesting system 112 may include one or more filters, a pump, a centrifuge, and/or other suitable devices, any of which may be separate from or integrated within the bioreactor 102. In some embodiments, and as discussed in further detail below with reference to
The computer system 106 may be a server, a desktop computer, a laptop computer, a tablet device, or any other suitable type of computing device or devices. In the example embodiment shown in
The processing unit 120 includes one or more processors, each of which may be a programmable microprocessor that executes software instructions stored in the memory 126 to execute some or all of the functions of the computer system 106 as described herein. Alternatively, some of the processors in the processing unit 120 may be other types of processors (e.g., application-specific integrated circuits (ASICs), field-programmable gate arrays (FPGAs), etc.), and some of the functionality of the computer system 106 as described herein may instead be implemented, in part or in whole, by such hardware. The memory 126 may include one or more physical memory devices or units containing volatile and/or non-volatile memory. Any suitable memory type or types may be used, such as read-only memory (ROM), solid-state drives (SSDs), hard disk drives (HDDs), and so on.
The display device 122 may use any suitable display technology (e.g., LED, OLED, LCD, etc.) to present information to a user, and the user input device 124 may be a keyboard or other suitable input device. In some embodiments, the display device 122 and the user input device 124 are integrated within a single device (e.g., a touchscreen display). Generally, the display device 122 and the user input device 124 mayjointly enable a user to interact with user interfaces (e.g., graphical user interfaces) provided by the computer system 106, e.g., to obtain an estimate of the packed cell volume of the cell culture process occurring within the bioreactor 102 as discussed in further detail below. In some embodiments, however, the computer system 106 does not include the display device 122 and/or the user input device 124.
The memory 126 stores the instructions of one or more software applications, including a packed cell volume (PCV) application 130, as well as data used by and generated by the software application(s). It is understood that the PCV application 130 may be a single software application executed by a single computing device, or may have various units or modules distributed among multiple software applications and/or executed by multiple computing devices. The PCV application 130, when executed by the processing unit 120, is generally configured to infer or predict packed cell volumes, either explicitly or implicitly, based on parameters associated with real-world and/or hypothetical cell cultures. To this end, the PCV application 130 implements a model 132, which may also be stored in the memory 126. The model 132 is a non-linear, machine learning model (e.g., a neural network, a random forest model, or an XGBoost model), which was trained by the computer system 106 (or by another suitable computing device or system) using data in a historical database 140.
The historical database 140 may be stored in the memory unit 126, and/or in one or more other persistent memories that are local or remote from each other (e.g., in a memory coupled to a remote library server, etc.). Generally, the historical database 140 may include numeric and/or categorical parameters associated with real-world cell cultures, such as experiment day (i.e., the count of days since a cell culture was introduced to a bioreactor, starting at Day 0 or Day 1), viable cell density (VCD), viability, total cell count (TCC), cell diameter, osmolality, molecule identifier (e.g., a particular antibody identifier), facility identifier (e.g., an identifier of the facility in which the cell culture was maintained), and/or other parameters (e.g., temperature, pH, particular metabolite levels, etc.). In some embodiments, the historical database 140 stores parameters corresponding to multiple different days of a single experiment (e.g., nine parameters corresponding to VCD at each of Days 0 through 8). Generally, the stored parameters can include any combination of known values (e.g., experiment day, molecule type), direct measurements, and/or indirect (e.g., soft-sensed) measurements and/or derived values, so long as at least some of the stored parameters are correlated in some way with (i.e., have some inferential or predictive strength with respect to) the packed cell volume of a cell culture.
In some embodiments, for each set of parameters associated with a particular experiment/cell culture, the historical database 140 also includes a label representing the packed cell volume corresponding to that set of parameters. The label may be a measured value, or a manually-estimated value (e.g., a packed cell volume estimate made by a skilled individual, or an average of such estimates from multiple individuals), for example. In other embodiments, the labels are not packed cell volume values, and instead are values or classifications that are indicative of packed cell volumes. For example, the labels may be classifications such as “adequate filter performance” or “inadequate filter performance” for a harvesting filter that requires a sufficiently low packed cell volume to avoid clogging. Labels of this sort may be set based on actual filter performance during harvesting (e.g., whether or not clogging actually occurred), or based on measured or estimated packed cell volumes and their known correspondences (e.g., labeling filter performance as “inadequate” in any case where the packed cell volume exceeded a threshold known to correspond to substantial filter clogging). As another example, the labels may be classifiers such as “modification required” or “no modification required,” to indicate whether modification of harvesting centrifuge parameters (e.g., timing parameters) was required as a result of excessive packed cell volume. As still another example, the labels may be specific centrifuge (or other device) parameters that were successfully used during harvesting, so long as those parameters are indicative of packed cell volume (e.g., with higher filter pressures corresponding to higher packed cell volumes, etc.).
The computer system 106, or another device or system, trains the model 132 using the cell culture parameters in the historical database 140 and the corresponding labels. For each set of parameters, for example, the computer system 106 may use the untrained or partially trained model 132 to infer or predict a value or classification, compare the value or classification to the corresponding label, and then modify the model 132 only if the label does not match the inferred or predicted value or classification (e.g., by modifying weights associated with the outputs of particular neural network nodes). In embodiments where a different computing device or system trains the model 132, the computer system 106 may obtain the model 132 via a wired or wireless communication network (e.g., via an Internet or intranet download), or via any other suitable means (e.g., by copying the model 132 from a portable storage medium). In some embodiments, the computer system 106 updates/refines the trained model 132 during operation, by using real-world results (e.g., additional packed cell volume estimates, or indications of whether filter clogging occurred, etc.) as labels for additional sets of model input data.
It is understood that other configurations and/or components may be used instead of those shown in
In operation, after the model 132 is trained, the PCV application 130 obtains parameters associated with a real-world or hypothetical cell culture. In some embodiments, the PCV application 130 obtains some or all of these parameters via a user interface generated by the PCV application 130. One example of such a user interface is the user interface 300 of
In the example embodiment of
A user may observe the packed cell volume (or other value or classification) shown in the field 306, and take one or more appropriate actions, such as deciding whether the cell culture is suitable for harvesting, manually adjusting various parameters of the harvesting system 112 based on the value and/or classification (e.g., adjust centrifuge timing and/or feed rate), and so on. Conversely, if the parameters in fields 302A-302C represent a purely hypothetical cell culture, the user may simply consider the results shown in field 306, and possibly enter new values, or initiate a real-world experiment based on the results, etc.
In other embodiments, the user interface 300 includes fields for more, fewer, and/or different types of input parameters than those shown in
In some embodiments, the PCV application 130 obtains one, some, or all of the inputs to the model 132 by means other than a user interface. For example, and as discussed in more detail below with reference to
In addition to, or alternative to, presenting an output in a field similar to field 306, the user interface 300 may use any other suitable format or formats for displayed output information, such as graphs, tables, and so on. For example, the computer system 106 may monitor the cell culture in the bioreactor 102 over time, in which case the PCV application 130 may periodically obtain measurements from the analytical instrument(s) 104 and apply those measurements as inputs to the model 132 in order to infer packed cell volume values. In such an embodiment, the user interface 300 may dynamically update a graph or table depicting packed cell volume values over time as those values are determined/obtained.
As noted above, the model 132 is a non-linear, machine learning model. While a linear regression model would provide a simpler implementation, the performance of such a model was found to be deficient. Using experiment day (i.e., duration of the cell culture in a bioreactor), total cell count, viable cell density, cell diameter, viability, and osmolality from over 300 datapoints as model inputs (after cleaning the data by removing samples with missing features and/or outliers outside of 2.5 standard deviations), linear regression provided the performance shown in plot 400 of
Plot 420 of
In some embodiments, in addition to or instead of displaying outputs (e.g., as shown in
The control application 530 includes a measurement unit 532, an inference/prediction unit 534, and a controller 536. It is understood that the various units of the control application 530 may be distributed among different software applications, and/or that the functionality of any one such unit may be divided among different software applications. The measurement unit 532 may obtain (e.g., request, or otherwise monitor) the measurements produced by the analytical instrument(s) 104 once, or periodically for any desired number of time intervals (e.g., once per day, once per hour, etc.). In some embodiments, the control application 530 omits the measurement unit 532, and the control application 530 obtains cell culture parameters via user inputs (e.g., via the user interface 300) or by other means.
In some embodiments, the inference/prediction unit 534 infers or predicts packed cell volume values based on the cell culture parameters obtained by the measurement unit 532 (and/or by other means), by applying the cell culture parameters as inputs to the model 132. The controller 536 operates on the inferred/predicted packed cell volume value(s), and possibly also other information (e.g., user-specified or default limits), to generate control data for one or more devices of the harvesting system 112. For example, the computer system 106 may send, to a centrifuge of the harvesting system 112, a command that causes the centrifuge to adjust one or more timing parameters, such as shot frequency (for a bowl shot centrifuge) or solid/liquid split ratio (for a continuous centrifuge). As another example, the computer system 106 may send, to a pump of the harvesting system 112, a command that causes the pump to adjust a feed rate of the contents of the bioreactor 102 to a centrifuge of the harvesting system 112. The controlled device(s) of the harvesting system 112 may include proportional-integral-derivative (PID) controllers, and receive set-points from the computer system 106 (e.g., set points generated by the controller 536) as inputs to the PID controllers, for example. The controller 536 may control the harvesting device(s) before the harvesting process begins and/or during the harvesting process, depending on the embodiment.
In other embodiments, the model 132 directly predicts certain device parameters that are sensitive to packed cell volume, in which case the controller 536 may not need to translate or map packed cell volume values to specific device settings. For example, the model 132 may directly predict a suitable timing parameter value for a centrifuge, in which case the controller 536 simply generates control data that represents that value in a suitable format.
The display device 122 and/or the user input device 124 may be operated by a user to monitor the control operations of the control application 530. For example, the controller 536 may cause the computer system 106 to present the generated device control settings to a user via the display 122, and/or the device(s) of the harvesting system 112 may send settings or readings to the computer system 106 to cause the computer system 106 to present the settings or readings to a user via the display 122.
At block 602, a plurality of parameters associated with the cell culture is obtained. The parameters may include analytical instrument measurements (e.g., as measured by the analytical instrument(s) 104 of
At block 604, a value or classification indicative of packed cell volume is inferred or predicted. Block 604 includes applying the parameters obtained at block 602 as inputs to a non-linear machine learning model, possibly after one or more steps of cleaning, normalizing, and/or otherwise pre-processing the input data. The model (e.g., the model 132) may be a neural network (e.g., a random forest or XGBoost model), for example. In some embodiments, block 604 includes inferring or predicting a packed cell volume value. In other embodiments, block 604 includes inferring or predicting a different value indicative of packed cell volume (e.g., a particular timing parameter for a centrifuge during a harvesting process) or a classification indicative of packed cell volume (e.g., binary indication of whether a filter will likely clog during the harvesting process, or a binary indication of whether the packed cell volume requires modification of one or more centrifuge parameters during the harvesting process, etc.).
At block 606, an output indicative of the inferred or predicted value or classification is generated. For example, block 606 may include generating or populating a user interface (e.g., similar to the user interface 300, or the field 306 thereof) for display to a user (e.g., via the display 122). As another example, block 606 may include generating control data for one or more devices configured to perform at least a portion of the harvesting process (e.g., one or more devices of the harvesting system 112, as discussed above with reference to
In some embodiments, the method 600 includes one or more additional blocks not shown in
Embodiments of the disclosure relate to a non-transitory computer-readable storage medium having computer code thereon for performing various computer-implemented operations. The term “computer-readable storage medium” is used herein to include any medium that is capable of storing or encoding a sequence of instructions or computer codes for performing the operations, methodologies, and techniques described herein. The media and computer code may be those specially designed and constructed for the purposes of the embodiments of the disclosure, or they may be of the kind well known and available to those having skill in the computer software arts. Examples of computer-readable storage media include, but are not limited to: magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD-ROMs and holographic devices; magneto-optical media such as optical disks; and hardware devices that are specially configured to store and execute program code, such as ASICs, programmable logic devices (“PLDs”), and ROM and RAM devices.
Examples of computer code include machine code, such as produced by a compiler, and files containing higher-level code that are executed by a computer using an interpreter or a compiler. For example, an embodiment of the disclosure may be implemented using Java, C++, or other object-oriented programming language and development tools. Additional examples of computer code include encrypted code and compressed code. Moreover, an embodiment of the disclosure may be downloaded as a computer program product, which may be transferred from a remote computer (e.g., a server computer) to a requesting computer (e.g., a client computer or a different server computer) via a transmission channel. Another embodiment of the disclosure may be implemented in hardwired circuitry in place of, or in combination with, machine-executable software instructions.
While the present disclosure has been described and illustrated with reference to specific embodiments thereof, it should be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the true spirit and scope of the present disclosure as defined by the appended claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US22/32308 | 6/6/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63208824 | Jun 2021 | US |