The present disclosure relates to a learning device, a learning method, and a control program.
Technologies for detecting odor from a urine sample of a subject and generating a determination model for determining a disease from the detected odor (that is, sensing result data) have been proposed (such as Patent Literature 1).
Patent Literature 1: Published Japanese Translation of PCT International Publication for Patent Application, No. 2004-531718
However, the technology disclosed in Patent Literature 1 assumes every piece of sensing result data as data used for generation of a determination model (that is, learning target data) without making a selection, and therefore precision of the determination model may not reach a desired level.
An object of the present disclosure is to provide a learning device, a learning method, and a control program that can achieve improved precision of a determination model.
A learning device according to a first aspect includes:
a selection unit configured to, from a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient, select part of the plurality of learning candidate data units as a learning target data set, based on a selection rule; and
a determination model formation unit configured to form a determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to, by using the selected learning target data set.
A learning method according to a second aspect includes:
from a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient, selecting part of the plurality of learning candidate data units as a learning target data set, based on a selection rule; and
forming a determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to, by using the selected learning target data set.
A control program according to a third aspect causes a learning device to execute processing of:
from a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient, selecting part of the plurality of learning candidate data units as a learning target data set, based on a selection rule; and
forming a determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to, by using the selected learning target data set.
The present disclosure enables provision of a learning device, a learning method, and a control program that can achieve improved precision of a determination model.
Example embodiments will be described below referring to drawings. The same or equivalent components are given the same sign in the example embodiments, and redundant description thereof is omitted.
The selection unit 11 receives (inputs) a plurality of “learning candidate data units” (that is, a learning candidate data unit group). The plurality of learning candidate data units are respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients. Further, each learning candidate data unit includes at least a “urine odor data unit” and a “cancer label.” A urine odor data unit included in a learning candidate data unit is data related to odor detected from urine of a related subject, and, for example, a form thereof may be a vector including feature values of odor or a second-rank or higher tensor. The “cancer label” is a label at least indicating whether a related subject is a cancer patient or a non-cancer patient and, for example, may include a sub-label indicating whether the related subject is a cancer patient or a non-cancer patient. Specifically, for example, in addition to a sub-label indicating whether a related subject is a cancer patient or a non-cancer patient, the “cancer label” may include a sub-label indicating a type of cancer or a sub-label indicating progress of cancer.
Then, from the plurality of input learning candidate data units, the selection unit 11 selects part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule.”
The determination model formation unit 12 forms the aforementioned “determination model” by using a learning target data set selected by the selection unit 11. The thus formed determination model is used in determination processing for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit a subject related to which is not determined to be a cancer patient or a non-cancer patient is related to. A learning method for forming the “determination model” is not particularly limited and, for example, may be logistic regression (LR), a support vector machine (SVM), a random forest (RF), or a neural network (NN).
As described above, from the aforementioned plurality of learning candidate data units, the selection unit 11 in the learning device 10 selects part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule,” according to the first example embodiment. The determination model formation unit 12 forms the aforementioned “determination model” by using the learning target data set selected by the selection unit 11.
With the configuration of the learning device 10, a learning candidate data unit to be an actual learning target can be selected, and therefore improved precision of a determination model can be achieved.
A second example embodiment relates to a specific example of the aforementioned “selection rule.” A basic configuration of a learning device according to the second example embodiment is the same as that of the learning device 10 according to the first example embodiment and therefore will be described with reference to
A selection unit 11 in a learning device 10 according to the second example embodiment selects, from a plurality of input learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule,” similarly to the first example embodiment.
The “selection rule” according to the second example embodiment includes a sub-rule (may be hereinafter referred to as a “first sub-rule”) for balancing, in a “learning target data set,” the number of learning candidate data units having a cancer label indicating a cancer patient with the number of learning candidate data units having a cancer label indicating a non-cancer patient.
Each entry in the left-hand diagram in
As described above, the selection unit 11 in the learning device 10 selects, from a plurality of input learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on the selection rule, according to the second example embodiment. The “selection rule” includes a sub-rule for balancing, in the “learning target data set,” the number of learning candidate data units having a cancer label indicating a cancer patient with the number of learning candidate data units having a cancer label indicating a non-cancer patient.
With the configuration of the learning device 10, the number of learning candidate data units having a cancer label indicating a cancer patient can be balanced in a “learning target data set” with the number of learning candidate data units having a cancer label indicating a non-cancer patient. Thus, improved precision of a determination model can be achieved.
A third example embodiment relates to a variation of the aforementioned “selection rule.” A basic configuration of a learning device according to the third example embodiment is the same as that of the learning device 10 according to the first example embodiment and therefore will be described with reference to
Each learning candidate data unit according to the third example embodiment includes a “characteristic parameter” related to a subject in addition to the aforementioned “urine odor data unit” and the aforementioned “cancer label.” The “characteristic parameter” may take N (where N is a natural number equal to or greater than 2) pieces of k-th values (where k=1, . . . , N). In other words, the “characteristic parameter” may take at least a first value and a second value. For example, the “characteristic parameter” may be any one item out of “sex,” a “height,” a “weight,” a “comorbidity other than cancer,” and a “medication type” about a subject, or any combination of the above items.
A selection unit 11 in a learning device 10 according to the third example embodiment selects, from a plurality of input learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule,” similarly to the first example embodiment and the second example embodiment.
The “selection rule” according to the third example embodiment includes a sub-rule (may be hereinafter referred to as a “second sub-rule”) for balancing, in a “learning target data set,” the numbers of learning candidate data units having k-th values. Specifically, the second sub-rule is a rule for balancing, in a learning target data set, the number of learning candidate data units having the aforementioned first value with the number of learning candidate data units having the aforementioned second value. The second sub-rule may be used with the aforementioned first sub-rule or may be used singly.
Each entry in the left-hand diagram in
Each entry in a left-hand diagram in
Further, a medication type given to a subject for treatment of a comorbidity other than cancer may be used as the aforementioned characteristic parameter. In this case, a “selection rule” may include a sub-rule for balancing, in a learning target data set, the number of learning candidate data units having a medication type indicating medication affecting urine of a subject and a cancer label indicating a cancer patient with the number of learning candidate data units having a medication type indicating medication affecting urine of a subject and a cancer label indicating a non-cancer patient. By using a learning target data set selected in accordance with the sub-rule in learning of a determination model, a bad effect of a determination model formed by a determination model formation unit 12 becoming a “determination model determining a medication type affecting urine of a subject” can be prevented.
Then, the determination model formation unit 12 according to the third example embodiment forms the aforementioned “determination model” by using the “learning target data set” selected by the selection unit 11, similarly to the first example embodiment and the second example embodiment. The determination model formation unit 12 may form a determination model by using a urine odor data unit and a cancer label as learning parameters used in learning of a determination model without using, in the learning, a characteristic parameter included in each learning candidate data unit in the learning target data set. The determination model formation unit 12 may instead form a determination model by using all of a characteristic parameter, a urine odor data unit, and a cancer label that are included in each learning candidate data unit in a learning target data set as learning parameters used in learning of a determination model.
As described above, the selection unit 11 in the learning device 10 selects, from a plurality of input learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule,” according to the third example embodiment. Each learning candidate data unit further includes a “characteristic parameter” that is related to a subject and may take at least a first value and a second value. The “selection rule” includes a sub-rule for balancing, in a learning target data set, the number of learning candidate data units having the first value with the number of learning candidate data units having the second value.
With the configuration of the learning device 10, the numbers of learning candidate data units between characteristic parameter values can be balanced in a learning target data set. Thus, improved precision of a determination model can be achieved.
A fourth example embodiment relates to a learning device that can accept specification of a sub-rule to be used out of a plurality of sub-rules different from one another included in a selection rule.
A “selection rule” according to the fourth example embodiment includes a plurality of sub-rules different from one another. The specification acceptance unit 21 accepts a “specification signal” indicating a single sub-rule or a combination of a plurality of sub-rules specified by a user operating an operation unit (unillustrated). Then, the specification acceptance unit 21 sets the single sub-rule or the combination of a plurality of sub-rules indicated by the specification signal to the selection unit 11 as a “selection rule to be used.” Thus, the selection unit 11 selects, from a plurality of input learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on the “selection rule to be used” set by the specification acceptance unit 21.
As described above, the specification acceptance unit 21 in the learning device 20 accepts a “specification signal” indicating a single sub-rule or a combination of a plurality of sub-rules specified by a user operating the operation unit (unillustrated), according to the fourth example embodiment. Then, the specification acceptance unit 21 sets the single sub-rule or the combination of a plurality of sub-rules indicated by the specification signal to the selection unit 11 as a “selection rule to be used.”
With the configuration of the learning device 20, a “learning target data set” can be selected by using a selection rule matching user needs.
A fifth example embodiment relates to a cancer examination system including a learning device.
The data acquisition device 30 in
The storage unit 32 stores a urine odor data unit received from the odor sensor 31 in a form of a table (may be hereinafter referred to as a “collected data table”).
The communication unit 33 transmits a collected data table stored in the storage unit 32 to the learning device 40.
The learning device 40 in
The communication unit 41 receives a collected data table transmitted from the data acquisition device 30 and outputs the collected data table to the storage unit 42.
The storage unit 42 stores a collected data table received from the communication unit 41.
The selection unit 43 extracts and acquires a learning candidate data unit from each entry in a collected data table stored in the storage unit 42. Specifically, since each entry in the collected data table also includes an item not required for selection processing in the selection unit 43, information about a required item is extracted from each entry and is acquired as a learning candidate data unit.
Then, the selection unit 43 selects, from a plurality of acquired learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule,” similarly to the selection unit 11 according to any one of the first to fourth example embodiments.
The determination model formation unit 44 forms the aforementioned “determination model” by using a learning target data set selected by the selection unit 43, similarly to the determination model formation units 12 according to the first to fourth example embodiments.
The determination device 50 in
The odor sensor 51 forms a determination target urine odor data unit by detecting odor from urine of a subject being a determination target and outputs the formed determination target urine odor data unit to the determination unit 52.
The determination unit 52 determines which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit received from the odor sensor 51 is related to, by using a determination model formed by the learning device 40. When a characteristic parameter is not used and a urine odor data unit is used in learning of a determination model in the learning device 40, the determination unit 52 makes a determination by using a determination target urine odor data unit received from the odor sensor 51. On the other hand, when a characteristic parameter is used with a urine odor data unit in learning of a determination model in the learning device 40, a value of a characteristic parameter related to a subject being a determination target is also input to the determination unit 52. Then, the determination unit 52 determines which of urine of a cancer patient and urine of a non-cancer patient the determination target urine odor data unit is related to, based on the input determination target urine odor data unit, the input characteristic parameter value, and the determination model.
While the determination device 50 has been described above as a device independent of the data acquisition device 30 and the learning device 40, the determination device 50 is not limited to the above. For example, the determination device 50 may be included in the data acquisition device 30. In this case, the odor sensor 31 and the odor sensor 51 may form a single odor sensor. Further, for example, the determination unit 52 in the determination device 50 may be provided in the learning device 40. In this case, a determination target urine odor data unit formed in the odor sensor 51 may be transmitted to the learning device 40 through a communication unit (unillustrated) in the determination device 50, and the determination unit 52 provided in the learning device 40 may determine which of urine of a cancer patient and urine of a non-cancer patient the determination target urine odor data unit is related to.
Example embodiments according to which the selection unit in the learning device selects, from a plurality of learning candidate data units, part of the plurality of learning candidate data units as a “learning target data set,” based on a “selection rule,” have been described in the aforementioned first to fifth example embodiments. Example embodiments according to which a learning target data set is formed in a learning device by assigning a weight of a loss function used for forming a determination model to each of a plurality of learning candidate data units, based on a balancing rule, will be described in a sixth example embodiment and beyond.
The learning target data set formation unit 61 receives (inputs) a plurality of learning candidate data units (a learning candidate data unit group), similarly to the selection units in the learning devices according to the first to fifth example embodiments.
Then, the learning target data set formation unit 61 forms a “learning target data set” by assigning a “weight” to each of the plurality of learning candidate data units, based on a “balancing rule.” The weight is a weight of a loss function used for forming a determination model. When zero is assigned to a learning candidate data unit as a weight, the learning candidate data unit does not contribute to learning by the determination model formation unit 62. Accordingly, assigning a zero value weight to a learning candidate data unit is equivalent to being excluded from a learning target data set in the “selection processing” in the first example embodiment to the fifth example embodiment.
Returning to the description of
Specifically, the determination model formation unit 62 forms a determination model f in such a way as to minimize the sum total summarizing, for every learning candidate data unit, a value acquired by multiplying a weight w by a value of a loss function loss acquired from a urine odor data unit, a cancer label, and the determination model f in each learning candidate data unit in a learning target data set [see Eqn. (1) below]. The loss function is not particularly limited and, for example, may be cross entropy, hinge loss, exponential loss, or 0-1 loss.
[Math. 1]
argminf=ΣiNwiloss(f(xi),yi) (1)
In In Eqn. (1), N denotes the number of learning candidate data units included in a learning target data set. Further, i denotes an i-th learning candidate data unit. Further, wi denotes a weight of an i-th learning candidate data unit. Further, xi denotes an explanatory variable of an i-th learning candidate data unit and at least includes a urine odor data unit of the i-th learning candidate data unit. Further, yi denotes a cancer label.
As described above, the learning target data set formation unit 61 in the learning device 60 forms a learning target data set by assigning a weight of a loss function used for forming a determination model to each of a plurality of input learning candidate data units, based on a balancing rule, according to the sixth example embodiment. The determination model formation unit 62 forms the aforementioned determination model, based on the learning target data set formed by the learning target data set formation unit 61.
With the configuration of the learning device 60, a degree of contribution of each learning candidate data unit to learning by the determination model formation unit 62 can be adjusted. Thus, improved precision of a determination model can be achieved.
A seventh example embodiment relates to a specific example of the aforementioned “balancing rule.” A basic configuration of a learning device according to the seventh example embodiment is the same as that of the learning device 60 according to the sixth example embodiment and therefore will be described with reference to
A learning target data set formation unit 61 in the learning device 60 according to the seventh example embodiment forms a “learning target data set” by assigning a “weight” to each of a plurality of input learning candidate data units, based on a “balancing rule,” similarly to the sixth example embodiment.
The “balancing rule” according to the seventh example embodiment includes a sub-rule A1 for balancing, in a “learning target data set,” the sum total of weights assigned to learning candidate data units having a cancer label indicating a cancer patient with the sum total of weights assigned to learning candidate data units having a cancer label indicating a non-cancer patient.
Each entry in the left-hand diagram in
For example, the determination model formation unit 62 according to the seventh example embodiment forms a determination model in such as way as to minimize a value acquired by Eqn. (1) described above, similarly to the sixth example embodiment. In the example in
As described above, the learning target data set formation unit 61 in the learning device 60 forms a learning target data set by assigning a weight of a loss function used for forming a determination model to each of a plurality of input learning candidate data units, based on a “balancing rule,” according to the seventh example embodiment. The “balancing rule” includes a sub-rule for balancing, in a “learning target data set,” the sum total of weights assigned to learning candidate data units having a cancer label indicating a cancer patient with the sum total of weights assigned to learning candidate data units having a cancer label indicating a non-cancer patient.
With the configuration of the learning device 60, a degree of contribution of the entire learning candidate data units having a cancer label indicating a cancer patient to learning by the determination model formation unit 62 can be balanced in a “learning target data set” with a degree of contribution of the entire learning candidate data units having a cancer label indicating a non-cancer patient. Thus, improved precision of a determination model can be achieved.
An eighth example embodiment relates to a variation of the aforementioned “balancing rule.” A basic configuration of a learning device according to the eighth example embodiment is the same as that of the learning device 60 according to the sixth example embodiment and therefore will be described with reference to
Each learning candidate data unit according to the eighth example embodiment includes a “characteristic parameter” related to a subject in addition to the aforementioned “urine odor data unit” and the aforementioned “cancer label.” The “characteristic parameter” may take N (where N is a natural number equal to or greater than 2) pieces of k-th values (where k=1, . . . , N). In other words, the “characteristic parameter” may take at least a first value and a second value. For example, the “characteristic parameter” may be any one item out of “sex,” a “height,” a “weight,” a “comorbidity other than cancer,” and a “medication type” about a subject, or any combination of the above items.
A learning target data set formation unit 61 in the learning device 60 according to the eighth example embodiment forms a “learning target data set” by assigning a “weight” to each of a plurality of input learning candidate data units, based on a “balancing rule,” similarly to the sixth example embodiment and the seventh example embodiment.
The “balancing rule” according to the eighth example embodiment includes a sub-rule A2 for balancing, in a “learning target data set,” the sum totals of weights of learning candidate data units having k-th values. Specifically, the sub-rule A2 is a rule for balancing, in a learning target data set, the sum total of weights of learning candidate data units having the aforementioned first value with the sum total of weights of learning candidate data units having the aforementioned second value. The sub-rule A2 may be used with the aforementioned sub-rule A1 or may be used singly.
For example, a medication type given to a subject for treatment of a comorbidity other than cancer may be used as the aforementioned characteristic parameter. In this case, the “balancing rule” may include a sub-rule for balancing, in a learning target data set, the sum total of weights of learning candidate data units having a medication type indicating medication affecting urine of a subject and a cancer label indicating a cancer patient with the sum total of weights of learning candidate data units having a medication type indicating medication affecting urine of a subject and a cancer label indicating a non-cancer patient.
As described above, the learning target data set formation unit 61 in the learning device 60 forms a learning target data set by assigning a weight of a loss function used for forming a determination model to each of a plurality of input learning candidate data units, based on a “balancing rule,” according to the eighth example embodiment. Each learning candidate data unit further includes a “characteristic parameter” that is related to a subject and may take at least a first value and a second value. The “balancing rule” includes a sub-rule for balancing, in a learning target data set, the sum total of weights of learning candidate data units having the aforementioned first value with the sum total of weights of learning candidate data units having the aforementioned second value.
With the configuration of the learning device 60, the sum totals of weights between characteristic parameter values can be balanced in a learning target data set. Thus, improved precision of a determination model can be achieved.
A ninth example embodiment relates to a learning device that can accept specification of a sub-rule to be used out of a plurality of sub-rules different from one another included in a balancing rule.
A “balancing rule” according to the ninth example embodiment includes a plurality of sub-rules different from one another. The specification acceptance unit 71 accepts a “specification signal” indicating a single sub-rule or a combination of a plurality of sub-rules specified by a user operating an operation unit (unillustrated). Then, the specification acceptance unit 71 sets the single sub-rule or the combination of a plurality of sub-rules indicated by the specification signal to the learning target data set formation unit 61 as a “balancing rule to be used.” Thus, the learning target data set formation unit 61 can form a learning target data set by assigning a weight of a loss function used for forming a determination model to each input learning candidate data unit, based on the “balancing rule to be used” set by the specification acceptance unit 71.
As described above, the specification acceptance unit 71 in the learning device 70 accepts a “specification signal” indicating a single sub-rule or a combination of a plurality of sub-rules specified by a user operating the operation unit (unillustrated), according to the ninth example embodiment. Then, the specification acceptance unit 71 sets the single sub-rule or the combination of a plurality of sub-rules indicated by the specification signal to the learning target data set formation unit 61 as a “balancing rule to be used.”
With the configuration of the learning device 70, a learning target data set” can be formed by using a balancing rule matching user needs.
A tenth example embodiment is related to a cancer examination system including a learning device.
The learning device 80 in
The learning target data set formation unit 81 extracts and acquires a learning candidate data unit from each entry in a collected data table stored in the storage unit 42. Specifically, since each entry in the collected data table also includes an item not required for selection processing in the selection unit 43, information about a required item is extracted from each entry and is acquired as a learning candidate data unit.
Then, the learning target data set formation unit 81 forms a “learning target data set” by assigning a “weight” to each of a plurality of learning candidate data units, based on a “balancing rule,” similarly to the learning target data set formation unit 61 according to any one of the sixth to ninth example embodiments.
The determination model formation unit 82 forms the aforementioned “determination model” by using a learning target data set formed by the learning target data set formation unit 81, similarly to the determination model formation units 62 according to the sixth to ninth example embodiments.
Each of the learning devices 10, 20, 40, 60, 70, and 80 according to the first to tenth example embodiments may include the hardware configuration illustrated in
While the present invention has been described above with reference to the example embodiments, the present invention is not limited to the above. Various changes and modifications that may be understood by a person skilled in the art may be made to the configurations and details of the present invention without departing from the spirit and scope of the present invention.
The whole or part of the example embodiments disclosed above can be described as, but not limited to, the following supplementary notes.
A learning device including:
a selection unit configured to, from a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient, select part of the plurality of learning candidate data units as a learning target data set, based on a selection rule; and
a determination model formation unit configured to form a determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to, by using the selected learning target data set.
The learning device according to Supplementary Note A1, wherein
each learning candidate data unit further includes a characteristic parameter that is related to the subject and may take at least a first value and a second value, and
the selection rule includes a first sub-rule for balancing, in the learning target data set, the number of the learning candidate data unit having the first value with the number of the learning candidate data unit having the second value.
The learning device according to Supplementary Note A2, wherein the selection rule further includes a second sub-rule for balancing, in the learning target data set, the number of the learning candidate data unit having the cancer label indicating a cancer patient with the number of the learning candidate data unit having the cancer label indicating a non-cancer patient.
The learning device according to Supplementary Note A2 or A3, wherein the characteristic parameter is any one item out of sex, a height, a weight, a comorbidity other than cancer, and a medication type about the subject, or any combination of the above items.
The learning device according to any one of Supplementary Notes A2 to A4, wherein
the selection rule includes a plurality of sub-rules different from one another, and
the learning device further includes a specification acceptance unit configured to accept specification of a sub-rule used for selection of the learning target data set by the selection unit out of the plurality of sub-rules.
The learning device according to any one of Supplementary Notes A2 to A5, wherein the determination model formation unit forms the determination model by using the urine odor data unit and a cancer label without using, in learning, the characteristic parameter included in each learning candidate data unit in the selected learning target data set.
The learning device according to Supplementary Note A1, wherein
each learning candidate data unit further includes a medication type given to the subject for treatment of a comorbidity other than cancer, and
the selection rule includes a third sub-rule for balancing, in the learning target data set, the number of the learning candidate data unit having the medication type indicating medication affecting urine of the subject and the cancer label indicating a cancer patient with the number of the learning candidate data unit having the medication type indicating medication affecting urine of the subject and the cancer label indicating a non-cancer patient.
The learning device according to any one of Supplementary Notes A1 to A7, wherein the cancer label further includes at least one item out of a type of cancer of the subject and progress of cancer of the subject.
A learning method including:
from a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient, selecting part of the plurality of learning candidate data units as a learning target data set, based on a selection rule; and
forming a determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to, by using the selected learning target data set.
A control program for causing a learning device to execute processing of:
from a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient, selecting part of the plurality of learning candidate data units as a learning target data set, based on a selection rule; and
forming a determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to, by using the selected learning target data set.
A learning device including:
a learning target data set formation unit configured to form a learning target data set by assigning, based on a balancing rule, a weight of a loss function used for forming a determination model to each of a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient; and
a determination model formation unit configured to, based on the formed learning target data set, form the determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to.
The learning device according to Supplementary Note B1, wherein the balancing rule includes a sub-rule for balancing, in the learning target data set, a sum total of a weight assigned to the learning candidate data unit having a cancer label indicating that the subject is a cancer patient with a sum total of a weight assigned to the learning candidate data unit having a cancer label indicating that the subject is a non-cancer patient.
The learning device according to Supplementary Note B1, wherein
each learning candidate data unit further includes a characteristic parameter that is related to the subject and may take at least a first value and a second value, and
the balancing rule includes a sub-rule for balancing, in the learning target data set, a sum total of a weight assigned to the learning candidate data unit having the first value with a sum total of a weight assigned to the learning candidate data unit having the second value.
The learning device according to Supplementary Note B3, wherein the characteristic parameter is any one item out of sex, a height, a weight, a comorbidity other than cancer, and a medication type about the subject, or any combination of the above items.
The learning device according to Supplementary Note B3 or B4, wherein
the balancing rule includes a plurality of sub-rules different from one another, and
the learning device further includes a specification acceptance unit configured to accept specification of a sub-rule used for formation of the learning target data set by the learning target data set formation unit out of the plurality of sub-rules.
The learning device according to Supplementary Note B1, wherein
each learning candidate data unit further includes a medication type given to the subject for treatment of a comorbidity other than cancer, and
the balancing rule includes a sub-rule for balancing, in the learning target data set, a sum total of a weight of the learning candidate data unit having the medication type indicating medication affecting urine of the subject and the cancer label indicating a cancer patient with a sum total of a weight of the learning candidate data unit having the medication type indicating medication affecting urine of the subject and the cancer label indicating a non-cancer patient.
The learning device according to any one of Supplementary Notes B1 to B6, wherein the learning target data set formation unit excludes part of the plurality of learning candidate data units from the learning target data set by assigning the weight with a zero value to the part of the learning candidate data units.
The learning device according to any one of Supplementary Notes B1 to B7, wherein the cancer label further includes at least one item out of a type of cancer of the subject and progress of cancer of the subject.
A learning method including:
forming a learning target data set by assigning, based on a balancing rule, a weight of a loss function used for forming a determination model to each of a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient; and,
based on the formed learning target data set, forming the determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to.
A control program for causing a learning device to execute processing of:
forming a learning target data set by assigning, based on a balancing rule, a weight of a loss function used for forming a determination model to each of a plurality of learning candidate data units respectively related to a plurality of subjects including a plurality of cancer patients and a plurality of non-cancer patients, each learning candidate data unit at least including a urine odor data unit acquired from urine of a related subject and a cancer label at least indicating whether the related subject is a cancer patient or a non-cancer patient; and,
based on the formed learning target data set, forming the determination model for determining which of urine of a cancer patient and urine of a non-cancer patient a determination target urine odor data unit is related to.
This application is based upon and claims the benefit of priority from Japanese patent application No. 2019-074032, filed on Apr. 9, 2019, the disclosure of which is incorporated herein in its entirety by reference.
Number | Date | Country | Kind |
---|---|---|---|
2019-074032 | Apr 2019 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2020/015290 | 4/3/2020 | WO | 00 |