The present disclosure relates to a biological information processing method, a biological information processing apparatus, and a biological information processing system.
In recent years, in order to save and share medical data, efforts to record medical data as electronic health records have been made. For example, Patent Literature 1 below discloses recording medical data as an electronic health record and transmitting part of the electronic health record to an EDC (Electronic Data Capture) system.
Patent Literature 1: Japanese Unexamined Patent Application Publication No. 2017-208039
Patent Literature 2: Japanese Unexamined Patent Application Publication No. 2012-30038
Linkage of medical data between each hospital however has not sufficiently progressed. For example, because the electronic health record system differs in each hospital, even when an electronic health record is linked from another hospital, each hospital is not able to appropriately utilize medical data that is recorded in the electronic health record.
The disclosure was thus made in view of the above-described circumstances and the disclosure provides a biological information processing method, a biological information processing apparatus, and a biological information processing system that are new and improved and that enable more appropriate linkage of medical data between each hospital.
For solving the problem described above, a biological information processing method according to one aspect of the present disclosure has acquiring biological information on a subject; based on the biological information, generating condition information representing a biological condition of the subject; and registering the condition information in a P2P database.
For solving the problem described above, a biological information processing apparatus according to one aspect of the present disclosure has a biological information acquisition unit configured to acquire biological information on a subject; a condition information generator configured to generate condition information representing a biological condition of the subject based on the biological information; and a register configure to register the condition information in a P2P database.
For solving the problem described above, a biological information processing system according to one aspect of the present disclosure has a biological information acquisition unit configured to acquire biological information on a subject; a condition information generator configured to generate condition information representing a biological condition of the subject based on the biological information; and a register configure to register the condition information in a P2P database.
For solving the problem described above, a biological information processing method according to one aspect of the present disclosure has acquiring biological information on a subject; generating condition information representing a biological condition of the subject based on the biological information; and registering the condition information as data of a distributed network.
According to the disclosure, it is possible to link condition information that is one type of medical data with each hospital that has a P2P database.
As described above, according to the disclosure, it is possible to more appropriately link medical data between each hospital.
Note that the above-described effect is not necessarily definitive and, together with the above-described effect or instead of the above-described effect, any one of the effects described in the description or another effect that can be identified from the description may be achieved.
With reference to the accompanying drawings, preferable embodiments of the disclosure will be described in detail below. In the description and drawings, components that have substantially the same functional configuration are denoted with the same number and thus redundant description thereof is omitted.
Description will be given in the following order.
1. Background
2. Embodiment
3. Summary
First of all, the background of the disclosure will be described.
1.1. Linkage of Medical Data Between Hospitals
As described above, linkage of medical data between each hospital has not sufficiently progressed. Various reasons are considered for insufficient progress of medical data linkage and, as one of the reasons, differences in the electronic health record system among each hospital or each doctor are taken. More specifically, items in an electronic health record, the order of the items, and the data form, etc., differ depending on each hospital (or each electronic heath record system that each hospital employs). Particularly, as for items in which entries can be made freely in an electronic health record, the content of data entered in the electronic health record differs depending on each doctor. For this reason, data difficult to understand properly and unnecessary data may be entered in the electronic health record.
Thus, even when an electronic health record is linked from another hospital, each hospital has difficulties in properly utilizing the medical data that is recorded in the electronic health record. For example, each hospital is sometimes required to, for each hospital that links an electronic health record, build a system (for example, a conversion system) to convert medical data that is linked from another hospital into a data form in which the medical data can be utilized and this causes a considerable work. When the medical data that is linked from another hospital contains data difficult to understand properly and unnecessary data, it is required to examine methods for the respective sets of data.
Even when medical data is linked properly between each hospital, because a appropriate analysis method and a utilization method for the linked medical data (for example, a method of analyzing medical data and utilizing the result of analysis for treatment) have not been established and the effect of linking medical data has not been clear, motivation of each hospital in linking medical data has not been high.
Medical data may contain data that can specify an individual and, from the point of view of protection of personal information, it has been difficult to link medical data between hospitals and it has been required to have a consensus of a patient on linking medical data. Determining whether each set of medical data is data that can specify an individual and managing such sets of data distinctively also cause a considerable work. Particularly, as for data that can specify an individual, it is required to manage the data in a secured environment in order to prevent the data from being falsified and misused, and it is not easy to realize a system (system used to link medical data) whose scale is large to be accessible by a lot of hospitals and that is secure.
1.2. Individual Medicine
Considering conventional utilization of medical data, randomized controlled trials (RCT) have been actively performed as a method of evaluating effectiveness of treatment. This is an approach of, in order to prevent occurrence of bias of subjective or arbitrary evaluation, extracting persons randomly from a population, sorting the persons into a treatment group and a control group, and analyzing a result of treatment on each of the persons. Such analysis based on a statistical process enables prediction of the course of treatment on a group (for example, general patients) but there is a limitation in predicting the course of treatment on individual members (for example, individual patients) who belong to the group.
Taking atopic skin as an example, a more specific description will be given. Randomized controlled trials enable development of effective medical treatment against atopic skin and this can be described as equivalent to development of more effective medical treatment based on an average of feature values of the skin of general patients with atopic skin. The background of the onset, such as sensitiveness of skin and a degree of allergy, however differs depending on the patient and thus, logically, there are not patients whose have completely the same symptom even when they have the same disease name, atopic skin.
The effect of a treatment method on which it is determined that the method is effective through randomized controlled trials differs depending on the patient. For this reason, while realization of individual medicine in which medical treatment is changed according to each patient is expected, it can be described that adjusting the amount of medicine or changing the type of medicine while diagnosing the symptoms of the patient based on experiences of each doctor or limited cases in the past are the limit of current individual medicine.
Individual medicine using genome information has been proposed but information that can be obtained by analyzing genome information using current medical techniques is limited (for example, only information on a tendency (type) of a patient can be obtained). It is considered that it is possible to realize more effective individual medicine by linking information on the body of the patient (for example, genome information, anthropometric information, diagnostic information, treatment information, operation information, or information representing the condition of the body of the patient, such as a blood pressure and an electrocardiogram (ECG), that is acquired using a wearable terminal device that is worn by a patient, which will be referred to as “physical information” below), or information on an environment that affects the patient (for example, information on lifestyle habits of the patient or medication information, or information representing the condition of the environment of the patient, such as an acceleration or an angular velocity that is acquired by a wearable terminal device that is worn by the patient, which will be referred to as “environment information” below) and using the information for analysis; however, it is hard to say that a system for individual medicine using such information has been built. Furthermore, it is hard to say that a system for individual medicine has been built by combining medical information that a medical setting records and non-medical information that each individual records.
Considering from a different point of view, it is argued that, in the current medicine, not definitive therapy to control the root cause of a disease but supportive therapy to increase spontaneous remission to promote cure by administering treatment to reduce main symptoms is often focused on. For example, because definitive therapy against abnormality of immune that is the root cause of atopic skin has not been sufficiently established, supportive therapy to reduce inflammation with steroid topical medication or antihistamine has been widely practiced as medical treatment against atopic skin. It can be described that the current medicine practices diagnosis and treatment based on understanding of features of a patient at the time of practice of diagnosis and treatment, that is, based on spatial features.
It is considered that, in order to promote and develop definitive therapy (or more effectively implement supportive therapy), analyzing not only data that is obtained during diagnosis but also time-series data in which data is arranged chronologically (features over time) is effective. For example, analyzing time-series data in which physical information or environment information on the patient are arranged chronologically clarifies the background of the onset per patient and enables more effective treatment. Currently, however, it is hard to say that a system enabling collection of and analysis on time-series data of physical information or environment information has been built.
1.3. Use of Blockchain
As one of techniques that are expected to link medical data between hospitals and link physical information and environment information in order to realize individual medicine, there is “blockchain”.
A “blockchain” is data in which multiple blocks in which data (transaction data) is stored are chained like a chain using hash values, or the like. Multiple information processing devices (peers and node devices) manages the block chain in a distributed manner, thereby securing authenticity of data that is stored in the blockchain. Furthermore, with the blockchain, it is expected that intervening of approval by the patient in sharing personal information on the patient secures the right of self-determination of the patient.
Basically, the block chain keeps storing data that was registered in the past and thus there is a possibility that the data size of the whole block chain would be enormous as the blockchain is practiced. Data in a large size increases the burden of computing for hashing. For this reason, registering large-sized data in a blockchain is not preferable. Thus, for example, a method of registering only (part of) an electronic heath record in a block chain, a method of registering, in a blockchain, only a site (pass) in a given database in which medical data is saved, etc., have been proposed. In the former method, however, because the electronic health record system differs between hospitals as described above, it is still difficult to link and utilize medical data and it is not possible to link physical information and environment information that are managed outside the electronic health record. The latter method requires access to the given database to acquire medical data and additionally requires a system for control on access to the database and thus it is hard to say that medical data is linked appropriately. From the point of view of protection of personal information, a problem still remains in both the former and latter methods in that it is necessary to have a consensus from the patient on linking the medical data and it is necessary to determine whether each set of data is data that can specify an individual and manage such data distinctively.
The disclosers reached creation of the technique according to the disclosure in view of the above-described background. A biological information processing apparatus according to the disclosure acquires biological information on a subject (covering a patient), generates condition information representing a biological condition of the subject based on the biological information, and registers the condition information in a P2P database (covering a blockchain). More specifically, the biological information processing apparatus generates a condition code representing a biological condition of the subject by encoding the biological information using a given method and registers condition information containing the condition code in the P2P database. An embodiment of the disclosure will be described in detail below.
2.1. Condition Allocation
First of all, details of a process performed by the biological information processing apparatus according to the embodiment to generate condition information based on biological information (the process is referred to as “condition allocation” below) will be described.
The “biological information” that is used for condition allocation is a concept covering physical information or environment information on the subject.
The “physical information” covered by biological information is information on the body of the subject (covering a patient) as described above and includes, for example, anthropometric information (for example, a height, seating height, a weight, a BMI (Body Mass Index), a body fat percentage, a visual acuity, or an audibility); diagnosis (medical interview) information (for example, a name of disease, an X-ray image, an MRI image, a gamma GTP, or a subjective symptom); treatment information (for example, the content of treatment or the time of treatment); or operation information (for example, the content of an operation or the duration of an operation). Note that the content of physical information is not limited to them. For example, the physical information may contain genetic information, such as genome information or epigenome information; information of molecular indices obtained using liquid biopsy, such as hormone, cytokine, growth factors, and circulating nucleic acids in a body fluid sample, such as the blood; or information representing the physical condition of the patient acquired using a wearable terminal device that is worn by the subject and a sensing terminal device having a function of sensing the subject from radio waves, image information, or the like (for example, information obtained by various sensors that the sensing terminal device includes, such as vital signs including a heart rate, autonomic nerves, and a sleep rhythm, an amount of oxygen in the blood, a blood glucose level, a blood pressure, or uric protein). The physical information may contain attribute information on the subject (covering a patient) that is recorded in the electronic health record (for example, a name, a date of birth, an age, a gender, a blood type, an address, a phone number, or a place of employment) and medical information, such as the names of the doctor and hospital that administered treatment and performed an operation on the subject. The physical information, such as anthropometric information, treatment information, and operational information, may contain elements as a time series (a log of each set of information). Treatment refers to medical practices not corresponding to operations.
The “environment information” contained in the biological information is information on the environment that affects the subject (covering a patient) as described above and, for example, contains information on lifestyle habits of the subject (for example, habits, such as smoking, drinking alcohol, diets, sleeping, or exercise, and stresses), medication information (for example, a type of drug, a dosage and an administration), information representing the condition of the environment of the patient acquired by the wearable terminal device that is worn by the subject or the sensing terminal having a function of sensing the subject from waveforms or image information (for example, information acquired by the various sensors of the sensing terminal, such as an acceleration, an angular velocity, etc.), and information that is obtained using the liquid biopsy, such as molecular indices of blood, body fluids, etc. The environment information may contain information on the environment that is estimated from the physical information (for example, vital signs, such as a heart rate, autonomic nerves, and a sleep rhythm, an amount of oxygen in the blood, a blood glucose level, a blood pressure, or uric protein). For example, the environment information may contain information, such as a time to wake up that is estimated from the sleep rhythm and exercise information) that is estimated from the heart rate. When it is information for estimating an environment of the patient, the physical information may be partly recorded as the environment information. The content of the environment information is not limited to them. The information that is acquired using the various sensors may be sensor information itself or may be feature value information of the sensor information that is output because the sensor information is analyzed (from the point of view of processing efficiency and data size, it is preferable that the information acquired using the various sensors be feature value information of the sensor information). Medication by the doctor during the treatment or operation is preferably incorporated in the treatment information or the operation information and medication by the subject himself/herself (ingestion of drug) is preferably contained in the medication information.
The condition allocation refers to inputting the above-described biological information (containing the physical information or the environment information) to a given classifier, thus outputting a classification result, and, using the classifying result, generating condition information. The classifier may simply refer to a table (more specifically, a table in which biological information and conditions are associated with each other) or an approach of machine learning. For example, condition allocation may be performed in a way that the biological information is converted using a table serving as a classifier as follows “atopic skin Type A→1A, the medical condition site is the right upper arm→7, a lot of redness→3). Condition allocation on the biological information may be performed using a method of machine learning, such as a support vector machine or a neural network. For example, a classifier obtained by performing learning using learning data in which given biological information and conditions are associated is generated and the biological information is input to the classifier and thereby dimensional compression is performed and condition allocation is performed. Condition allocation in which the biological information is input to a neural network with a given parameter and a vector value or a scholar value obtained by performing dimensional compression is performed serves as a condition may be performed. Note that, because classification by the aforementioned classifier enables compression of dimensions of the biological information, the condition allocation can be also referred to as generating condition information by performing a process of compressing the dimensions of the biological information using a table conversion or machine learning approach (referred to as “dimensional compression process” below).
As for the support vector machine, for example, combining multiple support vector machines builds a support vector machine model for multiclass classification and inputting learning data (the biological information) to the model generates a classifier. As for the neural network, building a multilayer neural network, inputting learning data (data of combination of the biological information and conditions corresponding to the biological information), and adjusting the parameter of the multiplayer neural network generate a classifier. The biological information processing apparatus may perform condition allocation using an artificial intelligence (AI) as the classifier. Dimensional compression using machine learning will be described in detail below.
A specific example of the dimensional compression process performed in condition allocation will be described. For example, when blood pressure information on the subject (for example, numeric data of the systolic blood pressure and the diastolic blood pressure) is input to the classifier described above and the classifier accordingly determines that the subject is “Type 1: Optimum blood pressure” from among “Type 1: Optimum blood pressure”, “Type 2: Normal blood pressure” and “Type 3: High blood pressure”, the biological information processing apparatus may output “1”, thereby implementing the dimensional compression process. For example, when acceleration information on the subject (for example, numeric data of acceleration within a given duration) is input to the classifier described above and accordingly it is determined that the lifestyle habits of the subject is “Type 3: Evening person” from among “Type 1: Morning person”, “Type 2: Standard” and “Type 3: Evening person”, the biological information processing apparatus may output “3”, thereby implementing the dimensional compression process. By using more types of biological information, the biological information processing apparatus is able to improve accuracy of condition allocation and increase possible conditions for allocation. For example, by further using not only the blood pressure information and acceleration information but also heart rate information and blood glucose level information, the biological information processing apparatus is able to increase accuracy of condition allocation and the number of possible conditions for allocation.
When performing classification using the above-described classifier, the biological information processing apparatus may convert biological information into a form that is easier to classify. For example, the biological information processing apparatus may perform dimensional compression by performing given approximation processing on the biological information to simplify the biological information (output an approximation model), thereby converting the biological information into a form that is easy to classify. For example, as illustrated in
When performing classification using the above-described classifier, the biological information processing apparatus is able to perform classification using a different classifier depending on the information contained in the biological information. For example, the biological information processing apparatus may classify blood pressure information using the table as a classifier and classify acceleration information using the support vector machine approach as a classifier. The biological information processing apparatus may perform classification using a combination of multiple classifiers. For example, after classifying blood pressure information using the table as a classifier, the biological information processing apparatus may further perform classification using the support vector machine approach as a classifier.
After performing classification using different classifiers depending on the types of biological information, the biological information processing apparatus may further perform a dimensional compression process on each of the results of classification, thereby performing condition allocation. For example, the biological information processing apparatus generates a matrix {A, B, C, . . . } by arranging character strings representing respective results of classification of multiple types of biological information: a matrix A representing a result of condition allocation of the blood pressure information, a matrix B representing a result of condition allocation of the acceleration information, a matrix C representing a result of condition allocation of the heart rate information, . . . . Based on the matrix, the biological information processing apparatus performs condition allocation by performing the dimensional compression process and generates information representing the condition of the owner of the biological information (condition information). In other words, the condition is represented as a condition vector (matrix). The physical information and environment information are also represented by condition vectors (matrices).
Physical feature values are enormous and thus the biological information processing apparatus preferably performs the process of dimensional compression to dimensions by which the features of the biological condition of the subject can be identified properly (a degree at which the features of the biological condition of the subject can be identified is referred to as “identification efficiency” below). With reference to
The biological information processing apparatus may perform the dimensional compression process based on medical knowledge and perform condition allocation. Each of disorders, such as cancers and atopic skin, is evaluated using eight feature values (immunogram) that are extracted based on medical knowledge from the feature values of enormous biological information. The biological information processing apparatus is able to perform dimensional compression by extracting a feature value associated with each disease that is specified based on medical knowledge from the input biological information. For example, atopic skin is known as being evaluable based on clinical trials using the eight feature values of the biological information representing a cutaneous barrier function, immunoregulation, and bacterial flora. Thus, an input of “atopic skin” that is diagnostic information triggers the biological information processing apparatus to extract the eight feature values on atopic skin from the input biological information, perform the dimensional compression process, thereby generate condition information. After the dimensional compression process based on the medical knowledge, the biological information processing apparatus may further perform dimensional compression using the above-described classifier, thereby generating condition information. The dimensional compression method makes it possible to, for various disorders, perform layering that has clinical meaning medically.
The biological information processing apparatus performs the dimensional compression process using the above-described classifier, thereby generating an encoded physical code or an encoded environment code. The biological information processing apparatus performs the dimensional compression process using a classifier on at least any one of the physical code and the environment code, thereby generating an encoded condition code. The biological information processing apparatus further incorporates the encoded information in the condition information. It is preferable that the dimensional compression process makes the data size of the condition information be equal to or smaller than the block capacity of a block chain.
With reference to
As described above, the “condition information” refers to information representing biological condition of a subject (covering a subject). With reference to
The “standard code” is information representing a method that is used to generate the condition information based on the biological information (note that, in other words, the standard code can be referred to as a method code representing the method used to generate the condition information). In
The “physical code” is information that is encoded by performing the dimensional compression process on the physical information. Encoding the physical information by the dimensional compression process makes it more difficult to specify the individual.
In the example in
The “environment code” in
In the example in
The “condition code” in
The “error detection checksum” is information that is used to detect an error in the condition information (for example, falsification of the condition information or corruption of the condition information due to some kind of cause). More specifically, the biological information processing apparatus that has acquired the condition information performs a given operation (for example, calculation of a hash value) on part of the condition information (more specifically, the part of the condition information excluding the error detection checksum) and confirms that no error is contained in the condition information based on matching between the result of operation and the error detection checksum. Incorporating the error detection checksum in the condition information prevents incorrect condition information from being used for processing.
For example, when an approach of machine learning, such as a support vector machine or a neural network, is used for the dimensional compression process, the biological information is converted into a position in a vector space. Thus, as illustrated in
The condition information is not limited to the example illustrated in
When generating condition information based on biological information, the biological information processing apparatus outputs accompanying information together. The “accompanying information” refers to information that is generated based on information that is not used to generate the condition information among the biological information. With reference to
As illustrated in
Like the standard code of the condition information described above, the “standard code” is information representing a method that is used to generate the accompanying information based on the biological information. In
The “generator ID” is information representing a generator that has generated the condition information and the accompanying information are generated. For example, the generator ID is a data string “ir9wro” of Data string No (2) in
The “management ID” is information that is used to manage the condition information and the accompanying information. For example, the generator ID is a data string “3ni89” of Data string number (3) representing an “electronic health record number” and is information that is generated by performing the given conversion process (for example, a coding process or a hashing process) on the electronic health record number (before conversion) that is set in the electronic health record that is used to generate the condition information and the accompanying information. Incorporating the management ID in the accompanying information makes it possible to specify the electronic health record that is used to generate the condition information and the accompanying information, etc.
The “personal ID” is information representing the subject of the condition information and the accompanying information. For example, the personal ID is a data string “usofna1wor7po” of Data string number (4) representing a “patient number” and is information that is generated by performing the given conversion process (for example, a coding process or a hashing process) on the patient number (before conversion) that is set in the patient who is the subject of the condition information and the accompanying information. Incorporating the personal ID in the accompanying information allows the setting that generates the condition information and the accompanying information to specify the subject of the condition information and the accompanying information.
The “sensing device information” is information representing sensing device information that is used to generate the condition information (or the type of the sensing device). For example, the sensing device information is a data string “3mnrtj0eikpf4dis0uf203pojmfioe8hfj” of Data string number (5) representing “identification information of the sensing device that is used by the patient” and is information that is generated by performing the given conversion process (for example, a coding process or a hashing process) on the identification information (before conversion) that is set previously in the sensing device that is used by the patient. Incorporating the sensing device information in the accompanying information, for example, allows analysis on the information of each sensor contained in the condition information.
Like the error detection checksum of the above-described condition information, the “error detection checksum” is information that is used to detect an error in the accompanying information (for example, falsification of the accompanying information or corruption of the accompanying information due to some kind of cause). Incorporating the error detection checksum in the accompanying information prevents incorrect accompanying information from being used for processing.
The accompanying information is not limited to the example illustrated in
With reference to
First of all, at step S1000, the biological information processing apparatus classifies electronic health record information into personal information and non-personal information. The “personal information” is information only by which the subject can be specified (or information by which the subject is highly likely to be specified) and contains, for example, attribute information on the subject, insurance information, a hospital visit history, or an X-ray image. The “non-personal information” is information only by which the subject cannot be specified (information by which the subject is not likely to be specified) and contains, for example, a subjective symptom, a medical interview symptom, or a diagnostic result.
At step S1004, the biological information processing apparatus converts part of the personal information into non-personal information (“non-personal process” below). The “non-personal process” refers to converting information into information by which the subject is less likely to be specified by, for example, converting information “Age: 25” contained in the attribute information into information “Age group: twenties” or converting information “Place of employment: ABC Inc.” into information “Occupation: Company Employee”. The non-personal information is converted into a condition code by the following process and, because information, such as the aforementioned age group and occupation, can also be a factor affecting the biological condition of the subject, the biological information processing apparatus adds such information to the non-personal information by performing the non-personal process. The non-personal process may be performed on any information as long as the information can be a factor that affects the biological condition of the subject.
At step S1008, the biological information processing apparatus stores the personal information in a storage in the hospital and issues a personal ID (for example, a patient number) and a management ID (for example, an electronic health record number).
At step S1012, the biological information processing apparatus generates a personal ID (after conversion) and a management ID (after conversion) by performing a given conversion process (for example, a coding process or a hashing process) on a personal ID (before conversion) and a management ID (before conversion) and generates accompanying information containing the personal ID (after conversion) and the management ID (after conversion) (note that description of the conversion process on the generator ID and the sensing device information illustrated in
At step S1016, the biological information processing apparatus generates condition information by performing condition allocation on the non-personal information by the dimensional compression process. Through the above-described process, condition information and accompanying information are generated. Thereafter, at step S1020, the biological information processing apparats performs a process of registering the condition information and the accompanying information in the P2P database, where transaction data is generated using the condition information and the accompanying information. A specific example of the process of registration in the P2P database will be described below.
2.2. Chronological Analysis on Condition Information
The condition allocation performed by the biological information processing apparatus according to the embodiment has been described above. Subsequently, chronological analysis on the condition information will be described.
When registering the condition information that is generated as described above (and the accompanying information) in the P2P database, the biological information processing apparatus determines whether the condition information is effective and, when it is determined that the condition information is effective, registers the condition information in the P2P database.
The biological condition of the subject is expressed by consecution of changes that are irreversible and occurring in the living body (referred to as “irreversible changes” below). For example, a human undergoes irreversible changes from the moment of fertilization to the birth, growth, and aging and then dies. The onset of a disorder also develops from the pre-stage of the onset affected by a potential change to the onset, a disorder of a specific physiological function, a loss of the specific physiological function, a physical disorder, and the death. Thus, in order to properly express the biological condition of the subject, it is more important to identify the irreversible changes.
The biological information (the physical information and environment information) from which the condition information originates is information containing changes other than irreversible changes. More specifically, as illustrated in
The “rhythm component” is mainly information based on the 24-hour circadian rhythm and is a component that varies in a given rhythm regardless whether an irreversible change occurs.
The “stimuli-responsive component” is information representing a direct output to an input (stimulus) (response to a stimulus) when the input is made to a living body. For example, when an input that is medication is made to a living body, an effect of the drug appears as a stimuli-responsive component.
The “baseline component” is information that is left after the rhythm component and the stimuli-responsive component are removed from the biological information and is information that irreversibly changes because of some kind of an input (stimulus) that is made to the living body or changes over time (the baseline component is information representing irreversible changes in time-series data). In other words, capturing a change in the baseline component makes it possible to capture an irreversible change. The irreversible change in the biological condition is, for example, an irreversible change over time in genome information and epigenome information in a molecular mechanism, in other words, an irreversible change in chromosome because of genetic modification and epigenetic modification. Discretizing the change in the biological condition over time based on the baseline component makes it possible to express the irreversible change.
Patent Literature 2 above discloses a method of dividing time-series data into a rhythm component, a stimuli-responsive component, and a baseline component. More specifically, Patent Literature 2 above discloses a method of dissolving time-series data on expression of molecules produced in a living body into a rhythm component (a periodic component in Patent Literature 2) using a season adjustment model, into a stimuli-responsive component (an environmental stimuli-responsive component in Patent Literature 2) using a multilinear model, and into a baseline component using a polynomial smoothing spline model.
Using the method described in Patent literature 2, or the like, the biological information processing apparatus performs extraction of a baseline component from the time-series data in which biological information is arranged chronologically as one type of the dimensional compression process and, based on variation in the baseline component, controls registration of the condition information on the subject in the P2P database. More specifically, when it is confirmed that the baseline component varies significantly with respect to a given threshold from the time of previous registration of the condition information on the subject in the P2P database, the biological information processing apparatus determines to register the condition information in the P2P database. Accordingly, the biological information processing apparatus is able to register the more effective condition information in the P2P database. In other words, even data is in an enormous amount like genome information or epigenome information, the biological information processing apparatus is able to compress the data volume into an analyzable form by discretization based on changes in the baseline component and register the data in the P2P database.
By performing chronological analysis on the condition information on the subject, the biological information processing apparatus is able to predict a condition of the subject at a certain future time and make an appropriate proposal based on the result of the prediction.
More specific description will be given. First of all, the biological information processing apparatus acquires condition information on a subject from the P2P database. The biological information processing apparatus then analyzes the condition information, thereby recognizing the condition of the subject. The biological information processing apparatus may acquire multiple sets of condition information that are generated within a given period and analyze the sets of condition information, thereby recognizing a pattern of transition of the condition of the subject (referred to as a “condition transition pattern” below) during the period.
Thereafter, the biological information processing apparatus searches the P2P database for another subject (referred to as a “similar subject” below) who had a condition in the past (or a condition transition pattern) similar to the condition (or the condition transition pattern) of the subject (in other words, the biological information processing apparatus compares the condition information on the subject and condition information on another subject (another set of condition information) and thus extracts a similar subject) with each other). It is more preferable that a similar subject with a condition transition pattern similar to that of the subject during a period as long as possible be found. The number of similar subjects is not particularly limited.
When a similar subject can be found, the biological information processing apparatus acquires condition information on the similar subject at and after the time when the similar subject was similar to the subject in condition (or condition transition pattern) from the P2P database and analyzes the condition information, thereby recognizing the following transition pattern of the similar subject. Accordingly, the biological information processing apparatus is able to predict a condition of the subject at a certain future time.
The biological information processing apparatus is able to notify the subject of the result of prediction of a condition of the subject at a certain future time. It is preferable that the biological information processing apparatus convert the meaning of the condition such that the subject can understand. More specific description will be given. The biological condition of the subject is converted into the condition code, and the subject is unable to recognize his/her biological condition by recognizing only the condition code. The biological information processing apparatus thus converts the condition code such that the subject can recognize the biological condition. For example, the biological information processing apparatus performs inverse conversion the condition code using a given table or verbalize the condition represented by a position in a vector space (for example, when the position of the subject in the vector space is close to positions of many patients with back problems, the biological information processing apparatus notifies that the subject in a condition with a back problem).
When it is predicted that the future condition of the subject is not preferable, the biological information processing apparatus is able to represent a method for making the condition of the subject preferable based on the condition transition pattern of the similar subject. More specifically, when there is a condition positioned in a vector space where the condition is considered as unhealthy among conditions in which the subject will be highly likely to be n years later, the biological information processing apparatus makes a comparison with a condition positioned in a vector space where the condition is considered as healthy and proposes a method to be in a healthy condition. For example, the biological information processing apparatus calculates how to update the environment information to achieve a healthy condition and represents information necessary for the update (for example, an amount of exercise and content of medication). When the condition in which the subject will be most likely to be after n years is positioned in a vector space where a condition is considered unhealthy, the biological information processing apparatus may calculate how the condition of the subject will change if given environment information (for example, an amount of exercise and content of medication) is changed and specify and represent environment information for shifting the condition to a vector space where the condition is considered to be healthy. In other words, the biological information processing apparatus may specify a parameter having a point in being changed (a parameter that contributes to shifting the condition to a vector space where the condition is considered as healthy) and present a method of changing the parameter (such as a method that is derived from medical knowledge (for example, prescribing Drug A when the uric acid level is high)). The “vector space where the condition is considered as healthy” is a vector space where conditions of healthy people are assembled.
An example of the chronological analysis on the condition transition will be described in detail using
Furthermore, it can be considered that xtn is a function f(xtn) where the time t is a variable. When it is possible to calculate the function f(xtn), it is possible to estimate a condition xtn of the subject at a future time tn. In the real world, however, parameters are enormous and it is practically impossible to calculate the function f(xtn). Thus, the biological information processing apparatus generates a machine learning model obtained by making a parameter adjustment using changes in condition information (or environment information or physical information) on a subject as learning data and inputs the changes in the condition information on the subject until the current time, thereby statistically estimating a condition of the subject at a future time tn. At a time t1, a probability that a condition xt1 will turn into a condition xt2 at a time t1 is represented as z1 and a probability that the condition xt1 will turn into a condition xt3 is represented by z3. Furthermore, a probability that the condition xt2 will turn into a condition xt3 at a time t2 is represented by z2. Such a condition change model, that is, a model that determines the condition of the current time probabilistically depending on the chronologically previous condition (condition transition probability) is referred to as a hidden Markov model (or a multilayer hidden Markov model). It is more preferable that the above-described estimation of a condition transition probability be performed using an algorithm by which the hidden Markov model is performed easily, for example, a machine learning algorithm using an RNN (Recurrent Neural Network). Not referring to all parameters in the real world but referring to changes probabilistically (statistically) as described above makes it possible to estimate a future condition.
2.3. P2P Database
Chronological analysis on the condition information has been described above. Condition information that is generated by the biological information processing apparatus is registered in the P2P database and is managed. Subsequently, an overview of the P2P database will be described.
The biological information processing system according to the embodiment uses a distributed P2P database that is distributed in the P2P network. A P2P network is sometimes referred to as a P2P distributed file system. As an example of the P2P database, a blockchain that is distributed in the P2P network is taken. With reference to
As illustrated in
As the block chain, for example, a block chain that is used to trade data of virtual currency, such as Bitcoin, is taken. In a block chain that is used to trade virtual currency data, for example, a hash value of a previous block and a value referred to as nonce are contained. The hash value of the previous block is information that is used to determine whether “it is a correct block” that is continuous correctly from the previous block. A nonce is information that is used to prevent spoofing in authentication using a hash value and using a nonce prevents falsification. As a nonce, for example, a character string, a number string, or data representing a combination thereof is taken.
In a blockchain, assigning a digital signature using an encryption key to each set of transaction data prevents spoofing. Each set of transaction data is open and is shared over the P2P network. Each set of transaction data may be encrypted using an encryption key.
In the blockchain system, for example, using a sidechain technique makes it possible to incorporate other subject data different from virtual currency in an existing blockchain that is used to trade data of virtual currency, such as a Bitcoin blockchain.
As described above, the biological information processing system according to the embodiment uses the distributed P2P database that is distributed in the P2P network; however, note that a distributed network in which a distributed processing is performed by multiple biological information processing apparatuses may be used. The distributed network may be a network including a cloud server that is accessible by, for example, only an authorized user and a biological information processing system in which the above-described condition information is recorded in a storage of the cloud server that is associated with the ID of each user and the condition information is browsed only by an ID that is authorized by each user to access may be built.
2.4. Example of System Configuration
The overview of the P2P database has been described. Subsequently, with reference to
As illustrated in
Application Backend 100
The application backend 100 is a biological information processing apparatus that is mainly used by a doctor who makes a diagnosis on, administers treatment on, or performs an operation on the patient.
More specially, the application backend 100 accesses a storage in the hospital, a storage of the biological information processing apparatus, or an external server (for example, a cloud server) and thus acquires biological information (physical information or environment information). The application backend 100 generates time-series data by arranging the biological information chronologically and extracts a baseline component from the time-series data using the method disclosed in Patent Literature 2 above, or the like.
Based on variation in the baseline component, the application backend 100 then determines whether to register condition information in the P2P database. When it is determined to register condition information in the P2P database, the application backend 100 performs a given conversion process (a dimensional compression process, an encryption process, or a hashing process) on the biological information, thereby generating condition information and accompanying information.
Thereafter, the application backend 100 provides the condition information and the accompanying information to the external server 300. Accordingly, the external server 300 is able to generate transaction data using these sets of information and register the transaction data in the P2P database.
The above-described process content of the application backend 100 is changeable as appropriate. The type of the apparatus that embodies the application backend 100 is not particularly limited. For example, the application backend 100 can be embodied with a freely-selected device covering a PC (Personal Computer), a tablet PC, or a smartphone.
Internal Server 200 and External Server 300
The internal server 200 is a biological information processing apparatus that is connected to the P2P network 500 and that has shared data (covering a P2P database). The internal server 200 generates transaction data using the condition information and the accompanying information that are provided from the application backend 100. The internal server 200 temporarily stores the transaction data in the shared data, thereby sharing the transaction data with the external server 300.
The external server 300 has the same function as that of the internal server 200 and generates transaction data using the condition information and the accompanying information that are generated by an application backend (not illustrated in
The internal server 200 and the external server 300 update the P2P database that each device includes while cooperating with each other and thus maintaining consistency (performing the process is referred to as “forming a consensus” below).
The internal server and the external server 300 are capable of performing not only the process of registering transaction data in the P2P database but also a process of acquiring the transaction data from the P2P database.
When the internal server 200 and the external server 300 access the P2P database (in other words, when registration or acquisition of transaction data are performed), the internal server 200 and the external server 300 basically use a given program that is provided in the P2P databased and that is executed in the P2P database (referred to as a “P2P database program” below). Using the P2P database program, for example, realizes various processes including trading a virtual currency, such as Bitcoin, according to given rules. Providing the P2P database program in the P2P database reduces a risk that the program is fraudulently modified.
The P2P database program is a tune code in Hyperledger; however, the P2P database program is not limited thereto. For example, the P2P database program may refer to a smart contract. The internal server 200 and the external server 300 may properly realize access to the P2P database properly using a program other than the P2P database program.
In the embodiment, description will be given, providing that the internal server 200 and the external server 300 have the same function, but the internal server 200 and the external server 300 may have different functions. For example, a device that approves registration of transaction data in the P2P database (for example, an endorsing peer), a device that gives an instruction for registration to each device after approval (for example, an ordering peer), or a device that registers transaction data in the P2P database (for example, a committing peer) may be provided and the internal server 200 and the external server 300 may share and implement the functions of the devices.
The process content of the internal server 200 and the external server 300 describe above can be changed as appropriate. The type of devices that embody the internal server 200 and the external server 300 is not particularly limited. For example, the internal server 200 and the external server 300 are embodied by a freely-selected device covering a general-purpose computer, a PC, a tablet PC or a smartphone.
P2P Network 500
The P2P network 500 is a network in which the P2P database is distributed. As described above, the internal server 200 and the external server 300 are able to form a consensus with another device by connecting to the P2P network 500.
Note that the embodiment provides that the P2P network 500 is a network of a consortium system that is run by multiple organizations; however, the type of the P2P network 500 is not limited thereto. For example, the P2P network 500 may be a network of a private system that is run by only a single origination or a network of a public system that does not particularly restrict participants.
The communication system that is used for the P2P network 500 or the type of line is not particularly limited. For example, the P2P network 500 may be implemented using a dedicated network, such as an IP-VPN (Internet Protocol-Virtual Private Network). The P2P network 500 may be implemented using a public network, such as a telephone network or a satellite network. The P2P network 500 may be implemented using various types of LAN (Local Area Network) and WAN (Wide Area Network) including Ethernet (trademark). The P2P network 500 may be implemented using a wireless communication network, such as Wi-Fi (trademark) or Bluetooth (trademark).
Internal Network 400
The internal network 400 is a network that connects the application backend 100 and the internal server 200. Like the P2P network 500, the communication system that is used for the internal network 400 or the type of line is not particularly limited.
The example of the configuration of the biological information processing system according to the embodiment has been described. Note that the configuration described above with reference to
2.5. Example of Functional Configuration of Each Device
The example of the functional configuration of the biological information processing system according to the embodiment has been described. Subsequently, an example of a functional configuration of each device will be described.
2.5.1. Example of Functional Configuration of Application Backend 100
An example of a functional configuration of the application backend 100 will be described.
As illustrated in
Processor 110
The processor 110 is a functional configuration that implements the general process of the application backend 100. For example, an operation input made by a doctor triggers the processor 110 to start a process on registering condition information in the P2P database and, after generating condition information and accompanying information, the processor 110 provides these sets of information to the internal server 200. An operation input made by a doctor triggers the processor 110 to start a process on making a proposal to a patient. The triggers to start these processes are not particularly limited. The content of the processes implemented by the processor 110 is not limited thereto. For example, the processor 110 may implement a process that is generally performed in a PC, a tablet PC or a smartphone (for example, a process performed by an OS (Operating System)). As illustrated in
Biological Information Acquisition Unit 111
A biological information acquisition unit 111 is a functional configuration that acquires at least any one of physical information and environment information that is biological information on the subject. As described above, the physical information covers anthropometric information, diagnostic information, treatment information, or operation information and the environment information covers information on lifestyle habits of the subject, medication information, or information acquired by a wearable terminal device that is worn by the subject, and the biological information acquisition unit 111 acquires these sets of information by accessing the storage in the hospital, the storage unit 120 that the application backend 100 includes or an external server (for example, a cloud server). For example, when the biological information is managed using a persona ID, or the like, the biological information acquisition unit 111 searches the storage in the hospital, or the like, for the biological information on the subject using the personal ID, or the like, and acquires the biological information. When the biological information is registered in the P2P database, the biological information acquisition unit 111 may acquiring the biological information by accessing the P2P database via the internal server 200. The biological information acquisition unit 111 provides the acquired biological information to the condition information generator 112, the accompanying information generator 113, and the registration determination unit 114.
Condition Information Generator 112
The condition information generator 112 is a functional configuration that generates condition information representing a biological condition of the subject based on the biological information that is provided from the biological information acquisition unit 111. As described with reference to
Accompanying Information Generator 113
The accompanying information generator 113 is a functional configuration that generates accompanying information when condition information is generated based on the biological information that is provided from the biological information acquisition unit 111. As described with reference to
Registration Determination Unit 114
The registration determination unit 114 is a functional configuration that determines whether to register condition information in the P2P database by determining whether the condition information is effective based on the biological information that is provided from the biological information acquisition unit 111. More specifically, the registration determination unit 114 generates time-series data by chronologically arranging the biological information that is provided from the biological information acquisition unit 111 and, using the method described in Patent Literature 2, or the like, (for example, the polynomial smoothing spline model) extracts the baseline component from the time-series data.
When it is confirmed that the baseline component largely varies with respect to the given threshold from the time of previous registration of the condition information on the subject in the P2P database, the registration determination unit 114 determines to register the condition information in the P2P database. The method of storing the baseline component at the time of previous registration of the condition information on the subject in the P2P database is not particularly limited. For example, when the condition information on the subject is registered in the P2P database previously, the baseline component may be stored in the storage in the hospital or may be registered in the P2P database (when the baseline component is registered in the P2P database, the registration determination unit 114 acquires the baseline component from the P2P database via the internal server 200).
Proposal Unit 115
The proposal unit 115 is a functional configuration that performs chronological analysis on the condition information on the subject, thus predicts a condition of the subject at a future time, and makes a proposal based on the result of the prediction. More specifically, the proposal unit 115 acquires the condition information on the subject from the P2P database via the internal server 200. The proposal unit 115 then recognizes the condition of the subject by analyzing the condition information. The proposal unit 115 may acquire multiple sets of condition information that are generated during a certain period and analyze these sets of condition information, thereby recognizing a condition transition pattern of the subject during the period.
The proposal unit 115 then searches the P2P database for a similar subject who had in the past a condition (or a condition transition pattern) similar to the condition of the subject (or the condition transition pattern). When a similar subject can be found, the proposal unit 115 acquires the condition information on the similar subject at and after the time when the similar subject was similar to the subject in condition (or condition transition pattern) from the P2P database and analyzes the condition information on the similar subject, thereby recognizing the following condition transition pattern of the similar subject. Accordingly, the proposal unit 15 is able to predict a condition of the subject at a future time (n years later). The proposal unit 115 presents the result of the prediction of the condition of the subject n years later to the subject.
When it is predicted that the future condition of the subject is not preferable, the proposal unit 115 is able to present a method of making the condition of the subject preferable based on the condition transition pattern of the similar subject. More specifically, when there is a condition positioned in a vector space where the condition is considered as unhealthy among conditions in which the subject will be highly likely to be n years later, the proposal unit 115 makes a comparison with a condition positioned in a vector space where the condition is considered as healthy, thereby proposing a method to be in a healthy condition. For example, the proposal unit 115 calculates how to update the environment information in order to be in a healthy condition and proposes information necessary for the update (for example, an amount of exercise and content of medication) to the subject. When a condition in which the subject will be most likely to be in n years later is positioned in a vector space where the condition is considered unhealthy, the proposal unit 115 may calculate how the condition of the subject will change if the given environment information (for example, the amount of exercise and the content of medication) is changed, specify environment information for shifting the condition to a vector space where the condition is considered healthy, and represent the environment information to the subject.
The method of making a proposal by the proposal unit 115 is not limited the above-described method. The proposal unit 115 may make a proposal using the given machine learning approach or an artificial intelligence (AI). For example, the proposal unit 115 may adjust the parameter that is used for the process through leaning a considerable number of proposal processes using the above-described approach (for example, generating a classifier formed of a multilayer neural network where the parameter is adjusted using learning data in which transition patterns and proposal content are associated), thereby improving accuracy of proposal.
The method of proposing the content of a proposal to the subject is not particularly limited. For example, the proposal unit 115 may present the content of a proposal to the subject by controlling the output unit 150 and thus displaying the content of the proposal on a display or outputting the content of proposal by sound from a speaker.
Storage Unit 120
The storage unit 120 is a functional configuration that stores various types of information. For example, the storage unit 120 stores information that is used for various processes performed by the processor 110 and information that is generated by the various processes (for example, the biological information covering physical information and environment information, the condition information, the accompanying information, the personal ID, or the management ID). The storage unit 120 stores programs or parameters that are used for the processes performed by the respective functional configurations. The information that is stored in the storage unit 120 is not limited to them.
Communication Unit 130
The communication unit 130 is a functional configuration that communicates with external devices. For example, the communication unit 130 transmits the condition information that is generated by the condition information generator 112, the accompanying information that is generated by the accompanying information generator 113, etc., to the internal server 200 and receives, from the internal server 200, various types of data that are acquired by the internal server 200 from the P2P database. The content of information that the communication unit 130 communicates is not limited to them.
Input Unit 140
The input unit 140 acquires an input made by a doctor. For example, the input unit 140 has an input mechanism, such as a touch panel, a keyboard, a mouse or a button, and, when the doctor performs various operations on the input mechanism, the input unit 140 generates input information based on the operations and provides input information to the processor 110. The input mechanism that the input unit 140 includes and the content of input are not particularly limited.
Output Unit 150
The output unit 150 controls various outputs. For example, the output unit 150 includes an output mechanism, such as a display, a speaker, or a lamp, and displays various types of information on a display according to the result of processes performed by the processor 110 or outputs various types of sound using a speaker. The output mechanism that the output unit 150 includes and the content of output are not particularly limited.
The example of the functional configuration of the application backend 100 has been described. Note that the functional configuration described above using
2.5.2. Example of Functional Configuration of Internal Server 200
With reference to
As illustrated in
Processor 210
The processor 210 is a functional configuration that implements the general process performed by the internal server 200. For example, the processor 210 controls the start and end of the process of registering the condition information and the accompanying information that are provided from the application backend 100 in the P2P database. The content of the process that is implemented by the processor 210 is not limited thereto. For example, the processor 210 may implement a process that is generally performed by various servers, a PC, a tablet PC or a smartphone (for example, a process performed by an OS). As illustrated in
Acquisition Unit 211
The acquisition unit 211 is a functional configuration that acquires various types of information. For example, the acquisition unit 211 acquires the condition information, the accompanying information, etc., from the application backend 100 via the communication unit 230. The acquisition unit 211 is also able to acquire various types of information (for example, the condition information on the subject) from the P2P database. Note that the information acquired by the acquisition unit 211 is not limited to them.
Transaction Generator 212
The transaction generator 212 is a functional configuration that generates transaction data to be registered in the P2P database. More specifically, when the condition information and the accompanying information that are provided from the application backend 100 are acquired by the acquisition unit 211, the transaction generator 212 generates transaction data containing these sets of information.
With reference to
“The digital signature using the secret key of the internal server 200” is information that is generated using the secret key of the internal server 200 that generates the transaction data and is information that is used to detect spoofing. “The digital signature using the secret key of the internal server 200” may be replaced with a digital signature that is generated using a secret key that a hospital or a doctor who has generated transaction data other than the internal server 200 holds.
“The public key of the internal server 200” is information that makes it possible to decode the digital signature. Incorporating “the public key of the internal server 200” in the transaction data makes it possible to verify whether spoofing is performed based on the result of decoding the digital signature.
“The address of the internal server 200” is information that can identify the internal server 200 that has generated transaction data. Incorporating “the address of the internal server 200” in the transaction data makes it possible to identify the generator of the transaction data. “The address of the internal server 200” may be replaced with “the public key of the internal server 200” or “the generator ID” illustrated in
“The address of the receiver” is information that is registered when there is a receiver that receives the transaction data (or the condition information). Incorporating “the address of the receiver” in the transaction data makes it possible to identify the receiver of the transaction data.
“The hash value of the previous transaction data” is the hash value of the transaction data at the time of previous registration of the condition information on the subject in the P2P database. Incorporating “the hash value of the previous transaction data” in the transaction data represents connection between sets of transaction data on the same subject (in other words, a change in the condition information on the same subject is represented).
“The condition information” is information that has been described with reference to
“The accompanying information” is information that has been described with reference to
“The accompanying information (the hash value of the attribute information on the subject)” is a hash value of the attribute information on the subject that is used to generate the accompanying information (for example, a name, a birth date, an age, gender, a blood type, an address, a phone number, or a place of employment). In order to specify the subject of the condition information based on “the accompanying information”, it is necessary to acquire the personal ID and the management ID that are stored in the storage in the hospital, or the like. On the other hand, incorporating “the accompanying information (the hash value of the attribute information on the subject)” in the transaction data makes it possible to specify the subject of the condition information when the given attribute information can be acquired.
“The version information” is information representing a version of the system (or software) that was used to generate the transaction data (or the condition information). Incorporating “the version information” in the transaction data allows the biological information processing apparatus that has acquired the transaction data to appropriately perform various types of processing using the transaction data.
The content of the transaction data that is generated by the transaction generator 212 is not limited to the above-described content. For example, the transaction generator 212 may generate transaction data by omitting the information described above or adding information that is not described above.
Consensus Formation Unit 213
The consensus formation unit 213 is a functional configuration that performs a process on forming a consensus with the external server 300 (referred to as a “consensus formation process” below) and thus registers the transaction data that is generated by the transaction generator 212 in the P2P database (in other words, the consensus formation unit 213 functions as a register that registers the condition information in the P2P database). The content of the consensus formation process that is performed by the consensus formation unit 213 is not particularly limited. For example, when the P2P database is a blockchain, the consensus formation unit 213 is able to perform the consensus formation process using a consensus algorithm that is known for blockchain techniques. For example, the consensus formation unit 213 performs the consensus formation process using PBFT (Practical Byzantine Fault Tolerance) and thus is able to store transaction data in a new block and register the block in the blockchain. The consensus formation unit 213 may perform the consensus formation process using another consensus algorithm, such as Proof of Work, Proof of Stake, Paxos, Raft, or Sieve.
Storage 220
The storage 220 is a functional configuration that stores various types of information. For example, the storage 220 stores programs or parameters that are used by the respective functional configurations of the internal server 200. The content of information that the storage 220 stores is not limited thereto. As illustrated in
Shared Data 221
The shared data 221 is a set of data that is shared between biological information processing apparatuses that are connected to the P2P network 500. Each of the biological information processing apparatuses acquires the shared data 221 via the P2P network 500 and updates the shared data 221 while maintaining consistency with the shared data that other biological information processing apparatuses store. As illustrated in
Transaction Storage 222
The transaction storage 222 is a functional configuration that stores transaction data that is not registered in the P2P database 223. The transaction storage 222 registers transaction data that is generated by the transaction generator 212 and transaction data that is generated by the external server 300 and is shared via the P2P network 500. The transaction data that is registered in the transaction storage 222 is basically the same as the transaction data that is registered in the external server 300.
P2P Database 223
The P2P database 223 is a database that is stored in the internal server 200 and is, for example, a blockchain. As described above, transaction data containing the condition information and the accompanying information on the subject is registered in the P2P database 223. The data that is registered in the P2P database 223 is not limited thereto. For example, when transaction data is registered in the P2P database 223 or when a charge is made when the transaction data is acquired from the P2P database 223, data on assets of the subject (for example, coins of Bitcoin) may be registered in the P2P database 223. In the P2P database 223, the above-described P2P database program may be registered. The development language of the P2P database program or the number of P2P database programs that are formed in the P2P database 223 is not particularly limited.
Communication Unit 230
The communication unit 230 is a functional configuration that communicates with external devices. For example, the communication unit 230 receives the condition information and the accompanying information from the application backend 100 or transmits various types of data that are acquired by the acquisition unit 211 from the P2P database 223 to the application backend 100. The communication unit 230 transmits and receives various types of information used for the consensus formation process performed by the consensus formation unit 213 to and from the external server 300. The content of information that the communication unit 230 communicates is not limited to them.
The example of the functional configuration of the internal server 200 has been described. Note that the functional configuration described above using
The external server 300 has the same functional configuration as that of the internal server 200 and thus description thereof will be omitted. Note that the external server 300 need not necessarily have the same functional configuration as that of the internal server 200 and part of the functional configuration may be omitted or the internal server 300 may have a functional configuration that that internal server 200 does not have.
2.6. Example of Flow of Process of Each Device
The example of the functional configuration of each device has been described. Subsequently, an example of a flow of a process of each device will be described.
Process of Registering Condition Information in P2P Database
First of all, with reference to
At step S1100, the registration determination unit 114 of the application backend 100 determines whether condition information is effective based on biological information, thereby determining whether to register the condition information in the P2P database 223. When the registration determination unit 114 determines to register the condition information in the P2P database 223 (step S1104/Yes), at step S1108, the condition information generator 112 generates condition information on a subject based on biological information and the accompanying information generator 113 generates accompanying information based on the biological information.
At step S1112, the transaction generator 212 of the internal server 200 generates transaction data using the condition information and the accompanying information that are provided from the application backend 100. At step S1116, the consensus formation unit 213 performs the consensus formation process using the consensus algorithm, such as PBFT, thereby registering the transaction data in the P2P database 223 and a series of steps ends.
At step S1104, when the registration determination unit 114 determines not to register the condition information in the P2P database 223 (step S1104/No), the condition information is not registered in the P2P database 223 by the process at steps S1108 to S1116 and a series of steps ends.
2.6.2. Process of Determining Whether Registering Condition Information in P2P Database is Appropriate
Subsequently, the process of determining whether to register condition information in the P2P database 223, which has been described at step S1100 in
At step S1200, the registration determination unit 114 of the application backend 100 chronologically arranges the biological information that is provided from the biological information acquisition unit 111, thereby generating time-series data. At step S1204, the registration determination unit 114 extracts a baseline component from the time-series data using the method described in Patent Literature 2 (for example, the polynomial smoothing spline model), or the like.
At step S1208, the registration determination unit 114 compares the baseline component at the time of previous registration of condition information on the subject in the P2P database 223 with the baseline component that is extracted at step S1204. When the baseline component largely varies with respect to the given threshold from the time of previous registration of condition information on the subject in the P2P database (step S1212/Yes), at step S1216, the registration determination unit 114 determines to register the condition information in the P2P database 223 and a series of steps ends. On the other hand, when the baseline component does not largely vary with respect to the given threshold from the time of the previous registration of condition information on the subject in the P2P database (step S1212/No), at step S1220, the registration determination unit 114 determines not to register the condition information in the P2P database 223 and a series of steps ends.
Process of Generating Condition Information and Accompanying Information
The process of generating condition information and accompanying information that has been described at step S1108 in
At step S1300, the condition information generator 112 and the accompanying information generator 113 of the application backend 100 classify the biological information that is provided from the biological information acquisition unit 111 into personal information and non-personal information. At step S1304, the condition information generator 112 performs the non-personal process on part of the personal information (for example, performs a process of converting information “Age: 25” into information “Age group: twenties”). At step S1308, the condition information generator 112 performs condition allocation on the non-personal information by the dimensional compression process, thereby generating condition information.
At step S1312, the accompanying information generator 113 issues a personal ID (for example, a patient number) and a management ID (for example, an electronic health record number) using the personal information. At step S1316, the accompanying information generator 113 performs the given conversion process (for example, an encryption process or a hashing process) on the personal ID (before conversion) and the management ID (before conversion), thus generates a personal ID (after conversion) and a management ID (after conversion), and then generates accompanying information containing the IDs. Accordingly, a series of steps ends.
2.6.4. Process of Making Proposal Based on Condition Information
Subsequently, with reference to
At step S1400, the proposal unit 115 of the application backend 100 acquires the condition information on the subject from the P2P database 223 via the internal server 200. At step S1404, the proposal unit 115 analyzes the condition information, thus recognizes the condition of the subject, and then searches the P2P database 223 for a similar subject who had in the past a condition (or a condition transition pattern) similar to the condition (or the condition transition pattern) of the subject.
When a similar subject is found in the P2P database 223 (step S1408/Yes), at step S1412, the proposal unit 115 acquires the condition information on the similar subject at and after the time when the similar subject was similar to the subject in condition (or in condition transition pattern) from the P2P database 223 and analyzes the condition information, thereby recognizing the following condition transition pattern of the similar subject. At step S1416, the proposal unit 115 predicts a condition transition pattern of the subject based on the following condition transition pattern of the similar subject and presents the condition transition pattern to the subject.
At step S1420, the proposal unit 115 proposes an improvement plan to improve the condition of the subject. For example, when there is a condition positioned in a vector space where the condition is considered as unhealthy among conditions in which the subject will be highly likely to be at a certain future time, the proposal unit 115 makes a comparison with a condition positioned in a vector space where the condition is considered as healthy, thereby proposing a method to be in a healthy condition. Accordingly, a series of steps ends.
When no similar subject is found in the P2P database 223 (step S1408/NO), at step S1424, the proposal unit 115 makes an output indicating that no similar subject is found and a series of steps ends.
2.7. Case of Use
The example of the flow of the process of each device has been described. Subsequently, examples of use of the biological information processing system according to the embodiment will be described.
As described above, registering the condition information on the subject in the P2P database allows devices that are able to access to the P2P network 500 to provide various services using the condition information.
It has been described using
For example, a freely-selected company X that is a solution provider (for example, a maker that manufactures products, such as drugs, or a service provider company, such as a training gym) may provide services using the condition information.
For example, at step S1500 in
At step S1512, the application accesses the P2P network 500 via a given API and extracts condition information on the subject from the P2P database 223 using the personal ID (after conversion). At step S1516, using the condition information, the application performs a proposal process like that described using
As described above, the biological information processing system according to the embodiment can be widely used by various providers that provide products and services to subjects.
Furthermore, for example, aggregating condition information that is recorded in the P2P database 223 with respect to each region or age and analyzing the condition information makes it possible to detect a degree of expansion of an infection and a regional disease. For example, condition information that is recorded in the P2P database 223 periodically is acquired and a region in which a hospital contained in a condition code and a physical code is positioned is extracted from the condition information. Furthermore, chronological statistical analysis, for example, analysis using an autoregression model or a moving average model, is performed on the extracted information. Accordingly, it is possible to analyze whether the condition code of each region has a statistical change according to the period, for example, whether the number of people in a high-temperature condition is gradually increasing or there are more people in a high-temperature condition than in other regions. Note that performing analysis per generation makes it possible to further increase accuracy.
Each step in the flowcharts in
2.8. Example of Hardware Configuration of Each Device
The example of the flow of the process of each device has been described. Subsequently, with reference to
The information processing apparatus 900 includes an MPU 901, a ROM 902, a RAM 903, a recording medium 904, an input-output interface 905, an operation input device 906, a display device 907, and a communication interface 908. The information processing apparatus 900, for example, connects each component via a bus 909 serving as a data transmission path.
The MPU 901 is formed of at least one processor that is formed of an operation circuit, such as an MPU, or various processing circuits and the MPU 901 functions as the processor 110 of the application backend 100 or the processor 210 of the internal server 200. Note that these functional configurations may be formed of a dedicated (or general-purpose) circuit (for example, a processor independent of the MPU 901) that can implement the various processes described above.
The ROM 902 stores control data, such as programs and operation parameters that are used by the MPU 901. The RAM 903 temporarily stores, for example, programs that are executed by the MPU 901, etc.
The recording medium 904 functions as the storage unit 120 of the application backend 100 or the storage 220 of the internal server 200 and stores various types of data, such as data on information processing and various programs. As the recording medium 904, for example, a magnetic recording medium, such as a hard disk, or a non-volatile memory, such as a flash memory, is taken. The recording medium 904 may be detachable from the information processing apparatus 900.
The input-output interface 905, for example, connects the operation input device 906 and the display device 907. As the input-output interface 905, for example, a USB (Universal Serial Bus) terminal, a DVI (Digital Visual Interface) terminal, a HDMI (High-Definition Multimedia Interface) (trademark) terminal, or various processing circuits are taken.
The operation input device 906 is, for example, on the information processing apparatus 900 and is connected to the input-output interface 905 in the information processing apparatus 900. As the operation input device 906, for example, a keyboard, a mouse, a keypad, a touch panel, a microphone, an operation button, a rotary selector, such as an orientation key or a jog dial, or a combination thereof is taken. The operation input device 906 functions as the input unit 140 of the application backend 100.
The display device 907 is, for example, on the information processing apparatus 900 and is connected to the input-output interface 905 in the information processing apparatus 900. As the display device 907, for example, a liquid crystal display or an organic EL (electro-luminescence) display is taken. The display device 907 functions as the output unit 150 of the application backend 100.
Needless to say, the input-output interface 905 is connectable to external devices, such as an operation input device or an external display device outside the information processing apparatus 900. The display device 907 may be, for example a device enabling display and user operations, such as a touch panel.
The communication interface 908 is a communication unit that the information processing apparatus 900 includes and the communication interface 908 functions as the communication unit 130 of the application backend 100 or the communication unit 230 of the internal server 200. The communication interface 908 may have a function of communicating with a freely-selected external device, such as a server, in a wired or wireless manner via a freely-selected network (or directly). As the communication interface 908, for example, a communication antenna and an RF (Radio Frequency) circuit (wireless communication), an IEEE802.15.1 port and a transmitter-receiver circuit (wireless communication), an IEEE802.11 port and a transmitter0receiver circuit (wireless communication), or a LAN (Local Area Network) terminal and a transmitter-receiver circuit (wired communication) are taken.
The hardware configuration of the information processing apparatus 900 is not limited to the configuration illustrated in
Subsequently, dimensional compression using the machine learning approach described above will be described in detail. The case where the dimensional compression process is realized by SAE (Stacked Auto-Encoders) that is one type of multilayer neural network will be described. The SAE is a neural network having a configuration in which neural networks refereed too as auto-encoders are stacked into a multilayer network. In SAE, based on learning data, the parameter of SAE (which means a network coefficient of each layer) is adjusted. The auto-encoder is a neural network having a configuration in which the number of neurons (the number of units) is equal between an input layer and an output layer and the number of neurons in an intermediate layer (also referred to as a hidden layer) is smaller that that of the input layer (or the output layer). Learning of the SAE is performed per auto-encoder forming the SAE and, for example, learning by backpropagation using learning data is performed to adjust the parameter.
For example, a classifier is generated by performing learning using learning data in which biological information and conditions are associated while dimensional compression and dimensional decompression is performed using 49 dimensions, 16 dimensions, three dimensions, 16 dimensions, and 49 dimensions as the numbers of dimensions of the respective layers of the SAE formed of five layers and thus making a parameter adjustment enabling proper classification. When biological information is input to the classifier, compression to three dimensions is performed and the biological information is converted into a value in a three-dimensional space vector. Classification of condition may be also performed. Using the machine learning approach thus makes it possible to generate a value in a freely-selected n-dimensional space vector obtained by dimensional compression or a result of classification. The value in the n-dimensional space vector is dealt with as a condition (or a condition code), or the result of classification is dealt with as a condition (or a condition code). Note that the number of dimensions in dimensional compression is an item to be designed and is not limited to three dimensions if classification is possible and any number of dimensions can be employed. When dimensional compression is possible, the machine learning approach is not particularly limited.
This, for example, makes it is possible to perform table conversion on information “Having atopic skin of Type A, a lot of redness on the right upper arm, BMI of 20, . . . ” to “Type 1 with tendency of causing skin inflammation, severity of reaction of Level 2, BMI of 5 representing an average, . . . ”, thereby generating a sequence vector X (1, 2, 2, . . . ). By inputting the sequence vector X into the neural network on which a parameter adjustment is made previously using learning data containing a given condition and a sequence vector, it is possible to generate a condition Y. Note that the condition Y may be a value in a n-dimensional space or may be a result of classification along a table that is prepared in advance.
For example, by inputting biological information as a1=height and a2=weight in each item, such as BMI, a matrix {a1, a2, a3, . . . an} having n-dimensional information is generated. The matrix is input to a machine learning model using a multiplayer neural network and the matrix is compressed to the two-dimensional information of a matrix {b1, b2}. The machine learning model is a machine learning model obtained by setting, in a multilayer network, a parameter by learning learning data in which n-dimensional information and two-dimensional information are associated with each other. This makes it possible to compress the n-dimensional information to two dimensions and allocate the information to the condition of the matrix {b1, b2}. The approach of expressing the biological information as a matrix of one row and n columns and performing dimensional compression into a matrix of one row and two columns has been described, and any matrix form may be taken according to the calculation model. On the same person, biological information measured for the first time may be represented by a matrix T1{a1, a2, a3, . . . , an}, biological information measured for the second time may be represented by a matrix T2{b1, b2, b3, . . . , bn}, and the matrix T1 and the matrix T2 may be connected. In other words, the first row generates the matrix T1 and the second row generates the matrix Tn. By inputting the matrix Tn to the multiplayer network as in the above-described case, the matrix may be compressed to two-dimensional information and condition allocation may be performed.
As described above, the biological information processing apparatus according to the disclosure (for example, the application backend 100) acquires biological information on a subject, generates condition information representing a biological condition of the subject based on the biological information, and registers the condition information in the P2P database 223 (covering a blockchain). More specifically, the biological information processing apparatus generates a condition code representing the biological condition of the subject by encoding the biological information by performing a dimensional compression process and registers condition information containing the condition code in the P2P database 223. Encoding the biological information by the dimensional compression process makes it more difficult to specify the individual and reduce the data size of the condition information. Registering the condition information in the P2P database 223 allows linkage of the condition information with another hospital with authenticity of the condition information being secured. Standardizing the condition information according to given standards allows each hospital to appropriately analyze and utilize the condition information.
The biological information processing apparatus is able to recognize the condition transition pattern of the subject by performing chronological analysis on the condition information using a given method and thus identify the background of the onset of a disease per subject. The biological information processing apparatus generates condition information in consideration of not only diagnostic information from a doctor but also environment information and thus is able to calculate condition information on the subject accurately. Accordingly, the biological information processing apparatus according to the disclosure is able to contribute to realization of individual medicine (or improvement in supportive therapy).
The preferable embodiments of the disclosure have been described in detail with reference to the accompanying drawings; however, the technical scope of the disclosure is not limited to the examples. It is obvious that those with general knowledge in the technical field of the disclosure can reach various modifications or corrections within the scope of the technical idea described in the claims and it is understood that they naturally belong to the technical scope of the disclosure.
The effects described herein are explanatory or exemplary only and thus are not definitive. In other words, the technique according to the disclosure can achieve, together with the above-described effects or instead of the above-described effects, other effects obvious to those skilled in the art from the description herein.
The following configuration also belongs to the technical scope of the disclosure.
(1) A biological information processing method comprising:
acquiring biological information on a subject;
based on the biological information, generating condition information representing a biological condition of the subject; and
registering the condition information in a P2P database.
(2) The biological information processing method according to (1), wherein the condition information contains a condition code that is generated by encoding the biological information using a given method and that represents the biological condition.
(3) The biological information processing method according to (2), wherein the biological information contains at least any one of physical information that is information on a body of the subject and environment information that is information on an environment that affects the subject.
(4) The biological information processing method according to (3), wherein the condition code is generated by, using the given method, encoding at least any one of a physical code that is generated by encoding the physical information using the given method and an environment code that is generated by encoding the environment information using the given method.
(5) The biological information processing method according to (4), wherein the condition information contains, in addition to the condition code, at least any one of the physical code and the environment code.
(6) The biological information processing method according to any one of (2) to (5), wherein the condition information contains, in addition to the condition code, a method code representing the given method.
(7) The biological information processing method according to any one of (2) to (6), wherein the biological information processing method comprises, as the given method, generating the condition code by compressing dimensions of the biological information.
(8) The biological information processing method according to (7), wherein the biological information processing method comprises, compressing the dimensions using at least any one of table conversion and a machine learning approach.
(9) The biological information processing method according to any one of (3) to (5), wherein the physical information contains at least any one of anthropometric information, diagnostic information, treatment information, and operation information on the subject.
(10) The biological information processing method according to any one of (3) to (5), wherein the environment information contains at least any one of information on lifestyle habits, medication information, and information that is acquired using a wearable terminal device that is worn by the subject, which are sets of information on the subject.
(11) The biological information processing method according to any one of (1) to (10), further comprising:
extracting, from time-series data in which the biological information is arranged chronologically, a baseline component representing an irreversible change in the time-series data; and
based on variation in the baseline component, controlling registration of the condition information in the P2P database.
(12) The biological information processing method according to (11), the biological information processing method comprises determining to register the condition information in the P2P database when it is confirmed that the baseline component varies largely with respect to a given threshold from a time of previous registration of the condition information on the subject in the P2P database.
(13) The biological information processing method according to any one of (1) to (12), further comprising:
by comparing the condition information representing the biological condition of the subject with other sets of condition information representing biological conditions of other subjects, extracting a similar subject who had in the past a biological condition similar to that of the subject from the other subjects; and
based on a transition pattern of the biological condition of the similar subject, predicting a transition pattern of a future biological condition of the subject.
(14) The biological information processing method according to (13), further comprising, when it is predicated that the future biological condition of the subject is not preferable, presenting a method of making the biological condition of the subject preferable based on the transition pattern of the biological condition of the similar subject.
(15) The biological information processing method according to any one of (1) to (14), wherein the P2P database is a blockchain.
(16) A biological information processing apparatus comprising:
a biological information acquisition unit configured to acquire biological information on a subject;
a condition information generator configured to generate condition information representing a biological condition of the subject based on the biological information; and
a register configure to register the condition information in a P2P database.
(17) A biological information processing system comprising:
a biological information acquisition unit configured to acquire biological information on a subject;
a condition information generator configured to generate condition information representing a biological condition of the subject based on the biological information; and
a register configure to register the condition information in a P2P database.
(18) A biological information processing method comprising:
acquiring biological information on a subject;
generating condition information representing a biological condition of the subject based on the biological information; and
registering the condition information as data of a distributed network.
Number | Date | Country | Kind |
---|---|---|---|
2018-116003 | Jun 2018 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2019/024357 | 6/19/2019 | WO | 00 |