This application claims priority based on Japanese patent application, No. 2016-197783 filed on Oct. 6, 2016, the entire contents of which are incorporated herein by reference.
The present invention relates to a system and a method for data management.
Advancement of communication network techniques has made collection and analysis for information from sensors and devices from all over the world much easier. Collective analysis on various types of information enables an operator of a data management system to find association from a comprehensive point of view and achieve system optimization.
Data (raw data) simply collected from sensors and devices is provided with no associated information (metadata) required in later processing such as analysis. The raw data provided with metadata can be associated with another piece of data, and thus has a higher value.
For example, data with a simple 32-bit string “0x42140000” would not be so valuable. When the bit string data is provided with metadata indicating that the data is a single-precision floating point, the bit string can be interpreted as data representing a numerical value “37.0”.
Furthermore, the raw data can have an even higher value when provided with metadata indicating that the bit string is temperature data, described in Celsius, is information detected by a sensor disposed at a certain position, or is data measured at a certain time point. The metadata can be provided at a time point when the raw data is generated, or after the raw data has been collected. Hereinafter, the raw data may be simply referred to as data.
Metadata provided to raw data has a system (metadata system) with which the data can have meanings or can be associated with another piece of data. For example, it is defined in the metadata system that a concept A is subordinate to another concept B, or that two concepts with different names actually have the same meaning.
The metadata system can be used for estimation to associate data with another piece of data in a wider range than in a case where only the metadata directly provided to the data is used. In this context, for example, positional information may be associated with an administrative division of a region including the position. Furthermore, for example, an average temperature of a certain district may be obtained with an estimation based on the association with a larger administrative division including the region.
In a data management system that independently collects and analyzes data, metadata and a metadata system provided to each raw data are fixed, and the metadata or metadata system provided are rarely changed.
Still, an additional function of enabling the metadata and the metadata system to be changed after establishment of the data management system would make way for a wider range of use of data through new association that has not been contemplated when the system was designed.
To achieve this, techniques for supporting provision of new metadata to existing data have conventionally been proposed. Japanese Patent Application Laid-open No. 2007-115069 discloses a method of selecting data estimated to be related to provision of metadata, and presenting the data to a user, so that the user can efficiently provide the metadata. With this technique, the user can receive recommendation about appropriate data to which metadata is provided.
The conventional technique described in Japanese Patent Application Laid-open No. 2007-115069 can recommend new metadata to raw data from an existing metadata group and metadata system, but cannot support a system administrator making new changes in the existing metadata system.
Furthermore, an impact of changing a metadata system on the system cannot be recognized. For this reason, it would be substantially difficult for an administrator who is concerned about the negative impact to change the metadata system.
The present invention is made in view of the above, and an object of the present invention is to provide a system and a method of data management with which a variance between cases where a plurality of metadata systems serving as comparison targets are used is obtained to support the selection of a metadata system, so that a user can enjoy higher usability. A further object of the present invention is to provide a system and a method of data management with which an index for selecting a metadata system is provided, so that changes in a metadata system can be facilitated.
To solve the problems described above, a data management system according to the present invention is a data management system that manages data including metadata and includes: a data storage unit configured to store: a data group that is a group of data provided with at least one metadata; and one or more metadata systems for interpreting the metadata, and to search for the data group with a selected metadata system, in accordance with a query; an impact evaluation unit configured to obtain a variance between a plurality of metadata systems serving as comparison targets when the plurality of metadata systems execute a same query for the data storage unit; and a metadata system management unit configured to select a metadata system, based on the variance obtained by the impact evaluation unit.
In the present invention, the variance between the plurality of metadata systems serving as the comparison targets when the plurality of metadata systems execute the same query is obtained, and a metadata system can be selected based on the variance. Thus, a user managing the metadata system can be supported, and thus can enjoy higher usability.
The details of one or more implementations of the subject matter described in the specification are set forth in the accompanying drawings and the description below. Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.
Embodiments of the present invention are described below with reference to the drawings. As described later, in the embodiments, management of a metadata system by a user is supported with an impact of changing the metadata system estimated before the metadata system is changed.
To achieve this, a data management system according to the embodiments includes: a data register/search unit 101 (also referred to as a data storage function 101), an impact evaluation unit 201 (also referred to as an impact evaluation function 201), and a metadata system management unit 301 (also referred to as a metadata system management function 301).
The data storage function 101 stores therein one or more metadata systems 103 and a data group 102 provided with metadata, and search is executed with the data group 102 combined with one of the metadata systems 103. The impact evaluation function 201 obtains a variance between the plurality of metadata systems 103(1) and 103(2) when the plurality of metadata systems 103(1) and 103(2) execute the same query for the data storage function 101. The variance may be hereinafter referred to as difference. The metadata system management function 301 presents the variance, presented by the impact evaluation function 201, to a user such as a system administrator, and issues an instruction indicating one or more metadata systems 103 selected by the user, as the metadata system of the data storage function 101. Thus, the determination of the user can be supported with the impact of changing the metadata system presented beforehand.
The query, employed by the plurality of metadata systems serving as comparison targets, is a test query for estimating the impact of the changing the metadata system. The test query may be prepared before a test, or generated at the time of the test. In the present embodiment, a history of the queries used in the past is managed, and the test query is selected from the query history to be used.
In one embodiment described below, the plurality of metadata systems serving as the comparison targets, are concurrently operated, and a variance in the search result over a predetermined period is examined. The user determines the impact of changing the metadata system, based on a variance in the operation result over the predetermined period. Thus, the variance between the plurality of metadata systems can be obtained while the metadata systems are actually operating concurrently, to support the determination of the user.
In another embodiment described below, the query is issued for a plurality of times to the plurality of metadata systems serving as the comparison targets. A second query is selected based on the search result corresponding to a first query and is issued to the plurality of metadata systems serving as the comparison targets, to obtain a second search result.
In the present embodiment with the configuration described above, a level of impact of changing the metadata system can be determined, so that changes in the metadata system can be facilitated. Thus, data utilization creating new values can be additionally introduced.
Embodiment 1 is described with reference to
The data management system that manages data including metadata includes, for example, a data storage apparatus 1, an impact evaluation apparatus 2, and a metadata system management apparatus 3. The data management system may further include a data collection apparatus 4 and a query processing apparatus 5. A data generation apparatus 6 and a data using apparatus 7, which are generally not included in the data management system, but the apparatuses 6 and 7 may be included in the data management system in some cases.
An example of configurations of the data storage apparatus 1, the impact evaluation apparatus 2, the metadata system management apparatus 3, the data collection apparatus 4, and the query processing apparatus 5 is described later in detail. As described later, the apparatuses 1 to 5 may be provided on different computers, or may be provided on a common computer. In a case described in the present embodiment, the apparatuses 1 to 5 are implemented with different computers.
For example, the data storage apparatus 1 includes; the data register/search unit 101 that registers data and searches for the registered data; the database 102 that stores and manages raw data to which metadata is set; the first metadata system 103(1); and the second metadata system 103(2). An example of the configuration of the data storage apparatus 1 is described later with reference to
A portion that implements a predetermined function may be referred to as “function”. For example, the data register/search unit 101 may be referred to as a data register/search function 101. The functions 101, 201, 301, 401 and 501 are implemented on the apparatuses 1, 2, 3, 4, and 5 formed as the computers. Thus, for example, a “data storage unit” may be the data storage apparatus 1, and an “impact evaluation unit” may be the impact evaluation apparatus 2.
For example, the impact evaluation apparatus 2 includes: the impact evaluation unit 201 that evaluates an impact of changing a metadata system; and a query history DB 202 that stores and manages queries issued in the past. An example of the configuration of the impact evaluation apparatus 2 is described later in detail with reference to
For example, the metadata system management apparatus 3 includes the metadata system management unit 301 that manages the metadata system based on an evaluation performed by the impact evaluation apparatus 2. An example of the configuration of the metadata system management unit 301 is described below in detail with reference to
For example, the data collection apparatus 4 includes: a raw data registration unit 401 that registers raw data, received from the data generation apparatus 6, in the data storage apparatus 1; and a template DB 402. An example of the configuration of the data collection apparatus 4 is described later with reference to
For example, the query processing apparatus 5 includes a query unit 501 that issues a query for data (raw data to which the metadata is set) stored in the data storage apparatus 1, and receives a result of the query (search result). An example of the configuration of the query processing apparatus 5 is described below in detail with reference to
The data generation apparatus 6 is an apparatus that generates raw data. For example, the data generation apparatus 6 includes various sensors and various devices, and generates a measured value of a sensor and log information on an apparatus, as the raw data. Although
The data using apparatus 7 is an apparatus that uses a data group managed in the data management system and requests the query processing apparatus 5 to search for data. The data using apparatus 7 receives a search result from the query processing apparatus 5, and uses the search result for data analysis or the like. Although
A connection configuration of the apparatuses 1 to 7 is described. The data storage apparatus 1, the impact evaluation apparatus 2, and the metadata system management apparatus 3 are connected to each other through a communication network CN1. The data generation apparatus 6 and the data collection apparatus 4 are connected to each other through a communication network CN2. The data using apparatus 7 and the query processing apparatus 5 are connected to each other through a communication network CN3. The communication networks CN1 to CN3 may be different communication networks, or may be a common communication network.
An overview of an operation in the data management system is described. The data generation apparatus 6 transmits raw data to the data collection apparatus 4 through the communication network CN2. The data collection apparatus 4 provides metadata to the raw data on the basis of a template set in advance, and stores the resultant data in the data storage apparatus 1. For example, the data generation apparatus 6 may have a sensing function, a storage function, and a communication function. A detail description on an example of the configuration of the data generation apparatus 6 will be omitted.
The data using apparatus 7 requests the query processing apparatus 5 to search for data, through the communication network CN3. The query processing apparatus 5 requests the data storage apparatus 1 to search for data. Upon receiving a response to the query from the data storage apparatus 1, the query processing apparatus 5 transmits the content of the response to the data using apparatus 7. For example, the data using apparatus 7 is formed as a computer, and has a calculation processing function, a storage function, a communication function, and a user interface function. A detail description on an example of the configuration of the data using apparatus 7 will be omitted.
When the user changes the metadata system, the metadata system management apparatus 3 uses the data storage apparatus 1 and the impact evaluation apparatus 2 to examine the impact of changing the metadata system. The metadata system management apparatus 3 presents an examination result of the impact of changing the metadata system to the user. The user can instruct the metadata system management apparatus 3 to change or not to change the metadata system, after checking the examination result. Changing the metadata system set in the data storage unit 1 may be expressed as “updating the metadata system” in the description below.
The storage unit 43 includes a rewritable storage medium such as a semiconductor memory and a hard disk, for example. For example, the storage unit 43 stores therein a program for implementing the raw data registration function 401 and the template DB 402. The raw data registration function 401 interprets a computer program, stored in the storage unit 43, with the information processing unit 41 to execute processing. This operation is hereinafter referred to as “the raw data registration function executes processing”.
The temperature sensor generates data D1 indicating that 37.0 degrees Celsius has been measured at a time point “2016/01/01 19:00”, and the data D1 is assumed to be received by the data collection apparatus 4. The raw data matches the first line in the template DB 402 illustrated in
The storage unit 13 stores therein: a computer program for implementing data registration and search function 101; the raw data+provided metadata DB 102; and the metadata system DB 103. In the present embodiment, the database is divided into the metadata system DB 103 and the raw data+provided metadata DB 102. Thus, version management can be easily performed for the metadata system only.
The raw data+provided metadata DB 102 stores and manages data (data such as data D2 illustrated in
The metadata system DB 103 manages at least one metadata system. The metadata system DB 103 illustrated in
The query processing apparatus 5 transmits the search result, received from the data storage apparatus 1, to the data using apparatus 7 as a response (S22). As a final step, the content of the query to the data storage apparatus 1 is transmitted from the query processing apparatus 5 to the impact evaluation apparatus 2 to be registered in the query history DB 202 (S23).
A data adding and data searching procedure in a case where the data management system according to the present embodiment is used is as described above. Components and procedures required for updating the metadata system with the data management system are described below.
The user can change the metadata system set in the data storage apparatus 1 for performing more appropriate data analysis or the like. Here, it is assumed that a single metadata system can be set in the data storage apparatus 1 for the sake of description. However, this should not be construed in a limiting sense, and a plurality of metadata systems can be set in the data storage apparatus 1, so that the user can designate a metadata system used for searching for data.
Changing the metadata system by the user might make a large impact on the search result obtained. For example, the search result obtained by using the metadata system after the change might largely increase or largely decrease compared with the case where the metadata system before the change is used. However, a conventional technique fails to enable a user to recognize the level of impact of changing the metadata system, and thus requires a long period of time before the metadata system can be appropriately changed, leading to low usability. In view of this, in the present embodiment, an impact of a change is estimated and presented to the user before the change to the metadata system is confirmed.
As illustrated on the left side of
Upon receiving the request from the metadata system management apparatus 3 (S31), the impact evaluation function 201 of the impact evaluation apparatus 2 requests the data storage apparatus 1 to prepare for two versions including an old metadata system 103(1) before the change is applied and a new metadata system 103(2) after the change is applied (S32).
The impact evaluation function 201 randomly extracts N queries (N>0) used in the past from the query history DB 202 (S33). An extraction method other than the random extraction, such as extraction of N latest queries may be employed.
The impact evaluation function 201 executes each query extracted in step S33 with the old metadata system 103(1) and for the new metadata system 103(2) (S34). The impact evaluation function 201 calculates a difference between a search result obtained by using the old metadata system 103(1) and a search result obtained by using the new metadata system 103(2) for the same query, and transmits the difference to the metadata system management apparatus as a response (S35).
Upon receiving the response from the impact evaluation apparatus 2 (S36), the metadata system management function 301 of the metadata system management apparatus 3 presents the difference, in the response, to the user who has issued the instruction to change the metadata system (S37). The user can determine whether to change (update) the old metadata system 103(1) to the new metadata system 103(2) by checking the difference. Changing the metadata system from the old system to the new system may be hereinafter referred to as “updating the metadata system”.
The metadata system management apparatus 3 receives an instruction from the user and determines whether the user has permitted the update (S38). When the user has permitted the update of the metadata system (S38: YES), an instruction to employ the new metadata system 103(2) is issued to the data storage apparatus 1 (S39). The old metadata system 103(1) may be discarded from the data storage apparatus 1, or may be maintained for a predetermined period.
When the user does not permit the update of the metadata system (S38:NO), the metadata system management function 301 issues an instruction to discard the new metadata system 103(2) to the data storage apparatus 1 (S40).
For example, the metadata system update management screen G1 includes: changed point input units GP11 and GP12 for inputting a changed point in the metadata system; a difference display unit GP13 that displays the difference due to the change (update) of the metadata system; and buttons BP11 to BP13.
The changed point input units GP11 and GP12 include: an input unit GP11 for partially adding a metadata system; and an input unit GP12 for partially deleting a metadata system. The adding metadata system input unit GP11 is a display area in which the metadata system to be added to the currently operating metadata system 103(1) is input. The deleting metadata system input unit GP12 is a display area in which the metadata system to be deleted from the currently operating metadata system 103(2) is input.
When the user inputs a change in the currently operating metadata system 103(1) by using one of the input units GP11 and GP12 or both and then presses a confirm button BP11, the metadata system management function 301 receives the change request from the user.
Then, as described above with reference to
Upon receiving the difference from the impact evaluation function 201 (S36), the metadata system management function 301 displays the difference on the difference display unit GP13. The difference display unit GP13 displays a newly detected search result and a lost search result for each query.
In
The user can determine whether to update or cancel the update of the metadata system, after checking the content displayed on the difference display unit GP13. A user who desires to update the currently operating metadata system presses an update button BP12. A user who desires to cancel or redo the update of the currently operating metadata system presses a cancel button BP13. The user can repeat the inputting of the changed point in the metadata system and the checking of the difference in the search result to achieve the metadata system desired by the user.
Steps S100 to S102 are data registration sequence. When the raw data arrives from the data generation apparatus 6 (S100), the data collection apparatus 4 uses the template DB 402 to generate data to be stored in the data storage apparatus 1 (S101). The data generated by applying the raw data to a template is transmitted from the data collection apparatus 4 to the data storage apparatus 1 to be stored (S102).
Steps S103 to S108 represent a sequence in a case where the data using apparatus 7 requests the data search. Upon receiving the data acquisition request from the data using apparatus 7 (S103), the query processing apparatus 5 transmits the search request to the data storage apparatus 1 (S104). The data storage apparatus 1 processes the search request to generate a search result (S105). The data storage apparatus 1 transmits the search result to the query processing apparatus 105 as a response (S106). The query processing apparatus 5 transmits the acquired result to the data using apparatus 7 as a response (S107). The query content is transmitted from the query processing apparatus 5 to the impact evaluation apparatus 2 to be registered in the query history DB 202 (S108).
Steps S110 to S123 represent a sequence related to the updating of the metadata system. Upon receiving the change request for the metadata system (S110), the metadata system management apparatus 3 transmits a difference obtaining request to the impact evaluation apparatus 2 (S111). The impact evaluation apparatus 2 notifies the data storage apparatus 1 of the changed point in the metadata system, and requests the preparation of the two versions of metadata systems that are old and new (S112). Upon receiving the request in step S112, the data storage apparatus 1 prepares the two versions of metadata systems that are old and new (S113), and notifies the impact evaluation apparatus 2 of the completion of the preparation (S114).
The impact evaluation apparatus 2 extracts the N past queries from the query history DB 202 (S115), and instructs the query processing apparatus to process each query in the two versions of the metadata systems that are old and new (S116, S117). In
The query processing apparatus 5 transmits a search request to the data storage apparatus 1 and transfers a search result, as a response to the search request, to the impact evaluation apparatus 2. The impact evaluation apparatus 2 obtains the difference (difference in the search result) in the response between the old and the new metadata systems applied to the same query group, and transmits the difference to the metadata system management apparatus 3 as a response (S118).
The metadata system management apparatus 3 presents the difference, received from the impact evaluation apparatus 2, to the user (S119), and waits for an instruction from the user. Here, it is assumed that the user has permitted the update. Upon receiving the permission to update from the user (S120), the metadata system management apparatus 3 issues an instruction indicating that the new metadata system 103(2) is to be used and that the old metadata system 103(1) is to be discarded, to the data storage apparatus 1 (S121).
The data storage apparatus 1 follows the instruction from the metadata system management apparatus 3, and thus switches the metadata system to be operated from the old metadata system 103(1) to the new metadata system 103(2) (S122). Then, the data storage apparatus 1 notifies the metadata system management apparatus 3 of the completion of the switching of the metadata system (S123).
With the present embodiment having the configuration described above, when the user attempts to change a metadata system, an impact on the search result due to the change can be estimated in advance. Thus, the user can issue an instruction to execute update processing for the metadata system, after reviewing the potential impact, and thus can enjoy higher usability. Furthermore, the user can search for data while correcting the metadata system, and thus can achieve an appropriate data analysis.
Embodiment 2 is described with reference to
In Embodiment 1, the impact of the update of the metadata system is presented to the user in a form of a difference in the search result between the old and the new metadata systems applied to the same query.
In Embodiment 2, the content of the difference presented to the user is changed in accordance with an amount of the difference with respect to the query. In the present embodiment, the metadata system is automatically updated when the difference is small and thus an impact on the search is small, to save time and effort of the user.
Upon receiving the update instruction for the metadata system from the user, the metadata system management function 301 of the metadata system management apparatus 3 requests the impact evaluation apparatus 2 to calculate the difference in the search result between the old and the new metadata systems applied to the same query (S50). Then, the metadata system management function 301 receives the difference with respect to the query as a response from the impact evaluation function 201 (S51).
The metadata system management function 301 calculates an amount of the difference ΔR (S52). For example, the metadata system management function 301 calculates the number of added, deleted, and updated data sets for each query to obtain the difference amount ΔR (S52). Specifically, the metadata system management function 301 obtains the difference amount ΔR as a sum or the like of the number of newly added data sets, the number of deleted data sets, and the number of data sets the content of which has been changed, in the search result as a result of applying the old and the new metadata systems to the same query.
The metadata system management function 301 checks whether the difference amount ΔR calculated in step S52 is larger than a predetermined threshold Th (S53). When the calculated difference amount ΔR is equal to or smaller than the threshold Th (S53: NO), the metadata system management function 301 determines that the range of the impact is small, and thus switches the old metadata system 103(1) to the new metadata system 103(2) (S56).
When the difference amount ΔR calculated in step S52 is larger than the threshold Th (S53: YES), the metadata system management function 301 regards the range of the impact as being large and thus presents the difference to the user to urge the user to determine whether the metadata system can be updated (S54). Then, the update instruction from the user is checked as in Embodiment 1 described above (S55). When the user permits the update (S55: YES), the metadata system management function 301 switches the system to the new metadata system 103(2) (S56). When the user cancels the update (S55: NO), the metadata system management function 301 causes the discarding of the new metadata system 103(2) (S57).
This embodiment with the configuration described above also provides advantageous effects similar to those of Embodiment 1. The present embodiment has an additional feature that the metadata system is automatically updated when the difference amount ΔR, as a result of executing the same query with the old and the new metadata systems, is equal to or smaller than the threshold Th. Thus, the usability can further be improved with the time effort of the user saved when the impact of the switching of the metadata system is small.
Embodiment 3 is described with reference to
Comparison between a flowchart illustrated in
The present embodiment additionally provides a function of retuning the search result, for the search processing executed in the data storage apparatus 1, including information on the calculation resources used in the search. Such a function can be implemented by using a resource management function of an operating system of a general computer. An example of the resource management function includes a vmstat command, a time command, and a ps command of the operating system as well as a job management function of a cluster management system.
In the present embodiment, the impact evaluation function 201 does not receive the query result but receives the used amount of calculation resources involved in the search using the old and the new metadata systems, as the result of the query executed in step S64. Then, the impact evaluation function 201 transmits the difference in the used amount of the calculation resources to the metadata system management function 301 (S65).
Upon receiving the difference from the impact evaluation function 201 (S66), the metadata system management function 301 calculates the difference in the calculation resources used in the old and the new metadata systems (S67), and presents the difference in the calculation resources calculated in step S67 to the user (S68).
The user can determine whether to update the metadata system by checking the variance between the calculation resource amount required for the search using the old metadata system 103(1) and the calculation resource amount required for the search using the new metadata system 103(2).
When the user permits the update (S69: YES), the metadata system management function 301 instructs the data storage apparatus 1 to update the old metadata system 103(1) with the new metadata system 103(2) (S70). When the user does not permit the update (S69: NO), the metadata system management function 301 instructs the data storage apparatus 1 to discard the new metadata system 103(2) (S71).
This embodiment with the configuration described above also provides advantageous effects similar to those of Embodiment 1. The present embodiment has an additional feature that the impact of changing the old metadata system to the new metadata system is calculated as the used amount of the calculation resources, and thus whether to permit the change can be determined based on the used amount of the calculation resources. Thus, the user can enjoy even higher usability.
Even when the difference in the data search result is not very large, the amount of calculation resources used for obtaining the result might largely differ. In a data management system involving a large variety of data, the used amount of calculation resources might largely fluctuate. Even in such a case, the data management system according to the present embodiment can be applied to improve the usability.
Embodiment 4 is described with reference to
Upon receiving the request from the metadata system management function 301 (S82), the impact evaluation function 201 requests the data storage apparatus 1 to generate the old and the new metadata systems (S83). Upon receiving the request from the impact evaluation function 201 (S84), the data storage apparatus 1 generates the old and the new metadata systems (S85). The old metadata system 103(1) is the currently operating metadata system and thus already exists. Still, the data storage apparatus 1 may prepare the copy of the old metadata system for investigation use for checking the impact.
Upon receiving a data acquisition request from the data using apparatus 7 (S90), the query processing apparatus 5 issues a query to the data storage apparatus 1. The data storage apparatus 1 searches for data by applying each of the old and the new metadata systems to the query (S91).
The impact evaluation function 201 calculates the difference between the search result by using the new metadata system and the search result obtained by using the old metadata system (S92), and transmits the result to the metadata system management function 301.
The metadata system management function 301 presents the received difference to the user (S93), and waits for the instruction from the user (S94). The user checks the presented difference, and issues an instruction to permit or not to permit the update of the metadata system or to suspend the decision, to the metadata system management function 301.
When the user permits the update, the metadata system management function 301 instructs the data storage apparatus 1 to perform the update to the new metadata system (S95). When the user does not permit the update, the metadata system management function 301 instructs the data storage apparatus 1 to discard the new metadata system (S96).
When the user suspends the decision to permit or not to permit the update, the metadata system management function 301 terminates the processing. A button for issuing the instruction indicating that the decision is suspended may be added to the screen G1 illustrated in
In step S93, the metadata system management function 301 may present only the difference related to the latest query to the user, or may present, for example, the history of the difference or a cumulated value of the difference related to the queries issued within a predetermined period of time to the user. The screen G1 may be provided with buttons and the like or a display section for displaying, for example, the history or the cumulated value of the difference.
This embodiment with the configuration described above also provides advantageous effects similar to those of Embodiment 1. The present embodiment has an additional feature that the difference with respect to the query can be monitored with the old and the new metadata systems concurrently operated. Thus, the user can determine whether to update the metadata system based on the monitoring result in the actual operation, and thus can more accurately review the range of the impact of changing the metadata system, compared with the case where the difference is examined with a group of test queries.
Embodiment 5 is described with reference to
Upon receiving the change request for the metadata system, the metadata system management function 301 requests the impact evaluation function 201 to calculate the difference in the case where the old and the new metadata systems are applied to the same query (S200).
Upon receiving the request from the metadata system management function 301, the impact evaluation function 201 evaluates the impact of changing the metadata system as described above with reference to steps S31 to S35 in
Specifically, the impact evaluation function 201 instructs the data storage apparatus 1 to prepare the old and the new metadata systems, randomly extracts the queries used in the past from the query history DB 202 for a predetermined number of times, executes data search with the old and the new metadata systems applied to the queries, and receives the difference as a response from the data storage apparatus 1 (S201). Then, the impact evaluation function 201 transmits the difference as the response to the metadata system management function 301 (S201).
Upon receiving the difference as a response related to the old and the new metadata systems (S202), the metadata system management function 301 presents the difference to the user (S203). The difference as a response may be the difference in the content of the response, the number of differences (the number of hits), or the used amount of the calculation resources.
The user can determine whether to change (update) the metadata system by checking the impact evaluation presented by the metadata system management function 301. The user may be unable to determine whether to change the metadata system in a case where the difference is small for example.
Thus, in the present embodiment, the impact evaluation can be repeated. The metadata system update screen G1 illustrated in
The impact evaluation function 201 randomly selects a predetermined number of queries other than the query used for first impact evaluation from the query history DB 202 in accordance with the difference obtained in the first impact evaluation (S205). The impact evaluation function 201 selects the query used in second impact evaluation in accordance with the difference obtained in the first impact evaluation. For example, when the first difference is smaller than a predetermined value set in advance, the impact evaluation function 201 may select a query similar or related to the first query. For example, when the content of the first query is “average temperature in A region”, “average power consumption in A region”, or “average temperature of a section as a part of A region” may be selected as the second query.
The impact evaluation function 201 searches for data with the reselected query (another query) executed to each of the old and the new metadata systems (S206). The impact evaluation function 201 transmits a difference in the response between cases where the query is applied to the old and the new metadata systems to the metadata system management function 301 (S207).
Upon receiving the difference obtained by the second impact evaluation (S202), the metadata system management function 301 presents the difference to the user (S203). The user may check the result of the second impact evaluation, and then issue an instruction to perform a third impact evaluation (S204: YES).
The user may not issue an instruction to execute a further impact evaluation (S204: NO), and permit the update of the metadata system (S208: YES). Upon receiving the permission from the user, the metadata system management function 301 instructs the data storage apparatus 1 to update the old metadata system with the new metadata system (S209). When the user does not permit the update (S209: NO), the metadata system management function 301 instructs the data storage apparatus 1 to discard the new metadata system (S210).
This embodiment with the configuration described above also provides advantageous effects similar to those of Embodiment 1. The present embodiment has an additional feature that the impact of changing the metadata system can be evaluated for a plurality of times with queries of different contents.
In the present embodiment, the query content related to the previous query content or the query content similar to the previous query content is randomly extracted from the query history DB 202 to be used in the current impact evaluation, and thus the impact of changing the metadata system can be more appropriately recognized.
Embodiment 6 is described with reference to
For example, in step S205A in the flowchart, when “temperature” is used as the previous query content, a query including “sensor type” obtained by tracing back “temperature” in the raw data+provided metadata DB 102 or the metadata system DB 103 for a predetermined number of nodes is selected as the current query content. This embodiment with the configuration described above also provides advantageous effects similar to those of Embodiment 6.
The embodiments described above may be implemented individually, or some of the embodiments may be implemented in combination as appropriate. Although the embodiments have been described with different functions performed in respective apparatuses, the present invention is not limited to this. These functions and DBs may be implemented in any other distribution form while the relations for calling the functions are maintained.
The present invention is not limited to the embodiments described above, but includes various modifications. The embodiments are described in detail for easy understanding of the present invention, and are not intended to limit the present invention to the one including all the components described above. At least part of the components of each embodiment may be provided with an additional component, and have some components omitted or replaced with other components, without departing from the spirit and scope of the claimed subject matter.
Part or the whole of the above-described components, functions, processing units, processors, and the like may be implemented in hardware, such as integrated circuits designed to achieve these, for example. The present invention may also be implemented by program codes for software implementing the functions of the embodiments. In this case, a computer is provided with a recording medium storing a program code and a CPU in the computer reads the program code stored in the recording medium. In this case, the program code itself read from the recording medium implements the functions of the embodiments described above, and the program code itself and the recording medium storing the program code constitute the present invention. Examples of the recording medium for providing the program code include flexible disks, CD-ROMs, DVD-ROMs, hard disks, SSDs (Solid State Drives), optical discs, magneto optical discs, CD-Rs, magnetic tapes, nonvolatile memory cards, ROMs, and the like.
Although the present disclosure has been described with reference to exemplary embodiments, those skilled in the art will recognize that various changes and modifications may be made in form and detail without departing from the spirit and scope of the claimed subject matter.
Number | Date | Country | Kind |
---|---|---|---|
2016-197783 | Oct 2016 | JP | national |