This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 2007-249019, filed on Sep. 26, 2007; the entire contents of which are incorporated herein by reference.
1. Field of the Invention
The present invention relates to an apparatus, a computer program product, and a method for managing meta data.
2. Description of the Related Art
Hierarchical databases, like Object-Oriented Databases (OODBs) and Object-Relational Databases (ORDBs), each have a hierarchical structure in which subordinate categories inherit the attributes of their superordinate category. In such a hierarchical database, the number of attributes progressively increases for the subordinate categories due to the inheritance of the attributes. A subordinate category's inheriting the attributes of its subordinate category is generally referred to as “inheritance”.
In an object-oriented database, each of the categories in the hierarchical levels is often referred to as a “class”. In an object relational database (ORDB), each of the tables that are allowed to be inherited corresponds to a class. Between two tables having a superordinate-subordinate relationship, attributes are inherited from the superordinate table to the subordinate table. In other words, the header information of the columns in the superordinate table is inherited to the subordinate table. Pieces of data that respectively belong to the categories in a hierarchical level and have mutually the same type of attribute are referred to as “instances”. A set of instances is referred to as a “population” of data. In a relational database (RDB) or in an ORDB, a population of data is usually stored in a structure called a “table”. A row of attributes included in a table is referred to as a “header” of the table.
An example of a hierarchical database is International Organization for Standardization (ISO) 13584 Parts Library standard (hereinafter, the “PLIB standard”), which is an international standard for implementing an electronic catalogue system that electronically provides information of products. The PLIB standard is an international standard that is made up of a plurality of “Parts” and defines an object-oriented method for writing library data of products or parts as well as the semantics for the exchange file format. In other words, the PLIB standard defines what terms, what description method, and what data type are used. Part 42 of the “PLIB” standard has the same contents as International Electrotechnical Commission (IEC) 61360-2 (Part 2). The PLIB standard is a system for categorizing products in an object-oriented manner, defining the attributes with which each of the categories is characterized, and exchanging files with respect to the contents for the categories. Needless to say, the concept of inheriting attributes is included in the system. Also, the PLIB standard is made by making reference to ISO 6523 “Structure for the identification of organizations and organization parts”. In particular, by utilizing the International Code Designator (ICD) defined in ISO 6523, it is possible to assign a world-wide unique identifier to each of the attributes.
In recent years, a number of systems that are compliant with the “PLIB” standard have been proposed (cf. JP-A 2004-177996 (KOKAI) and JP-A 2004-178015 (KOKAI)).
Categories (i.e., classes) and attributes (i.e., properties) that are necessary when the product data is written are collectively referred to as a “dictionary”. Examples of international standard dictionaries that are compliant with the “PLIB” data model include ISO 13584-501 related to measuring instruments, ISO 13584-511 related to fasteners (e.g., screws), and IEC 61360-4 related to electric and electronic products. In Japan, ECALS dictionary and JeMarche dictionary are examples of standard dictionaries in the industrial field and are used for exchanging the specification data of products. Also, in other countries of the world, dictionaries are actively being developed.
Dictionary maintenance and management organizations maintain and manage the international standard dictionaries as described above by using a mechanism such as Registration Authority (RA) or Maintenance Agency (MA). The dictionary maintenance and management organizations receive suggestions for corrections from members in and out of the organizations and update the dictionaries after the suggestions for corrections are approved by, for example, a majority vote. The standard dictionaries in the industrial field are also maintained and managed by using a similar mechanism.
Generally speaking, instance data editors who write product data (hereinafter, “instance data”) by using dictionaries are different from dictionary editors who create, maintain, and manage dictionaries. As for the definitions in a dictionary, it is often the case that there is a disparity between the dictionary editor's understanding thereof and the instance data editor's understanding thereof. For example, the instance data editor may find that the definitions in the dictionary provided by the dictionary editor are not sufficient and may feel like he/she does not know what should be written in what way. As another example, the instance data editor may not be able to write what the dictionary editor has had in his/her mind. As yet another example, the definitions in the dictionary may not be regarded appropriate when the instance data editor writes the actual product data by using them.
The disparity between the dictionary editor and the instance data editor with respect to their understanding of the definitions in the dictionary prevents the product data from being written properly and also immediately brings about deterioration in quality of the product data.
According to one aspect of the present invention, a meta data management apparatus includes a dictionary storing unit that stores a dictionary that is meta data defining a schema including a restriction of an instance data description; a rule storing unit that stores dictionary change rules each of which defines, in a manner of a rule, a change of the dictionary in correspondence with an error pattern of the instance data description; a receiving unit that receives an instance data described according to the schema via an Instance data editor terminal used by an editor of the instance data; an error detecting unit that detects specifics of an error when the instance data received by the receiving unit does not satisfy the restriction of the instance data description; and a suggestion presenting unit that presents a change proposal for dictionary (hereinafter, the “dictionary change proposal”) to a dictionary editor terminal used by a dictionary editor, the dictionary change proposal suggesting the change of the dictionary created based on the specifics of the error and one of the dictionary change rules corresponding to the specifics of the error.
According to another aspect of the present invention, a meta data management method includes receiving an instance data described according to a schema that is defined in a dictionary being the meta data and includes a restriction of an instance data description, via an instance data editor terminal used by an editor of the instance data; detecting specifics of an error from the received instance data, when the received instance data does not satisfy the restriction of the instance data description; and presenting a dictionary change proposal to a dictionary editor terminal used by a dictionary editor, the dictionary change proposal suggesting a change of the dictionary created from the specifics of the error, based on a dictionary change rule which defines, in a manner of a rule, the change of the dictionary corresponding to an error pattern of the instance data description.
A computer program product according to still another aspect of the present invention causes a computer to perform the method according to the present invention.
Exemplary embodiments of the present invention will be explained with reference to
In the server 1 and each of the clients 3, when the operator turns on the electric power, the CPU 101 runs a program that is called a loader and is stored in the ROM 102. A program that is called an Operating System (OS) and that manages hardware and software of the computer is read from the HDD 104 into the RAM 103 so that the OS is activated. The OS runs other programs, reads information, and stores information, according to an operation by the operator. Typical examples of an OS that are conventionally known include Windows (registered trademark). Operation programs that run on such an OS are called application programs. Application programs include not only programs that operate on a predetermined OS, but also programs that cause an OS to take over execution of a part of various types of processes described later, as well as programs that are contained in a group of program files that constitute predetermined application software or an OS.
In the server 1, a meta data management program is stored in the HDD 104, as an application program. In this regard, the HDD 104 functions as a storage medium that stores therein the meta data management program.
On the other hand, in each of the clients 3, an editing processing program is stored in the HDD 104, as an application program. In this regard, the HDD 104 functions as a storage medium that stores therein the editing processing program.
Also, generally speaking, the application programs to be installed in the HDD 104 included in the server 1 and each of the clients 3 can be recorded in one or more storage media 110 including various types of optical disks such as CD-ROMs and Digital Versatile Disks (DVDs), various types of magneto optical disks, various types of magnetic disks such as flexible disks, and media that use various methods such as semiconductor memories, so that the operation programs recorded on the storage media 110 can be installed into the HDD 104. Thus, storage media 110 that are portable, like optical information recording media such as CD-ROMs and magnetic media such as Floppy Disks (FDs), can also be each used as a storage medium for storing therein the application programs. Further, it is also acceptable to install the application programs into the HDDs 104 after obtaining the application programs from an external source via, for example, the communication controlling device 106.
In the server 1, when the meta data management program that operates on the OS is run, the CPU 101 performs various types of computation processes and controls the functional units in an integrated manner, according to the meta data management program. On the other hand, in each of the clients 3, when the editing processing program that operates on the OS is run, the CPU 101 performs various types of computation processes and controls the functional units in an integrated manner, according to the editing processing program. Of the various types of computation processes performed by the CPU 101 included in the server 1 and each of the clients 3, characteristic processes according to the present embodiment will be explained below.
By following the editing processing program, each of the clients 3 outputs, via a Graphic User Interface (GUI), data received from the server 1 to the displaying unit 107. Each of the clients 3 also receives, via the GUI, data and commands based on operations and settings that have been performed and configured by the operator via the input unit 108 on screens displayed on the displaying unit 107, and further transmits the received data and commands to the server 1. The editing processing program achieves various types of functions according to the authority granted to each operator. As explained in detail later, by following the editing processing program, each of the clients 3 according to the present embodiment functions, according to the authority granted to each operator, as a dictionary editor terminal 31 that is to be used by a dictionary editor and edits dictionaries or an instance data editor terminal 32 that is to be used by an instance data editor and accepts inputs of created instance data, as shown in
On the other hand, as shown in
Also, by following the meta data management program, the server 1 includes: a data error detecting unit 51 that functions as an error detecting unit; a dictionary change proposal presenting unit 52 that functions as a suggestion presenting unit; a change proposal processing unit 53 that functions as a suggestion processing unit; a dictionary change proposal receiving unit 54 that functions as a suggestion receiving unit; a user management unit 55 that manages identifiers of instance data editors; an error knowledge detecting unit 56; and a dictionary change notifying unit 57.
Next, each of these functional units will be explained in detail.
First, the dictionary DB 41 will be explained.
In the explanation of the present embodiment, the instance data are also shown in
The dictionary version control rule DB 45 is a database that stores therein dictionary version control rules each of which describes the effect that will be made by a change in the dictionary elements (e.g., the classes, the properties). In
The dictionary change rule DB 43 is a database that stores therein dictionary change rules each of which defines, in the manner of a rule, providing of an annotation in correspondence with an error pattern that is found during the instance data description, the annotation being a piece of knowledge that is helpful when the dictionary is improved or when an instance data is written. Examples of the change proposal rules are shown in Tables 1 to 5 below. Shown in Tables 1 to 5 are processes related to data-type errors of the properties that are applicable to the example using the PLIB data model. The degrees of the effects that will be made on the dictionary are indicated based on the definitions in the PLIB; however, the degrees of the effects are written for the sake of convenience in the explanation and may vary. Also, the degrees of the effects that will be made on the dictionary and the available data types may vary depending on the model being used. In Tables 1 to 5, “Change proposal” indicates a change proposal rule; “Pattern of Change” indicates what change will be made on what attribute of the dictionary elements; “Error instance data Usability” indicates whether it is possible to use (Yes: Y) or not (No: N) the instance data that currently has an error without modifying it, if the dictionary happens to be updated according to the change proposal rule; “Degree of Effect on Dictionary” indicates the effect that will be made by a change in the dictionary and is obtained based on the dictionary management rules stored in the dictionary-version control rule DB 45. As for “Error instance data Usability”, when the dictionary has been improved in the case where the value is “Yes: Y”, the Instance data editor terminal 32 used by the instance data editor who has caused the error is notified, so that the information is helpful for the update of the instance data.
For example, in the case where the specifics of an error are indicated as “an enumeration value {EJ} that is not defined as one of the enumeration values has been input for a property {P}”, change proposals as shown in Table 1 are presented.
For example, in the case where the specifics of an error are indicated as “the number of digits is excessive”, change proposals as shown in Table 2 are presented.
For example, in the case where the specifics of an error are indicated as “a character string has been input although the data type is a numerical value type”, change proposals as shown in Table 3 are presented.
For example, in the case where the specifics of an error are indicated as “there was a violation related to restricted characters”, change proposals as shown in Table 4 are presented. The restrictive condition is indicated as {C}. It is assumed that the example identified with the ID “1” in Table 4 below are prepared in a plurality of patterns for different types of errors.
The data type is a “class reference type” (so we call “class instance type” in PLIB data model) provides a reference (i.e., link) to another class in a dictionary. This data type is used for, for example, describing component parts. In a dictionary, (an identifier for) a class being a reference destination is defined as the data type of a property. In an instance data, we further specify a lower class of the reference destination class defined in the data type and plural pair of a property and its value. With this arrangement, when a class-reference type property is used, it is possible to refer to the instance data that belongs to the reference destination class.
For example, let us discuss a situation in which the specifics of an error are indicated as “the reference destination class {CC} that is specified in the instance data is not one of the child classes of the reference destination class {DC} that is defined in the dictionary”. As shown in Table 5 below, the change proposals identified with the IDs “1”, “2”, and “3” each provide a help so as to make it possible to write a correct reference destination class. Also, as shown in Table 5 below, the change proposal identified with the ID “4” suggests that the data pattern should be changed so that an upper class for both DC and CC that also satisfies the reference destination class {CC} specified in the instance data should be newly defined in the dictionary as a reference class. In addition, as shown in Table 5 below, the change proposal identified with the ID “5” suggests that a new property of which the reference destination class is {CC} may be defined. The change proposal s identified with the IDs “1”, “2”, and “3” refer to the processes that are performed in the case where the dictionary editor has judged that {CC} is an error, whereas the change proposals identified with the IDs “4” and “5” refer to the processes that are performed in the case where the dictionary editor has judged that the dictionary should be improved in such a manner that the description of {CC} is also acceptable. In Table 5 below, the notation “subclasses {C1}” denotes all the classes that are positioned below the class C1. The notation “superclass {C1, C2}” denotes the class that is positioned the lowest among the classes that are positioned above both C1 and C2.
The data error detecting unit 51 detects an error out of an instance data that has been input by the instance data editor via the Instance data editor terminal 32 according to the dictionary stored in the dictionary DB 41, based on the restrictions (e.g., the data type, the keys, and the requirements) that are defined in the dictionary. The specifics of the error are returned to the instance data editor via the Instance data editor terminal 32. The instance data editor corrects the instance data properly on the Instance data editor terminal 32. In
In addition, the data error detecting unit 51 records the specifics of the error that has been detected into the error DB 42. In
As for errors of the same type corresponding to mutually different properties, an arrangement is acceptable in which a search is conducted in the process record DB 46 so that the error is brought into correspondence with a process that was selected last time or a process that has been selected the largest number of times.
Next, let us discuss a situation in detail in which an input error has occurred while the data type is an enumeration type, as shown in
At a predetermined point in time, the dictionary change proposal presenting unit 52 obtains the specifics of the error (e.g., an “enumeration value error”) that are stored in the error DB 42 and one of the dictionary change rules that have been described in the dictionary change rule DB 43 that corresponds to the specifics of the error. The dictionary change proposal presenting unit 52 creates one or more dictionary change proposals that are suitable for the actual specifics of the error and stores the created dictionary change proposals into the change proposal for dictionary DB 47. Also, the dictionary change proposal presenting unit 52 reads the dictionary change proposals from the change proposal for dictionary DB 47 and presents the dictionary change proposals onto the dictionary editor terminal 31 used by the dictionary editor. In this situation, the predetermined point in time may be one of the following: once every year, once every hour, when a new error has been stored, when an error having a high level of importance has been stored, and when the dictionary editor triggered it. To create the change proposals, it is acceptable to use the status of the dictionary elements in another dictionary included in the dictionary so as to derive relationships of synonymous terms and the like. According to the dictionary change proposals presented on the dictionary editor terminal 31, the dictionary editor chooses one of the following: to record the error into the knowledge describing instance data DB 44 as an annotation and an example of error; to change the dictionary; and not to make any change.
For example, in the dictionary change rule DB 43, the dictionary change rules that correspond to an “enumeration value error” are as shown in Table 1. Table 1 uses the dictionary data model of ISO 13584-12 as an example. The degrees of the effects that will be made on the dictionary by the changes are based on the definitions in the ISO standard. For example, the attributes (e.g., the name, synonyms, abbreviated names, the data type, and the definition of a property) that serve as definition elements of a property may vary according to the data model being used. Thus, the change proposal rules may also vary according to the data model being used. Also, the effect that will be made on the dictionary by a change in the attributes may vary according to the data model being used. Examples of the dictionary change proposals that are presented to the dictionary editor according to the change proposal rules are shown in
In the examples shown in
After that, as shown in
According to the present embodiment, the dictionary change proposals are displayed together with the degrees of effects that will be made on the dictionary. Thus, the dictionary editor is able to improve the dictionary while comparing the effects with the specifics of the errors. Generally speaking, the smaller the effect made on the dictionary is, the smaller the effect made on the existing instance data is. Thus, the dictionary editor tends to select a dictionary change proposal that has a lower degree of effect on the dictionary.
Another arrangement is acceptable in which the dictionary change proposals described above are presented in ascending order of the degree of the effect that will be made on the dictionary. When a big change such as changing the data type is made on the dictionary, it is not possible to respond to the change by upgrading the version of the dictionary element, and it is necessary to define a new dictionary element. As for dictionaries that are currently used with instance data, it is desirable to be able to keep using the existing instance data, and it is preferable to define as few new dictionary elements as possible. Accordingly, the dictionary change proposals are displayed while the degrees of the effects are determined in the following order (see
Further, in the case where a change proposal has been made for a dictionary element that is similar to another dictionary element for which a change proposal has previously been made, an arrangement is acceptable in which the dictionary change proposal presenting unit 52 conducts a search in the process record DB 46 for the process that was selected by the dictionary editor in the past and presents the dictionary change proposals while giving a higher priority to the example in the past by adding the information of the result of the search to the dictionary change proposals.
In the case where the dictionary editor has judged that there is no dictionary change proposal to be implemented among the ones that have been presented by the dictionary change proposal presenting unit 52 onto the dictionary editor terminal 31, it is acceptable for the dictionary editor to create a new dictionary change rule or a new change proposal.
When the data error detecting unit 51 has detected an error, the dictionary change proposal receiving unit 54 receives the dictionary change proposals that have been made by the instance data editor via the Instance data editor terminal 32 with respect to the detected error. Shown in
The dictionary change proposal receiving unit 54 stores the dictionary change proposal that has been received as described above into the change proposal for dictionary DB 47. At a predetermined point in time, the dictionary change proposal from the instance data editor that has been stored in the change proposal for dictionary DB 47 is presented by the dictionary change proposal presenting unit 52 onto the dictionary editor terminal 31 used by the dictionary editor. In this situation, the predetermined point in time may be one of the following: once every year, once every hour, and when a new dictionary change proposal has been received. The dictionary change proposal presented on the dictionary editor terminal 31 is processed in the same manner as the change proposal that has been made based on an error as described above.
In the case where the dictionary editor has defined a new dictionary change rule, the change proposal processing unit 53 additionally stores the newly-defined dictionary change rule into the dictionary change rule DB 43 so that the newly-defined dictionary change rule can be used when the dictionary change proposals are presented next time or later. Also, based on a dictionary change plan that has been selected or created by the dictionary editor in response to the dictionary change proposal, the change proposal processing unit 53 makes a change in the dictionary DB 41 or adds an annotation to the knowledge describing instance data DB 44 according to the dictionary version control rules. When a change is made in the dictionary DB 41, the version and the revision of the dictionary are also changed according to the version control rules defined in the dictionary-version control rule DB 45. In addition, the change proposal processing unit 53 records the error and the dictionary change proposal that has been selected or created for the error into the process record DB 46.
According to the process performed by the change proposal processing unit 53, the error knowledge detecting unit 56 obtains the knowledge that has not been put into the dictionary but has adherently been provided and is required in the instance data description, such as annotations, comments, error examples, from the knowledge describing instance data DB 44 and displays the obtained knowledge onto the Instance data editor terminal 32 when the instance data editor writes instance data, as shown in
When the dictionary has been updated at a point in time, the dictionary change notifying unit 57 notifies the instance data editor as to which one of the errors that were caused by the instance data editor in the past is no longer an error because the dictionary has been updated, based on the process record DB 46 and the error DB 42. As a result, the instance data editor is able to rewrite the instance data in such a manner that the instance data better complies with the actual circumstances.
Shown in
Because instance data editors rarely browse the instance data that they edited in the past, an arrangement is acceptable in which the instance data editor is notified, by e-mail or the like, that the dictionary has been revised and that there is a possibility that the instance datas created in the past may be corrected.
At the point in time when the instance data editor is notified of an error, he/she makes a correction in units of inputs of instance data. Thus, the same errors in one property are corrected all at once, as long as there is no misunderstanding again.
Another arrangement is also acceptable in which, as for the instance data errors that are caused by the instance data editor, the number of times the data error detecting unit 51 has detected an instance data error is tallied for each of the properties. It is effective to tally the number of times an instance data error is detected for each of the properties because it is often the case that an instance data editor creates a large number of instance data in a mechanical fashion. It is safe -to say that an error that is caused by misinterpretation of a property occurs only once. Needless to say, another arrangement is acceptable in which the number of times an instance data error is detected is stored into the error DB 42. By tallying the number of times an instance data error is detected, it is possible to conjecture that an instance data editor who has an especially high ratio of errors may have a problem in his/her capability of understanding (i.e., he/she does not have specialized knowledge). Accordingly, an arrangement is acceptable in which the dictionary change proposal presenting unit 52 puts a mark to each of the dictionary change proposals that have been made by such an instance data editor, so as to indicate that the instance data editor is a user having a high rate of causing errors while errors are presented. By indicating that the instance data editor is a user having a high rate of causing errors, it is possible to allow the dictionary editor to judge more carefully whether the errors should be reflected in the change of the dictionary.
Next, from the processes performed by the functional elements described above, the procedures in the instance data editing process and the dictionary change proposal presenting process will be explained.
First, the instance data editing process will be explained with reference to the flowchart in
In the case where the data error detecting unit 51 has judged that there is no error (step S2: No), the instance data that has been input is written (step S3), and the process is ended.
On the other hand, in the case where the data error detecting unit 51 has judged that there are one or more errors (step S2: Yes), the data error detecting unit 51 records the specifics of the errors that have been detected into the error DB 42 (step S4). The data error detecting unit 51 also returns the specifics of the errors to the instance data editor via the Instance data editor terminal 32 (step S5), and the process returns to step S1. The instance data editor refers to the specifics of the errors that have been returned and corrects the instance data properly on the Instance data editor terminal 32.
Next, the dictionary change proposal presenting process will be explained with reference to the flowchart in
As explained above, according to the present embodiment, in the case where the specifics of an error have been detected out of an instance data -that has been received by the receiving unit via the Instance data editor terminal and that does not satisfy the restrictions being applied to the instance data description and being defined in the dictionary that is meta data, the dictionary change proposals that have been created based on the specifics of the errors and suggest the changes that can be made on the dictionary are presented onto the dictionary editor terminal, based on the dictionary change rules each of which defines, in the manner of a rule, a change that can be made on the dictionary in correspondence with an error pattern that is found during the instance data description. It is therefore possible to eliminate the disparity between the instance data editor and the dictionary editor and to improve the quality of the instance data.
Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the invention in its broader aspects is not limited to the specific details and representative embodiments shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2007-249019 | Sep 2007 | JP | national |