The present application claims priority from Japanese application JP2006-104803 filed on Apr. 6, 2006, the content of which is hereby incorporated by reference into this application.
The present invention relates to a personal analysis and search system, and particularly to a system and a method for generating personal-connection information by using electronic information communication data, calculating a index from personal-connections, and presenting the personal-connections.
According to the growth of computerized communication means such as an “electronic mail” and a “weblog”, these communication histories have been saved or accumulated with electronically formed conditions. A relationship between users as principals of the communications is extracted from the electronically formed communication logs, and the relationship is processed and re-used, which has been known as techniques for improving smoothness and liveliness of the communications.
In the invention disclosed in JP-A-10-301905 (Patent document 1), communications between information processing devices are set to as a pipe, and a relationship between persons is managed as a set of pipes each having a meaning and a weight representing connections. The document has also proposed a pipe expansion method for synthesizing plural pipe information to generate new pipes. Incidentally, the pipes are managed by respective information managing devices from the viewpoint of privacy protection.
A technique has also been known for searching persons matched with a search condition by using a relationship between persons and personal profiles. In the case of the invention disclosed in JP-A-2003-216799 (Patent document 2), business activity histories are registered in a storage unit, and a path is provided to be connected between an intellectual and a support-requested person in a company by a directly or indirectly acquainted person, in response to the business activities to be requested supports.
A method has been known for quantifying a relationship between persons and analyzing networks in communications among organized constituent members. In the invention disclosed in JP-A-2005-182439 (Patent document 3), a relationship between electronic mails is checked by only information of an addresser, an addressee and an originating time contained in the electronic mail to thereby investigate the communications in the organization.
A method of controlling communications has been known in response to a type of a relationship between persons. In the invention disclosed in JP-A-11-203322 (Patent document 4), personal-connection networks are divided into private and official relationships to implement a different processing given to one another. Incidentally, a relationship attribute is registered by user.
As a further relevant art, JP03446886 (Patent document 5) discloses a personal management system and personal search method in which communication histories for individuals and others are managed as personal-connection information to effectively use with the individual personal-connection information in common.
Patent document 1: JP-A-10-301905
Patent document 2: JP-A-2003-216799
Patent document 3: JP-A-2005-182439
Patent document 4: JP-A-11-203322
Patent document 5: JP 03446886
non-patent document 1: “Basis of Social Network Analysis” by Jun Kanamitsu, Keisou Publishing Inc., Chart 6 “Centrality”, Dec. 20, 2003, vol. 1, Version 1
In Patent documents 1, 2 and 5, there is no different position for users, but disclose the same position of communications.
In Patent documents 3 and 4, a processing is implemented on the basis of an organization and an official or private relationship, but members belonged to the organization and official or private relationship are set by user in advance.
Heretofore, it has been contemplated that communications within organizations, companies, etc. are unnecessary, except for formal (official) communications relative to organized operations. However, according to the progress of knowledge society in these years, the business operations carried through only by a top-down superintendent command have been decreased, particularly in the organizations such as modern enterprises. On the contrary, it has been important for informal activities such as autonomous decision making operations, information collection activities, and study meetings regardless of a ladder-connection in the organization, by company members as constituent members etc. In the meantime, the modern enterprise has maintained formal ladder-connections in the organization from the viewpoint of efficiency in the entire organization and carried through their business activities so that duties are divided into every ladder-connection and defined as they are.
In such case of the modern enterprises, they should be required for both the formal and informal activities, but their boundary between the activities are not only clearly divided, but also unclear so that one activity is high in a formal degree, and the other is low.
However, in the case where the constituent members in the organization take action for other party, particularly, for communications, it is difficult to judge such formal degree, approach by an appropriate way, and communicate directly to other parties.
It is therefore an object of the invention to provide a quantization of an index relative to formality for judging whether, of relationships between constituent members, a relationship is strongly relevant to a business operation or not in an organization having a formal structure with ladder-connections or the like, and also provide an organization display in accordance with a quantized formal degree and approach means for new other parties.
In order to achieve the objects, according to a personal-connection analysis and search system in the invention, a word is extracted from a text used in communications between constituent members in an organization, the number of times of appearing the word is accumulated as communication flows, and the communications in the organization are made into networks as a weight equivalent to the communication flows. The communication networks can be represented by a matrix or represented by (the number of constituent members)×(the number of constituent members), as communication flows are defined as values in the matrix. The communication flows are classified into every word or every concept which is categorized on the basis of the meaning of word, when the communication flows are accumulated. Further, when a user designates a word group or a natural text as a query, the system synthesizes the communication networks which are classified to be corresponded to the designations to then output a communication network on the basis of the designation by the user.
Further, according to the personal-connection analysis and search system in the invention, a set of constituent members (communication core) having a high communication density is extracted from the communication networks. A ratio of “communication flows between two constituent members each having a ladder-connection relationship” to “a total of communication flows” in the communication core is defined as a formal degree (a core formal degree) of the communication core.
Further, according to the personal-connection analysis and search system in the invention, a ratio of “formal communication flows” to the “a total of communication flows” between the two constituent members is defined as a formal degree (a communication formal degree) of the communications between the two constituent members. The formal communication flows are defined by a product of communication flows based on respective keywords multiplied by the formal degree of the communication core belonging to the communications. The formal communication flows in the communications which does not belong to the communication core is defined by “0”.
According to the invention, the communications based on subjects designated by user are represented by a matrix as communication networks, therefore, it is possible to analyze the communication networks, and it is also possible to present a set of constituent members and constituent members who are actively communicated with each other in relation to the designated subjects, as advantages.
Further, a formal degree of the communication core is calculated, therefore, it is possible to specify informal community in the organization.
A formal degree (path formal degree) of a search path is calculated by using the communication formal degree, therefore, it is possible to present a path corresponding to a query attribute (formal or not) by user.
Other objects, features and advantages of the invention will become apparent from the following description of the embodiments of the invention taken in conjunction with the accompanying drawings.
Hereinafter, embodiments of a personal-connection analysis and search system in the invention will be described with reference to the drawings.
In step S41 of “communication-log data analysis”, information transmitted and received by respective constituted members is analyzed as a text to generate a constitute member profile and keyword network. The constituent member profile contains words and weights representing professions and interests of the constituent members, which is used for a processing of searching a knowledge possessing person for after-mentioned-personal connections. The keyword network is a communication network which is communication flows equivalent to an appearance frequency of a keyword, which will be used for after-mentioned processings of a personal-connection data generation, personal-connection analysis, and personal-connection search.
In step S42 of “personal-connection data generation”, various data generations relative to relationships between the constituent members are performed, including generation of a ladder-connection network, calculation of a core formal degree of the keyword network, calculation of a communication formal degree, etc.
In step S43 of “query reception”, a query is entered from a terminal. In step S44 of “personal-connection analysis”, the communication network synthesized the keyword network and generated on the basis of the query is analyzed so that a centrality of the communication network and a communication core are outputted. In this way, it is possible that a main person who is concerned with the communication network and a set of active constituent members in the communications are presented to the user.
In step S45 of “personal-connection search”, a constituent member who possesses the information on the basis of the query is selected (KnowWho function), and a path is outputted for use from the constituent member (or user itself in this case) designated by the user to the constituent member selected by the step S45. Therefore, it is possible that the constituent member who possesses the necessary information and the path to be directed to this constituent member are presented to the user. In step S46 of “result data transmission”, the outputs from the step S44 of “personal-connection analysis” and the step S45 of “personal-connection search” are transmitted to a terminal.
In step S51, communication-log data between the constituent members is analyzed as a text, and words are extracted from the text.
In step S52 of “count word appearance frequency”, an appearance frequency is calculated from the words extracted by the step S51.
In step S53, a value of the appearance frequency calculated by the step S52 is added to the value of word appearance frequency of the constituent member profile.
In step S54, it is confirmed whether the keyword network already exists in the respective words extracted by the step S51.
In step 55, if a judged result in the step S54 is “yes”, the value of appearance frequency calculated by the step S52 is added to a value of the communication flows between the constituent members in the keyword network.
In step S56, if the judged result in the step S54 is “no”, a keyword network of the words is generated, and the value of appearance frequency calculated by the step S52 is substituted for the value of communication flows between the constituent members.
In step S61 of “generate ladder-connection network”, a ladder-connection network is generated from ladder-connection information. The ladder connection information and ladder-connection network will be described in detail later.
In step S62 of “calculate formal degree of keyword-network cores”, a communication core is extracted from the keyword networks of every word generated by the step S61 to calculate a core formal degree. The extraction of communication core and the calculation of core formal degree will be described in detail later.
In step S63 of “calculate formal degree of communications”, a communication formal degree is calculated by using the core formal degree obtained by the step S62 of “calculate formal degree of keyword-network cores” in accordance with the definition by the following Expression 1. Incidentally, in the keyword networks, the core formal degree of communications which does not belong to the communication core is set by “0”.
The communication formal degree means an index representing a formal degree of the communications between two constituent members. Using the communication formal degree, it is possible to output an appropriate path between the constituent members in response to an attribute of the personal-connection search request by user, when the personal-connection search function is realized.
In step S71 of “extract communication cores”, a partial network (communication cores) having communications with high density is extracted from the communication networks.
In step S72 of “calculate formal degree of cores”, a formal degree (core formal degree) of communication cores is calculated in accordance with the following Expression 2,
The core formal degree is defined with the communication flows between two constitute members having a ladder-connection relationship in the communication cores, with respect to a total of the communication flows in the communication cores. The core formal degree represents a degree indicating that the communication cores are made up of communications on the basis of a formal structure in the organization. By calculating the core formal degree, it is possible to specify informal communication cores in the organization and calculate a communication formal degree between the two constituent members.
Here, in the case of extracting the communication cores by the step S71 of “extract communication cores”, various and known algorisms can be used, such as a complete graph, n-Clique, n-Clan, K-core, etc. In this embodiment, the communication cores are extracted by using the n-Clique. The clique becomes synonymous with the core, in this case.
The n-Clique is an algorism for obtaining a sub-graph capable of connecting arbitrary two nodes in a clique-in-n hop. Specifically, summation of an adjacency matrix to the nth power, in which the adjacency matrix represents presence or absence of a relationship between the two nodes, is calculated. In this way, an adjacency matrix representing whether the two constituent members can be directed with each other within n hops, is generated. With respect to the matrix, a clique is extracted by repeating exchanges of the rows and columns so that elements having a value 1 are gathered on diagonal in the matrix.
In step S81 of “synthesize keyword-network”, the keyword network for respective words, which is contained in the query received by the step S43 of “query reception”, is added to one another to generate a communication network on the basis of the designation by the user.
In step S82 of “extract communication cores” and in step S83 of “calculate formal degree of the cores”, identical processings to the step S71 of “extract communication cores” and the step S72 of “calculate formal degree of cores” are implemented by using the communication network, as an input, generated in the step S81 of “synthesize keyword-network”.
In step S84 of “calculate centrality”, a centrality for respective constituent members is calculated in the communication network generated by the step S81. The centrality means a value indicating whether each of the nodes has how many connections are concerned with other nodes in a network and how many the other nodes are intermediately transmitted through the network. In the case of the invention, a degree indicating how the respective constituent members are contributed to communications is represented by the centrality in the communication network on the basis of the keyword entered by user. By outputting the centrality, the user can discriminate main constituent members in the communication network. The centrality can be calculated by a known algorism disclosed in “Basis of Social Network Analysis” by Jun Kanamitsu, Keisou Publishing Inc., Chart 6 “Centrality”, Dec. 20, 2003, Vol. 1, Version 1. (non-patent document 1)
In step S91 of “know-who search for person with knowledge”, the constituent member profile generated by the step S41 of “communication-log data analysis” is searched by using the query received by the step S43 of “query reception”, so that a constituent member to be presumed to possess necessary knowledge with high probability is detected by user.
In step S92 of “search for intermediate path”, a path is detected as a connected path through constituent members other than the two constituent members, between the constituent member (or user itself in this case) designated by user and the knowledge possessing constituent member detected by the step S91. The search for intermediate path uses either the communication network generated by the step S81 of “synthesize keyword-network” or the communication network synthesized over the total of keywords in the keyword network. There is Warshall-Floyd method or the like to search a path for connecting between two designated constituent members by the shortest distance.
In step S93 of “calculate formal degree of the path”, a formal degree of the path detected by the step S92 is calculated in accordance with the following Expression 3.
The path formal degree is defined with an average value of the communication formal degrees in the paths. By outputting the path formal degree, the user can select an optimal path for the knowledge possessing person in response to a background which needs knowledge.
In step S94 of “calculate centrality”, a centrality is calculated in the communication network used by the step S92 of “search for intermediate path”.
In this embodiment, a plurality of feature quantity sets of relationship between the two constituent members are used such as the communication network. As a method of holding these values, there is a method of using a matrix representation. A matrix representing a social relationship between the constituent members is particularly referred to as a sociomatrix. Matrix representations of the respective networks and feature quantities will be described with reference to
The core formal degrees as important indexes are displayed every core in the communication core extracted result 165 and centrality calculated result 166.
A reference numeral 167 denotes a display of the communication networks based on the keyword extracted from the query. In the display 167, the presence and absence of the ladder-connection relationship are dividedly displayed by solid lines (a ladder-connection is present) and dotted lines (a ladder-connection is absent).
Further, when the user designates a path, the designated path is displayed emphatically by a width of edges, and a knowledge possessing person is displayed emphatically by a color of node. An example of an emphatically displayed screen is shown in
The displays for the communication networks in
Generally, according to a formal attribute indicating whether a condition of requesting information by a user is officially or privately, if knowledge possessing constituent members are selected appropriately in such a way that it is decided that a query is easily reported to the knowledge possessing constituent members from the user and easily obtained a response from the members, more effective information collection and formation for the personal-connections can be implemented. A response evaluation index (⊚: good, ◯: fair, Δ: no good) for a combination of formal attributes and path formal degrees is shown in
For example, in the case where a user sends the knowledge possessing constituent members a formal query relevant to a business operation, a good response can be expected to receive from the members if the user queries the members or query destination members by using a query destination (core) formal degree, the query destination having a high formal degree with a path formal degree, and a query path (personal connection).
On the contrary, in the case where a query of the user is low relevancy to his/her business operation or privately informal for him/her, a good response can also be expected to receive from the members if the user queries the members by using a path of the foregoing formal query destination and the query destination having a low formal degree rather than the path.
If the foregoing control is designated by the user with the formal attribute to request information, it is possible to realize automation.
Therefore, the invention is applicable to a person and personal-connection search system such as KnowWho system, and an organization management support system for support to make an organization activated.
It should be further understood by those skilled in the art that although the foregoing description has been made on embodiments of the invention, the invention is not limited thereto and various changes and modifications may be made without departing from the spirit of the invention and the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2006-104803 | Apr 2006 | JP | national |