This application is based on and claims priority under 35 USC 119 from Japanese Patent Application No. 2010-204785 filed Sep. 13, 2010.
(i) Technical Field
The present invention relates to a program and a method for collection of information, and an information collection support system.
(ii) Related Art
On information communication networks such as the Internet and a corporate intranet, information is newly created everyday and distributed without limitation or within a predetermined range by using various systems and services, such as a document management system, various databases like a research paper database, an electronic bulletin board, a blog (Web log), and an image- or movie-sharing service.
A typical approach is to selectively provide information to a user if the information meets the liking and interest of the user from an enormous amount of information on the network. A tool or a system that automatically collects information relating to the user and provides the information to the user has been suggested.
A computer readable medium storing a program causing a computer to execute a process for collection of information according to an exemplary embodiment of the invention, includes receiving a division condition of a value range of an attribute from a user; dividing the value range of the attribute into a plurality of ranges under the received division condition; selecting writers who have an attribute value corresponding to the respective divided ranges of the attribute by referencing a writer information storage unit that stores values of the attribute of the writers; collecting information that is posted by the selected writers from a posted information storage device that stores information posted by the writers; and providing the collected information to the user.
Exemplary embodiment(s) of the present invention will be described in detail based on the following figures, wherein:
An approach that is collecting information that meets the interest of a user is a natural and necessary way. However, if collection of information in this way is repeated, information collected may become limited to a certain aspect, and the view of the user may be forced to be narrow. Accordingly, this exemplary embodiment suggests a system that intentionally collects multifaceted information and provides the information to a user.
Referring to
In
An information collection and management device 200 is a computer which is capable of making communication with the information terminal 100 through an information communication network, such as the Internet or a corporate intranetwork (for example, local area network). The information collection and management device 200 manages information collecting activities for the information terminal 100 (user). The information collection and management device 200 includes functional modules including a work style information storage unit 202, a work style information collection unit 204, and a multifaceted information collection unit 210.
The work style information storage unit 202 is a functional module that stores work style information of users who are registered in the system according to this exemplary embodiment. In this exemplary embodiment, users (users who post information) as supply sources of information that are posted to the information terminal 100 are selected based on attributes corresponding to work styles (ways of working) of the respective user.
Among the attribute items of the exemplary work style information illustrated in
In this example, the user is an employee of a corporation that uses the system. The user ID may be, for example, an employee ID that is used in the corporation. The user logs in various information systems (e.g., a document management system 308 and the SNS system 310, which will be described later) on the corporate network by using the employee ID of the user. Not all information processing systems on the network have to use the employee IDs as the user IDs. However, it is assumed that the user IDs in each information processing system may be converted into the employee IDs by a directory server (not shown) or the like. Each employee logs in each system, such as the document management system 308 or the SNS system 310 provided on the network, browses and posts a document or other information. Each system stores the document posted by the log-in user in association with the user ID; and records an action performed by the user, such as browsing or posting, in the system and an object of the action (information such as a document or the like) as log information. It is to be noted that since the employee of the corporation is a user of the system according to this exemplary embodiment, hereinafter, the employee is merely called user unless the employee has to be particularly distinguished from a mere user. When the user represents a person who posts information, the user may be occasionally called a writer.
The information that is stored in the work style information storage unit 202 is collected by the work style information collection unit 204 from various information sources provided on the network. For example, the information sources include a work time management system 302, a scheduling system 304, a welfare service system 306, the document management system 308, and the social networking service (SNS) system 310.
The work time management system 302 manages information of an attendance state of each user (for example, information of used vacation days, start and end times of work, and a commute time). The scheduling system 304 manages information of a work schedule of each user (for example, information of a schedule of business trips and meetings). The welfare service system 306 manages a use status of welfare services of each user (for example, information of attending private trips). The document management system 308 saves an electronic document (for example, a document file created with a word processor) posted by each user (in this case, the user is a writer because the user creates a document and posts it as information), and distributes the saved document to each user in accordance with access control information if it is set. The SNS system 310 provides a social networking service. For example, the functions of the SNS system 310 include a profile publishing function, a user searching function, a blog posting function, a community creating function, and an information sharing function in the community. The document management system 308 and the SNS system 310 are examples of systems that serve as fields where users share the information. In addition to the aforementioned systems, a system for distribution of information, such as an electronic bulletin board system, may be provided. The document management system 308 and the SNS system 310 may permit an access from a non-employee user who is outside the corporation but has special permission.
Among the attribute items of the exemplary work style information illustrated in
The multifaceted information collection unit 210 provides a collecting service of multifaceted information for the user through the information terminal 100. The “multifaceted information” mentioned here is information having wide variety as possible in the aspect of the writer's personality. In this exemplary embodiment, users (writers) are selected from multiple users (writers) as subjects of a collection of posted information, by focusing on the attributes of the users (writers). This selection is made based on a policy that persons having different attribute values in each are selected as far as possible from a limited number of persons. Information posted by the selected users is collected through an information centralization system 320.
The function of the multifaceted information collection unit 210 is divided into a condition designation reception unit 212, a collection subject selection unit 214, and a collection request unit 216. The condition designation reception unit 212 receives designation of information collection conditions from the information terminal 100. The collection subject selection unit 214 selects various writers from records of users stored in the work style information storage unit 202, in accordance with the information collection conditions. The collection request unit 216 requests the information centralization system 320 for collection of information posted by the selected writers, and provides the information collected in accordance with the request to the information terminal 100.
The information centralization system 320 collects the information posted by the users and requested by the collection request unit 216 from information sources, such as the document management system 308 and the SNS system 310, through the network, and provides the collected information to the collection request unit 216.
The overview of the system has been described above. Now, the details will be described below.
Referring to
The condition relating to the number of information pieces to be collected corresponds to two items on the right side in
The condition that ensures variety is designated as a division number by which the range of attribute values in each attribute item of users, such as the “commute time” and “the number of vacation days,” is divided. To be more specific, in this exemplary embodiment, the entire range of values that are expected to be set in an attribute item is divided into plural ranges (sections) which are not overlapped with each other, and users who respectively meet the divided ranges are selected as information sources. Accordingly, the variety of writers is ensured.
For example, regarding the “commute time,” if the division number is designated to three by a slide bar 406, the distribution range of commute time of users is divided into three, and an information source is selected from each of the three divided ranges to include at least one person from each divided ranges. In addition, regarding “the number of business trips,” if the division number is designated to five by a slide bar 408, the distribution range of the number of business trips of users is divided into five, and an information source is selected from each of the five divided ranges to include at least one person from each divided ranges. Accordingly, in this example, at least three persons are selected in the aspect of the “commute time,” at least five persons are selected in the aspect of “the number of business trips,” and hence at least eight persons in total are selected as information sources.
It is to be noted that there may be an attribute item to which a division number is not designated. In this case, any writer is not selected in the aspect of such an attribute item. For example, an operation system determines “not to designate a division number” when the slide bar 406 indicates “zero.”
Here, since at least one writer is selected from each divided range, the sum of the division numbers set for the respective attribute items has to be the “upper limit number of selecting persons” or smaller. The range selectable by the slide bar of each attribute item may be limited so that the sum of the division numbers becomes the “upper limit number of selecting persons” or smaller.
The writers are selected such that the numbers of selecting persons become as even as possible for the divided ranges. To be more specific, if the total number of divided ranges is three and the upper limit number of selecting persons is six, two persons are selected from each of the three divided ranges. If the upper limit number of selecting persons is not divided evenly by the total number of divided ranges, persons are selected by the same number for each of the ranges within the upper limit, and the remainder may be left unused. Alternatively, persons may be selected by the same number for each of the ranges within the upper limit, and the remainder may be allocated to part of the ranges by one each. An operator of this system may determine either of these methods.
Similarly, the information is collected by an even number of information pieces from the selected writers. If the upper limit number of collecting information pieces is not divided evenly by the number of selected writers, the information is collected by the same number of pieces from each of the writers, and the remainder may be left unused. Alternatively, the information may be collected by the same number of pieces from each of the writers, and the remainder may be allocated to part of the writers by one each. The operator of this system may determine either of these methods.
The window for designating collection conditions may be provided by a collection condition designation program that is installed in the information terminal 100 (if the condition storage unit 110 is included in the information terminal 100), or may be provided by the condition storage unit 110, for example, in the form of an Web page.
Now, a flow of designating the collection conditions will be described with reference to
Now, a flow of collection of information under information collection conditions set by a user according to this exemplary embodiment will be described with reference to
In the multifaceted information collection unit 210, the condition designation reception unit 212 receives the request including the information collection conditions. The collection subject selection unit 214 requests selection of writers, who are collection subjects of information, to the work style information storage unit 202 in accordance with the upper limit number of selecting persons and information of division numbers for ranges of respective attribute items included in the information collection conditions (S38). With this selection request, the entire range of values that are expected to be set in each attribute item is divided into a number of ranges (division number) that is determined in the information collection condition of that attribute item, and a request is made to the work style information storage unit 202 to obtain users having attribute values that belong to the respective divided ranges. The work style information storage unit 202 that has received the request, for example, randomly selects writers by a number of persons, which is obtained by allocating the upper limit number of selecting persons evenly to the respective divided ranges, from among the users having the attribute values that belong to the respective divided ranges (S40). User IDs of the selected users are returned to the collection subject selection unit 214 (S42). The collection subject selection unit 214 gives a list of the selected user IDs to the collection request unit 216.
The collection request unit 216 gives the received list of the user IDs and the upper limit number of collecting information pieces determined in the information collection conditions to the information centralization system 320, and requests to collect information (S44). The information centralization system 320 obtains the number of collecting information pieces per one person by allocating the upper limit number of collecting information pieces evenly to the respective users in the received list, and searches for and acquires the collecting information pieces per one person from information posted in the document management system 308 and the SNS system 310 by the users (S46). With this search, if there are plural information source devices which are posting targets of information (for example, the document management system 308 and the SNS system 310), information pieces may be collected evenly from the respective information sources as far as possible. In the information sources, information pieces that have been posted at various timings from the past to the current are usually existed mixedly. Hence, in the search processing in S46, information to be provided may be searched from information posted at a recent time. For example, the collecting number of information pieces may be searched in order from the latest information posted by the selected writers. Also, the last request time of the user who requests the collection may be recorded in, for example, the multifaceted information collection unit 210, and the collecting number of information pieces may be searched from information posted after the last request time in S46. Other various methods may be conceived.
In this way, when the collecting number of information pieces posted by each user (writer) of the collection subject is collected, the information centralization system 320 returns the IDs and additional information (if provided) of the information to the collection request unit 216 (S48). Herein, an unique ID (identification information) may be assigned to each piece of information posted to various information sources, such as the document management system 308 and the SNS system 310, and the ID may be used as an information ID. Alternatively, a combination of identification information (for example, domain name) that specifies a device of each information source and a storage location (for example, path name) in the device, or a uniform resource locator (URL) of each piece of information may serve as an information ID. Also, examples of the additional information may be attribute information (creator name, date and time of creation, file size, summary, etc.) of collected information.
The collection request unit 216 returns the list of the information IDs and the additional information, the list which is the collection result acquired from the information centralization system 320, to the information terminal 100 that has originally made the request (S50). The list may be provided in the form of a Web page in which, for example, links to the respective pieces of information and the additional information are listed. The information terminal 100 displays the received list on a window (S52). The user selects information that the user wants to browse from the list, and browses the information (S54). After browsing, the user may return to S34 to check whether the information is updated or not after the last collection of information, and may instruct to re-collect information (S56). If the user performs an operation for the end of browsing on the displayed window of the tool in S56, the series of processing is finished.
In the above example, the list of the collected information IDs is provided to the information terminal 100. Alternatively, collected information itself may be provided to the information terminal 100 instead of the list.
Also, in the above example, the multifaceted information collection unit 210 performs the processing for the collection of information in response to the instruction for the collection of information from the user. However, this is merely an example. Alternatively, the information collection and management device 200 may register information collection conditions of respective users, periodically collect information in accordance with the conditions, and distribute the information to the information terminal 100.
Also, in this exemplary embodiment, persons selected from user groups corresponding to ranges of each attribute item divided in accordance with information collection conditions may be changed every time when information is collected, or may be randomly changed at a certain time interval. Accordingly, the variety of information to be collected is further increased.
Also, the information terminal 100 or the information collection and management device 200 may periodically recommend a user to change or review the information collection conditions. The recommendation may be made on a mobile tool, for example, by displaying a message that recommends a user to change setting for the information collection conditions, on a window of the multifaceted information collection tool. If information is collected for a long period under the same conditions, the user who browses the information may get used to the way of collection, and the collection become mere routine. To avoid this, the user may be periodically recommended to change the conditions.
Also, although the user intends to collect many pieces of information and browse the information at the beginning, as the user actually operates this system, the user may frequently have difficulty to browse many pieces of information because the user does not have a time. If such a state is left unaddressed, the user may be indebted for that user's goal may not be attained. This may put the user away from the cycle of collecting and browsing information. Therefore, the information collection and management device 200 or the information terminal 100 may monitor the amount of information that is actually browsed by the user, from among information of the collection result provided by the information collection and management device 200 to the information terminal 100. If a ratio of the amount of information actually browsed by the user to the amount of collected information is a predetermined ratio or smaller, the information collection and management device 200 or the information terminal 100 may automatically suggest the user to decrease the upper limit value of the number of collecting information pieces. Alternatively, a request for an information entity corresponding to an information ID from the information terminal 100 may be transmitted to the information collection and management device 200, and the information collection and management device 200 may acquire information corresponding to the information ID from a device of an information source and provide the information, so that the information collection and management device 200 recognizes the amount of information actually browsed by the user. Through monitoring, if the ratio of the amount of information actually browsed by the user to information in the list of the collection result is smaller than the predetermined ratio, a monitoring tool installed in the information collection and management device 200 or the information terminal 100 may transmit a message for recommending the user to perform resetting to decrease the upper limit value of the number of collecting information pieces to the information terminal 100.
The information terminal 100 and the information collection and management device 200 exemplified above are provided, for example, by causing a general-purpose computer to execute a program that represents processing of the aforementioned respective function modules. The computer has, for example, a circuit configuration including, as hardware, a microprocessor such as a central processing unit (CPU), memories (primary memories) such as a random-access memory (RAM) and a read-only memory (ROM), an HDD controller that controls a hard disk drive (HDD), various input/output (I/O) interfaces, and a network interface that performs control for connection with a network such as a local area network. These units are connected through, for example, a bus. Also, a disk drive that reads and/or writes data from and/or on a portable disk storage medium, such as a compact disc (CD) or a digital versatile disc (DVD), or a memory reader/writer that reads and/or writes data from and/or on a potable non-volatile storage medium of any of various standards, such as a flash memory, may be connected to the bus through, for example, an I/O interface. The program having written therein the processing contents of the function modules exemplified above is saved in a fixed storage device, such as the hard disk drive, through the storage medium, such as the CD or DVD, or through a communication way, such as a network, and is installed in the computer. The program stored in the fixed storage device is read to the RAM, and is executed by the microprocessor such as the CPU. Accordingly, the function modules group exemplified above is provided.
The foregoing description of the exemplary embodiments of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in the art. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, thereby enabling others skilled in the art to understand the invention for various embodiments and with the various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2010-204785 | Sep 2010 | JP | national |