This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2008-099101, filed on Apr. 7, 2008, the entire contents of which are incorporated herein by reference.
1. Field of the Invention
The present invention relates to a retrieving apparatus, a retrieving method, and a retrieval program that present results of retrieval by various conditions and in particular, relates to a retrieving apparatus, a retrieving method, and a retrieval program that present results of retrieval by desired conditions in an easy-to-understand, intuitive screen.
2. Description of the Related Art
Conventionally, retrieval services receive input of a search keyword and, retrieve and present web pages that include the search keyword. The web pages may be presented classified into hierarchical categories. New retrieval services have appeared such as meta search engines that execute cross-searches using plural search engines (for example, refer to Japanese Patent Application Laid-Open Publication No. 2002-351916).
Conventional technologies, however, fail to provide a screen capable of appropriately presenting the retrieved results to a user. For example, when the results are displayed in a hierarchical form, manual operation is required to switch the hierarchy by which the results are displayed. If search conditions are changed during the course of a search, the search cannot be executed to reflect cumulatively the results obtained. At the same time, with respect to the display screen presenting the retrieved results, a mere display of the results according to search condition requires switching between search result displays and cannot provide a desirable and appropriate presentation to the user. Thus, a problem arises in that despite the high performance of a search engine, desired information may not be found among enormous amount of information retrieved.
It is an object of the present invention to at least solve the above problems in the conventional technologies.
A retrieving apparatus according to one aspect of the invention includes an input unit that receives plural search conditions of different categories; plural search units that are provided respectively for each of the categories and execute searches based on the search conditions; a processing unit that, based on the search conditions received by the input unit, causes a search unit provided for a corresponding category to execute a search and outputs search results obtained; and a display control unit that causes the search results output from the processing unit to be displayed by a display screen having a layout corresponding to the category. The processing unit, through an operation of the display screen depicting the search results, further receives search conditions of another category and narrows down the search conditions.
A retrieving method according to another aspect of the invention includes receiving plural search conditions of different categories; searching with respect to each of the categories and based on the search conditions; processing to, based on the search conditions received at the receiving, cause searching with respect to a corresponding category and output search results obtained; and controlling to cause the search results output at the processing to be displayed by a display screen having a layout corresponding to the category. The processing includes processing, through an operation of the display screen depicting the search results, to further receive search conditions of another category and narrow down the search conditions.
A computer-readable recording medium according to still another aspect of the present invention stores therein a retrieval program that causes a computer to execute the retrieving method according to claim 15.
The other objects, features, and advantages of the present invention are specifically set forth in or will become apparent from the following detailed description of the invention when read in conjunction with the accompanying drawings.
Referring to the accompanying drawings, exemplary embodiments according to the present invention are explained in detail below.
According to exemplary embodiments, information display is proposed that can narrow down information by taking various search conditions (keys) as axes and combining plural axes to intuitively and simply present search results to a user.
The retrieving server 130 acquires from the web server 122, the contents of web pages made available by the web server 122; analyses the contents; generates index information; and registers the information in a database. The user terminal 124 accesses the retrieving server 130 and makes a query indicating search conditions. The retrieving server 130 searches the database, retrieves content matching the search conditions indicated by the user, and presents to the user terminal 124, a list of URLs of the content matching the search conditions. By clicking a link to the desired content among the list, the user may browse the desired content.
The retrieving server 130 provides a retrieval service capable of searching based on different categories (referred to as axes) such as “word (phrase) search”, “map” “time”, and “person” as keys. The retrieving server 130 has a function of clustering the results retrieved. Search engines respectively provide an intuitive and easy-to-operate search UI according to characteristics of the information used as the key. For example, a search engine that executes searches with “time” as a key provides a UI such as a calendar and a time slider for receiving designation of time. A search engine that executes searches with a “map” as a key provides a UI of a map, etc.
The retrieving server 130 provides an integrated UI that combines these search engines and narrows down the information, thereby enabling complex information acquisition such as obtaining the results of a search by “map” further narrowed down by “time”. Further, it becomes possible to make a comparative study from various aspects while maintaining the results of various search engines, to efficiently support the user in his knowledge discovery, and to greatly enhance usability.
A hardware configuration of the retrieving apparatus according to an embodiment of the present invention will be described.
The CPU 201 governs overall control of the retrieving apparatus 130. The ROM 202 stores therein programs such as a retrieval program. The retrieval program stored in the ROM 202 is read and executed under the control of the CPU 201. The RAM 203 is used as a work area of the CPU 201.
The HDD 204, under the control of the CPU 201, controls the reading and writing of data with respect to the HD 205. The HD 205 stores therein the data written under control of the HDD 204. The data stored in the HD 205 is read out under the control of the HDD 204.
The FDD 206, under the control of the CPU 201, controls the reading and writing of data with respect to the FD 207. The FD 207 stores therein the data written under control of the FDD 206, the data being read out under the control of the FDD 206.
The CD-RWD 208, under the control of the CPU 201, controls the reading and writing of data with respect to the CD-RW 209. The CD-RW 209 is a removable recording medium storing data written under the control of the CD-RWD 208. Data stored on the CD-RW 209 is read out under the control of the CD-RWD 208.
The display 210, under the control of the CPU 201, displays a cursor, menus, windows, or various types of data such as documents, images, etc. The keyboard 211 is an input device equipped with keys for the input of characters, numerals, and various instructions, and data is entered through the keyboard 211. The mouse 212 performs cursor movement, range selection, and movement, size change, etc., of a window. The mouse 212 uses a touch panel for receiving input of stroke by a finger or a pen at the time of hand-written input. The mouse 212 may use a tablet composed of the display 210 and a transparent touch panel provided over the display 210.
The network I/F 213 connects with the network 120 such as a LAN and a WAN by way of the communication cable 214 and functions as an interface between the network 120 and the CPU 201.
The retrieving server 130 according to the present embodiment is equipped with the above hardware and executes retrieval by executing the retrieval program stored on the ROM 202, under the control of the CPU 201. A function of the retrieval program is to present retrieved results to the user on an easy-to-understand display screen.
Functional configuration will be described of the retrieving server 130.
As depicted in
The content acquiring unit 332 accesses the web server 122 by way of the network 120 and acquires the contents of a web page stored by the web server 122. When the retrieving server 130 retrieves the contents stored in a hard disk, etc. thereof in a stand-alone environment, the content acquiring unit 332 accesses a storage device connected to the apparatus and acquires the contents stored therein. The content analyzing unit 334 analyzes the contents acquired by the content acquiring unit 332 and extracts information serving as keys for a search. The registering unit 336 registers index information of the contents extracted by the content analyzing unit 334 to the index information database 338.
The content analyzing unit 334, using a named entity extraction technique, extracts named entities included in the web page, for example, information concerning date, location name, designation of an object, telephone number, personal name, organization name, etc.; indexes the named entities; and registers the named entities in the index information database 338. The content analyzing unit 334 may use a morphological analysis technique to disassemble the text information in the contents into morphemes and extract the named entities from nouns among the morphemes, by referring to a dictionary, etc. stored in a dictionary storage unit 335. At this time, a thesaurus including synonyms, antonyms, etc., or an ontology dictionary, etc. having information representing conceptual classifications may be referenced.
The content analyzing unit 334, when extracting time information from the contents, may acquire information indicative of preparation date/time or update date/time of the contents and supplement the time information. For example, when only the date is stated and the year is missing, preparation year of the contents may be supplemented. The content analyzing unit 334 may specify the extent of the validity of the information concerning a time, an address, etc., by analyzing the structure, etc. of the contents. For example, when, a website of a company includes a web page that outlines the company or the like, location information on the web page is considered to represent the address of the company and therefore, configuration may be such that the web contents under such domain will be regarded collectively and the location of the company will be correlated as the address information thereof.
In a web page including a list of restaurants, correspondence between the name and the address of the restaurants may be analyzed based on the layout of a table, etc. When the web page of a private individual includes a web page of a collection of links, the content analyzing unit 334 may extract and correlate relationships between personal names. The contents are systematized from various aspects by such techniques.
As described, by indexing the extracted information, a search may be realized that uses various types of information as keys. The related URL column 386 stores the URL of related content. Related content may be, for example, content that is referenced such as content to which the particular contents are linked or content that sets the particular contents as a destination of link or may be content highly similar to the particular contents. Content relationships may be built according to folder hierarchy, etc. in the web server 122 or as described above, according to relationships between persons. The index information database 338 may further store keywords included in the contents, preparation date/time and update date/time of the contents and may further store such information as a time range (start time, finish time, and start time to finish time).
The description of
A word retrieving unit 341 searches each column (URL column 380 to related URL column 386) of the index information database 338, using an arbitrary word (phrase) as a key. The word retrieving unit 341 has a function of clustering the retrieved results and, for example, presents the retrieved results, separating phrases of the same classification from phrases of other classifications.
An address/location name information retrieving unit 342 searches the index information database 338, using information such as an address, a location name, and longitude/latitude as a key. The address/location name information retrieving unit 342 provides a map-based screen as the UI, and on a map, maps the content that include the location name or location information, thereby enabling the user to reach the information from map browsing. The address/location name information retrieving unit 342 has a function of clustering content related to the address/location name and, for example, presents the content classified by area.
A time information retrieving unit 343 searches the index information database 338, using time information as a key. The time information retrieving unit 343 displays a screen based on a time axis such as a calendar and a time slider as the UI and maps on the calendar, the time slider, etc., content that includes the time information, so that clicking on any of such mapped content may cause the display on the screen to jump to a corresponding page. The time information retrieving unit 343 further presents the time-related content clustered, for example, by date.
A personal name information retrieving unit 344 searches the index information database 338, using personal name information as a key. The personal name information retrieving unit 344 presents content related to a personal name clustered, for example, by name, where individuals having the same family name and the same first name are classified together. If a person is designated, the personal name information retrieving unit 344 retrieves information so that detailed information on the person, for example, his personal history and latest information, may be known extensively. The personal name information retrieving unit 344 may further present personal relationships.
As described, since each search engine provides an intuitive, easy-to-operate UI, usability may be enhanced.
The integrated retrieving unit 350 includes an input unit 351 that receives the search conditions for each of the search engines from the user terminal 124 and a display control unit 353 that provides the user terminal 124 with a user interface (display screen) that integrates plural user interfaces for a presentation of the retrieved results. A processing unit 352 notifies the search engines of their respective search conditions received from the input unit 351 and causes the engines to execute searches.
Upon acquiring search results from the search engines, the integrated retrieving unit 350 presents the search results, which satisfy the search conditions received from the user; thereby making it possible to easily perform narrowing-down searches using plural keys taken as axes.
An example will be described of the extraction of content with respect to the database depicted in
The axis window 510 displays, in vertical direction, a phrase (word) search condition setting unit 501, a map search condition setting unit 502, a time search condition setting unit 503, and a people (organization name) search condition setting unit 504. The phrase search condition setting unit 501 is provided with an item input unit 511 for inputting an arbitrary phrase and an icon 521 for selecting the phrase search. The map search condition setting unit 502 is provided with an item input unit 512 for inputting an address or a location name and an icon 522 for selecting the map search. The time search condition setting unit 503 is provided with an item input unit 513 for inputting a date, a period, or a time and an icon 523 for selecting the time search. The people search condition setting unit 504 is provided with an item input unit 514 for inputting a personal name or an organization name and an icon 524 for selecting the people search.
Through an input of search phrases to the item input units 511 to 514 and manipulation of a search button 517, a search is executed based on the input phrases. At the time of this search, the processing unit 352 causes the word retrieving unit 341, the address/location name information retrieving unit 342, the time information retrieving unit 343, and the personal name information retrieving unit 344 corresponding to the respective search condition setting units 501 to 504 to execute a search using the input phrases as keys. A search phrase may be input to one or more of the item input units 511 to 514 and further, a search may be executed by a combination of the phrases. Further, an input of plural phrases to each of the item input units 511 to 514 and a search by a combination of the phrases may be executed.
In the vicinity of the search button 517, a setting slider 517 is provided for setting the number of divisions at the time of clustering to be described later. The setting slider 517 is set at 3-divisions in the illustrated example and 2- to 6-divisions are possible by moving the setting slider 518.
The example depicted in
Encircled numerals, 1, 2, . . . , displayed on the map information 602 correspond to the encircled numerals of the websites displayed in the search result display window 530. For example, an encircled numeral 2 indicates an address or the name of a place included in the website of the “Great Dictionary of Everything” displayed as the second item in the search result display window 530 and the display control unit 353 displays the location on the map information 602 corresponding to such address or location name, marked by the encircled numeral 2. Configuration is such that when the cursor is moved to the encircled numeral 2, the corresponding location name “Ise” is displayed by a pop-up indicator 603, thereby making it possible to identify in more detail the place of the encircled numeral 2 displayed on the map information 602 and aid in the display of the search results.
According to such map information 602, the address or location name information included in the search results for “udon” may be arranged at the corresponding location on the map information 602 and the search results may be displayed at appropriate locations on the map. The pop-up indicator 603, besides displaying the location name in the case of the map information 602, may display a date or time on a time-axis display to be described later.
Specifically, a page group is divided into three parts by the frequency vector of [word|map|time|people] and a circle graph is displayed by the following processing: (1) divide the page group of the search results into three clusters; (2) each cluster is colored by a respective color; (3) when a certain phrase is focused on, the phrase is included in one or more pages, thus the phrase belongs to one or more clusters; (4) display a circle graph indicating in which cluster and at what frequency the phrase is included. Consequently, in the example depicted, the display control unit 353 displays on the display screen 500, a circle graph for “udon” indicating ¼ included in cluster 1 and ¾ in cluster 2.
In the state of
Thus, “(Mr.) Murayama” 811 has personal relationships with “(Mr.) Atsushi” 812 and the “Noodle Expansion Laboratory” 813. The personal name information retrieving unit 344 recognizes “(Mr.) Murayama” 811 and “(Mr.) Atsushi” 812 as individuals and “Noodle Expansion Laboratory” 813 as an organization and the display control unit 353 displays these names with their respectively corresponding icons attached thereto at the vertexes.
As described, through a shift of a vertex via the mouse, etc., the classification that could not be discerned from
In this example, the display control unit 353 records the display screen depicted in
While the screen switching by the computer may be completed in 0.0001 sec, 0.1 sec or more is needed for human visual recognition. Therefore, by giving 0.1 sec or more to smoothly switch the screen, recognition load is alleviated. As described, when search results are changed from one axis to another, the capability of switching, with continuity, a display screen to the subsequent display screen, does not bore the operator and may alleviate recognition load, unlike an instantaneous switching of the display screen.
Such processing may prevent the user from paying attention to the display of the main screen 520 and make the user conscious of the operation with respect to the axis window 510 or the search result display window 530, thereby aiding user operation. As described, through the processing of defocusing the display in the area for displaying the search results (the processing being based on location of the cursor) in switching between viewing the information of the search results and not viewing the search results, the screen may be modulated to improve visual recognition. Further, cursor position may be determined.
Thereafter, after the elapse of a predetermined time, for example, 1 sec, the display control unit 353 performs the image processing to put the title and address display in a focused (clearly visible) state. At this time, a detailed information display screen 712 of the seventh item “udon recipe . . . ” is displayed in such manner that the seventh item “udon recipe . . . ” is drawn from the search result display window 530 to the main screen 520. The detailed information display screen 712 is provided with tags including screen shot, summary, phrase, address/location name, time/hour, people/organization, etc. The screen shot is a graphical version of the corresponding website and represents the results of the search by the word retrieving unit 341, the address/location name information retrieving unit 342, the time information retrieving unit 343, and the personal name information retrieving unit 344. In the search result display window 530, through a selection operation (mouse click) at the location of, for example, the seventh item “udon recipe . . . ”, the corresponding website is displayed as a separate screen.
As described, by focusing and defocusing an item selected in the search result display window 530 (information of corresponding web page), the selected item may be made conspicuous and even if the cursor 550 is positioned on the text. There is no occurrence of losing sight of the location of the selected item and visual recognition may be improved. Even if the cursor is located on a character string of a search result item and is displayed as the vertical line “|” making it difficult to distinguish the cursor from the surrounding text, what is changed is not the display state of the cursor but the display state of the search result item on which the cursor is positioned and therefore, the selected item is easily distinguishable.
Consequently, as depicted in
The example described using
According to the above configuration, since, in performing the narrowing-down search, there is no need for inputting phrases in the item input units 511 to 514 provided in the search condition setting unit of respective axes and the narrowing-down phrases may be set by merely making selection such as designating a range with respect to the information displayed on the main screen 520, a narrowing-down search may be performed easily. Conversely, with respect to the clearing of the narrowing-down conditions as well, since the clearing may be performed by each axis, the trouble of clearing the search conditions for all axes and setting them again may be saved and the clearing of the narrowing-down conditions as well may be performed easily.
(Period Display with Time Axis)
According to the retrieval by the time information retrieving unit 343, the search may be performed with time information such as year, date, and time of web pages as a key. The processing unit 352 further consolidates and arranges the acquired time information according to a predetermined period and the display control unit 353 may display such retrieval results using a period axis.
Among the scales 910 to 914, the scale of the selected time axis is displayed at the uppermost position of the lower part of the screen. In the example depicted in
As described previously, if the cursor 923 is located on a given the search result on the main screen 520, “6.9” is displayed by a pop-up indicator 924. Similarly, the indicator 924 may display what kind of phrase included in the website is construed as the date/time. In the example depicted, “6.9” is construed as “June 9th” and displayed.
Although previously described, if the cursor 926 is located on the above search result on the main screen 520, “11:00 22:00” is displayed by a pop-up indicator 927 and a more detailed time frame (opening time is 11:00 and closing time is 22:00) of the information on which the cursor is located may be displayed.
If only the opening time is obtained from the search result, a long, narrow strip may be displayed thickly only at the opening time and progressively less thickly along the time-axis. For example, a bar 928 indicates a website carrying description of “Noodle Spot Yu@Uguisudani” with the opening time described as 11:30, where the closing time is unknown. Correspondingly, even if there is a description of only the opening time, the bar 928 may be displayed accordingly. Conversely, when there is a description of only a closing time, configuration is such that a long, narrow strip is displayed more and more thickly along the time-axis.
As described, according to the embodiments, an addition of operations to the display screen displaying search results enables the search results to be narrowed down easily, thereby eliminating the need for operations concerning plural search conditions involving the input of phrases and enabling the narrowing-down operation to be easily performed.
In particular, since the results of searches based on search conditions of different categories are displayed respectively in a suitable display mode, the search results may be grasped intuitively and easily. In the display mode in which the search results are displayed, the retrieval results by another category may also be displayed; thus, the search results based on various search conditions may easily be obtained without switching the display screen and organization of the information may be supported, without causing disorder to the information obtained as the search results.
The retrieving method explained in the present embodiment can be implemented by a computer, such as a personal computer and a workstation, executing a program that is prepared in advance. The program is recorded on a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD, and is executed by being read out from the recording medium by a computer. The program can be a transmission medium that can be distributed through a network such as the Internet.
The present invention enables the narrowing-down, etc. of search results using plural search conditions and intuitive, easy-to-understand display of the search results even for complex searches by search conditions of different categories.
Although the invention has been described with respect to a specific embodiment for a complete and clear disclosure, the appended claims are not to be thus limited but are to be construed as embodying all modifications and alternative constructions that may occur to one skilled in the art which fairly fall within the basic teaching herein set forth.
Number | Date | Country | Kind |
---|---|---|---|
2008-099101 | Apr 2008 | JP | national |