1) Field of the Invention
The present invention relates to a server, method, program, and terminal device for searching item data that store item data corresponding to items of periodicals such as newspaper and magazines.
2) Description of the Related Art
Item searching services to perform search of items of newspapers and magazines through the internet have been known. Conventionally, when a user sends a request, with a keyword, for a search for searching an item to an item search server, the item search server searches through the memory unit using the keyword, sorting all the hit item data by, for example, date, and provides a predetermined number of newest item data. In many cases, the predetermined number is set to about several tens cases that can be displayed in a window (page) on a display of the user.
All the hit data are sorted by, for example, date because new items of newspapers and magazines are constantly generated day by day and it-is highly likely that a number of items that cannot be displayed on the display hit.
One example of the conventional technology to sort all the hit data is sorting all the cases that are hit in the database in the order of higher similarity as disclosed in Japanese Patent Application Laid-Open Publication No. H09-73464. This technology is widely adopted as disclosed in various patent documents other than Japanese Patent Application Laid-Open Publication No. H09-73464.
However, the process of sorting-the entire hit item data and narrowing down the data to be provided by limiting to some newest item data from among the hit item data requires much time for the sorting, with the result that efficient searching cannot be performed.
Among the various processes the item search server carries out, such as session establishing, searching, and sorting, the most influenced by the number of item data is the sorting. The more the number of item data searched in the memory unit, the more the time required for sorting the item data and hence the more the delay in the sorting, leading to a decrease in the efficiency of the search. The processes other than the sorting do not take much time even if the number of the item data is large.
It is an object of the present invention to at least solve the problems in the conventional technology.
According to an aspect of the present invention a server for searching item data includes a memory unit that stores item data corresponding to items of periodicals; a searching unit that searches a plurality of item data in the memory unit when a search request requesting a search for item data in with designation of a keyword corresponding to the item data is received from a terminal device, and if number of hit item data exceeds a predetermined number, re-searches the hit item data after shortening a search period; a sorting unit that sorts the hit item data or re-hit item data by date; and an outputting unit that outputs data sorted by the sorting unit to the terminal device.
According to another aspect of the present invention a method for searching item data includes searching a plurality of item data in a memory unit when a search request requesting a search for item data in with designation of a keyword corresponding to the item data is received from a terminal device, and if number of hit item data exceeds a predetermined number, re-searches the hit item data after shortening a search period; sorting the hit item data or re-hit item data by date; and outputting data sorted by the sorting unit to the terminal device.
According to still another aspect of the present invention a computer-readable recording medium stores therein a computer program that causes a computer to the above method according to the present invention.
The other objects, features, and advantages of the present invention are specifically set forth in or will become apparent from the following detailed description of the invention when read in conjunction with the accompanying drawings.
Exemplary embodiments of a server, a method, program, and a terminal device for searching item data according to the present invention are explained in detail below with reference to the accompanying drawings. An embodiment of the present invention relates to an item data search system that operates on a World Wide Web (hereafter, WWW), which is one of distributed systems on the internet and the case in which indices of the items of newspapers and magazines are provided is explained.
First, the configuration of the item data search system according to the present embodiment is explained.
The terminal devices 10a to 10c are installed with a WEB browser capable of having access to a hypertext transport protocol (HTTP) server and can have access to various server devices on the Internet, for example, via internet service providers. More particularly, the terminal devices 10a to 10c have a function to request a search to the WEB server 12 for searching an index data of an item on which the keyword is designated. The WEB server 12 conducts a search for a plurality of index data using the keyword and when the number of the searched index data exceeds a predetermined number, conducts again a search for the item data after shortening the search period. The terminal devices 10a to 10c have a function to receive results of the search conducted by the WEB server 12 and display the result of the search. These functions are realized by a web browser.
The WEB server 12 is an HTTP server that is accessed by the various terminal devices 10a to 10c on the internet and provides a web service such as home page. The WEB server 12 provides not only a general web service but also stores index data of newspapers and magazines in the memory unit 123 and when the request for search is received from the terminal devices 10a to 10c with designation of the keyword is received, conducts a search for the item data corresponding to the keyword in the memory unit 123 and outputs the results of search to the terminal devices 10a to 10c.
More particularly, the WEB server 12 does not sort by date all the index data obtained by the search conducted using the keyword as is done in the conventional technology. Instead, the WEB server 12 conducts a search again after shortening the search period to reduce the number of indices when the number of index data searched exceeds a predetermined number, and then sorts the index data by date, thereby reducing the load of the sorting.
As shown in
The inputting unit 121 is an inputting device, for example, a keyboard or a mouse. The displaying unit 122 is a display device such as a liquid crystal panel or display. The memory unit 123 is a memory device such as a hard disk device. The memory unit 123 stores hypertext 123a and index data 123b. The portion that stores the index data 123b forms a database for the items.
The hypertext 123a is a hypertext for forming a page screen image that is distributed to the terminal devices 10a to 10c. The index data 123b are index data of items of newspapers and magazines. The index data 123b are obtained on-line from the servers of newspaper publishers to which subscription has been made in advance or off-line using a medium such as a compact disk read only memory (CD-ROM).
The control unit 124 controls the whole system of the WEB server 12 that provides index data of items and includes a server processing unit 124a, an index data searching unit 124b, and a sorting unit 124c.
The server processing unit 124a provides the function of an HTTP server to the terminal devices 10a to 10c and more particularly, reads the hypertext 123a stored in the memory unit 123 when accessed by the terminal devices 10a to 10c and performs, for example, transmission process to transmit the read hypertext 123a to the corresponding terminal devices 10a to 10c.
The index data searching unit 124 conducts a search for a plurality of index data 123b in the memory unit 123 using the keyword and conducts a search again after shortening the search period when the number of searched index data exceeds the predetermined number. The reason that a reduction in the number of index data is performed by such a re-searching is that the greater the number of the index data the more delayed the sorting of the searched data by the sorting unit 124c.
More particularly, for example, a search period of two years is set in advance and when a request for search without designation of the search period is received from the terminal devices 10a to 10c, a search using the keyword is performed for a search period of 2 years. If the number of the hit index data exceeds the predetermined number, the date for starting the search and the date for ending the search are shifted based on the number of days in the search period (2 years=730 days), the number of index data that can be displayed on the terminal devices 10a to 10c, and the number of hit index data to restrict the search period, and then a search is conducted again. If the number of the hit index data by this re-search exceeds the predetermined number, a re-search is conducted after shortening the search period in a similar manner. When a request for a search with designation of the search period is received and if the number of the hit index data exceeds the predetermined number, a re-search is conducted after shortening the search period in a similar manner.
The sorting unit 124c sorts by date the hit index data searched by the index data searching unit 124b. Since the index data searching unit 124b controls the number of the index data, the sorting unit 124c need not sort a large number of the index data. The sorted index data 123b are output as a hypertext to the terminal devices 10a to 10c through the server processing unit 124a.
The process procedure performed between the terminal device 10a and the WEB server 12 is explained in detail.
When a WEB browser is booted on the terminal device 10a and access is made to the WEB server 12 through the internet 11 (step S201), the WEB browser 12 prepares a main menu (not shown) (step S202), and displays the main menu on the terminal device 10a.
When a search for index is selected on the main menu (step S203), the selection is notified to the WEB server 12 and the WEB server 12 provides an index search page (step S204). The index search page is displayed on the terminal device 10a.
When a user inputs a keyword and a search period on the index search page 300 to instruct start of the index search (step S205), the searching unit 124b of the WEB server 12 carries out an index search (step S206). When the user inputs no search period, the index search is carried out by setting a predetermined period (for example, two years) as the search period. If the number of the hit index exceeds a predetermined number, a re-search is carried out by shortening the search period.
Display data is prepared based on the results of the index search (step S206), the prepared display data is transmitted to the terminal device 10a (step S208), and the display data is displayed on the terminal device 10a as the result of the search.
The process procedure for carrying out the index search shown in the Step S206 in
The index data searching unit 124b of the WEB server 12 is first initialized to set a variable, n, that indicates the times of search to 0 (zero) (step S401), prepares an initial search formula based on the keywords designated by the terminal devices 10a to 10c (step S402), and conducts a search to search the index data 123b in the memory unit 123 using the initial search formula (step S403).
The index data searching unit 124b examines whether the number of hits of the index data (number of index data) is less than a predetermined number (step S404). If the number of hits of the index data is less than the predetermined number (step S404, yes), the sorting unit 124c sorts the hit index data 123b by date (step S405).
When the number of hits exceeds the predetermined number (step S404, No), the sorting unit 124c examines whether the variable, n, is less than a predetermined times (step S406). If the number of hits exceeds the predetermined times (step S406, No), the sorting unit 124c sorts the index data 123 hit using the initial search formula by date (step S405). That is, in this case, there is a possibility that too much time is required for restriction due to repeated process, so that the repetition is stopped after a predetermined number of times of repetition and the sorting is carried out on the results of initial search.
If the variable, n, is less than the predetermined times (step S406, Yes), the variable, n, is incremented (step S407), and a new search formula in which the search period is shortened is prepared (step S408), and system control is passed to step S403.
The process procedure for preparing the new search formula shown in step S408 in
The index data searching unit 124b of the WEB server 12 calculates the number of days from the start of the search to the end of the search (step S501), and calculates a shortening rate, x (step S502).
The index data searching unit 124b then calculates a product (d×x) of the number of days, d, and the shortening rate, x, as a number of shift days (step S503), and sets a date obtained by shifting the previous search start date by the number of shift days as new search start date (step S504). The index data searching unit 124b prepares a restricted search formula in which the search period is shortened using the new search start date (step S505).
In this manner, the number of index data can be reduced to be less than the predetermined number before the sorting by the sorting unit 124c is carried out. This reduces the time required for the sorting and hence a delay in the processes.
A specific algorithm used when the present invention is carried out using a program is explained. The following assumptions are made. “X” designates a coefficient (1 or more) relating to a restricted maximum number of hits; “Y” designates a coefficient (1 or more) relating to restriction of the search period; “pos” designates a display start position; “max” designates a number of displayed items; “start” designates a search start date; “end” designates a search end date; “keyquery” designates a search formula disregarding the search period; “n” designates a number of times of restricted search; “query(n)” designates a search formula when an n-th time search is conducted; “hits(n)” designates a number of hits when a search is conducted using the query(n); “start(n)” designates a search start date when an n-th time restricted search is conducted; and “end(n)” designates a search end date when an n-th time restricted search ends.
When an initial search formula is prepared,
start(0)=start,
end(0)=end,
query(0)=keyquery AND “item date>start(0)” AND “item date<end(0)”.
When a new search formula is prepared by shifting the search start date,
date obtained by subtracting {(a difference between end(n−1) and start(n−1)×(max+pos)/hits(n−1)×Y} from start (n)=end(n−1)
end(n)=end(n−1)
if start>start(n), then start(n)=start
if end<end(n−1), then end(n)=end
If end<end(n), then end(n)=end
query (n) keyquery AND “item date>start(n)” AND “item date<end(n)”.
When a new search formula in which restriction is performed by shifting the search end date,
start(n)−start(n−1)
date obtained by adding {(a difference between end(n−1) and start(n−1)×(max+pos)/hits(n−1)×Y} to end (n)=start(n−1)
If start>start(n), then start(n)=start
If end<end(n), then end(n)=end
query(n)=keyquery AND “item date>start(n)” AND “item date<end(n)”.
A specific example of judgment of number of hits when the above-mentioned formula is used and date of re-search is explained in more detail. Assumptions are made as follows. The search start date is Jan. 1, 2001 and the search end date is Jan. 1, 2003. The initial number of hits is 5,000. The tuning parameter X for judging number of hits is set 20 folds and the tuning parameter Y for shifting the number of days is 5 folds. When the period is restricted, the search end date is shifted.
Judging number of hits,
(max+pos)×X=(20+0)×20=400
is obtained. Since 400<5,000, a period shortening process is carried out.
Since the number of days, d, from the search start date to the search end date=730 days,
d×x=730×(730×(20/5,000)=3 days.
Therefore, the search end date is after (d×x)×Y=3×5=15 days from Jan. 1, 2003, i.e., Dec. 18, 2001. Accordingly, when a re-search is carried out, the re-search is carried out for a shortened search period of from Jan. 1, 2001 to Dec. 18, 2002.
When a request for a search with designating a keyword and a search period on the terminal devices 10a to 10c is made, the index data searching unit 124b of the WEB server 12 searches a plurality of index data 123b in the memory unit 123 using the keyword. When the number of the hit index data exceeds the predetermined number, a re-search is conducted after shortening the search period and the sorting unit 124c sorts by date the index data 123b hit by the index data searching unit 124b. Accordingly, the delay due to sorting when index data of items in newspapers and magazines and so on can be reduced, thereby enabling an efficient search of the item data.
In the above, the present invention is applied to the case where WWW on the Internet is used. However, the present invention is not limited to this and may be realized by using an exclusive use network.
Moreover, in the above, the case where index data of newspapers and magazines are target for search is explained. However, the present invention is not limited to this and may be applied to the case where the item data themselves are searched.
Furthermore, in the above, the case where repeated restrictions are performed by re-searching is explained. However, the present invention is not limited to this and may be applied to the case where a research is conducted with broadening the search period when the number of hits is small.
According to the present invention, a plurality of item data are searched in the memory unit using a keyword and when the number of the hit index data exceeds a predetermined number, a re-search of the item data is conducted after shortening the search period, and the hit or re-hit item data are sorted by date. Accordingly, the delay in the processes due to the sorting when the item data of newspapers, magazines and so on are hit can be reduced, thereby enabling an efficient search of the item data.
According to the present invention, when a search request containing no search period in search conditions is received from a terminal device, item data having corresponding keywords are searched in the item data having date within a predetermined period, and if a number of the hit item data exceeds a predetermined number, a re-search of the item data is performed after shortening the predetermined period. Accordingly, when no search period is designated, a re-search is performed with appropriately shortening the search period and the delay in the processes that occurs when searching item data of newspapers, magazines and so on due to sorting can be reduced.
According to the present invention, when a search request containing a search period in search conditions is received from a terminal device, item data having corresponding keywords are searched in the item data having date within the search period, and if a number of the hit item data exceeds a predetermined number, a re-search of the item data is performed after shortening the search period. Accordingly, the re-search is performed with appropriately shortening the search period designated by the user and the delay in the processes that occurs when searching item data of newspapers, magazines and so on due to sorting can be reduced.
According to the present invention, when the number of re-hit item data exceeds a predetermined number, re-search is repeated until the number of hit item- data is less than the predetermined number after shortening the search period. Accordingly, the number of item data can be limited to maximum the predetermined number, so that the sorting can be accelerated.
According to the present invention, when the number of the hit item data exceeds a predetermined number, the search period is shortened by shifting the search start date based on the number of days of the search period, the number of the item data that can be displayed on the terminal device, and the number of the item data. Accordingly, the search start date can be flexibly and properly changed depending on the number of days of the search period, the number of the item data that can be displayed on the terminal device, and the number of the item data.
According to the present invention, when the number of the hit item data exceeds a predetermined number, the search period is shortened by shifting the search end date based on the number of days of the search period, the number of the item data that can be displayed on the terminal device, and the number of the item data. Accordingly, the search end date can be flexibly and properly changed depending on the number of days of the search period, the number of the item data that can be displayed on the terminal device, and the number of the item data.
According to the present invention, a search request with designation of a keyword is made to the item data search server, which in response to the search request, searches a plurality of item data using the keyword, and when the number of the hit item data exceeds a predetermined number, re-searches the item data after shortening the search period, and results of the search are received and controlled for display by the terminal device. Accordingly, an appropriate number of the hit item data can be displayed on the terminal device.
Although the invention has been described with respect to a specific embodiment for a complete and clear disclosure, the appended claims are not to be thus limited but are to be construed as embodying all modifications and alternative constructions that may occur to one skilled in the art that-fairly fall within the basic teaching herein set forth.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP03/02997 | Mar 2003 | US |
Child | 11104624 | Apr 2005 | US |