This application claims priority based on a Japanese patent application, No. 2009-154544 filed on Jun. 30, 2009, the entire contents of which are incorporated herein by reference.
The present invention relates to a server or a relay apparatus disposed in a communication path between a terminal and a server in a network system for communicating data between the server and the terminal such as WWW (World Wide Web).
Mobile phones are now in widespread use due to the convenience thereof, and parents often buy and provide them for their children. Many mobile phones of these days have Web browsing capability, which enables the browsing of various Web sites on the Internet and the use of contents through mobile phones. However, there exist on the Internet many Web sites containing information harmful to children's education and security such as sites showing pornography and dating websites. Further, it is difficult to monitor and restrict Web site browsing from mobile terminals such as mobile phones.
For this reason, there is required means for controlling access to sites on the Internet from mobile phones and other terminals to restrict the browsing of specific Web sites.
For example, according to a method described in Japanese Patent Application Laid-Open No. 2003-50758, identification information is provided to a terminal, an access permission level is established, and a management site for storing the level of the terminal and a Uniform Resource Locator (hereinafter referred to as URL), subject to access restriction, corresponding to the level is provided beforehand on the Internet, so that when the terminal accesses a site on the Internet, browsing is restricted based on the URL of the site and the level of the terminal.
In many cases, information (called a link) for referring to a file such as another document or image or another Web page is inserted into a Web page, and displayed as a link display in a selectable manner. In Web site browsing, a user selects a link display in a Web page, thereby to move to a referred-to (linked) Web page. However, in Japanese Patent Application Laid-Open No. 2003-50758, a browsing restriction determination is made on an access-requested URL; therefore, whether the linked Web page is subject to browsing restriction cannot be determined until the user actually selects the link display.
Therefore, it is not until the user selects the link display to browse the Web page and then receives a response of browsing restriction that the user recognizes that the linked Web page is under browsing restriction. This leads to interruption of Web page browsing, which impairs the user's convenience. Further, despite unintended access, it may be taken as an attempt of unauthorized access, leading to disadvantage for the user.
Disclosed is a Web page display method of determining whether or not a linked URL contained in a requested Web page is subject to browsing restriction in accordance with the attribute (e.g., level) of a requesting terminal or a requester before the display of the Web page and displaying the Web page along with a determination result on the terminal.
According to the Web page display method, it is determined whether a linked URL contained in a Web page is subject to browsing restriction, and the Web page to which determination result information is added is displayed on the terminal.
The determination result information may be, for example, information for changing the color of a link display subject to browsing restriction. By the display based on the determination result information, the user can recognize which link is under browsing restriction before selecting the link.
Further, whether a URL is subject to browsing restriction may be determined by providing and querying a site (URL management site) for managing a database for storing URLs subject to browsing restriction or URLs not subject to browsing restriction. The queried URL management site searches a URL information database to check whether the queried URL is subject to browsing restriction or is not subject to browsing restriction (this processing is referred to as look-up processing), and sends a result.
The above processing may be performed by a Web page relay apparatus provided on a network between the terminal and a Web server.
Some Web pages contain quite a few links, which may place a large load on the URL management site. Further, the load of string retrieval for extracting links in a Web page may become excessive.
To reduce the load on the URL management site, by the relay apparatus, information about linked URLs contained in each Web page may be created for each Web page, stored in a memory, and used to determine whether a linked URL is subject to browsing restriction. This information is referred to as link information, and indicates, for example, the in-page location of each link contained in a Web page and the attribute (e.g., level) of browsing restriction on each link.
If link information corresponding to a Web page requested by the terminal exists in the memory, the relay apparatus can determine, based on the link information about the page stored in the memory and the attribute of the requesting terminal, whether the linked URL is subject to browsing restriction, without querying the URL management site. This can reduce the load on the URL management site.
The Web page display method includes a transmission step of transmitting a Web page request to the server; a reception step of receiving the Web page from the server; an extraction step of extracting a linked URL contained in the received Web page; a determination step of determining, based on a browsing restriction attribute of a linked Web page indicated by the extracted linked URL and terminal attribute information contained in the request, whether or not the linked Web page is subject to browsing restriction; a creation step of creating a determined Web page with a determination result reflected in a link display corresponding to the linked URL in the received Web page; and a display step of displaying the determined Web page on the terminal, as a response to the Web page request.
Further, a relay apparatus may be provided on a network between the terminal and the Web server, and perform the reception step, the extraction step, the determination step, and the creation step, and the terminal may perform the transmission step and the display step.
Further, the relay apparatus may perform a link information creation step of creating link information indicating a location of the linked URL in the Web page from which the linked URL is extracted in the extraction step and the browsing restriction attribute of the linked Web page, and a link information storage step of storing the link information in association with an identifier for identifying the Web page from which the linked URL is extracted.
Further, the relay apparatus may include a communication relay unit for relaying a Web page request from the terminal to the server and receiving the Web page from the server; a link extraction unit for extracting a linked URL contained in the Web page received from the server; a URL information look-up unit for querying a URL information database for managing a browsing restriction attribute of a Web page, as to a browsing restriction attribute of a linked Web page indicated by the extracted linked URL; a link information creation unit for creating link information indicating a location of the linked URL in the received Web page and the browsing restriction attribute of the linked Web page indicated by the linked URL based on the browsing restriction attribute acquired by the URL information look-up unit and the received Web page; a terminal information look-up unit for extracting terminal identification information contained in the Web page request received from the terminal and querying a terminal information database for managing a combination of terminal identification information and terminal attribute information, as to a terminal attribute corresponding to the terminal identification information; and a browsing restriction determination unit for determining, based on the terminal attribute information acquired by the terminal information look-up unit and the link information, whether the linked Web page indicated in the received Web page is subject to browsing restriction on the terminal.
Further, the browsing restriction determination unit may have a content change unit for creating a determined Web page with a change in an attribute of a link display corresponding to a linked URL determined to be subject to browsing restriction, and the communication relay unit may transmit to the terminal the determined Web page created by the content change unit instead of the Web page acquired from the server.
According to the teaching herein, it is possible to inform the user beforehand whether links contained in a Web page are subject to browsing restriction, without placing a high load on the URL information database.
These and other benefits are described throughout the present specification. A further understanding of the nature and advantages of the invention may be realized by reference to the remaining portions of the specification and the attached drawings.
Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings.
In the following embodiments, a relay apparatus disposed on a network has a browsing restriction determination function; however, a Web server may have the browsing restriction determination function.
Further, in the following embodiments, one relay apparatus on the network has the browsing restriction determination function; however, the processing units and storage units of the browsing restriction determination function may be separated to physically different devices coupled via the network.
As shown in
In the relay apparatus 100 shown in
A terminal information DB 606 is a storage unit for storing a pair of terminal identification information (e.g., terminal serial number on a mobile phone) registered beforehand and a terminal attribute such as a level.
A terminal information look-up unit 106 extracts terminal identification information (e.g., contained in a User-Agent header in HTTP) contained in a request from the terminal 10 and queries the terminal information DB 606, using the identification information as a key, thereby to acquire a level corresponding to the identification information registered beforehand.
A link extraction unit 104 extracts the URL of a link (e.g., anchor tag in HTML) to another Web page contained in a Web page acquired from the Web server 20 by the communication relay unit 102. The URL is a symbol sequence for specifying the location of an information resource, and is expressed, for example, in the form of “http://host name/path name” in HTTP.
A URL information DB 608 is a database (referred to as DB) for storing a pair of the URL of a Web page registered beforehand and the browsing restriction level of the Web page.
A URL information look-up unit 108 queries the URL information DB 608 as to the browsing restriction attribute (e.g., level at which access is restricted) of a Web page indicated by a URL extracted by the link extraction unit 104, and acquires the level. In the case where a plurality of extracted URLs are equal completely or partially such as in a domain name, the URL information look-up unit 108 may have a query function representatively using one of them.
A look-up control unit 109 determines whether the URL information look-up unit 108 queries the URL information DB 608 to control a query.
Based on a Web page acquired from the Web server 20 by the communication relay unit 102 and the level of a linked URL in the Web page acquired by the URL information look-up unit 108, a link information creation unit 110 creates link information describing the level of the link in the Web page. Further, the link information creation unit 110 caches the created link information into a link information DB 610.
A browsing restriction determination unit 112 determines, based on the link information cached by the link information creation unit 110 and the level of a requesting terminal acquired by the terminal information look-up unit 106, whether the link is subject to browsing restriction.
The content change unit 114 changes, deletes, or adds information as to the display of a link subject to browsing restriction determined by the browsing restriction determination unit 112, thus creating a determined Web page. Such a change is, for example, the change of an attribute such as the color of the link and/or the link background by adding a style attribute to an anchor element in HTML, or the addition of a unique attribute that the terminal 10 can interpret.
The relay apparatus 100 may include an access extraction unit 116 to restrict browsing when the terminal 10 requests direct access to a Web page subject to browsing restriction without following a link from a Web page. When the communication relay unit 102 receives a Web page acquisition request from the terminal 10, the access extraction unit 116 extracts the URL of the requested Web page. For example, in HTTP, the URL of the requested Web page is contained in a request line. The URL information look-up unit 108 acquires the level of the extracted URL, and the terminal information look-up unit 106 acquires the level of the requesting terminal. If the browsing restriction determination unit 112 determines from these pieces of information that the Web page is subject to browsing restriction, the communication relay unit 102 sends an error response or a prepared specific page in response to the request from the terminal 10.
The above method of restricting the browsing of an access-requested Web page without following a link from a Web page can be achieved by a known technique.
The program may be stored beforehand in the memory device 502 or the disk device 504 in the relay apparatus 100, or may be installed from a removable storage medium that the relay apparatus 100 can use or from another device through a communication medium (a network, or a carrier wave or a digital signal that propagates through a network) when needed.
Further, each processing described below is implemented when the processor 501 reads and executes a program stored in the disk device 504.
First, the communication relay unit 102 of the relay apparatus 100 receives a Web page acquisition request from the terminal 10 (S102), and acquires the Web page from the Web server 20 based on the request (S104). Then, the terminal information look-up unit 106 acquires terminal identification information (e.g., contained in a User-Agent header) which the request has (S106), and acquires a level corresponding to the identification information from the terminal information DB 606 (S108). If the terminal information look-up unit 106 fails to acquire the identification information (NO in S106), or fails to perform look-up processing (NO in S108) because the identification information is not registered in the terminal information DB 606, the communication relay unit 102 sends the acquired Web page to the terminal as it is (S120). Further, the steps from S106 to S108 may be executed concurrently without waiting for a response from the Web server.
Then, the look-up control unit 109 determines whether the link information DB 610 having the page identification information DB 610-2 equal to identification information (e.g., the value of an Etag header contained in a response from the Web server) about the acquired Web page exists in a cache (S110). If the link information DB 610 having the page identification information DB 610-2 does not exist in the cache (NO in S110), the link extraction unit 104 extracts all linked URLs contained in the Web page, and the URL information look-up unit 108 acquires from the URL information DB 608 the browsing restriction level 608-L of each extracted URL (S112). If the link information DB 610 exists in the cache (YES in S110), the look-up control unit 109 suppresses the extraction of linked URLs and queries as to the URLs, and uses the link information DB 610 existing in the cache in the subsequent steps (from S116).
If NO in S110, the link information creation unit 110 creates and caches the link information DB 610, using the URL of the Web page, the identification information about the Web page, and the level 608-L of each URL in the Web page (S114).
Then, based on the terminal level 606-L and the link information DB 610, the browsing restriction determination unit 112 determines which URL is subject to browsing restriction (S116). For example, the browsing restriction determination unit 112 determines that a URL located in a location path 610-3 having a level 610-4 greater than the terminal level 606-L is subject to browsing restriction. In response thereto, the content change unit 114 adds, modifies, or deletes information as to the link of the URL subject to browsing restriction in the Web page (S118). Lastly, the communication relay unit 102 sends the Web page changed by the content change unit 114 to the terminal 10 (S120).
Thus, according to this embodiment, in response to a Web page acquisition request from the terminal, the relay apparatus can send to the terminal a Web page to which information about whether links in the Web page are subject to browsing restriction is added, without placing a high load on the URL information database.
In the second embodiment, a relay apparatus creates Web page identification information, so that even if a response from the Web server does not contain page identification information, a browsing restriction determination function can be provided.
An operational flow according to this embodiment will be described. When the communication relay unit 102 acquires a Web page from the Web server 20 (S104 in the first embodiment), the page identification information creation unit 120 inputs the Web page data to a hash function (e.g., Message Digest 5), thereby obtaining the data of page identification information. In this embodiment, instead of S114 in the first embodiment, the link information creation unit 110 creates the link information DB 610, using the page identification information created by the page identification information creation unit 120. In this embodiment, in S118 in the first embodiment, if a link does not exist at a location indicated by the location path 610-3 of the link information DB 610, for example, because the link information creation unit 110 queries an incorrect link information DB 610 due to page identification information overlap caused by hash value collision, it is determined that the link information is incorrect, and the steps from S112 are executed. The other operations are the same as in
Thus, according to this embodiment, even if a response from the Web server does not contain page identification information, the relay apparatus can send to the terminal a Web page to which information about whether links in the Web page are subject to browsing restriction is added, without placing a high load on the URL information database.
In the third embodiment, a category to which a linked Web page belongs is used instead of a level to determine whether the linked Web page is subject to browsing restriction.
The URL information DB 608 has, as an entry, a category 608-C to which a URL belongs, instead of the level 608-L in the first embodiment. The category 608-C is registered beforehand from among certain categories, and a plurality of categories 608-C may be registered for one address 608-U.
A configuration example of a communication system is the same as in
An operational flow according to this embodiment will be described. In this embodiment, the banned category 606-C, the category 608-C, and the category 610-C are used instead of the level 606-L, the level 608-L, and the level 610-4 in the first embodiment, respectively. Instead of S116 in the first embodiment, the browsing restriction determination unit 112 determines that the URL of a location path 610-3 containing at least one banned category 606-C of the terminal in a category 610-C is subject to browsing restriction. The other operations are the same as in
Thus, according to this embodiment, even if browsing is restricted based on the banned category registered by a user, the relay apparatus can send to the terminal a Web page to which information about whether links in the Web page are subject to browsing restriction is added, without placing a high load on the URL information database.
Further, this embodiment can be combined with the first to second embodiments.
In the fourth embodiment, a terminal level corresponding to the current time is used to determine whether a linked Web page is subject to browsing restriction.
An operational flow according to this embodiment will be described. Instead of S108 in the first embodiment, the terminal information look-up unit 106 acquires the current time in addition to terminal identification information contained in a request received by the communication relay unit 102. The terminal information look-up unit 106 acquires a level 606-L that has a terminal identification information DB 606-I equal to the acquired terminal identification information and corresponds to a time period 606-T including the current time. The other operations are the same as in
Thus, according to this embodiment, even in the case where the browsing restriction level of the terminal changes depending on time (in the case of strict browsing restriction at night and loose restriction in the daytime), the relay apparatus can send to the terminal a Web page to which information about whether links in the Web page are subject to browsing restriction is added, without placing a high load on the URL information database.
Further, this embodiment can be combined with the first to third embodiments.
In the fifth embodiment, if a linked URL in a Web page matches (or does not match) a URL pattern registered beforehand in a whitelist (or blacklist), the URL information look-up of the URL is suppressed.
An operational flow according to this embodiment will be described. Instead of S112 in the first embodiment, the look-up control unit 109 suppresses querying the URL information DB 608 as to a URL that matches any of the URL patterns D200 (or URL that does not match any of the URL patterns D200) among URLs extracted by the link extraction unit 104. The URL that has undergone the suppression of querying the URL information DB 608 is not subject to browsing restriction. The other operations are the same as in
Thus, according to this embodiment, linked URLs not subject to browsing restriction are specified beforehand in a whitelist, and linked URLs subject to browsing restriction are specified beforehand in a blacklist, so that the relay apparatus can send to the terminal a Web page to which information about whether links in the Web page are subject to browsing restriction is added, while further suppressing a load on the URL information database.
Further, this embodiment can be combined with the first to fourth embodiments.
In the sixth embodiment, when a relay apparatus sends to the terminal a Web page containing links changed based on a determination result, a link through which the terminal acquire an original unchanged Web page having links is contained in a response Web page having links, so that the selection of the link enables the acquisition of the original Web page.
An operational flow according to this embodiment will be described. In 5118 in the first embodiment, in addition to the change of the link subject to browsing restriction, the content change unit 114 creates a new linked URL (e.g., http://a.co.jp/index.html?original) by adding a predetermined character string (e.g., “original”) as a CGI parameter to the URL of the Web page requested by the terminal, changes the Web page to display the corresponding link display e.g. at the end of the Web page, and creates new link information. The other operations are the same as in
In S102 in the first embodiment, when the communication relay unit 102 receives a Web page request from the terminal 10 through the selection of the new link having the CGI parameter of the character string, the request URL change unit 122 creates a request in which the CGI parameter is eliminated from the URL of the requested Web page, and the communication relay unit 102 transmits the created request to the Web server and relays a response to the terminal as it is.
Thus, according to this embodiment, the terminal can also acquire an original Web page having links not changed by the relay apparatus. For example, in the case where the change of a link display impairs the appearance of a Web page, it is possible to browse the original Web page.
Further, this embodiment can be combined with the first to fifth embodiments.
In the seventh embodiment, if link information corresponding to an acquired Web page does not exist in the cache, a relay apparatus queries another relay apparatus as to the link information.
Thus, according to this embodiment, the relay apparatus can send to the terminal a Web page to which information about whether links in the Web page are subject to browsing restriction is added, while further suppressing a load on the URL information database by acquiring link information from another relay apparatus.
Further, this embodiment can be combined with the first to sixth embodiments.
The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense. It will, however, be evident that various modifications and changes may be made thereto without departing from the spirit and scope of the invention as set forth in the claims.
Number | Date | Country | Kind |
---|---|---|---|
2009-154544 | Jun 2009 | JP | national |