1. Field of the Invention
In general, the present invention provides a content searching method, system, program product and architecture. Specifically, the present invention allows for searchable content to be architecturally separated from corresponding displayable content as well as associated formatting tags, thus allowing for faster, more efficient content searching.
2. Related Art
As the use of computer networks such as the Internet becomes more pervasive, search engines have become a valuable tool in locating needed content. For example, today a computer user can utilize a search engine to locate needed goods/services, perform research, and find solutions to various problems/issues. Currently, it is the practice for a search engine database to include both searchable content and displayable content. For example, troubleshooting solutions, which include problem statements and fixes, are loaded into the search engine database. The content is broken down into searchable items which are searched to fulfill a search request/query, and displayable items which are presented when corresponding searchable items are located. To this extent, the searchable content is typically a pared down, unformatted version of the displayable content. That is, since only a small part of the displayable content is needed for searching, the searchable content will generally include only the bare content that might be the target of a user's search request. Conversely, the corresponding displayable content will include all content and formatting that is desired to be presented to the user. For example, when a search request is submitted pursuant to a problem the user is attempting to troubleshoot, the displayable content might not only include a statement of the problem and the possible solution(s), but also other items such as related documents, a feedback mechanism, advertisements, etc.
Unfortunately, the co-location of searchable content and displayable content in the search engine database raises many issues. For example, loading both types of content in the search engine database increases the volume of material therein. As a result, the speed at which searches are handled is reduced. However, no existing system provides a way of architecturally separating searchable content from displayable content so that a search engine database can be more rapidly searched. Providing such separation would greatly improve search engine performance.
In view of the foregoing, there exists a need for an improved content searching method, system, program product and architecture. Specifically, a need exists for an architecture in which searchable content is architecturally separated from displayable content and associated formatting characteristics.
In general, the present invention provides a content searching method, system, program product and architecture. Specifically, under the present invention, searchable content is loaded into a search engine database, while displayable content is stored in a file system (e.g., in an XML file or the like) that is architecturally separate from the search engine database. Searchable content is associated with corresponding displayable content through links/pointers. In addition, tags for formatting the displayable content are provided in a style sheet or the like that is referenced by the displayable content file. When a search request for one or more search items is received by the search engine, the searchable content in the database will be searched. When appropriate searchable content is located, the corresponding displayable content will be located through the links. Once located, the displayable content will be formatted according to formatting tags contained in the associated style sheet. After formatting, the displayable content will then be presented to the user.
A first aspect of the present invention provides a content searching method, comprising: loading searchable content into a search engine; storing displayable content corresponding to the searchable content in a file system that is separate from the search engine, wherein the searchable content loaded into the search engine includes at least one link pointing to the displayable content in the file system; and providing formatting tags for formatting the displayable content in a style sheet.
A second aspect of the present invention provides a content searching architecture, comprising: a search engine having searchable content corresponding to searchable items; a file system separate from the search engine having displayable content corresponding to the searchable content, wherein the searchable content is associated with the displayable by at least one link; and a style sheet containing formatting tags for formatting the displayable content.
A third aspect of the present invention provides a content searching system, comprising: a searchable content location system for locating searchable content in a search engine database based on a search request; a displayable content location system for locating displayable content corresponding to the searchable content in a file system based on at least one link that associates the searchable content with the displayable content, wherein the file system is separate from the search engine database; a content formatting system for formatting the displayable content based on formatting tags contained in a style sheet associated with the displayable content; and an output system for outputting the displayable content after the formatting.
A fourth aspect of the present invention provides a content searching program product stored on a recordable medium, which when executed, comprises: program code for locating searchable content in a search engine database based on a search request; program code for locating displayable content corresponding to the searchable content in a file system based on at least one link that associates the searchable content with the displayable content, wherein the file system is separate from the search engine database; program code for formatting the displayable content based on formatting tags contained in a style sheet associated with the displayable content; and program code for outputting the displayable content after the formatting.
A fifth aspect of the present invention provides a system for deploying a content searching application, comprising: a computer infrastructure being operable to: locate searchable content in a search engine database based on a search request; locate displayable content corresponding to the searchable content in a file system based on at least one link that associates the searchable content with the displayable content, wherein the file system is separate from the search engine database; format the displayable content based on formatting tags contained in a style sheet associated with the displayable content; and output the displayable content after the formatting.
A sixth aspect of the present invention provides content searching computer software embodied in a propagated signal, the content searching computer software comprising instructions for causing a computer system to perform the following functions: locate searchable content in a search engine database based on a search request; locate displayable content corresponding to the searchable content in a file system based on at least one link that associates the searchable content with the displayable content, wherein the file system is separate from the search engine database; format the displayable content based on formatting tags contained in a style sheet associated with the displayable content; and output the displayable content after the formatting.
Therefore, the present invention provides a content searching method, system, program product and architecture.
These and other features of this invention will be more readily understood from the following detailed description of the various aspects of the invention taken in conjunction with the accompanying drawings in which:
The drawings are not necessarily to scale. The drawings are merely schematic representations, not intended to portray specific parameters of the invention. The drawings are intended to depict only typical embodiments of the invention, and therefore should not be considered as limiting the scope of the invention. In the drawings, like numbering represents like elements.
For convenience purposes, the Detailed Description of the Drawings will have the following sections:
I. General Description
II. Computerized Implementation
I. General Description
As indicated above, the present invention provides a content searching method, system, program product and architecture. Specifically, under the present invention, searchable content is loaded into a search engine database, while displayable content is stored in a file system (e.g., in an XML file or the like) that is architecturally separate from the search engine database. Searchable content is associated with corresponding displayable content through links/pointers. In addition, tags for formatting the displayable content are provided in a style sheet or the like that is referenced by the displayable content file. When a search request for one or more search items is received by the search engine, the searchable content in the database will be searched. When appropriate searchable content is located, the corresponding displayable content will be located through the links. Once located, the displayable content will be formatted according to formatting tags contained in the associated style sheet. After formatting, the displayable content will then be presented to the user.
Referring now to
In any event, as shown in
To this extent, referring to
Referring back to
Thereafter, when a user submits a search request (of search items) to search engine 12, search engine logic 14 will perform a search of search engine database 16 in an attempt to locate appropriate searchable content. As known, this function can be carried out in many ways. For example search engine logic 14 could attempt to match keywords or the like contained in the search request with searchable content contained in the search engine database 16. In any event, because search engine database 16 now includes less content, the search request should be processed considerable faster than with previous systems/architectures. Once the appropriate searchable content has been located, the link(s) stored therewith will be followed by search engine logic 14 to locate the corresponding displayable content in file system 16. Thereafter, the corresponding style sheet 20 will be located, and the formatting tags therein will be used to format the displayable content for display to the user.
II. Computerized Implementation
Referring now to
Where search engine 12 is a network-based search engine, user 70 could communicate therewith via a user computer system 72, while system administrator 22 could communicate with search engine 12 via an administrator computer system 76. These computer systems 72 and 76 should be understood to be any type of computerized systems capable of carrying out their respective functions. For example, computer systems 72 and 76 could be desktop computers, workstations, laptop computers, hand held devices, clients, etc. Regardless, the network can be any type of network such as the Internet, a local area network (LAN), a wide area network (WAN), a virtual private network (VPN), etc. To this extent, a direct hardwired connection (e.g., serial port), or an addressable connection with search engine 12 could be implemented. The addressable connection may utilize any combination of wireline and/or wireless transmission methods. Moreover, conventional network connectivity, such as Token Ring, Ethernet, WiFi or other conventional communications standards could be used. Still yet, connectivity could be provided by conventional IP-based protocol.
As also depicted, search engine 12 generally comprises processing unit 60, memory 62, bus 64, input/output (I/O) interfaces 66, external devices/resources 68 and search engine database 16. Processing unit 60 may comprise a single processing unit, or be distributed across one or more processing units in one or more locations, e.g., on a client and server. Memory 62 may comprise any known type of data storage and/or transmission media, including magnetic media, optical media, random access memory (RAM), read-only memory (ROM), a data cache, a data object, etc. Moreover, similar to processing unit 60, memory 62 may reside at a single physical location, comprising one or more types of data storage, or be distributed across a plurality of physical systems in various forms.
I/O interfaces 66 may comprise any system for exchanging information to/from an external source. External devices/resources 68 may comprise any known type of external device, including speakers, a CRT, LED screen, hand-held device, keyboard, mouse, voice recognition system, speech output system, printer, monitor/display, facsimile, pager, etc. Bus 64 provides a communication link between each of the components in search engine 12 and likewise may comprise any known type of transmission link, including electrical, optical, wireless, etc.
Search engine database 16 can be any system capable of providing storage for information under the present invention. Such information could include, among other things, searchable content. As such, search engine database 16 could include one or more storage devices, such as a magnetic disk drive or an optical disk drive. In another embodiment, search engine database 16 includes data distributed across, for example, a local area network (LAN), wide area network (WAN) or a storage area network (SAN) (not shown).
Although not shown, additional components, such as cache memory, communication systems, system software, etc., may be incorporated into search engine 12. Moreover, it should be understood that user computer system 72 and administrator computer system 76 will likely include computerized components similar to search engine 12.
As explained above, searchable content, displayable content and formatting tags are maintained architecturally separate from one another under the present invention. To this extent, shown loaded on administrator computer system 76 is content system 77, which can incorporate components of any known system for providing content to a search engine 12. Under the present invention, when system administrator 22 wishes to submit content for searching by user 70, he/she will utilize (1) searchable content system 78 to load searchable content to search engine 12 (i.e., to search database 16); (2) displayable content system 80 to store the corresponding displayable content in file system 18; and (3) style sheet system 82 to store formatting tags for formatting the displayable content in a style sheet 20. As mentioned above, the searchable content will be stored with links that point to the corresponding displayable content in file system 18 (e.g., the associated XML file). Such links can be stored by either searchable content system 78, displayable content system 80 or by another system not shown in
In providing these functions, content system 77 should be understood to include all interfaces and functionality necessary to architecturally separate the searchable content, the displayable content and the formatting tags, as well as to provide all necessary links, references, etc. To this extent, it should be understood that the depiction of content system 77 of
Regardless, once the searchable content, displayable content and formatting tags have been provided, a user 70 is free to conduct a search thereof. Accordingly, assume that user 70 submits a search request of search items (e.g., natural language words) via user interface 74 (e.g., a web browser). The search request will be received by request reception system 90 of search engine logic 14. Thereafter, searchable content location system 92 will utilize the search items to locate appropriate searchable content (i.e., searchable content that best fulfills the search request). Once located, displayable content location system 94 will follow the link(s) stored with the located searchable content to locate the corresponding displayable content in file system 18. Thereafter, content formatting system 96 will access/retrieve the style sheet 20 associated with the displayable content, and use the formatting tags there to format the displayable content as needed. Once formatted, the displayable content will then be outputted to user 70 by output system (e.g., for display in interface 74).
It should be appreciated that the teachings of the present invention could be offered as a business method on a subscription or fee basis. For example, search engine 12 or administrator system 76 could be created, maintained, supported and/or deployed by a service provider that offers the functions described herein for customers.
It should also be understood that the present invention can be realized in hardware, software, a propagated signal, or any combination thereof. Any kind of computer/server system(s)—or other apparatus adapted for carrying out the methods described herein—is suited. A typical combination of hardware and software could be a general purpose computer system with a computer program that, when loaded and executed, carries out the respective methods described herein. Alternatively, a specific use computer, containing specialized hardware for carrying out one or more of the functional tasks of the invention, could be utilized. The present invention can also be embedded in a computer program product or a propagated signal, which comprises all the respective features enabling the implementation of the methods described herein, and which—when loaded in a computer system—is able to carry out these methods. Computer program, propagated signal, software program, program, or software, in the present context mean any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: (a) conversion to another language, code or notation; and/or (b) reproduction in a different material form.
The foregoing description of the preferred embodiments of this invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and obviously, many modifications and variations are possible. Such modifications and variations that may be apparent to a person skilled in the art are intended to be included within the scope of this invention as defined by the accompanying claims. For example, the depiction of search engine logic 14 of