The present application claims priority from the Indian (IN) provisional patent application serial no. 3028/CHE/2010 filed on Oct. 13, 2010 and entitled “A system and method for assisting the user to select the context of a search and to display the contexts of search results” the content of which is herein incorporated by reference in its entirety.
The invention relates generally to network and internet search. More particularly, to a method and a system for providing assistance to a user to identify one or more contexts of an input or search results.
Search services are currently being improved to increase users' convenience. Accordingly, when a user inputs an appropriate query, the search service must provide satisfactory search results. Specifically, as users of search service are being expanded and also users who do not have enough background knowledge about appropriate queries increase, it is required to guide the users to make an appropriate query or refine the query from the search results.
The amount of data available on various topics has been increasing drastically over the years, internally in companies and externally in the internet. The most common way of retrieving relevant information is through filters or keyword based queries. One problem with the present approach is that the amount of information shows up in any query is very large and it is time consuming to sort the information to determine the most relevant information. This problem is aggravated by the fact that users themselves do not know the right keywords or contexts to be used in the search even if the user has clear ideas about the type of knowledge the user is looking for.
In another case, the user does not have sufficient knowledge on the area of search and needs guidance in navigating through relevant topics within the subject. This is the case where the user does not know what to look for but still expects relevant information to be provided by the system.
There have been major improvements in providing relevant information for each user based on the contexts derived from the data collected explicitly from the user or implicitly through user's implicit feedback through key strokes, historical selections or other behavioural patterns of the user. As a result the results received from major search engines such as Google, Yahoo, and Bing etc. retrieve information relevant to the query in most cases. However, these search engines are still limited by the fact that the users need to spend long periods of time to receive the intended information. The user can only alter the search query to retrieve the relevant information. There is no guided way available for the user to refine the context of the search results.
There are two major factors for not receiving the intended information from a search. One is that the user is unable to provide sufficient details of the intended information to the search engine. In most cases the input is a number of query terms and the search engine is expected to bring the intended information from the search. First of all, the terms input to the system may not be accurate enough to address the intended information. In addition, the list of a few terms is not sufficient to accurately communicate the intended information to the system.
The second factor that limits the retrieval of intended information from a search is that the search engine does not provide sufficient information about the results that would help the user to select the intended information. Current search engines provide the URL of the retrieved files and representative text or image from the file. These are not sufficient to select the intended information from the list of search results.
For the reasons stated above, which will become apparent to those skilled in the art upon reading and understanding the present specification, there is a need in the art for to a method and a system for providing assistance to a user to identify one or more contexts of an input or search results.
This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to identify key features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
According to one aspect of the invention there is provided a method of providing assistance to a user to identify at least one context of search results of a search query, the method comprising: receiving an input related to the search query, providing a plurality of first contexts related to the received input, wherein the first contexts include one of the keywords and concepts, displaying a set of results from a web search or a file search based on the selection of the first context, each search result containing a link to a web page or file location and displaying a plurality of second contexts relevant to the search results, wherein the second contexts include one or more topics, and wherein the topics are part of one or more ontologies.
In another aspect of the invention there is provided a method of providing assistance to a user to identify at least one context of a content of at least one input, the method comprising: receiving content location as input from the user, extracting the content from the received content location, and displaying a plurality of contexts relevant to the content from the received content location, wherein the contexts include one or more topics, and wherein the topics are part of one or more ontologies.
According to another aspect of the invention there is provided a method of providing assistance to a user to identify at least one context of search results of a search query, the method comprising: receiving an input as a search query, displaying a set of results from a web search or a file search, each search result containing a link to a web page or file location and displaying a plurality of contexts relevant to the search results, wherein the contexts include one or more topics, and wherein the topics are part of one or more ontologies.
In another aspect, the invention includes a system of providing assistance to a user to identify one or more contexts of the search results of a search query, the system comprising: one or more communication networks, a plurality of user devices connected to the communication network and at least one server coupled to a data store, the server comprising a content extractor and a context module, wherein the server is capable of communicating with the plurality of user devices via at least one communication network, the server is configured for receiving an input as a search query, displaying a set of search results from a web search or a file search, each search result containing a link to a web page or file location and displaying a plurality of contexts related to the search results, wherein the contexts include one or more topics, and wherein the topics are part of one or more ontologies.
In another aspect, the invention provides a machine-readable medium having stored thereon machine-executable instructions that if executed by a machine cause the machine to perform a method of providing assistance to a user to identify one or more contexts of the search results of a search query, the method comprising: receiving an input as a search query, displaying a set of search results from a web search or a file search, each search result containing a link to a web page or file location and displaying a plurality of contexts related to the search results, wherein the contexts include one or more topics, and wherein the topics are part of one or more ontologies.
In another aspect, the invention provides a machine-readable medium having stored thereon machine-executable instructions that if executed by a machine cause the machine to perform a method of providing assistance to a user to identify one or more contexts of the search results of a search query, the method comprising: receiving an input as a search query, displaying a set of search results from a web search or a file search, each search result containing a link to a web page or file location, displaying a plurality of contexts related to the search results, wherein the contexts include one or more topics, and wherein the topics are part of one or more ontologies, and allowing the user to navigate to select at least one of the topics to retrieve a different context of search results, wherein the displayed search results are related to the selected topics.
Additional advantages and features of the present invention will be more apparent from the detailed description and accompanying drawings, which illustrate preferred embodiments of the invention.
In the following detailed description of the preferred embodiments, reference is made to the accompanying drawings that form a part hereof, and in which are shown by way of illustration specific embodiments in which the invention may be practiced. It is understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the present invention.
The leading digit(s) of reference numbers appearing in the Figures generally corresponds to the Figure number in which that component is first introduced, such that the same reference number is used throughout to refer to an identical component which appears in multiple Figures.
The features, structures, or characteristics of the invention described throughout this specification may be combined in any suitable manner in one or more embodiments. For example, reference throughout this specification to “certain embodiments,” “some embodiments,” or similar language means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in certain embodiments,” “in demonstrative embodiments,” “in some embodiment,” “in other embodiments,” or similar language throughout this specification do not necessarily all refer to the same group of embodiments and the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
In an example, the system displays a plurality of contexts related to the search results, where the contexts may be or may include topics and topic relations. Once the topics are displayed in the topic window 120, the user is allowed to select topic or topics. After the selection of topic or topics, where the selected topic or topics are the intended information of the user, the user is allowed to tick the search button to get the preferred results. In addition, the topics are shown with respect to the relevancy ranking in a descending order or it may be in an ascending order which is based on user preference.
Further, the graphical representation in the graph window of the topic or topics or/and semantic proximity of the topic and their relations allows the user to navigate to retrieve his actual search result which the user is looking for. In addition of all, the plurality of contexts and their relation are shown with the relevancy ranking with respect to the entered query by the user. In one example embodiment, the graphical representation is shown in a topology having a context and all sub-contexts are connected to the context. The connection between all the other nodes with respect to root node is directly descending from the root node. The root node shown in the graphical representation has the highest relevancy rank and other nodes are descendents of the main root node.
The topic window 320 is capable of displaying contexts related to the generated or displayed search results in the search results window 360. In other aspects of the present invention, the system is capable of generating relevant results which may be the search result of the input query by the user before clicking the search button after the input is received from the user at the search query window 310. Further, the system is capable of displaying or generating a plurality of contexts related to the generated or displayed search results in the search results window 360. The context may be or may include topics and topic relation, where the topics and topic relations are based on underlying domain ontologies. In the graph window 330, the displayed contexts in the topic window 320 are shown in a graphical representation which is also a part of the underlying ontology.
In addition of the above, the system is capable of generating one or more concepts and one or more keywords related to the generated or displayed search results. The concepts and keyword are displayed in their respective windows i.e. concept window 340 and keyword window 350. The displayed concepts are based on semantic proximity and the displayed keywords are based on the words available in the domain.
The system is capable of displaying all the windows simultaneously as soon as the query window receives one or more inputs. By displaying all the windows with different options, the user is capable of identifying at least one context of search results of a search query and their relationship, one or more concepts, and one or more keywords. The present system helps the user to take decision quickly and with an ease on the search results by avoiding unnecessary keep looking for the results displayed in the search results window 360. In addition to this, there is also possibility of having a provision for adding a keyword or a concept known to the user with the help of a tab (not shown in figures) or removing a keyword or a concept displayed by the system from their respective windows.
The system is capable of generating or displaying contexts in the topic window 420 related in respect of the generated or displayed search results in the search window 460. Also, the system is capable of representing the relation between the topic or topics or/and their semantic proximity of one topic in a graphical representation in a graph window 330. The graphical representation of the topic or topics or/and their semantic proximity of one topic are also a part of the underlying ontology. The topic window 420 displays a plurality of topics i.e. Black-Scholes Equation, Derivative Pricing, Brownian Motion, Thermodynamics, Electromagnetic Field, etc. All the displayed topics under the topic window 420 underlying domain ontologies which include topics and their relations in respect of search results of the search query i.e. ‘Black’. The first three topics belong to financial domain ontology and the last two belong to Physics Ontology. In the graph window 430, the displayed contexts in the topic window 420 i.e. topic and topic relation are shown in a graphical representation which is also a part of the underlying ontology as shown in
In addition to these, the system generates one or more concepts and one or more keywords related to the generated or displayed search results of the search query ‘Black’. The concepts and keywords are displayed in their respective windows i.e. concept window 440 and keyword window 450. In an example, the concepts related to the search results of the query word ‘Black’ are stock market, radiation, spectrum, mobile device light etc. The displayed concepts are based on semantic proximity of the search results of the query word ‘Black’. The keywords related to the search results of the query are color, black body, black-scholes, frequency, blackberry etc. The displayed keywords are based on the words available in the domain.
The system by generating various options in the interface allows the user to choose or to select one or more contexts, one or more concepts, and one or more keywords of search results of a search query.
The user is allowed to enter a file to upload in the upload box window 510. The uploaded files are displayed in the uploaded file list window 560. The contexts related to the uploaded file are displayed in the topic window 520 before clicking the search button. The context may be or may include topics and topic relation, where the topics and topic relations are based on underlying domain ontologies. In addition to this, the system is capable of representing the relation between the topic or topics or/and their semantic proximity of one topic in a graphical representation in a graph window 530. The graphical representation of the topic or topics or/and their semantic proximity of one topic are also a part of the underlying ontology.
In addition of the above, the system is capable of generating one or more concepts and one or more keywords related to the uploaded file. The concepts and keywords are displayed in their respective windows i.e. concept window 540 and keyword window 550. The displayed concepts are based on semantic proximity and the displayed keywords are based on the words available in the domain.
The keywords window 550 related to the content of the uploaded files are displayed along with a provision for removing a keyword (not shown in figure) from the context. In addition to this, it is also possible for adding keywords to the context in the keyword window 550. The concept window 540 displays concepts which are related to the content of the uploaded files along with a provision for removing a concept from the context. It is also possible for adding concepts using the concept window 540 to the context. The topics in the topic window 520 are related to the content of the uploaded files along with the corresponding relevancy factor to the content of the uploaded files. Every topic has an underlying ontology which includes multiple topics and their relations. In the graph window 530, the user is allowed to navigate in the graphical representation of the context or topic or topics relation. If the user clicks or selects one topic in the representation, the system takes the user to a different part of ontology, where the selected topic becomes the central theme. Also, one or more connected nodes along with root node are displayed where each connected node corresponding to at least one topic, wherein each connected node is connected to the root node, each of the connections corresponding to at least one relationship.
At step 820, the method displays a set of results from a web search or a file search, each search result containing a link to a web page or file location.
At step 830, the method displays or able to generate one or more contexts relevant to the search results. The contexts include one or more topics, and where the topics are part of one or more ontologies. The method displays one or more contexts based on the relevance ranking that matches the search results.
At step 840, the method displays in a graphical representation of a relation between parts of one or more ontologies related to one or more of the contexts or topics. The graphical representation of the topic or topics or/and their semantic proximity of one topic are also a part of the underlying ontology. The method displays a part of one or more ontologies related to one or more of the contexts in a two-dimensional or three-dimensional representation. The one or more ontologies include a plurality of assertions with each assertion including a first topic, a second topic and a relationship between the first topic and the second topic. The graphical representation includes a root node that depicts at least one topic and one or more connected nodes, each connected node or branch node corresponding to at least one topic, wherein each connected node is connected to the root node, each of the connections corresponding to at least one relationship.
At step 850, the method further generates a list of concepts and keywords related to search results. The displayed or generated concepts are based on semantic proximity and the displayed or generated keywords are based on the words available in the domain.
At step 860, the method allows the user to navigate to select the displayed contexts/topics with in a graphical region. By navigating in the region, the user made a first selection of any node, the method retrieves a part of ontology related to the first selected node and the first selected node becomes the root node. Further the user may make a second selection of any node that retrieves a set of documents related to the second selected node and displays in terms of web page URL or file location.
Although the flowcharts 600, 700 and 800 includes steps 610-640, 710-750 and 810-860 that are arranged serially in the exemplary embodiments, other embodiments of the subject matter may execute two or more steps in parallel, using multiple processors or a single processor organized as two or more virtual machines or sub-processors. Moreover, still other embodiments may implement the steps as two or more specific interconnected hardware modules with related control and data signals communicated between and through the modules, or as portions of an application-specific integrated circuit. Thus, the exemplary process flow diagrams are applicable to software, firmware, and/or hardware implementations.
The server 920 further includes or coupled to a data store 940. In one form of data store is a database designed to integrate data from multiple sources for additional operations on the data. The data store 940 may include but not limited to domain thesauri, domain ontology, internal database, and internet sites.
The server 920 further includes a content extractor 950 and a context module 960. The content module 960 further includes a keyword module 962, a concept module 964 and an ontology module 966. The content extractor 950 is capable of extracting the content from the received input from one or more user input devices 910 through a communication network 930. The received input may include but not limited to terms, documents or URLs etc. The context module 960 is capable of processing the received input from the content extractor 950, and further coupled to data store 940 to provide or generate keyword list, concept list, topics and their relations.
The keyword module 962 has a keyword exploration module 963 which is capable receiving the keyword list provided by the keyword module 962 as input and allows the user to navigate through other keywords related to one or more of the input keywords in the displayed graphical region. The concept module 964 has a concept exploration module 965 which is capable of receiving the concept list provided by the concepts module 964 as input and allows the user to navigate through other concepts related to one or more of the input concepts in the displayed graphical region. The ontology module 966 has a topic exploration module 967 which is capable of receiving the topics and relations lists provided by the ontology module 966 as input and allows the user to navigate through other topics related to one or more of the input topics in the displayed graphical region. All the outputs of three modules i.e. keyword exploration module, concepts module and the topic exploration module are further processed by the modification module 968 which receives as input as graphs of keywords, concepts and topics, and allows the user to select a set of keywords, concepts and topics from the displayed graphical regions. In addition to this, the modification module allows the user to add new keywords, concepts and topics.
The system 900 further includes a rules module 970 which defines the rules for the search. The rules can be specified and stored in the system or given as input by the user. Rules may be or may include specifying the proximity of two terms in the search, allotting different weightage for different fields in the extracted content etc.
The methods and systems described herein are operational with numerous other general purpose or special purpose computing system, environments or configurations. Examples of well-known computing systems, environments, and/or configurations that may be suitable for use include, but are not limited to, personal computers, server computers, multiprocessor systems, microprocessor-based systems, network PCs, minicomputers, mainframe computers, distributed computing environments that include any of the above systems or devices, and so on. Compact or subset versions of the framework may also be implemented in clients of limited resources, such as handheld computers, or other computing devices. The invention is practiced in a distributed computing environment where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
With reference to
A computer 1002 typically includes a variety of computer-readable media. Computer-readable media can be any available media that can be accessed by computer 1002 and includes both volatile and non-volatile media, removable and non-removable media. By way of example, and not limitation, computer-readable media may comprise computer storage media and communication media. Computer storage media includes volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules, or other data. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by computer 1002.
Communication media typically embodies computer-readable instructions, data structures, or program modules, and includes any information delivery media. Combinations of the any of the above should also be included within the scope of computer-readable media.
System memory 1006 includes computer storage media in the form of volatile and/or non-volatile memory such as read only memory (ROM) 1010 and random access memory (RAM). A basic input/output system (BIOS) 1014, containing the basic routines that help to transfer information between elements within computer, such as during start-up, is typically stored in ROM; RAM typically contains data and/or program modules that are immediately accessible to and/or presently being operated on by processing unit. By way of example and not limitation,
The computer 1002 may also include other removable/non-removable, volatile/non-volatile computer storage media. By way of example only,
The drives and their associated computer storage media discussed above and illustrated in
In one implementation, a user may enter commands and information into the computer 1002 through input devices such as a keyboard 1040 and pointing device 1042, commonly referred to as a mouse, trackball, or touch pad. Other input devices (not shown) may include a microphone, joystick, game pad, satellite dish, scanner, or the like. These and other input devices are often connected to the processing unit 1004 through a user input interface 1044 that is coupled to the system bus, but may be connected by other interface and bus structures, such as a parallel port, game port, 1394/Fire wire, accelerated graphics port, or a universal serial bus (USB).
The computer operates in a networked environment using logical connections to one or more remote computers, such as a remote computer 1002. The remote computer may be a personal computer, a server, a router, a network PC, a mobile computing device, a peer device, or other common network node, and as a function of its particular implementation, may include many or all of the elements described above relative to the computer 1002, although only a memory storage device 1052 has been illustrated in
When used in a LAN networking environment, the computer 1002 is connected to the LAN 1034 through a network interface or adapter. When used in a WAN networking environment, the computer 1002 typically includes a modem 1060 or other means for establishing communications over the WAN 1056, such as the Internet. The modem 1060, which may be internal or external, may be connected to the system bus 1008 via the user input interface 1044, or other appropriate mechanism. In a networked environment, program modules depicted relative to the computer, or portions thereof, may be stored in the remote memory storage device. By way of example and not limitation,
The present disclosure may be implemented with a variety of combination of hardware and software. If implemented as a computer-implemented apparatus, the present disclosure is implemented using means for performing all of the steps and functions described above.
The present disclosure can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the mechanisms of the present invention. The article of manufacture can be included as part of a computer system or sold separately.
In the foregoing detailed description of embodiments of the invention, various features are grouped together in a single embodiment for the purpose of streamlining the disclosure.
The present invention as described above helps the user by providing more details of the search results to select the most appropriate result from the list of results. The search results provided by the system have more details about the content of each result in addition to providing the relationship among the results. In addition, the present invention helps the user in summarizing the content of each retrieved document. Also, the present invention provides a graphical user interface for the user to easily navigate through the results and select the most relevant document from the results.
Number | Date | Country | Kind |
---|---|---|---|
666/CHE/2011 | Mar 2011 | IN | national |