Method and system for facilitating information searching on electronic devices

Description

FIELD OF THE INVENTION

The present invention relates to providing relevant information to users, and in particular to providing relevant information to users with reduced user input.

BACKGROUND OF THE INVENTION

The Internet has become a popular source of entertainment and information. Most Internet content is designed for access via a web browser, making it difficult for access via most consumer electronics (CE) devices which lack typical computer keyboards. As a result, the Internet is generally restricted to access on personal computers (PC) or via cumbersome interfaces on CE devices.

With advances in hardware and software technologies, CE devices are becoming more powerful. Growth in network infrastructure and the falling prices of hardware have increased the availability of network-capable entertainment devices. Many users are configuring home networks including cable set-top boxes, digital television sets, home media servers, digital audio players, personal video recorders, etc. Home network consumers are also creating, storing and accessing more digital content through CE devices and PCs.

A second trend, running in parallel to the emergence of networked entertainment devices, is the growing use of the Internet for creating and publishing content. Greater broadband penetration and falling memory prices are enabling users to move ever larger media files, such as television (TV) shows and full-length movies, through the Internet.

However, there is a gap between the digital content on the Internet and the networked digital entertainment devices in that most Internet content is structured and organized for access via a web browser not a typical CE device. For example, typically a user searches for Internet information using a search engine or by directly accessing a known website via a PC. When using a search engine, the user is required to form an initial query and then iteratively refine the query depending upon the results obtained. As such, the user is forced to comprehend and analyze large quantities of information to identify/access the exact information the user is looking for. This process may work on a PC, but on CE devices that lack a keyboard and a mouse, the searching/refinement process is awkward and unpleasant. Moreover, users typically expect a “lean-back” experience when it comes to using CE devices in their homes and home networks. For instance, someone watching a television news program on a television may not be inclined to conduct an Internet search if such a search requires any effort more than pushing a few buttons on a remote control.

BRIEF SUMMARY OF THE INVENTION

The present invention provides a method and system for facilitating information searching for a user of an electronic device. One embodiment involves obtaining information about the user interests, identifying potential data of interest to the user, extracting data related to said data of interest to the user, and collecting the extracted related data for presentation to the user on the device.

Identifying potential data of interest to the user may include monitoring user access to content, selecting a set of extraction rules for information extraction, and extracting key information from metadata for the content based on the selected extraction rules.

Selecting a set of extraction rules may include selecting a set of extraction rules based on the content type. Selecting a set of extraction rules may include selecting a set of extraction rules from a rules library based on the content type, wherein the rules library includes a list of rules for extracting various keywords.

Obtaining information about the user interests may include obtaining information about current user activity on the device. Obtaining information about the user interests may further include obtaining contextual information about current user activity on the local network.

Extracting data related to said data of interest to the user may include forming a query based on potential data of interest, to search for said related data. The query is executed to search for related information on a local network and/or external sources. The device may comprise a consumer electronics device, and executing the query may include searching the Internet for said related data.

These and other features, aspects and advantages of the present invention will become understood with reference to the following description, appended claims and accompanying figures.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a network implementing a process for facilitating information searching for users, according to an embodiment of the present invention.

FIG. 2 shows an example architecture for facilitating information searching, according to the invention.

FIG. 3 shows a flowchart of the overall steps involved in facilitating information searching, according to the invention.

FIG. 4 shows an example of keyword selection, according to the invention.

FIG. 5 shows an example of tokenizing, according to the invention.

FIG. 6 shows an architecture of facilitating searching using execution plans, according to the invention.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a method and system for facilitating access to information via electronic devices such as consumer electronic (CE) devices. One embodiment involves enabling home users to easily find and access Internet content related to content presented on a CE device. An example is enabling a user to easily find and access Internet content related to a program the user is watching on a television. The user is now able to access relevant information and video content on the Internet in a “lean-back” living room experience while watching TV.

Searching for information on the Internet typically involves two stages: search query formation, and data search and analysis. Query information involves forming a search query that describes the type of information being sought. Data search and analysis involves resolving the search query according to the following steps: potential sources of data are identified; relevant data from such sources are extracted via search queries and then aggregated (collected); and correlations in the form of associations among the aggregated data are identified to make the results more meaningful.

An example implementation for CE devices in a local area network (LAN), such as a home network, is described below, however the present invention is useful with other electronic devices, and electronic devices that are not in a LAN but have access to the Internet. FIG. 1 shows a functional architecture of an example network/system 10, such as a LAN of home devices, embodying aspects of the present invention. The network/system 10 comprises devices 20 such as appliances, a personal computer (PC) 21, CE devices 30 which may include content, and an interface 40 that connects the network/system 10 to an external network 50 (e.g., another local network, the Internet). The external network 50 can be connected to one or more servers 51. The network/system 10 can implement the Universal Plug and Play (UPnP) protocol or other network communication protocols (e.g., Jini, HAVi, IEEE 1394, etc.). The network/system 10 can be a wireless network, a wired network, or a combination thereof. Examples of CE devices include digital televisions (DTVs, PDAs, media players, etc.).

The network 10 further includes a search facilitator system 24 that provides searching, aggregation and analysis functions. The facilitator 24 performs query formation, data search and analysis, wherein query formation includes identifying potential search queries (i.e., potential data of interest to the user) based on the user's context. Further, data search and analysis includes extracting, aggregating and correlating data of interest using execution plans.

FIG. 2 shows an architecture 60 including the facilitator 24, according to an embodiment of the present invention. The architecture 60 implements searching, aggregation and analysis functions via the facilitator 24, to provide information to a user through a user interface of a client module 64 that, in this example, is implemented in a CE device such as the DTV 30. Referring to the example process 45 in FIG. 3, in order to free users from the involved process of query formation, the facilitator 24 identifies potential data of interest to the user (step 46), extracts data related to data of interest to the user (step 47), aggregates the extracted data (step 48) and correlates the said related data to present to the user (step 49).

Information of interest to the user, or user-related information, may include one or more of: user profile, user preferences, content previously/currently accessed by the user, terms previously selected by the user, etc.

In one example, the client module 64 enables the user to obtain desired information from, e.g., the Internet using a simple and intuitive Graphical User Interface (GUI) application, utilizing the facilitator 24, including:

- 1. Mapping the functionalities that support an information search to a small number of keys (e.g., mapping such functionalities to a few keys of a TV remote control 31 (FIG. 1), as an example for receiving user input when using a DTV 30 for information access).
- 2. Enabling the user to express interest in obtaining additional information related to information currently accessed by the user (e.g., providing an “info.” button on the remote control 31 for the user to press, and mapping this action into a “more info” request, etc.).
- 3. Enabling the user to indicate the specific type of additional information the user is looking for, after the user has expressed interest in accessing additional information. An example involves displaying a set of keywords related to the data that the user has expressed interest in (e.g., a TV program or media content the user is accessing). Then, providing a combination of keys (e.g., up/down/right/left arrow keys) on a remote control 31 for the user to select one of the keywords as a search query.
- 4. Enabling the user to refine or edit a suggested keyword/search query, such as by displaying a set of additional query suggestions that contain, or are related to, the selected keyword and providing a combination of the arrow keys (up/down/right/left arrows) on the remote control 31 for the user to select one of the query suggestions. The GUI allows the user to refine the search queries as many times as the user desires by just repeating the process described above. Further, the query suggestions are displayed in an editable text box that allows the user to delete existing characters or enter new characters to modify the query as desired. This can be performed using, e.g., a wireless keyboard or a remote control that has an inbuilt keypad.
- 5. Performing a search based on a formulated query. Then, enabling the user to access the search results by displaying a list of search results corresponding to the keyword previously selected by the user. Then, providing a combination of arrow keys (up/down/right/left arrows) on the remote control device 31 for the user to select one of the refined search results. An example of a search result includes a link to a web page containing information about the search query, wherein the title of the web page is displayed to the user on the GUI.

The user utilizes the client module 64 to access certain content, and the facilitator 24 obtains information related to the accessed content for display to the user. The user then requests that the facilitator 24 provide more information about the accessed content. For example, the user utilizes the client module 64 to request that the facilitator 24 provide more information from Internet data sources 66 about a pre-recorded/broadcast TV program the user is watching on the DTV 30.

Using the client module 64, the user can choose, edit or enter new queries (such as the suggested keywords/categories) with minimal effort on a CE device that may not have a keyboard/mouse. Specifically, the facilitator 24 suggests and displays queries including keywords related to the TV program and information categories related to those keywords. Using the suggested keywords and categories as search queries, users can seamlessly browse/search for related information available on the Internet through their CE devices by simply selecting among the suggested queries for searching. The facilitator 24 identifies many relevant search queries, and allows the user to edit a suggested query or enter a new query. The facilitator 24 then obtains information of interest to the user and presents such information to the user.

In the architecture shown in FIG. 2, the facilitator 24 includes a query identification function 25 and a query resolution function 27, which implement the above steps. Specifically, the query identification function 25 identifies potential data of interest to the user. The query resolution function 27 extracts identified data of potential interest to the user (e.g., from local sources 69 and/or Internet sources 66), aggregates the extracted data and correlates the aggregated data for presentation to the user. The operations of the query identification function 25 and the query resolution function 27 are described below.

In one example, the query identification function 25 identifies potential data of interest to the user, based on the user's current application state. Current application state refers to the state of the application that the user is using at the time the user desires to access relevant Internet content. For example, if the user is watching a television program on DTV 30, the channel the DTV is tuned to and the program being broadcast, constitute the application state.

The query identification function 25 identifies the content used/rendered by the application. Then, the query identification function 25 obtains metadata information and/or other associated data for the content being accessed, and identifies potential search queries that might represent the data of interest to the user. When a user accesses content that has structured meta-data available, the query identification function 25 directly uses field/value pairs from the metadata as potential search queries. For example, if a user is listening to a music album by the artist “Sting” and expresses interest to access related content, the query identification function 25 obtains the following fields from the album's metadata (content=“MusicAlbum” & artist=“Sting”) and using these, the query identification function 25 infers that the user might be interested to access more albums by the same artist and suggests (MusicAlbum, artist, “Sting”) as one of the search queries to the user.

When a user accesses content such as broadcast TV programs and DVDs, the query identification function 25 uses the caption data (closed captions), that is embedded in the content stream, to identify potential search queries. This embedded caption data contains useful information in the form of keywords. When a user watches a TV program and expresses interest to access related content, the query identification function 25 analyzes the TV program's caption text to identify significant keywords and suggests them to the user as possible search queries.

The query identification function 25 can be implemented, e.g., in a stand-alone module, in a device 20 such as a set-top box or in a CE device 30 such as a DTV. A user interface (UI) can be displayed on a device in the network/system 10 capable of displaying information, such as a CE device 30. An example of identifying keywords and suggesting them as possible search keywords by the query identification function 25 utilizing natural language processing (NLP) to analyze closed captions and identify keywords from the captions, is described below.

The closed captions (CC) of a TV program are embedded in the TV signal by the content provider before it is broadcast. They are primarily for the benefit of the hearing impaired. Extracting useful information from this text is not straightforward. The captions typically do not contain any case information, precluding any attempt to extract proper nouns based on case information. Also, they are often ungrammatical (e.g., because of the spoken nature of the content), poorly punctuated and may have typos. Because of these limitations, typical keyword extraction techniques used for text documents may not be suitable for closed caption text. In addition, the content of closed captions is highly dependent on the type of the program. A news program's captions are high-content and factual, whereas a sitcom's captions are typically low on content and full of slang. FIG. 4 shows a closed caption analyzer (CCA) 70 according to an embodiment of the present invention. The CCA extracts search queries from closed captions of a program using NLPs techniques and the Electronic Program Guide (EPG) information 75 to customize the extraction mechanism for different types of programs.

The CCA 70 operates in real-time on broadcast signals and processes a steady stream of closed caption text 74 entering the system. The CCA maintains two history windows over the stream of incoming text (FIG. 5). The smaller, most recent window, spans the last N (N=5 in our prototype) sentences (S_i) and the larger program wide window covers the entire TV program/current news story/current program section, etc. Only the keywords extracted from the program wide window are stored and indexed for recommendation. Also, the keywords extracted from the most recent window are ranked higher than others, such that the most recent keywords appear at the top of the list of keywords presented to the user. As soon as the program or the news story changes (indicated either by special characters in the closed captions, such as ‘>>>’ in the U.S., or determined by looking at the EPG and the current time) both the windows are flushed and restarted.

A CC Tokenizer 78 receives the stream of CC text 74 and breaks it down into sentences. This is done in order to preserve the grammar of the text. A tagger 73 then tags sentences, e.g., using Brill's part-of-speech tagging (Brill 1992). The tagger 73 analyzes the sentence and determines how each word is used in the sentence. The tagger 73 uses lexical rules to assign an initial tag to each word in a sentence, and then uses contextual rules to update the tag based on the context in which the word occurs. The contextual rules are sensitive to the grammar of the input sentence. Ungrammatical or incomplete sentences may result in incorrect tagging of the words in the sentence.

In one example, for an input sentence: “John Wayne ran home”:

The output of tagger 73 would be:

- John<PROP> Wayne<PROP> ran<VB_PST> home<NOUN>

This indicates that in the previous sentence, “John” and “Wayne” are used as proper nouns, “ran” is a verb in past tense and “home” is a noun.

This tagged sentence from the tagger 73 is then passed on to a rule engine 79 which extracts keywords from the tagged sentence based on extraction policy rules from a rule library 71. A rule library 71, R, is an exhaustive set of rules that can be used to extract different kinds of phrases appearing in the sentence. The rules are represented as tag patterns. For example, it may have a rule to extract consecutive proper nouns (<PROP>+) and another rule to extract an adjective followed by one or more nouns (<ADJ> <NOUN>+), etc. A rule selector 72 includes a mapping from genre to an extraction policy. The genre of the program being watched determines the type of keywords to extract from the captions. For example, if the program being watched is a high-content, factual program such as news, the extraction policy is highly aggressive, essentially extracting additional differing types of keywords (e.g., sequences of nouns, compound nouns, proper nouns etc.). On the other hand, if the program is a low-content, non-factual program such as a sitcom, a very conservative extraction policy is used, extracting keywords very selectively, extracting only those keywords considered as having a higher likelihood of being useful (e.g., only proper nouns). The rule engine 79 alters its extraction behavior depending upon the type of program being watched.

Each extraction policy, P_e, corresponds to a subset of the rules in R. This mapping can either be preset, or it can be learned. The mapping essentially defines the kinds of patterns to be used for extracting keywords 76 from a particular type (genre) of program. In one example, the mapping can be determined by conducting a small user study involving four subjects asked to mark the keywords they would like to search for from CC transcripts of four types of sample programs: News, Sitcom, Talk Show and Reality TV. The transcripts were then tagged using Brill's tagger and the tags of the marked keywords were extracted as rules (e.g., if the keyword “Global Warming” in a news program was marked, and if the words were tagged “Global<ADJ> Warming<NOUN>”, then “<ADJ> <NOUN>” is extracted as a rule for the genre “news”). The top ranking rules (based on frequency and a threshold) were used as the rules that form the extraction policy for that kind of program and the union of all rules for all types of programs forms R. This facilitates reusability of rules and extraction policies. The rule engine 79 applies the extraction policy on the text received from the tagger 73 and extracts keywords from it. These keywords are then weighted based on whether they occur in the most recent window. The weighted keywords are then ordered and presented to the user.

The extracted keywords identify information of potential interest to the user. The query resolution function 27 enables extracting data related to identified data of potential interest to the user, aggregating the extracted data and correlating the aggregated data. Such correlation involves identifying associations between data. For example, data A is ‘similar to’ or the ‘same as’ data B.

The query resolution function 27 can be implemented, e.g., in a stand-alone module, in a device 20 such as a set-top box or in a CE device 30 such as a DTV. An example implementation of extracting, aggregating and correlating data by the query resolution function 27 utilizing query plans is described below. XML-based execution plans are provided which encapsulate the steps involved in a search query resolution process. An execution plan comprises one or more plan-steps and each plan-step essentially specifies the type of task (i.e., data extraction, aggregation or correlation) to be performed.

Further, special classes, termed RuleLets, are provided to execute the three tasks (i.e., data extraction, aggregation or correlation) in a typical query resolution process. The RuleLets are: GetDataRuleLet, MergeDataRuleLet and GetContentNotInHomeRuleLet. The GetDataRuleLet obtains data from different data sources, the MergeDataRuleLet merges data obtained from different data sources and the GetContentNotInHomeRuleLet identifies the data/content (from a collection of data extracted from different sources) that are not available on the home devices.

A plan-step essentially specifies the RuleLet to be executed and the set of input and output parameters required for the execution of the RuleLet. The specific fields in a plan-step include the name of the RuleLet to be executed, the input data required for the RuleLet execution, the output-type expected from the execution of the RuleLet and the scope of the desired output data (if applicable). The scope field is used to specify whether the required data should be available in the home (“Local”) or on the “Internet.” In order to cater to different kinds of search queries, a plan library containing different kinds of plans is maintained. When a user chooses a search query, the query resolution function 27 identifies a plan based on the context of the user (e.g., if the user is watching a TV program, DVD or music video, or listening to a music album).

The use of execution plans in a search scenario in conjunction with example execution plans is described below. The search scenario involves a case where a user is watching a broadcast documentary program titled “Drumming Techniques” on a TV. When the user expresses interest to access related Internet content, the search facilitator 24 identifies and displays potential search queries from the program's closed captions (using the techniques described above) by executing the following plan steps: obtain the EPG related to the TV program being watched by the user; obtain keywords from the EPG information obtained in the previous step; obtain the genre of the TV program; based on the genre obtain significant keywords from the closed captions of the TV program; and merge the keywords identified from the EPG and the closed captions. An XML version of such a plan comprises:

<?xml version=“1.0” ?>

<Plan>

<Plan-step>

<RuleLet>GetDataRule</RuleLet>

<OutputType>EPGInfo</OutputType>

<Scope>Internet</Scope>

</Plan-step>

<Plan-step>

<RuleLet>GetDataRule</RuleLet>

<InputType>EPGInfo</InputType

<OutputType>KeywordsFromEPG</OutputType>

<Scope>Local</Scope>

</Plan-step>

<Plan-step>

<RuleLet>GetDataRule</RuleLet>

<OutputType>ProgramGenre</OutputType>

<Scope>Local</Scope>

</Plan-step>

<Plan-step>

<RuleLet>GetDataRule</RuleLet>

<InputType>ProgramGenre</InputType

<OutputType>KeywordsFromCaptions</OutputType>

<Scope>Internet</Scope>

</Plan-step>

<Plan-step>

<RuleLet>MergeDataRule</RuleLet>

<InputType>KeywordsFromEPG</InputType>

<InputType>KeywordsFromCaptions</InputType>

<OutputType>LiveTVKeywords</OutputType>

<Scope>Local</Scope>

</Plan-step>

- </Plan>

The keywords obtained by executing this plan are then displayed to the user. One of the keywords/potential search queries displayed is: “Polyrthymic Drumming”. The user chooses “Polyrthymic Drumming” and expresses interest to see more related videos that the user has not seen before. To resolve this request, the facilitator 24 executes a plan, with “Polyrthymic Drumming” set as the keyword, including the plan steps: obtain videos related to the keyword (“Polyrthymic Drumming”) that are available on the Internet sources 66 (FIG. 2); identify pre-recorded videos available in the home related to “Polyrthymic Drumming”; filter out videos in the list resulting after the last step that are already available in the local sources 69. An XML version of such a plan comprises:

<?xml version=“1.0” ?>

<Plan>

<Plan-step>

<RuleLet>GetDataRule</RuleLet>

<InputType>Keyword </InputType

<OutputType>RelatedVideos</OutputType>

<Scope>Internet</Scope>

</Plan-step>

<Plan-step>

<RuleLet>GetDataRule</RuleLet>

<InputType>Keyword </InputType>

<OutputType>RecordedVideos</OutputType>

<Scope>Local</Scope>

</Plan-step>

<Plan-step>

<RuleLet>GetContentNotInHomeRule</RuleLet>

<InputType>RelatedVideos</InputType>

<InputType>RecordedVideos</InputType>

<OutputType>InternetVideosNotInHome</OutputType>

<Scope>Local</Scope>

</Plan-step>

</Plan>

The related Internet videos that are not already available in the local sources 69 are displayed to the user on the client module.

FIG. 6 shows an example functional architecture 80 for the facilitator system implemented as a context-specific search facilitator system (CSF) 82. The CSF 82 provides query identification functions (e.g., keyword extraction) and query resolution functions (e.g., data extraction, aggregation and correlation), as described above. The CSF 82 includes different layers to enable seamless CE device and Internet content from the data sources 81 for search and access.

The CSF 82 includes a data and query processing (DQP) layer 83. The DQP 83 assists in resolving user queries and also provides an API for client applications 64 to make use of. Though client applications 64 are shown external to the CSF 82, the client applications 64 can also be components of the CSF 82. The DQP 83 includes a query execution planner (QEP) 84 and an information source manager (ISM) 85. The CSF 82 further includes a data execution (DE) layer 86. The DE 86 includes a data extraction manager (DEM) 87 and multiple plug-ins 88.

The QEP 84 provides interfaces for client applications to search for and access locally available data (i.e., data stored on the devices 30 and/or 20) and related data available on the Internet. The QEP 84 maintains a plan library 89, containing a set of pre-defined execution plans that are used to resolve requests for data. The QEP 84 also maintains the RuleLet 90 classes that are executed as part of a plan. When the QEP 84 receives a query from a client application, the QEP 84 retrieves the relevant plan from its plan library 89 and executes it. During the plan execution, the QEP 84 gathers the information/content requested by the user using the plug-ins 88 in the data extraction layer 86 (via the ISM 85). The ISM 85 manages a directory containing details about the types of data each data extraction plug-in component could extract and the input data (if any) expected by the plug-ins 88 to do so. This allows the QEP 84 to identify the plug-in 88 that provides a specific type of data.

The DE 86 includes many plug-ins 88 for extracting content/information from local and Internet data sources. 81 Local data sources refer to, e.g., home devices. Internet data sources include seed sources (e.g., BarnesandNoble.com, YouTube.com) and Internet search engines (e.g., Google, Yahoo). The functionalities provided by the different plug-ins 88 include: (1) A web scraper plug-in allows extracting specific information from specific websites; (2) A content manager plug-in allows accessing media content stored on the home devices; (3) An Internet video search plug-in allows searching for and accessing video content on the Internet; (4) A closed caption analyzer plug-in allows analyzing and identifying keywords from TV captions; and, (5) An EPG plug-in allows obtaining the EPG information for TV programs.

The DE 86 manages the plug-ins 88 and allows new plug-ins 88 to be added or removed with minimal code changes and provides an application programming interface for the higher-level components to use the plug-ins.

As an example of a search facilitation process by the CSF 82, according to the present invention, wherein a TV viewer accesses the Internet is as follows A user Trisha is watching a TV program on her TV about “Drumming Techniques” and is intrigued by the video. She wishes to learn more about the topics discussed in the program, especially about “Polyrhythmic drumming” which has just been mentioned. She presses a button on her TV remote control 31 and finds a host of information regarding the program being watched. A UI graphic on the client module screen shows two menus. One menu 64A provides a list of keywords related to the TV program (assembled by the query identification function of the CSF 82), and the first keyword “Polyrhythmic Drumming” is highlighted. The other menu 64B shows a list of search results (assembled by the query resolution function of the CSF 82) including web links containing information and/or videos related to the keyword “Polyrhythmic Drumming”. Trisha notices that the second link on this menu is a “how to” video. Using the navigation buttons on her remote control she highlights this link, and then presses the “enter” button to select the video and start viewing it.

The above scenario illustrates the following essential features: first, the user need not enter text or queries at any point; interaction is via the navigation buttons on a conventional remote control. Second, the user is able to access desired related Internet information by pushing a few buttons, as there is no need to bring up a search page or enter search terms. In this scenario, the context of the user (the program being watched), helps focus the search to relevant content.

The process for providing relevant information to a user of a CE device on a local network such as a home network generally involves:

- 1. Gathering information about current activities of the user on the local network (e.g., listening to a song, watching a TV program);
- 2. Gathering contextual information about current user activity on the local network (e.g., finding the metadata of a song or a TV program);
- 3. Obtaining additional information interrelated to the information gathered in the above steps from other sources, such as the devices on the local network and/or information from external sources such as the Internet (e.g., obtaining information related to a song or a TV program);
- 4. Identifying correlations in the information obtained in the above steps;
- 5. Using the correlations in forming queries to search for information in local and/or external sources such as the Internet; and
- 6. Presenting the search results to the user as information related to the current user activity (i.e., information of interest to the user).

Identifying correlations can be performed in one or more of the following example ways: (1) identifying correlations between information about current user activity and the interrelated information obtained from local sources, (2) identifying correlations between information about current user activity and the interrelated information obtained from external sources, and (3) identifying correlations between information about current user activity and the interrelated information obtained from local and external sources.

In order to minimize the number of keystrokes a user has to enter to receive information related to the current user activity, functionalities that support information searching are mapped to a small number of keys (e.g., mapping searches to a few keys of a remote control). Then, certain information is gathered about current user activity on CE devices. This includes obtaining metadata contained in media that is accessible only by content-rendering CE devices (e.g., the length and type of content contained in a CD or a DVD).

The process further involves obtaining information embedded in broadcast streams that are accessible only by a receiving/rendering CE device (e.g., subtitles and closed captions). In addition, information is gathered about content already existing on the home network (e.g., songs by Sting that are already owned by the user and the corresponding metadata). Further information is gathered about relevant structured data that exists on the Internet (e.g., gathering metadata about the songs already owned by the user from a compact disk database (CDDB)). Additional relevant information is obtained from semi-structured data that exists on the Internet (e.g., the biography of an artist from the Internet Movie Database (IMDb) and/or from the relevant web pages). Further relevant information is gathered from unstructured data that exists on the Internet (e.g., URLs of the web pages carrying the geographical, economical, political and cultural information about the place from which main events are being reported in the news).

The gathered/obtained information defines the information at hand. Then, when a user operates a CE device, what the user inputs to a CE device is correlated with the information at hand to automatically form queries to search for related information. This minimizes the need for the user to generate queries or use a keyboard in forming queries.

Then, from the information at hand, the data extracted from the Internet sources is correlated with the data extracted from home network content to form a query plan to refine the queries for precise searching. The query plan is then executed for searching the queries on the external network (e.g., the Internet, other resources), without requiring user intervention. The query execution results, in the form of search results, are then presented to the user. Preferably, based on the information at hand, the most relevant information from the search results is selected for presentation to the user, without requiring user intervention. Therefore, the information presented to the user includes information of potential interest to the user as related to the information at hand.

Another example of facilitating searches for the user involves obtaining information about current user activity on a local network, obtaining contextual information about current user activity on the local network, obtaining additional information interrelated to the contextual information and the user activity information, identifying correlations between the additional information, the contextual information and the user activity information, and using the correlations in forming a query to search for information related to the current user activity.

Obtaining additional information may include obtaining additional information interrelated to the contextual information and the user activity information, from sources including the local network and/or external sources. Identifying correlations may include identifying correlations between information about current user activity and interrelated information obtained from local sources. Identifying correlations may include identifying correlations between information about current user activity and the interrelated information obtained from external sources. Identifying correlations may include identifying correlations between information about current user activity and the interrelated information obtained from local and external sources.

Forming a query includes automatically forming a query, without requiring user intervention. The query is executed to obtain search results including information related to the current user activity. Executing the query further may include executing the query to search for related information on the local network and/or external sources. The search results may be presented to the user at this stage on a user interface in a device such as a CE device.

Obtaining information about current user activity on the local network may include obtaining information from user input to the device, or obtaining information from applications running in the network. Obtaining additional information may include obtaining the additional information from external structured data sources. Obtaining additional information may include obtaining additional information that is relevant to user interests from local media content sources.

Obtaining additional information may include obtaining the additional information from external unstructured data sources, from external semi-structured data sources, or from external broadcast data sources.

Obtaining contextual information about current user activity on the local network may include obtaining associated metadata available on the local network. As such forming a query may include using metadata related to the content on the local network for determining a context for query formation. Further, determining a context for query formation may include using metadata related to the content in the network and information from applications on the local network, to determine a context for query formation without requiring user intervention. The query may be used to search the Internet for information related to the current user activity or interest. As such, the above processes also enable improved access to the Internet to the users of CE devices.

As is known to those skilled in the art, the aforementioned example architectures described above, according to the present invention, can be implemented in many ways, such as program instructions for execution by a processor, program product stored on a computer useable medium, computer implemented method, as logic circuits, as an application specific integrated circuit, as firmware, etc. The present invention has been described in considerable detail with reference to certain preferred versions thereof; however, other versions are possible. Therefore, the spirit and scope of the appended claims should not be limited to the description of the preferred versions contained herein.

Claims

1. A method of facilitating information searching for a user of an electronic device, comprising: playing currently accessed multimedia data;placing all words from a closed caption stream for the currently accessed multimedia data in a program wide memory window;keeping a set of most recently received words from the closed caption stream in a most recent memory window distinct from the program wide memory window;extracting keywords from the words stored in the program wide memory window and the most recent memory window;weighting the extracted keywords, wherein keywords contained both in the most recent memory window and the program wide memory window are given greater weight than keywords contained only in the program wide memory window;presenting the weighted extracted keywords to the user for selection, wherein keywords with greater weight are displayed more prominently than those keywords having lesser weight;receiving a selection of one or more keywords from the presented weighted extracted keywords;performing a search using a search query that is based on the selected one or more keywords; andcausing search results from the search to be displayed to the user.
2. The method of claim 1, wherein extracting keywords includes monitoring user access to content, selecting a set of phrase extraction rules for information extraction, and extracting key information from metadata for the content based on the selected phrase extraction rules.
3. The method of claim 2, wherein selecting a set of phrase extraction rules further includes selecting a set of phrase extraction rules based on the content genre.
4. The method of claim 3, wherein selecting a set of phrase extraction rules further includes selecting a set of phrase extraction rules from a rules library based on the content genre, wherein the rules library includes a list of rules for extracting various keywords.
5. The method of claim 2, further comprising; mapping a currently accessed content genre to a phrase extraction policy, wherein the phrase extraction policy includes determining type of keywords for extraction based on the currently accessed content genre, and wherein said mapping defines pattern types used for extracting keywords for a particular genre of a plurality of genres, from the currently accessed multimedia data.
6. The method of claim 5, wherein determining information about the user interests further includes obtaining contextual information about current user activity on the local network.
7. The method of claim 1, wherein performing a search includes searching the Internet.
8. The method of claim 1, further comprising purging both the program wide memory window and the most recent memory window are purged when the currently accessed multimedia data has completed playing.
9. The method of claim 8, wherein it is determined that the currently accessed multimedia data has completed playing by finding a predesignated character sequence in the closed caption screen identifying the end of a section.
10. The method of claim 8, wherein it is determined that the currently accessed multimedia data has completed playing by comparing a current time to an ending time identified for the currently accessed multimedia data in an electronic program guide entry for the currently accessed multimedia data.
11. An apparatus for facilitating information searching for a user of an electronic device, comprising: employing a hardware processor coupled with:an extractor configured forplacing words from a closed caption stream for the currently accessed multimedia data in a program wide memory window;keeping a set of most recently received words from the closed caption stream in a most recent memory window distinct from the program wide memory window; andextracting keywords from the words stored in the program wide memory window and the most recent memory window;a collector for collecting the extracted keywords for weighting the extracted keywords, wherein keywords contained both in the most recent memory window and the program wide memory window are given greater weight than keywords contained only in the program wide memory window and presentation to the user for selection as a search query, wherein the extracted keywords with greater weight are displayed more prominently than those keywords having lesser weight; and a facilitator configured for performing a search using the search query.
12. The apparatus of claim 11, wherein the device comprises a consumer electronics device.
13. The apparatus of claim 12, wherein the extractor is further configured for executing the query by searching the Internet for said related data.
14. A system for facilitating information searching for a user, comprising: an electronic device for access to content; anda facilitator including:
15. A program product stored on a computer useable medium for facilitating information searching for a user of an electronic device, the program product comprising program code instructions for causing a system to perform; determining contextual information form currently accessed multimedia data about the user interests;at a client side, identifying data of potential interest to the user base on the contextual information; extracting data related to said data of potential interest to the user and forming a search query to access further information related to data of potential interest to the user, wherein forming the search query comprises extracting keywords from the currently accessed multimedia data and extracting a phrase from the extracted keywords based on a phrase based rule wherein the extracting the keywords includes: placing words from a closed caption stream for the currently accessed multimedia data in a program wide memory window;keeping a set of most recently received words from the closed caption stream in a most recent memory window distinct from the program wide memory window;weighting the extracted keywords, wherein keywords contained both in the most recent memory window and the program wide memory window are given greater weight than keywords contained only in the program wide memory window;presenting the weighted extracted keywords to the user in such a way that keywords contained with greater weight are displayed more prominently than those keywords having lesser weight;receiving a selection of one or more keywords from the presented weighted extracted keywords;performing a search using a search query that is based on the selected one or more keywords; andcollecting the extracted keywords and search results for presentation to the user on the device as additional information related to data of potential interest to the user.
16. An apparatus for facilitating information searching for a user, comprising: means for playing currently accessed multimedia data;means for placing all words from a closed caption stream for the currently accessed multimedia data in a program wide memory window;means for keeping a set of most recently received words from the closed caption stream in a most recent memory window distinct from the program wide memory window;means for extracting keywords from the words stored in the program wide memory window and the most recent memory window;means for weighting the extracted keywords, wherein keywords contained both in the most recent memory window and the program wide memory window are given greater weight than keywords contained only in the program wide memory window;means for presenting the weighted extracted keywords to the user for selection, wherein keywords with greater weight are displayed more prominently than those keywords having lesser weight;means for receiving a selection of one or more keywords from the presented weighted extracted keywords;means for performing a search using a search query that is based on the selected one or more keywords; andmeans for causing search results from the search to be displayed to the user.
17. A non-transitory program storage device readable by a machine tangibly embodying a program of instructions executable by the machine to perform a method for facilitating information searching for a user, the method comprising: playing currently accessed multimedia data;placing all words from a closed caption stream for the currently accessed multimedia data in a program wide memory window;keeping a set of most recently received words from the closed caption stream in a most recent memory window distinct from the program wide memory window;extracting keywords from the words stored in the program wide memory window and the most recent memory window;weighting the extracted keywords, wherein keywords contained both in the most recent memory window and the program wide memory window are given greater weight than keywords contained only in the program wide memory window;presenting the weighted extracted keywords to the user for selection, wherein keywords with greater weight are displayed more prominently than those keywords having lesser weight;receiving a selection of one or more keywords from the presented weighted extracted keywords;performing a search using a search query that is based on the selected one or more keywords; andcausing search results from the search to be displayed to the user.

RELATED APPLICATION

This application claims priority from U.S. Provisional Patent Application Ser. No. 60/898,257, filed on Jan. 29, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/713,370, filed on Mar. 1, 2007, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/780,400, filed on Mar. 7, 2006. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/713,350, filed on Mar. 1, 2007, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/780,398, filed on Mar. 7, 2006. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/713,312, filed on Mar. 1, 2007, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/780,105, filed on Mar. 7, 2006. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/725,865, filed on Mar. 20, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/726,340, filed on Mar. 21, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/732,887, filed on Apr. 5, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/789,609, filed on Apr. 25, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/803,826, filed on May 15, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/821,938, filed on Jun. 26, 2007, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/903,962, filed on Feb. 28, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/823,005, filed on Jun. 26, 2007, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/904,044, filed on Feb. 28, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/825,161, filed on Jul. 5, 2007, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/904,044, filed on Feb. 28, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/879,569, filed on Jul. 20, 2007. This application is further related to co-pending U.S. patent application Ser. No. 11/981,019, filed on Oct. 31, 2007. This application is further a Continuation-in-Part of U.S. patent application Ser. No. 11/969,837, filed on Jan. 4, 2008, which in turn claims the benefit of priority of U.S. Provisional Patent Application Ser. No. 60/906,082, filed on Mar. 9, 2007.

US Referenced Citations (153)

Number	Name	Date	Kind
5481296	Cragun et al.	Jan 1996	A
5703655	Corey et al.	Dec 1997	A
5790935	Payton	Aug 1998	A
5974406	Bisdikian et al.	Oct 1999	A
5983214	Lang et al.	Nov 1999	A
5983237	Jain et al.	Nov 1999	A
5995959	Friedman et al.	Nov 1999	A
6151603	Wolfe	Nov 2000	A
6253238	Lauder et al.	Jun 2001	B1
6317710	Huang et al.	Nov 2001	B1
6334127	Bieganski et al.	Dec 2001	B1
6412073	Rangan	Jun 2002	B1
6438579	Hosken	Aug 2002	B1
6480844	Cortes et al.	Nov 2002	B1
6637028	Voyticky et al.	Oct 2003	B1
6714909	Gibbon et al.	Mar 2004	B1
6721748	Knight et al.	Apr 2004	B1
6766523	Herley	Jul 2004	B2
6774926	Ellis et al.	Aug 2004	B1
6801895	Huang et al.	Oct 2004	B1
6807675	Maillard	Oct 2004	B1
6826512	Dara-Abrams et al.	Nov 2004	B2
6842877	Robarts et al.	Jan 2005	B2
6954755	Reisman	Oct 2005	B2
6981040	Konig et al.	Dec 2005	B1
7028024	Kommers et al.	Apr 2006	B1
7054875	Keith, Jr.	May 2006	B2
7062561	Reisman	Jun 2006	B1
7069575	Goode et al.	Jun 2006	B1
7110998	Bhandari et al.	Sep 2006	B1
7158961	Charikar	Jan 2007	B1
7158986	Oliver et al.	Jan 2007	B1
7162473	Dumais et al.	Jan 2007	B2
7165080	Kotcheff et al.	Jan 2007	B2
7181438	Szabo	Feb 2007	B1
7184959	Gibbon et al.	Feb 2007	B2
7194460	Komamura	Mar 2007	B2
7203940	Barmettler et al.	Apr 2007	B2
7225187	Dumais et al.	May 2007	B2
7284202	Zenith	Oct 2007	B1
7305384	Omoigui	Dec 2007	B2
7343365	Farnham et al.	Mar 2008	B2
7363294	Billsus et al.	Apr 2008	B2
7386542	Maybury et al.	Jun 2008	B2
7389224	Elworthy	Jun 2008	B1
7389307	Golding	Jun 2008	B2
7433935	Obert	Oct 2008	B1
7552114	Zhang et al.	Jun 2009	B2
7565345	Bailey et al.	Jul 2009	B2
7593921	Goronzy et al.	Sep 2009	B2
7603349	Kraft et al.	Oct 2009	B1
7617176	Zeng et al.	Nov 2009	B2
7634461	Oral et al.	Dec 2009	B2
7657518	Budzik et al.	Feb 2010	B2
7664734	Lawrence et al.	Feb 2010	B2
7685192	Scofield et al.	Mar 2010	B1
7716158	McConnell	May 2010	B2
7716199	Guha	May 2010	B2
7793326	McCoskey	Sep 2010	B2
8060905	Hendricks	Nov 2011	B1
8065697	Wright et al.	Nov 2011	B2
8115869	Rathod et al.	Feb 2012	B2
20010003214	Shastri et al.	Jun 2001	A1
20010023433	Natsubori et al.	Sep 2001	A1
20020022491	McCann et al.	Feb 2002	A1
20020026436	Joory	Feb 2002	A1
20020087535	Kotcheff et al.	Jul 2002	A1
20020161767	Shapiro et al.	Oct 2002	A1
20020162121	Mitchell	Oct 2002	A1
20030028889	McCoskey	Feb 2003	A1
20030033273	Wyse	Feb 2003	A1
20030105682	Dicker et al.	Jun 2003	A1
20030131013	Pope et al.	Jul 2003	A1
20030158855	Farnham et al.	Aug 2003	A1
20030172075	Reisman	Sep 2003	A1
20030184582	Cohen	Oct 2003	A1
20030221198	Sloo	Nov 2003	A1
20030229900	Reisman	Dec 2003	A1
20030231868	Herley	Dec 2003	A1
20040031058	Reisman	Feb 2004	A1
20040073944	Booth	Apr 2004	A1
20040194141	Sanders	Sep 2004	A1
20040244038	Utsuki et al.	Dec 2004	A1
20040249790	Komamura	Dec 2004	A1
20050004910	Trepess	Jan 2005	A1
20050137996	Billsus et al.	Jun 2005	A1
20050144158	Capper et al.	Jun 2005	A1
20050154711	McConnell	Jul 2005	A1
20050160460	Fujiwara et al.	Jul 2005	A1
20050177555	Alpert et al.	Aug 2005	A1
20050240580	Zamir et al.	Oct 2005	A1
20050246726	Labrou et al.	Nov 2005	A1
20050289599	Matsuura et al.	Dec 2005	A1
20060026152	Zeng et al.	Feb 2006	A1
20060028682	Haines	Feb 2006	A1
20060036593	Dean et al.	Feb 2006	A1
20060066573	Matsumoto	Mar 2006	A1
20060074883	Teevan et al.	Apr 2006	A1
20060084430	Ng	Apr 2006	A1
20060095415	Sattler et al.	May 2006	A1
20060106764	Girgensohn et al.	May 2006	A1
20060133391	Kang et al.	Jun 2006	A1
20060136670	Brown et al.	Jun 2006	A1
20060156326	Goronzy et al.	Jul 2006	A1
20060161542	Cucerzan et al.	Jul 2006	A1
20060195362	Jacobi et al.	Aug 2006	A1
20060212897	Li et al.	Sep 2006	A1
20060242283	Shaik et al.	Oct 2006	A1
20070043703	Bhattacharya et al.	Feb 2007	A1
20070050346	Goel et al.	Mar 2007	A1
20070061222	Allocca et al.	Mar 2007	A1
20070061352	Dimitrova et al.	Mar 2007	A1
20070073894	Erickson et al.	Mar 2007	A1
20070078822	Cuzerzan et al.	Apr 2007	A1
20070107019	Romano et al.	May 2007	A1
20070130585	Perret et al.	Jun 2007	A1
20070143266	Tang et al.	Jun 2007	A1
20070156447	Kim et al.	Jul 2007	A1
20070179776	Segond et al.	Aug 2007	A1
20070198485	Ramer et al.	Aug 2007	A1
20070198500	Lucovsky et al.	Aug 2007	A1
20070198508	Yoshimura	Aug 2007	A1
20070214123	Messer et al.	Sep 2007	A1
20070214488	Nguyen et al.	Sep 2007	A1
20070220037	Srivastava et al.	Sep 2007	A1
20070233287	Sheshagiri et al.	Oct 2007	A1
20070300078	Ochi et al.	Dec 2007	A1
20080040316	Lawrence	Feb 2008	A1
20080082744	Nakagawa	Apr 2008	A1
20080114751	Cramer et al.	May 2008	A1
20080133501	Andersen et al.	Jun 2008	A1
20080133504	Messer et al.	Jun 2008	A1
20080162651	Madnani	Jul 2008	A1
20080162731	Kauppinen et al.	Jul 2008	A1
20080183596	Nash et al.	Jul 2008	A1
20080183681	Messer et al.	Jul 2008	A1
20080183698	Messer et al.	Jul 2008	A1
20080204595	Rathod et al.	Aug 2008	A1
20080208839	Sheshagiri et al.	Aug 2008	A1
20080235209	Rathod et al.	Sep 2008	A1
20080235393	Kunjithapatham et al.	Sep 2008	A1
20080242279	Ramer et al.	Oct 2008	A1
20080250010	Rathod et al.	Oct 2008	A1
20080266449	Rathod et al.	Oct 2008	A1
20080288641	Messer et al.	Nov 2008	A1
20090029687	Ramer et al.	Jan 2009	A1
20090055393	Messer	Feb 2009	A1
20090077065	Song et al.	Mar 2009	A1
20090112848	Kunjithapatham et al.	Apr 2009	A1
20100070895	Messer	Mar 2010	A1
20100091182	Gibbon et al.	Apr 2010	A1
20100191619	Dicker et al.	Jul 2010	A1
20100293165	Eldering et al.	Nov 2010	A1

Foreign Referenced Citations (15)

Number	Date	Country
1393107	Jan 2003	CN
1585947	Feb 2005	CN
1723458	Jan 2006	CN
1808430	Jul 2006	CN
2003-099442	Apr 2003	JP
10-2002-0005147	Jan 2002	KR
10-2002-0006810	Jan 2002	KR
10-2004-0052339	Jun 2004	KR
10-2006-0027226	Mar 2006	KR
WO 0137465	May 2001	WO
WO 0243310	May 2002	WO
WO 0243310	May 2002	WO
WO 03042866	May 2003	WO
WO 2005055196	Jun 2005	WO
WO 2007004110	Jan 2007	WO

Related Publications (1)

	Number	Date	Country
	20080183698 A1	Jul 2008	US

Provisional Applications (7)

Number	Date	Country
60898257	Jan 2007	US
60780400	Mar 2006	US
60780398	Mar 2006	US
60780105	Mar 2006	US
60903962	Feb 2007	US
60904044	Feb 2007	US
60906082	Mar 2007	US

Continuation in Parts (13)

	Number	Date	Country
Parent	11713370	Mar 2007	US
Child	11969778		US
Parent	11713350	Mar 2007	US
Child	11713370		US
Parent	11713312	Mar 2007	US
Child	11713350		US
Parent	11725865	Mar 2007	US
Child	11713312		US
Parent	11726340	Mar 2007	US
Child	11725865		US
Parent	11732887	Apr 2007	US
Child	11726340		US
Parent	11789609	Apr 2007	US
Child	11732887		US
Parent	11803826	May 2007	US
Child	11789609		US
Parent	11821938	Jun 2007	US
Child	11803826		US
Parent	11823005	Jun 2007	US
Child	11821938		US
Parent	11825161	Jul 2007	US
Child	11823005		US
Parent	11879569	Jul 2007	US
Child	11825161		US
Parent	11969837	Jan 2008	US
Child	11879569		US

Method and system for facilitating information searching on electronic devices

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Disclaimer

Term Extension

Abstract