Service providers and device manufacturers are continually challenged to deliver value and convenience to consumers by, for example, providing compelling services and vast array of media and products. Service providers can provide various user interface applications for use on user equipment that enhance the user's interface experience with the user equipment and utilization of the various products and services offered by the service provider. While many devices offered today provide a user with the capability of communicating with other users and accessing various products and services, such devices either do not have the capability or have limited ability to provide a user with customized interactions with the device and services provided via the device. While current devices are privy to large amounts of contextual information about the user of the device, such devices do not utilize such contextual information in a manner that is useful to the user or in a manner that accentuates the services provided by service providers. Currently available user interface applications have clear limitations in their ability to utilize the vast amount of information passing therethrough, and thus fail to provide the user with an interface that can allow for the user to fully appreciate and utilize the various products and services offered by the service provider.
Therefore, there is a need for an approach for providing context attributes and informational links for media data.
According to one embodiment, a method comprises receiving media data on an apparatus, and receiving one or more context attributes related to the apparatus or accessed by the apparatus. The method further comprises determining whether the one or more context attributes relate to the media data, and causing, at least in part, display of the media data with the one or more context attributes that are determined to relate to the media data.
According to another embodiment, an apparatus comprising at least one processor, and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following: receive media data on the apparatus; receive one or more context attributes related to the apparatus or accessed by the apparatus; determine whether the one or more context attributes relate to the media data; and cause, at least in part, display of the media data with the one or more context attributes that are determined to relate to the media data.
According to another embodiment, a computer-readable storage medium carrying one or more sequences of one or more instructions which, when executed by one or more processors, cause an apparatus to at least perform the following steps: receiving media data on an apparatus; receiving one or more context attributes related to the apparatus or accessed by the apparatus; determining whether the one or more context attributes relate to the media data; and causing, at least in part, display of the media data with the one or more context attributes that are determined to relate to the media data.
According to another embodiment, an apparatus comprises means for receiving media data on an apparatus, and means for receiving one or more context attributes related to the apparatus or accessed by the apparatus. The apparatus further comprises means for determining whether the one or more context attributes relate to the media data, and means for causing, at least in part, display of the media data with the one or more context attributes that are determined to relate to the media data.
According to another embodiment, a method comprises receiving media data on an apparatus, parsing the media data into one or more structured elements, determining one or more informational links that relate to the one or more structured elements of the media data, and causing, at least in part, display of the media data with the one or more informational links that are determined to relate to the one or more structured elements.
According to another embodiment, an apparatus comprises: at least one processor; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following, receive media data on the apparatus; parse the media data into one or more structured elements; determine one or more informational links that relate to the one or more structured elements of the media data; and cause, at least in part, display of the media data with the one or more informational links that are determined to relate to the one or more structured elements.
According to another embodiment, a computer-readable storage medium carrying one or more sequences of one or more instructions which, when executed by one or more processors, cause an apparatus to at least perform the following steps: receiving media data on an apparatus; parsing the media data into one or more structured elements; determining one or more informational links that relate to the one or more structured elements of the media data; and causing, at least in part, display of the media data with the one or more informational links that are determined to relate to the one or more structured elements.
According to another embodiment, an apparatus comprises means for receiving media data on an apparatus, means for parsing the media data into one or more structured elements, means for determining one or more informational links that relate to the one or more structured elements of the media data, and means for causing, at least in part, display of the media data with the one or more informational links that are determined to relate to the one or more structured elements.
Still other aspects, features, and advantages of the invention are readily apparent from the following detailed description, simply by illustrating a number of particular embodiments and implementations, including the best mode contemplated for carrying out the invention. The invention is also capable of other and different embodiments, and its several details can be modified in various obvious respects, all without departing from the spirit and scope of the invention. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.
The embodiments of the invention are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings:
Examples of a method, apparatus, and computer program for providing context attributes and informational links for media data are disclosed. In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the invention. It is apparent, however, to one skilled in the art that the embodiments of the invention may be practiced without these specific details or with an equivalent arrangement. In other instances, well-known structures and devices are shown in block diagram form in order to avoid unnecessarily obscuring the embodiments of the invention.
Although various embodiments are described with respect to mobile communication devices, it is contemplated that the approach described herein may be used with other user equipment, as noted in greater detail below.
Various embodiments described herein provide context attributes and informational links for media data. The media data can include content data and metadata. For example, the media data can include any type of (multi)media content such as textual data, texts, images, audio, video, any combination of multiple types of content, etc., and metadata of the content data, such as a set of textual descriptions that express the semantic meanings of the content data. Such metadata can be obtained from the content data through various state-of-the-art techniques, such as natural language processing (NLP), speech recognition, image/video content analysis, and so on.
As shown in
The UEs 101A . . . 101N, and 103 are any type of mobile terminal, fixed terminal, or portable terminal including a mobile handset, station, unit, device, multimedia computer, multimedia tablet, Internet node, communicator, desktop computer, laptop computer, Personal Digital Assistants (PDAs), or any combination thereof. It is also contemplated that the UEs can support any type of interface to the user (such as “wearable” circuitry, etc.).
The UEs 101A . . . 101N include a user interface widget 109A . . . 109N, while UE 103 does not include such a widget. The user interface widgets 109A . . . 109N can be used to provide context attributes and informational links for media data for the users of the respective UEs 101A . . . 101N when communicating with each other, with UE 103, and/or with service provider 105. The service provider 105 can additionally provide certain media data, context attribute data, informational data, etc. to the UEs 101A . . . 101N, and/or 103 in conjunction with the UEs in order to provide context attributes and informational links for media data thereon. Thus, even UE 103 could be provided with context attributes and informational links for media data provided by the service provider 105 using communication management module 111 in conjunction with UE 103 (e.g., utilizing a web-based application containing such a user interface widget, where the UE 103 is merely acting as a conduit for passing information between the user and the service provider). Additionally, the communication management module 111 can act as a communication session manager between UEs or between a UE and a service provider in order to control any exchange of information between the parties to the communication.
By way of example, the UEs 101A . . . 101N, UE 103, and service provider 105 communicate with each other and other components of the communication network 107 using well known, new or still developing protocols. In this context, a protocol includes a set of rules defining how the network nodes within the communication network 107 interact with each other based on information sent over the communication links. The protocols are effective at different layers of operation within each node, from generating and receiving physical signals of various types, to selecting a link for transferring those signals, to the format of information indicated by those signals, to identifying which software application executing on a computer system sends or receives the information. The conceptually different layers of protocols for exchanging information over a network are described in the Open Systems Interconnection (OSI) Reference Model.
Communications between the network nodes are typically effected by exchanging discrete packets of data. Each packet typically comprises (1) header information associated with a particular protocol, and (2) payload information that follows the header information and contains information that may be processed independently of that particular protocol. In some protocols, the packet includes (3) trailer information following the payload and indicating the end of the payload information. The header includes information such as the source of the packet, its destination, the length of the payload, and other properties used by the protocol. Often, the data in the payload for the particular protocol includes a header and payload for a different protocol associated with a different, higher layer of the OSI Reference Model. The header for a particular protocol typically indicates a type for the next protocol contained in its payload. The higher layer protocol is said to be encapsulated in the lower layer protocol. The headers included in a packet traversing multiple heterogeneous networks, such as the Internet, typically include a physical (layer 1) header, a data-link (layer 2) header, an internetwork (layer 3) header and a transport (layer 4) header, and various application headers (layer 5, layer 6 and layer 7) as defined by the OSI Reference Model.
In the embodiment shown in
The context sensors 213 can include physical sensors (e.g., global positions system (GPS) device, compass, environmental sensors (such as, a temperature sensor, pressure sensor, etc.), body sensors capable of measuring a variety of body conditions, camera, microphone, etc.), software services (e.g., weather, calendar, battery status, memory status, etc.) that are either locally provided by the device or remotely provided by a service provider and received by the device, etc. The user interface 215 can include any type of input or output device that allows the user and the user equipment to communicate with one another, such as a display screen, speaker, microphone, buttons (e.g., keyboard, dedicated physical buttons, touchscreen, etc.), etc. Additionally, the communication module 217 allows the UI widget 109A to communicate with any remote device or server, if needed in order to present objects on the GUI, or to utilize data or applications associated with the objects. Also, the database 219 can be used to store data and applications.
In step 301 of
As noted above,
As shown in
In
In
In
Thus, in
As shown in
In
While
The UI widget can match textual metadata of any media data to context attributes on the apparatus. The textual metadata can be stored separately from the content data, for example, on permanent storage of the apparatus, and the stored textual metadata can be linked to the content data. The keywords/keyphrases in the metadata can then be highlighted when it is viewed by the user of the apparatus, and/or the original content data can directly be highlighted when it is viewed by the user.
The UI widget can provide automatic highlighting of context-correlated keywords/keyphrases, by analyzing every piece of incoming or outgoing text in real time when the text is received or when the text is being composed to be sent. The UI widget can find every keyword/keyphrase in the text that, for example, has matching semantics to a context attribute received by the mobile device. The UI widget can cause the display of keywords/keyphrases as “highlighted” on the device screen each time when the text is read by the user. For example, such highlighting can be based on underlining or using a different font color; it can be activated or deactivated via different ways of user input such as keypad presses, touch screen selections and voice commands.
The UI widget can provide dynamic display of real-time context. For example, whenever a keyword/keyphrase is activated (shown), the real-time value of the corresponding context attribute is shown. The context display is “real-time”, for example, a different and updated context attribute value can be presented each time when the user accesses the text and reads the context display. The context display can be provided in a pop-up text box or an attached sidebar. The displayed contextual information can be for the reference of the user only, and may or may not be useful for the user at the moment, thus the user can decide to whether to view a real-time value or simply ignore it or view it at a later time.
The contextual highlighting can be used for various services, such as context-based service search, automatic message composition, dynamic advertisement dissemination, and so on. The context-based UI and linked services are aimed to provide a better user experience. The UI can automatically associate real-time contextual data of mobile device users to their daily usage of traditional communication services. The UI widget can utilize text analytics and text mining. The UI widget can provide name and instance based semantic matching, for example, by matching a keyword in the text with the names of the context attributes, textual descriptions of the attributes (if any), sample values of the attributes, weighted combination of the above, and so on.
The context/informational link display (e.g., in a pop-up text box or sidebar) can be attached with a command option that allows the mobile device user to use a template to automatically compose a message embedded with the real-time context attribute value and/or the keyword/keyphrase. The template can be either pre-defined by the user or be constructed at run time via natural language processing. Context data can be inserted into proper positions in the template by pre-defined placeholders. The templates are stored at a local template database, such as database 205 in
In other embodiments, the UI widget can provide real-time contextual highlighting for keywords/keyphrases in the textual contents of web browser pages that the user is currently surfing. The UI widget can also display advertisements that are related to the highlighted keywords/keyphrases in a sidebar of the browser. The UI widget can filter these advertisements using the real-time values of context attributes corresponding to the keywords/keyphrases. The UI widget can also be applied for advertisement attachment for various communication services, such as SMS, instant messaging and chat room. For example, in a situation where the user's context environmental_humidity is low, the UI widget highlights the keyword “humidity” in the current browser web page and displays a few advertisements of humidifiers. In another example, in a situation where the user's context number_of_children is not zero, then the UI widget highlights the keyword “children” in the current instant message and provides a dynamic display including corresponding advertisements and links for children's stuff (e.g., toys, clothes, etc.) in the IM software sidebars.
In yet another embodiment, the UI widget can be utilized in a music player to match keywords/keyphrases in lyrics of a song to context attributes. The lyrics can be provided as metadata with the music content, accessed using a service via the web, determined using speech recognition, etc. The keywords/keyphrases can then be highlighted, and real-time context attribute values can be displayed, and corresponding services/advertisements can be searched or displayed.
In another embodiment, speech recognition can be utilized to get a textual summary (i.e. metadata) of a video/voice phone call. Texts in such metadata can then be analyzed and highlighted, context attribute values can be displayed, and corresponding services/advertisements can be searched or displayed.
In step 601 of
As shown in
As shown in
In
Thus, the UI widget can provide intelligent point-of-interest (POI) recommendations for a map service. The UI widget can analyze location-based semantic information from a text communication that is sent or received by a user. Textual communications, especially in mobile devices, provide a key communication method for users. Such communications frequently express a users' intention, and therefore, intelligent messaging services that can automatically mine and apply useful concepts will bring significant amount of value to the customers.
Though a textual communication may contain a rich set of information, to properly use the information, a user currently has to take further action manually. On the other hand, automatic POI recommendations, such as can be provided by the UI widget, based on the user's intentions can enable truly intelligent and personalized services. The UI widget establishes a way to automatically recommend the POIs to individuals from their interactions in textual communications. Recommended POIs represent the true information needs, and are customized towards the user's daily life. Hence, these POIs really matter to the user's life and have been specifically selected because they are more likely to be of interest to those individual users. The UI widget provides dynamic, real-time recommendations in the POI domain that are based on a user's intentions that are directly mined from the textual communications, and not solely from user history.
In various embodiments of the UI widget, named entity recognition (NER) applies natural language processing (NLP) techniques to parse a text message or other media data into a set of structured elements (also referred to herein as keywords/keyphrases), such as a person's name, location name, organization name, events, identifiable phone numbers, card numbers, etc. A variety of the algorithms have been developed in different commercial domains to perform such language processing techniques. In the case of SMS processing, events that a message describes can be retrieved (or parsed out of textual data), which can include time information, location information, person information, and related actions information. For example, Hidden Markov Model (HMM) can be used to tag a message with labels having the greatest probabilities. By parsing textual data of, for example, an SMS message, into structured information with a NER engine, it can enable automatic intelligent recommendation based on a user's intentions from the SMS message.
A given SMS message can, therefore, be parsed into structured elements for further analysis and comparison with available contextual data and/or informational links. For example, for an SMS message that states “Let's have dinner at Wangfujing Street, br, John,” the SMS message can be parsed to include the following keywords/keyphrases: “dinner”, “Wangfujing”, and “John”. In this instance, the keyword “dinner” is tagged as an event category, the keyword “Wangfujing” is tagged as a location, and the keyword “John” is tagged as a name. Given the extracted knowledge on the location and event pair, recommendations become possible by efficiently converting the location and category pair to a set of relevant POIs. Thus, a local search POI query can be formed as follows:
There are several approaches to map the keyword to the concept. For example, taking “restaurant” as one concept entity, the relevant keywords can be found from a large training corpus, such as Wikipedia®. It can be provided as either a bag of words or a similarity measure to the concept entity (e.g., restaurant=restaurant, breakfast, lunch, dinner, supper, eating, cafe, food, etc. . . . ). With this model, the event defined in the text message can be mapped to its normalized category (e.g., mapping from event instance (e.g., dinner) to concept entity (e.g., restaurant)). Clearly, the query pattern (i.e., location+category) can then be set to the local search engine to fetch the recommended POIs. The query can be generalized as (context+category) pattern, where context can be described as temporal features or combination of the context as higher level of context.
Thus, as seen in
The following description presents a technical exemplary implementation of POI recommendations based on messaging channels like SMS; however, other embodiments are not limited to these implementation alternatives. This implementation is only provided as the reference embodiment.
Named entity recognition engine has been developed both with conditional random field (CRF) and Hidden Markov model (HMM). The model can be trained with an annotated text corpus. In one embodiment, the model has been trained using China Daily collected for a few years. Annotation includes the person name, location name, organization name, primary part of speech (POS). The time, number and digit are extracted with regular expression pattern. The NER parse is then built with the supports from a trained model. The performance of NER that has been integrated in the S60 device has been optimized with real time processing capability. The NER recognition accuracy is summarized in Table 1.
The local search engine is specialized in POI searches and place recommendations. It is a flexible, scalable, and efficient local search system for the purposes of context-based search and behavioral ranking. The system can be used to search for POIs and addresses near a specified location with free-text keywords to retrieve suggestions for query completion and interpretation. For example, one can search for “Wangfujing Pizza”. Web-based UI for search and for query completion suggestions are supported. By utilizing the NER parsed structured data as mentioned above, it is possible to automatically and intelligently form the query for the local search engine to obtain the recommended POIs. The obtained POIs can be further used to match a user's behavior models and formalize the final recommendation list being shown to the user.
The UI widget clearly improves the user experience by enhancing, for example, an otherwise plain and dull SMS interface. It offers the user a very intuitive and easy way to access the service. Additionally, the user interface widget can also link together different applications (e.g., an SMS application and a mapping application), which can increase the user base of such application services. The user interface widget provides a solution-oriented offering, to enhance device and service use, in a unified business circle of a variety of different application services.
The processes described herein for providing context attributes and informational links for media data may be advantageously implemented via software, hardware (e.g., general processor, Digital Signal Processing (DSP) chip, an Application Specific Integrated Circuit (ASIC), Field Programmable Gate Arrays (FPGAs), etc.), firmware or a combination thereof. Such exemplary hardware for performing the described functions is detailed below.
A bus 910 includes one or more parallel conductors of information so that information is transferred quickly among devices coupled to the bus 910. One or more processors 902 for processing information are coupled with the bus 910.
A processor 902 performs a set of operations on information as specified by computer program code related to provide context attributes and informational links for media data. The computer program code is a set of instructions or statements providing instructions for the operation of the processor and/or the computer system to perform specified functions. The code, for example, may be written in a computer programming language that is compiled into a native instruction set of the processor. The code may also be written directly using the native instruction set (e.g., machine language). The set of operations include bringing information in from the bus 910 and placing information on the bus 910. The set of operations also typically include comparing two or more units of information, shifting positions of units of information, and combining two or more units of information, such as by addition or multiplication or logical operations like OR, exclusive OR (XOR), and AND. Each operation of the set of operations that can be performed by the processor is represented to the processor by information called instructions, such as an operation code of one or more digits. A sequence of operations to be executed by the processor 902, such as a sequence of operation codes, constitute processor instructions, also called computer system instructions or, simply, computer instructions. Processors may be implemented as mechanical, electrical, magnetic, optical, chemical or quantum components, among others, alone or in combination.
Computer system 900 also includes a memory 904 coupled to bus 910. The memory 904, such as a random access memory (RAM) or other dynamic storage device, stores information including processor instructions for providing context attributes and informational links for media data. Dynamic memory allows information stored therein to be changed by the computer system 900. RAM allows a unit of information stored at a location called a memory address to be stored and retrieved independently of information at neighboring addresses. The memory 904 is also used by the processor 902 to store temporary values during execution of processor instructions. The computer system 900 also includes a read only memory (ROM) 906 or other static storage device coupled to the bus 910 for storing static information, including instructions, that is not changed by the computer system 900. Some memory is composed of volatile storage that loses the information stored thereon when power is lost. Also coupled to bus 910 is a non-volatile (persistent) storage device 908, such as a magnetic disk, optical disk or flash card, for storing information, including instructions, that persists even when the computer system 900 is turned off or otherwise loses power.
Information, including instructions for providing context attributes and informational links for media data, is provided to the bus 910 for use by the processor from an external input device 912, such as a keyboard containing alphanumeric keys operated by a human user, or a sensor. A sensor detects conditions in its vicinity and transforms those detections into physical expression compatible with the measurable phenomenon used to represent information in computer system 900. Other external devices coupled to bus 910, used primarily for interacting with humans, include a display device 914, such as a cathode ray tube (CRT) or a liquid crystal display (LCD), or plasma screen or printer for presenting text or images, and a pointing device 916, such as a mouse or a trackball or cursor direction keys, or motion sensor, for controlling a position of a small cursor image presented on the display 914 and issuing commands associated with graphical elements presented on the display 914. In some embodiments, for example, in embodiments in which the computer system 900 performs all functions automatically without human input, one or more of external input device 912, display device 914 and pointing device 916 is omitted.
In the illustrated embodiment, special purpose hardware, such as an application specific integrated circuit (ASIC) 920, is coupled to bus 910. The special purpose hardware is configured to perform operations not performed by processor 902 quickly enough for special purposes. Examples of application specific ICs include graphics accelerator cards for generating images for display 914, cryptographic boards for encrypting and decrypting messages sent over a network, speech recognition, and interfaces to special external devices, such as robotic arms and medical scanning equipment that repeatedly perform some complex sequence of operations that are more efficiently implemented in hardware.
Computer system 900 also includes one or more instances of a communications interface 970 coupled to bus 910. Communication interface 970 provides a one-way or two-way communication coupling to a variety of external devices that operate with their own processors, such as printers, scanners and external disks. In general the coupling is with a network link 978 that is connected to a local network 980 to which a variety of external devices with their own processors are connected. For example, communication interface 970 may be a parallel port or a serial port or a universal serial bus (USB) port on a personal computer. In some embodiments, communications interface 970 is an integrated services digital network (ISDN) card or a digital subscriber line (DSL) card or a telephone modem that provides an information communication connection to a corresponding type of telephone line. In some embodiments, a communication interface 970 is a cable modem that converts signals on bus 910 into signals for a communication connection over a coaxial cable or into optical signals for a communication connection over a fiber optic cable. As another example, communications interface 970 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN, such as Ethernet. Wireless links may also be implemented. For wireless links, the communications interface 970 sends or receives or both sends and receives electrical, acoustic or electromagnetic signals, including infrared and optical signals, that carry information streams, such as digital data. For example, in wireless handheld devices, such as mobile telephones like cell phones, the communications interface 970 includes a radio band electromagnetic transmitter and receiver called a radio transceiver. In certain embodiments, the communications interface 970 enables connection to the communication network 107 for providing context attributes and informational links for media data to the UEs 101A . . . 101N.
The term “computer-readable medium” as used herein to refer to any medium that participates in providing information to processor 902, including instructions for execution. Such a medium may take many forms, including, but not limited to computer-readable storage medium (e.g., non-volatile media, volatile media), and transmission media. Non-transitory media, such as non-volatile media, include, for example, optical or magnetic disks, such as storage device 908. Volatile media include, for example, dynamic memory 904. Transmission media include, for example, coaxial cables, copper wire, fiber optic cables, and carrier waves that travel through space without wires or cables, such as acoustic waves and electromagnetic waves, including radio, optical and infrared waves. Signals include man-made transient variations in amplitude, frequency, phase, polarization or other physical properties transmitted through the transmission media. Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, CDRW, DVD, any other optical medium, punch cards, paper tape, optical mark sheets, any other physical medium with patterns of holes or other optically recognizable indicia, a RAM, a PROM, an EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave, or any other medium from which a computer can read. The term computer-readable storage medium is used herein to refer to any computer-readable medium except transmission media.
Logic encoded in one or more tangible media includes one or both of processor instructions on a computer-readable storage media and special purpose hardware, such as ASIC 920.
Network link 978 typically provides information communication using transmission media through one or more networks to other devices that use or process the information. For example, network link 978 may provide a connection through local network 980 to a host computer 982 or to equipment 984 operated by an Internet Service Provider (ISP). ISP equipment 984 in turn provides data communication services through the public, world-wide packet-switching communication network of networks now commonly referred to as the Internet 990.
A computer called a server host 992 connected to the Internet hosts a process that provides a service in response to information received over the Internet. For example, server host 992 hosts a process that provides information representing video data for presentation at display 914. It is contemplated that the components of system 900 can be deployed in various configurations within other computer systems, e.g., host 982 and server 992.
At least some embodiments of the invention are related to the use of computer system 900 for implementing some or all of the techniques described herein. According to one embodiment of the invention, those techniques are performed by computer system 900 in response to processor 902 executing one or more sequences of one or more processor instructions contained in memory 904. Such instructions, also called computer instructions, software and program code, may be read into memory 904 from another computer-readable medium such as storage device 908 or network link 978. Execution of the sequences of instructions contained in memory 904 causes processor 902 to perform one or more of the method steps described herein. In alternative embodiments, hardware, such as ASIC 920, may be used in place of or in combination with software to implement the invention. Thus, embodiments of the invention are not limited to any specific combination of hardware and software, unless otherwise explicitly stated herein.
The signals transmitted over network link 978 and other networks through communications interface 970, carry information to and from computer system 900. Computer system 900 can send and receive information, including program code, through the networks 980, 990 among others, through network link 978 and communications interface 970. In an example using the Internet 990, a server host 992 transmits program code for a particular application, requested by a message sent from computer 900, through Internet 990, ISP equipment 984, local network 980 and communications interface 970. The received code may be executed by processor 902 as it is received, or may be stored in memory 904 or in storage device 908 or other non-volatile storage for later execution, or both. In this manner, computer system 900 may obtain application program code in the form of signals on a carrier wave.
Various forms of computer readable media may be involved in carrying one or more sequence of instructions or data or both to processor 902 for execution. For example, instructions and data may initially be carried on a magnetic disk of a remote computer such as host 982. The remote computer loads the instructions and data into its dynamic memory and sends the instructions and data over a telephone line using a modem. A modem local to the computer system 900 receives the instructions and data on a telephone line and uses an infra-red transmitter to convert the instructions and data to a signal on an infra-red carrier wave serving as the network link 978. An infrared detector serving as communications interface 970 receives the instructions and data carried in the infrared signal and places information representing the instructions and data onto bus 910. Bus 910 carries the information to memory 904 from which processor 902 retrieves and executes the instructions using some of the data sent with the instructions. The instructions and data received in memory 904 may optionally be stored on storage device 908, either before or after execution by the processor 902.
In one embodiment, the chip set 1000 includes a communication mechanism such as a bus 1001 for passing information among the components of the chip set 1000. A processor 1003 has connectivity to the bus 1001 to execute instructions and process information stored in, for example, a memory 1005. The processor 1003 may include one or more processing cores with each core configured to perform independently. A multi-core processor enables multiprocessing within a single physical package. Examples of a multi-core processor include two, four, eight, or greater numbers of processing cores. Alternatively or in addition, the processor 1003 may include one or more microprocessors configured in tandem via the bus 1001 to enable independent execution of instructions, pipelining, and multithreading. The processor 1003 may also be accompanied with one or more specialized components to perform certain processing functions and tasks such as one or more digital signal processors (DSP) 1007, or one or more application-specific integrated circuits (ASIC) 1009. A DSP 1007 typically is configured to process real-world signals (e.g., sound) in real time independently of the processor 1003. Similarly, an ASIC 1009 can be configured to performed specialized functions not easily performed by a general purposed processor. Other specialized components to aid in performing the inventive functions described herein include one or more field programmable gate arrays (FPGA) (not shown), one or more controllers (not shown), or one or more other special-purpose computer chips.
The processor 1003 and accompanying components have connectivity to the memory 1005 via the bus 1001. The memory 1005 includes both dynamic memory (e.g., RAM, magnetic disk, writable optical disk, etc.) and static memory (e.g., ROM, CD-ROM, etc.) for storing executable instructions that when executed perform the inventive steps described herein to provide context attributes and informational links for media data. The memory 1005 also stores the data associated with or generated by the execution of the inventive steps.
Pertinent internal components of the telephone include a Main Control Unit (MCU) 1103, a Digital Signal Processor (DSP) 1105, and a receiver/transmitter unit including a microphone gain control unit and a speaker gain control unit. A main display unit 1107 provides a display to the user in support of various applications and mobile terminal functions that perform or support the steps of providing context attributes and informational links for media data. The display 11 includes display circuitry configured to display at least a portion of a user interface of the mobile terminal (e.g., mobile telephone). Additionally, the display 1107 and display circuitry are configured to facilitate user control of at least some functions of the mobile terminal. An audio function circuitry 1109 includes a microphone 1111 and microphone amplifier that amplifies the speech signal output from the microphone 1111. The amplified speech signal output from the microphone 1111 is fed to a coder/decoder (CODEC) 1113.
A radio section 1115 amplifies power and converts frequency in order to communicate with a base station, which is included in a mobile communication system, via antenna 1117. The power amplifier (PA) 1119 and the transmitter/modulation circuitry are operationally responsive to the MCU 1103, with an output from the PA 1119 coupled to the duplexer 1121 or circulator or antenna switch, as known in the art. The PA 1119 also couples to a battery interface and power control unit 1120.
In use, a user of mobile terminal 1101 speaks into the microphone 1111 and his or her voice along with any detected background noise is converted into an analog voltage. The analog voltage is then converted into a digital signal through the Analog to Digital Converter (ADC) 1123. The control unit 1103 routes the digital signal into the DSP 1105 for processing therein, such as speech encoding, channel encoding, encrypting, and interleaving. In one embodiment, the processed voice signals are encoded, by units not separately shown, using a cellular transmission protocol such as global evolution (EDGE), general packet radio service (GPRS), global system for mobile communications (GSM), Internet protocol multimedia subsystem (IMS), universal mobile telecommunications system (UMTS), etc., as well as any other suitable wireless medium, e.g., microwave access (WiMAX), Long Term Evolution (LTE) networks, code division multiple access (CDMA), wideband code division multiple access (WCDMA), wireless fidelity (WiFi), satellite, and the like.
The encoded signals are then routed to an equalizer 1125 for compensation of any frequency-dependent impairments that occur during transmission though the air such as phase and amplitude distortion. After equalizing the bit stream, the modulator 1127 combines the signal with a RF signal generated in the RF interface 1129. The modulator 1127 generates a sine wave by way of frequency or phase modulation. In order to prepare the signal for transmission, an up-converter 1131 combines the sine wave output from the modulator 1127 with another sine wave generated by a synthesizer 1133 to achieve the desired frequency of transmission. The signal is then sent through a PA 1119 to increase the signal to an appropriate power level. In practical systems, the PA 1119 acts as a variable gain amplifier whose gain is controlled by the DSP 1105 from information received from a network base station. The signal is then filtered within the duplexer 1121 and optionally sent to an antenna coupler 1135 to match impedances to provide maximum power transfer. Finally, the signal is transmitted via antenna 1117 to a local base station. An automatic gain control (AGC) can be supplied to control the gain of the final stages of the receiver. The signals may be forwarded from there to a remote telephone which may be another cellular telephone, other mobile phone or a land-line connected to a Public Switched Telephone Network (PSTN), or other telephony networks.
Voice signals transmitted to the mobile terminal 1101 are received via antenna 1117 and immediately amplified by a low noise amplifier (LNA) 1137. A down-converter 1139 lowers the carrier frequency while the demodulator 1141 strips away the RF leaving only a digital bit stream. The signal then goes through the equalizer 1125 and is processed by the DSP 1105. A Digital to Analog Converter (DAC) 1143 converts the signal and the resulting output is transmitted to the user through the speaker 1145, all under control of a Main Control Unit (MCU) 1103—which can be implemented as a Central Processing Unit (CPU) (not shown).
The MCU 1103 receives various signals including input signals from the keyboard 1147. The keyboard 1147 and/or the MCU 1103 in combination with other user input components (e.g., the microphone 1111) comprise a user interface circuitry for managing user input. The MCU 1103 runs a user interface software to facilitate user control of at least some functions of the mobile terminal 1101 to provide context attributes and informational links for media data. The MCU 1103 also delivers a display command and a switch command to the display 1107 and to the speech output switching controller, respectively. Further, the MCU 1103 exchanges information with the DSP 1105 and can access an optionally incorporated SIM card 1149 and a memory 1151. In addition, the MCU 1103 executes various control functions required of the terminal. The DSP 1105 may, depending upon the implementation, perform any of a variety of conventional digital processing functions on the voice signals. Additionally, DSP 1105 determines the background noise level of the local environment from the signals detected by microphone 1111 and sets the gain of microphone 1111 to a level selected to compensate for the natural tendency of the user of the mobile terminal 1101.
The CODEC 1113 includes the ADC 1123 and DAC 1143. The memory 1151 stores various data including call incoming tone data and is capable of storing other data including music data received via, e.g., the global Internet. The software module could reside in RAM memory, flash memory, registers, or any other form of writable storage medium known in the art. The memory device 1151 may be, but not limited to, a single memory, CD, DVD, ROM, RAM, EEPROM, optical storage, or any other non-volatile storage medium capable of storing digital data.
An optionally incorporated SIM card 1149 carries, for instance, important information, such as the cellular phone number, the carrier supplying service, subscription details, and security information. The SIM card 1149 serves primarily to identify the mobile terminal 1101 on a radio network. The card 1149 also contains a memory for storing a personal telephone number registry, text messages, and user specific mobile terminal settings.
While the invention has been described in connection with a number of embodiments and implementations, the invention is not so limited but covers various obvious modifications and equivalent arrangements, which fall within the purview of the appended claims. Although features of the invention are expressed in certain combinations among the claims, it is contemplated that these features can be arranged in any combination and order.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CN2010/070484 | 2/3/2010 | WO | 00 | 8/13/2012 |