Some human machine interfaces (HMIs) may overwhelm or frustrate the user. For example, in the case of inappropriately-timed advertisement insertions, or overly challenging or boring games, customers may be alienated. In automobile collision alert systems, safety may be an issue if an HMI is distracting the user with low-priority messages while also issuing alerts of impending danger. In the field of marketing, the effectiveness of the marketing may be reduced by poor timing and/or messaging. Some approaches to making suggestions to consumers regarding products or content may use inferential data that may not be a good predictor of actual product or content enjoyment or interest. The ability of students to learn material in education settings may be reduced as teachers or computer-based training may progress without regard to the student's ability to process new information. Progressing through a lesson this way may result in a loss of efficiency.
Systems, methods, and instrumentalities are disclosed for generating a human machine interface (HMI) that may be aware of and that may adapt to the user's cognitive load and/or emotional affective state. Sensor data may be used to estimate a cognitive state (e.g., cognitive load) and/or affective state of a user, which may be used to prioritize or otherwise affect interactions with the user. The cognitive and/or affective state of the user may include information other than what can be inferred from the context and/or from content that has already been consumed. For example, cognitive and/or affective state information may be used for adaptive gaming, advertisement placement and/or delivery timing, driver or pilot assistance, education, advertisement selection, product and/or content suggestions, and/or video chat applications.
A system may generate a human machine interface (HMI). The system may manage a content placement in the HMI. The system may deliver content to a user. The content may include video data, video game data, educational data, training data, or the like. The system may receive sensor data from one or more sensors. The sensor data may be associated with a user. The sensor data from the one or more sensors may include at least one of camera data, galvanic skin response (GSR) data, voice analysis data, facial expression analysis data, body language analysis data, eye movement and gaze tracking analysis data, blink rate analysis data, electroencephalographic data, electrodermal activity data, pupillometry data, heart rate data, blood pressure data, respiration rate data, or body temperature data. The system may determine at least one of a cognitive state or an affective state of the user based on the received sensor data. The cognitive state of the user may include a cognitive load of the user. The affective state of the user may include an arousal measure and a valence measure. The system may analyze the received sensor data. The system may plot the arousal measure and the valence measure on a two-dimensional arousal valence space. The system may associate the user with one or more predefined affective states based on the plot.
The system may determine a timing for delivery of content. The content may include an advertisement. The timing for delivery of the content may be determined based on at least one of the determined cognitive state of the user or the determined affective state of the user. The content may be delivered to the user when the cognitive load of the user is below a predetermined threshold or when the affective state of the user indicates that the user is receptive. The affective state of the user may indicate that the user is receptive when a distance measure from the affective state of the user to a predefined affective state is below a predetermined threshold. The predefined affective state may include a predefined arousal measure and a predefined valence measure. The distance measure may be based on a distance between the affective state of the user and the predefined affective state. The distance measure may include an arousal component and a valence component. The content may be delivered to the user based on the determined timing. The content may be delivered to the user via the HMI or a second HMI.
The system may select the content for delivery to the user. The content may be selected for delivery to the user based on at least one of the cognitive state of the user or the affective state of the user. The content may be selected for delivery to the user based on a stimulus response model for the user. The stimulus response model may be based on historical user responses to (e.g., historical observations of a user's cognitive and/or affective state in response to) prior content. The user may be associated with a customer category. The user may be associated with the customer category based on a stimulus/response pair. The stimulus/response pair may be based on information presented to the user and at least one of the cognitive state or the affective state of the user in response to the information presented. The system may store the stimulus/response pair. The content may be selected for delivery to the user based on the customer category associated with the user. The content may be selected for delivery to the user based on a stimulus response database of customers in a predefined customer category. The predefined customer category may include the user. The content may be a first advertisement for a first product. The first advertisement for the first product may be selected based on a previous response of the user to a second advertisement for a second product.
A detailed description of illustrative embodiments will now be described with reference to the various Figures. Although this description provides a detailed example of possible implementations, it should be noted that the details are intended to be exemplary and in no way limit the scope of the application.
As shown in
The communications system 100 may also include a base station 114a and a base station 114b. Each of the base stations 114a, 114b may be any type of device configured to wirelessly interface with at least one of the WTRUs 102a, 102b, 102c, 102d to facilitate access to one or more communication networks, such as the core network 106/107/109, the Internet 110, and/or the networks 112. By way of example, the base stations 114a, 114b may be a base transceiver station (BTS), a Node-B, an eNode B, a Home Node B, a Home eNode B, a site controller, an access point (AP), a wireless router, and the like. While the base stations 114a, 114b are each depicted as a single element, it will be appreciated that the base stations 114a, 114b may include any number of interconnected base stations and/or network elements.
The base station 114a may be part of the RAN 103/104/105, which may also include other base stations and/or network elements (not shown), such as a base station controller (BSC), a radio network controller (RNC), relay nodes, etc. The base station 114a and/or the base station 114b may be configured to transmit and/or receive wireless signals within a particular geographic region, which may be referred to as a cell (not shown). The cell may further be divided into cell sectors. For example, the cell associated with the base station 114a may be divided into three sectors. Thus, in one embodiment, the base station 114a may include three transceivers, e.g., one for each sector of the cell. In another embodiment, the base station 114a may employ multiple-input multiple output (MIMO) technology and, therefore, may utilize multiple transceivers for each sector of the cell.
The base stations 114a, 114b may communicate with one or more of the WTRUs 102a, 102b, 102c, 102d over an air interface 115/116/117, which may be any suitable wireless communication link (e.g., radio frequency (RF), microwave, infrared (IR), ultraviolet (UV), visible light, etc.). The air interface 115/116/117 may be established using any suitable radio access technology (RAT).
More specifically, as noted above, the communications system 100 may be a multiple access system and may employ one or more channel access schemes, such as CDMA, TDMA, FDMA, OFDMA, SC-FDMA, and the like. For example, the base station 114a in the RAN 103/104/105 and the WTRUs 102a, 102b, 102c may implement a radio technology such as Universal Mobile Telecommunications System (UMTS) Terrestrial Radio Access (UTRA), which may establish the air interface 115/116/117 using wideband CDMA (WCDMA). WCDMA may include communication protocols such as High-Speed Packet Access (HSPA) and/or Evolved HSPA (HSPA+). HSPA may include High-Speed Downlink Packet Access (HSDPA) and/or High-Speed Uplink Packet Access (HSUPA).
In another embodiment, the base station 114a and the WTRUs 102a, 102b, 102c may implement a radio technology such as Evolved UMTS Terrestrial Radio Access (E-UTRA), which may establish the air interface 115/116/117 using Long Term Evolution (LTE) and/or LTE-Advanced (LTE-A).
In other embodiments, the base station 114a and the WTRUs 102a, 102b, 102c may implement radio technologies such as IEEE 802.16 (e.g., Worldwide Interoperability for Microwave Access (WiMAX)), CDMA2000, CDMA2000 1×, CDMA2000 EV-DO, Interim Standard 2000 (IS-2000), Interim Standard 95 (IS-95), Interim Standard 856 (IS-856), Global System for Mobile communications (GSM), Enhanced Data rates for GSM Evolution (EDGE), GSM EDGE (GERAN), and the like.
The base station 114b in
The RAN 103/104/105 may be in communication with the core network 106/107/109, which may be any type of network configured to provide voice, data, applications, and/or voice over internet protocol (VoIP) services to one or more of the WTRUs 102a, 102b, 102c, 102d. For example, the core network 106/107/109 may provide call control, billing services, mobile location-based services, pre-paid calling, Internet connectivity, video distribution, etc., and/or perform high-level security functions, such as user authentication. Although not shown in
The core network 106/107/109 may also serve as a gateway for the WTRUs 102a, 102b, 102c, 102d to access the PSTN 108, the Internet 110, and/or other networks 112. The PSTN 108 may include circuit-switched telephone networks that provide plain old telephone service (POTS). The Internet 110 may include a global system of interconnected computer networks and devices that use common communication protocols, such as the transmission control protocol (TCP), user datagram protocol (UDP) and the internet protocol (IP) in the TCP/IP internet protocol suite. The networks 112 may include wired or wireless communications networks owned and/or operated by other service providers. For example, the networks 112 may include another core network connected to one or more RANs, which may employ the same RAT as the RAN 103/104/105 or a different RAT.
Some or all of the WTRUs 102a, 102b, 102c, 102d in the communications system 100 may include multi-mode capabilities, e.g., the WTRUs 102a, 102b, 102c, 102d may include multiple transceivers for communicating with different wireless networks over different wireless links. For example, the WTRU 102c shown in
The processor 118 may be a general purpose processor, a special purpose processor, a conventional processor, a digital signal processor (DSP), a plurality of microprocessors, one or more microprocessors in association with a DSP core, a controller, a microcontroller, Application Specific Integrated Circuits (ASICs), Field Programmable Gate Array (FPGAs) circuits, any other type of integrated circuit (IC), a state machine, and the like. The processor 118 may perform signal coding, data processing, power control, input/output processing, and/or any other functionality that enables the WTRU 102 to operate in a wireless environment. The processor 118 may be coupled to the transceiver 120, which may be coupled to the transmit/receive element 122. While
The transmit/receive element 122 may be configured to transmit signals to, or receive signals from, a base station (e.g., the base station 114a) over the air interface 115/116/117. For example, in one embodiment, the transmit/receive element 122 may be an antenna configured to transmit and/or receive RF signals. In another embodiment, the transmit/receive element 122 may be an emitter/detector configured to transmit and/or receive IR, UV, or visible light signals, for example. In yet another embodiment, the transmit/receive element 122 may be configured to transmit and receive both RF and light signals. It will be appreciated that the transmit/receive element 122 may be configured to transmit and/or receive any combination of wireless signals.
In addition, although the transmit/receive element 122 is depicted in
The transceiver 120 may be configured to modulate the signals that are to be transmitted by the transmit/receive element 122 and to demodulate the signals that are received by the transmit/receive element 122. As noted above, the WTRU 102 may have multi-mode capabilities. Thus, the transceiver 120 may include multiple transceivers for enabling the WTRU 102 to communicate via multiple RATs, such as UTRA and IEEE 802.11, for example.
The processor 118 of the WTRU 102 may be coupled to, and may receive user input data from, the speaker/microphone 124, the keypad 126, and/or the display/touchpad 128 (e.g., a liquid crystal display (LCD) display unit or organic light-emitting diode (OLED) display unit). The processor 118 may also output user data to the speaker/microphone 124, the keypad 126, and/or the display/touchpad 128. In addition, the processor 118 may access information from, and store data in, any type of suitable memory, such as the non-removable memory 130 and/or the removable memory 132. The non-removable memory 130 may include random-access memory (RAM), read-only memory (ROM), a hard disk, or any other type of memory storage device. The removable memory 132 may include a subscriber identity module (SIM) card, a memory stick, a secure digital (SD) memory card, and the like. In other embodiments, the processor 118 may access information from, and store data in, memory that is not physically located on the WTRU 102, such as on a server or a home computer (not shown).
The processor 118 may receive power from the power source 134, and may be configured to distribute and/or control the power to the other components in the WTRU 102. The power source 134 may be any suitable device for powering the WTRU 102. For example, the power source 134 may include one or more dry cell batteries (e.g., nickel-cadmium (NiCd), nickel-zinc (NiZn), nickel metal hydride (NiMH), lithium-ion (Li-ion), etc.), solar cells, fuel cells, and the like.
The processor 118 may also be coupled to the GPS chipset 136, which may be configured to provide location information (e.g., longitude and latitude) regarding the current location of the WTRU 102. In addition to, or in lieu of, the information from the GPS chipset 136, the WTRU 102 may receive location information over the air interface 115/116/117 from a base station (e.g., base stations 114a, 114b) and/or determine its location based on the timing of the signals being received from two or more nearby base stations. It will be appreciated that the WTRU 102 may acquire location information by way of any suitable location-determination implementation while remaining consistent with an embodiment.
The processor 118 may further be coupled to other peripherals 138, which may include one or more software and/or hardware modules that provide additional features, functionality and/or wired or wireless connectivity. For example, the peripherals 138 may include an accelerometer, an e-compass, a satellite transceiver, a digital camera (for photographs or video), a universal serial bus (USB) port, a vibration device, a television transceiver, a hands free headset, a Bluetooth® module, a frequency modulated (FM) radio unit, a digital music player, a media player, a video game player module, an Internet browser, and the like.
As shown in
The core network 106 shown in
The RNC 142a in the RAN 103 may be connected to the MSC 146 in the core network 106 via an IuCS interface. The MSC 146 may be connected to the MGW 144. The MSC 146 and the MGW 144 may provide the WTRUs 102a, 102b, 102c with access to circuit-switched networks, such as the PSTN 108, to facilitate communications between the WTRUs 102a, 102b, 102c and traditional land-line communications devices.
The RNC 142a in the RAN 103 may also be connected to the SGSN 148 in the core network 106 via an IuPS interface. The SGSN 148 may be connected to the GGSN 150. The SGSN 148 and the GGSN 150 may provide the WTRUs 102a, 102b, 102c with access to packet-switched networks, such as the Internet 110, to facilitate communications between and the WTRUs 102a, 102b, 102c and IP-enabled devices.
As noted above, the core network 106 may also be connected to the networks 112, which may include other wired or wireless networks that are owned and/or operated by other service providers.
The RAN 104 may include eNode-Bs 160a, 160b, 160c, though it will be appreciated that the RAN 104 may include any number of eNode-Bs while remaining consistent with an embodiment. The eNode-Bs 160a, 160b, 160c may each include one or more transceivers for communicating with the WTRUs 102a, 102b, 102c over the air interface 116. In one embodiment, the eNode-Bs 160a, 160b, 160c may implement MIMO technology. Thus, the eNode-B 160a, for example, may use multiple antennas to transmit wireless signals to, and receive wireless signals from, the WTRU 102a.
Each of the eNode-Bs 160a, 160b, 160c may be associated with a particular cell (not shown) and may be configured to handle radio resource management decisions, handover decisions, scheduling of users in the uplink and/or downlink, and the like. As shown in
The core network 107 shown in
The MME 162 may be connected to each of the eNode-Bs 160a, 160b, 160c in the RAN 104 via an S1 interface and may serve as a control node. For example, the MME 162 may be responsible for authenticating users of the WTRUs 102a, 102b, 102c, bearer activation/deactivation, selecting a particular serving gateway during an initial attach of the WTRUs 102a, 102b, 102c, and the like. The MME 162 may also provide a control plane function for switching between the RAN 104 and other RANs (not shown) that employ other radio technologies, such as GSM or WCDMA.
The serving gateway 164 may be connected to each of the eNode-Bs 160a, 160b, 160c in the RAN 104 via the S1 interface. The serving gateway 164 may generally route and forward user data packets to/from the WTRUs 102a, 102b, 102c. The serving gateway 164 may also perform other functions, such as anchoring user planes during inter-eNode B handovers, triggering paging when downlink data is available for the WTRUs 102a, 102b, 102c, managing and storing contexts of the WTRUs 102a, 102b, 102c, and the like.
The serving gateway 164 may also be connected to the PDN gateway 166, which may provide the WTRUs 102a, 102b, 102c with access to packet-switched networks, such as the Internet 110, to facilitate communications between the WTRUs 102a, 102b, 102c and IP-enabled devices.
The core network 107 may facilitate communications with other networks. For example, the core network 107 may provide the WTRUs 102a, 102b, 102c with access to circuit-switched networks, such as the PSTN 108, to facilitate communications between the WTRUs 102a, 102b, 102c and traditional land-line communications devices. For example, the core network 107 may include, or may communicate with, an IP gateway (e.g., an IP multimedia subsystem (IMS) server) that serves as an interface between the core network 107 and the PSTN 108. In addition, the core network 107 may provide the WTRUs 102a, 102b, 102c with access to the networks 112, which may include other wired or wireless networks that are owned and/or operated by other service providers.
As shown in
The air interface 117 between the WTRUs 102a, 102b, 102c and the RAN 105 may be defined as an R1 reference point that implements the IEEE 802.16 specification. In addition, each of the WTRUs 102a, 102b, 102c may establish a logical interface (not shown) with the core network 109. The logical interface between the WTRUs 102a, 102b, 102c and the core network 109 may be defined as an R2 reference point, which may be used for authentication, authorization, IP host configuration management, and/or mobility management.
The communication link between each of the base stations 180a, 180b, 180c may be defined as an R8 reference point that includes protocols for facilitating WTRU handovers and the transfer of data between base stations. The communication link between the base stations 180a, 180b, 180c and the ASN gateway 182 may be defined as an R6 reference point. The R6 reference point may include protocols for facilitating mobility management based on mobility events associated with each of the WTRUs 102a, 102b, 102c.
As shown in
The MIP-HA may be responsible for IP address management, and may enable the WTRUs 102a, 102b, 102c to roam between different ASNs and/or different core networks. The MIP-HA 184 may provide the WTRUs 102a, 102b, 102c with access to packet-switched networks, such as the Internet 110, to facilitate communications between the WTRUs 102a, 102b, 102c and IP-enabled devices. The AAA server 186 may be responsible for user authentication and for supporting user services. The gateway 188 may facilitate interworking with other networks. For example, the gateway 188 may provide the WTRUs 102a, 102b, 102c with access to circuit-switched networks, such as the PSTN 108, to facilitate communications between the WTRUs 102a, 102b, 102c and traditional land-line communications devices. In addition, the gateway 188 may provide the WTRUs 102a, 102b, 102c with access to the networks 112, which may include other wired or wireless networks that are owned and/or operated by other service providers.
Although not shown in
A human-machine interface (HMI) may receive data (e.g., sensor data) from one or more sensors. The HMI may determine, based on the received sensor data, a cognitive state of a user and/or an affective state of the user. The HMI may adapt to the cognitive state and/or affective state of the user. The HMI may deliver content to the user. The content may be delivered to the user via a WTRU 102 (e.g., a smart phone handset or a tablet computer, etc.) as depicted in
One or more (e.g., multiple) signals may be captured that may correlate to cognitive load and/or affective state. The one or more signals may include sensor data received from one or more sensors. For example, pupil dilation may be associated with cognitive effort. A change in pupillary dilation elicited by psychological stimuli may be on the order of 0.5 mm. The chance in pupillary dilation may occur as the result of a neural inhibitory mechanism by the parasympathetic nervous system.
There may be a number of approaches for measuring cognitive loading and/or affective state with varying levels of invasiveness. The approaches for measuring cognitive loading and/or affective state may include, galvanic skin response (GSR) or electrodermal activity, voice analysis, facial expression analysis, body language analysis, eye movement and gaze tracking, blink rate analysis, heart rate analysis, blood pressure analysis, respiration rate analysis, body temperature analysis, and/or electroencephalography. Cognitive load and/or affective state estimation (e.g., determination) may be performed using one or more of the approaches for measuring cognitive loading and/or affective state (e.g., depending on the setting and feasibility of obtaining the data).
Affective computing may include the study and development of systems and/or devices that can recognize, interpret, process, and/or simulate human affects. Affective computing may be an interdisciplinary field spanning at least computer science, psychology, and/or cognitive science. A machine may interpret an emotional state of a human. The machine may adapt a behavior to the emotional state of the human. The machine may provide an appropriate response to the emotional state of the human.
An affective state may be categorized into one or more predefined affective states. The affective state may be categorized in a space, as shown by way of example in
Arousal and/or valence may be tracked. For example, arousal and/or valence may be tracked via speech analysis, facial expression analysis, body language analysis, electroencephalography, galvanic skin response (GSR) or electrodermal activity (e.g., a measure of activity of the sympathetic nervous system, e.g., fight or flight response), tremor or motion analysis, pupillometry, eye motion/gaze analysis, blink rate analysis, heart rate analysis, blood pressure analysis, respiration rate analysis, and/or body temperature analysis. One or more predefined affective states may be determined and may be plotted on a two-dimensional space plotting arousal and valence. A predefined affective state may be concurrent with one or more predefined affective states. An arousal measure and a valence measure for the user may be determined at various times. At the various times, the arousal measure and the valence measure may be measured and/or plotted on the two-dimensional arousal valence space.
A game designer (e.g., a video game designer) may attempt to achieve a balance between making games challenging (e.g., overly challenging), which may frustrate a user, and making games easy (e.g., overly easy), which may bore the user. Sensor data may be captured using gaming platforms that incorporate cameras, accelerometers, motion trackers, gaze trackers, and/or the like. The sensor data may be analyzed to determine an affective and/or cognitive state of the user. Analyzing the sensor data may include analyzing one or more video images captured of a user along with other sensor input data, such as GSR, tremor, body language, and/or facial expressions. Game content may be adjusted based on the determined affective and/or cognitive state (e.g., to increase or maximize the user's engagement and/or reduce or minimize attrition).
For example, game content may be adapted by reducing the level of difficulty when a user's valence and arousal measures indicate excessive anger with the game play. If pupillometric estimates indicate that the user is saturated and/or unable to encode memories, game content may be adjusted based on the pupillometric estimates. A system may approach this trade-off in an open-loop fashion. For example, after a user attempts to achieve an objective more than a threshold number of times, the system may provide the user with a hint or another form of assistance. A closed-loop approach may be more tuned to the user's response to the game content. Some systems may allow the user to select a level of difficulty for the game. An adaptive difficulty mode may base the level of difficulty on the user's measured affective and/or cognitive state. The system may offer assistance and/or hints as the affective and/or cognitive state may indicate that such assistance may improve the user's gaming experience. Adaptive difficulty may be enabled or disabled, for example, based on user preferences.
A timing for delivery (e.g., insertion) of content (e.g., one or more advertisements) may be determined. The timing for delivery may increase (e.g., maximize) an impact of the one or more advertisements. An affective and/or cognitive state of a user may be used to influence advertisement placement (e.g., a timing for delivery). If advertisements are inserted at the wrong time (e.g., when the user is saturated with other activities) during a user's session, the marketing message may be lost, or an advertisement may be bypassed. By tracking a user's affective and/or cognitive state, a system may insert advertisements at a time that may increase or maximize the efficacy of the message delivery and/or reduce or minimize the frequency of advertisement bypasses (e.g., “skip the ad” clicks). An adjustment to the timing of delivery may not impact the overall rate of advertisement insertions. The adjustment to the timing of delivery may optimize the timing of the insertion. For example, a time window may include a duration of time during which an advertisement may be inserted. The timing of the advertisement delivery within the time window may be determined based on the user's cognitive and/or affective state. The advertisement may be inserted at a particular time within the window based on the detection of a receptive cognitive state and/or a receptive affective state of the user at the particular time within the window. The advertisement may be inserted at or toward the end of the time window on a condition that the affective and/or cognitive state of the user did not trigger an advertisement insertion earlier in the time window. A content viewing timeline may be overlaid with or partitioned into one or multiple such time windows such that one or multiple advertisements are inserted as the user views the content. For example, an hour long video program (or an hour of video viewing, even if not a single video program) may be partitioned into five time windows of twelve minutes each, and the cognitive and/or affective state of the user may be used to adjust the timing of delivery (e.g., insertion) of an advertisement into each of the five time windows. In this way an advertiser may time the delivery of advertisements to coincide with receptive cognitive and/or affective states of the user, while maintaining a pre-determined overall rate of advertisement insertion (e.g., five advertisements per hour). The time window may be combined with a normalized peak detector. The normalized peak detector may determine an affective and/or cognitive state normalization based on a moving average of the affective and/or cognitive state of the user. A threshold affective and/or cognitive state for advertisement placement may adapt to a user with a lower average response.
The determined cognitive and/or affective state of the user 504 may be provided for an advertisement delivery timing, at 508. The cognitive/affective state estimation 506 may be provided via a network 510. The advertisement delivery timing, at 508, may determine a timing for delivery (e.g., schedule insertion) of one or more advertisements based on the determined cognitive and/or affective state of the user 504. An advertisement insertion may be triggered when the user is receptive. For example, an advertisement may be delivered to the user when the user is receptive. The affective state of the user may indicate when the user is receptive. For example, a distance measure from the affective state of the user to a predefined affective state may be determined. The predefined affective state may include a predefined arousal measure and a predefined valence measure. When the distance measure from the affective state of the user to a predetermined affective state is below a predetermined threshold, the user may be receptive. The user may be receptive when the user may be exhibiting moderately high arousal and high valence. An advertisement may be delivered to the user when a cognitive load of the user is below a predetermined threshold. The cognitive load of the user may be below a predetermined threshold when the user is able to encode new memories.
The determined timing for delivery of the one or more advertisements, at 508, may be provided to a content publisher 512. The content publisher 512 may deliver the one or more advertisements and/or other content to the user 504 based on the determined timing. The affective- and/or cognitive-adaptive advertisement insertion timing system 500 may be implemented using a system architecture such as the systems depicted in
An affective and/or cognitive state of a user may be used to assist drivers and/or pilots. The cognitive state of the user may be given more weight than the affective state of the user (e.g., cognitive processing may be more important than affective state). Adaptation of infotainment systems may leverage affective state estimation as disclosed herein. For example, music may be suggested to a driver based on the affective state of the driver.
Drivers or pilots may become distracted or saturated with lower priority interface alerts and may miss high priority messages, such as collision alerts. The cognitive load of the user may be monitored, for example, via pupillometry (e.g., using a rear-view mirror or dashboard mounted camera), GSR (e.g., using GSR sensors incorporated in the steering wheel or aircraft yoke), voice analysis (e.g., as captured by a vehicle communications system), and/or information from a vehicle navigation system (e.g., based on GPS, etc.). These inputs may be synthesized into a cognitive load assessment that may be used to provide one or more triggers (e.g., dynamic triggers, for user interface alerts). For example, if the fuel level is dropping below a threshold, an alert may be timed for delivery to avoid distraction and/or allow the driver to maintain focus on higher priority tasks, such as avoiding collisions. The alert may be timed for delivery in a prioritized manner based on a cognitive and/or affective state of a driver or pilot. The prioritized manner may enable one or more critical alerts to be processed immediately (e.g., without delay). A lower priority alert, in the prioritized manner, may be delivered while a cognitive bandwidth (e.g., cognitive load) of the driver or pilot enables the driver or pilot to focus on the lower priority alert. The vehicle navigation system may be used to keep track of key points on a route and/or to trigger alerts of upcoming maneuvers.
Cognitive loading may provide an input to the timing of one or more user interface messages. For example, if the driver or pilot is (e.g., according to a preplanned navigation route or a flight plan) approaching a location that may involve the execution of a maneuver, such as an exit from a highway or a vector change, the HMI may deliver one or more nonessential messages while monitoring cognitive load and may provide one or more indications based on the cognitive load of the user. For example, if pupillometric and/or galvanic skin response (GSR) measurements indicate that the driver or pilot is not saturated with mental activity, in advance of a critical maneuver, the interface may indicate the presence of a lower priority interrupt, e.g., maintenance reminders, etc. As another example, if pupillometric and/or GSR measurements indicate that the driver or pilot may be saturated with mental activity, the interface may omit or delay indicating the presence of lower priority interrupts.
Affective and/or cognitive stale may be used in educational settings, such as computer-based training sessions and/or a classroom environment (e.g., a live classroom environment). One or more cues may be used to determine student engagement and retention. The one or more cues may be used to control the flow of information and/or the timing of breaks in the flow of information. A cognitive and/or affective state of a student may be used to determine a timing for (e.g., pace) the presentation of material. The cognitive state of the student may include the cognitive load of the student. The cognitive and/or affective state of the student may be used as a trigger for repetition and/or reinforcement. For example, if the first presentation of a topic results in a negative affect (e.g., high arousal and negative valence), the topic may be clarified and/or additional examples may be provided. The cognitive and/or affective state of the student may be used by a teacher (e.g., a live teacher) or in the context of computer-based training. For example, an ability of a student to absorb material may be calculated based on the cognitive and/or affective state of the student. As another example, the timing of a break (e.g., appropriate breaks) may be calculated based on the cognitive and/or affective state of the student. The computer-based or live training system may monitor one or more students in a class. The computer-based or live training system may provide one or more indications (e.g., reports) of the cognitive and/or affective states of the one or more students. For example, the one or more indications may be presented to a teacher in a live classroom via an HMI during the class. Based on the cognitive and/or affective state of the one or more students, the computer-based or live training system may determine an efficacy of a rate (e.g., a current rate) of teaching and/or may monitor incipient frustrations that may be developing. In the live training (e.g., classroom) system, the teacher may be presented (e.g., via an HMI) with a recommendation to change the teaching pace, to spend additional time to reinforce material associated with low (e.g., poor) cognitive and/or affective states, and/or to spend additional time with one or more students identified as having a low (e.g., poor) cognitive and/or affective state. In the computer-based training (e.g., learning) system, the teaching pace (e.g., the pace of the lessons) may be adjusted, and/or the reinforcement of material for a student may be triggered automatically in response to detection of a low (e.g., poor) cognitive and/or affective state of the student.
Affective and/or cognitive state may be used for product suggestion (e.g., advertisement selection). A content provider may provide a user with a suggestion based on feedback (e.g., explicit feedback). The feedback may include a click of a “like” button and/or prior viewing choices by the user. A retailer may suggest (e.g., select for delivery) one or more products based on a browsing and/or purchase history of the user. The cognitive and/or affective state of the user may be tracked (e.g., to facilitate selecting advertisements for products and/or content that the user may enjoy). For example, when the user exhibits a high level of arousal (e.g., a high arousal measure) and positive valence (e.g., joyfulness) following the presentation (e.g., delivery) of a certain type of video, audio, or product, the retailer may select a product for delivery that has historically elicited positive affective responses from similar users. Similar users may include users who have had similar affective responses to content as the current user of interest. Users who have had similar responses can be used as proxies for the expected response of the current (e.g., target) user to new content.
An affective- and/or cognitive-adaptive product suggestion system may track a cognitive and/or affective state of a user as the user consumes content (e.g., an advertisement). The affective- and/or cognitive-adaptive product suggestion system may categorize the cognitive and/or affective state of the user as the user consumes content. The affective- and/or cognitive-adaptive product suggestion system may categorize one or more stimulus/response pairs based on the cognitive and/or affective state of the user as the user consumes content. The stimulus in a stimulus/response pair may include information presented to the user (e.g., content). The response in the stimulus/response pair may include a cognitive state and/or an affective state of the user in response to the information presented to the user. The user may be associated with a customer category based on the one or more stimulus/response pairs. The one or more stimulus/response pairs may indicate how the content or product made the user feel. The one or more stimulus/response pairs may be stored in a database. The user may be associated with a customer category based on the one or more stimulus/response pairs. An advertisement may be selected for delivery to the user based on the customer category associated with the user. The affective- and/or cognitive-adaptive product suggestion system may observe how the user responds to different content or products and, over time, develop a stimulus response model. The stimulus response model may be based on historical user responses to one or more prior advertisements. The stimulus response model may be used to categorize one or more preferences of the user. The stimulus response model may be used to select an advertisement for delivery to the user. The stimulus response model may select an advertisement for delivery to the user based on one or more previous responses by the user to one or more advertisements for one or more products. For example, a first advertisement for a first product may be selected for delivery to the user based on a previous response of the user to a second advertisement for a second product.
For example, a user may be placed in (e.g., associated with) a category associated with enjoying certain content or products and being unhappy with other content or products. Content (e.g., specific content) and/or one or more products that are widely consumed (e.g., popular content and/or popular products) may be used as markers for user affinity. The content and/or one or more products that are widely consumed may be used to associate one or more advertisements with one or more customer categories. One or more advertisement selections (e.g., product suggestions) may be determined based on a stimulus/response pair (e.g., by extrapolating one or more positive responses) from one or more other users in the same or a similar customer category.
An affective- and/or cognitive adaptive product suggestion system, such as the affective- and/or cognitive-adaptive product suggestion system 800, may indicate that a user who watched a first content or advertisement and/or bought a first product may be interested in a second content and/or a second product. A person that watched and/or bought the first content and/or the first product may not have enjoyed or been pleased with the first content and/or the first product. A user who watched and/or bought the first content and/or the first product may not be pleased with the second content and/or the second product.
By monitoring the affective and/or cognitive state of the user, an affective- and/or cognitive-adaptive product suggestion system (e.g., the affective- and/or cognitive-adaptive product suggestion system 800) without the need for explicit user feedback, may be able to indicate (e.g., report) to a user that one or more users with similar tastes or interests who enjoyed or were pleased with a first content and/or first product have also enjoyed or were also pleased with a second content and/or a second product. The affective- and/or cognitive-adaptive product suggestion system may not require (e.g., avoid the need for) explicit user feedback. The affective- and/or cognitive-adaptive product suggestion system may provide a faster (e.g., more immediate) and/or more direct measure of consumer satisfaction. For example, a user may not be consciously aware of an enjoyment level of the user. As another example, the user may be influenced by one or more transient events, which may not affect a user assessment of the user experience.
Affective and/or cognitive state may be used for human-machine-machine-human interactions, such as video chat. Affective and/or cognitive analysis of one or more participants may be performed in a real-time video chat (e.g., video call) between two or more participants. The affective and/or cognitive analysis of the one or more participants may provide information to one or more participants to enhance the flow and/or content of information. The affective and/or cognitive analysis may determine a cognitive state of one or more participants and/or an affective state of the one or more participants based on sensor data. Affective and/or cognitive state analysis may assist in interpersonal relationships. For example, a participant in a conversation may offend (e.g., unknowingly offend) another participant. When affective state analysis is performed, an interaction between two or more participants may be improved (e.g., and issues may be dealt with rather than leaving them unaddressed). A user interface on an end (e.g., each end) of the real-time video chat may incorporate cognitive and/or affective state analysis. A user video stream may be processed for cognitive and/or affective state analysis at a client (e.g., each client) or in a central server that processes the session video (e.g., the one or more session video streams).
Examples of a video annotation may include an indication, to a first party, that a second party (e.g., the other party) in a call may be confused and/or may desire clarification or reiteration. For example, the second party may desire additional time before continuing the discussion (e.g., while processing information). The second party may be offended and may desire an apology or other form of reconciliation. The second party may be overloaded and may desire a pause or break from the conversation. The second party may be detached (e.g., may not be paying attention). A cognitive/affective estimation subsystem may determine (e.g., estimate) that the other party may be deceptive based, for example, on facial expressions and/or cognitive and/or affective analysis. When a party is determined to be deceptive, the cognitive/affective estimation subsystem may indicate that the party is being deceptive and that the party be handled with caution (e.g., be wary of responses provided by the party, be alert for any deceptive tactics, etc.).
An interpretation may be performed. The interpretation may be performed by leveraging one or more reference pairs of cognitive and/or affective state and/or one or more interpretations to provide interpretations of the cognitive and/or affective state to one or more users.
The processes and instrumentalities described herein may apply in any combination, may apply to other wireless technologies, and for other services.
A WTRU may refer to an identity of the physical device, or to the user's identity such as subscription related identities, e.g., MSISDN, SIP URI, etc. WTRU may refer to application-based identities, e.g., user names that may be used per application.
The processes described above may be implemented in a computer program, software, and/or firmware incorporated in a computer-readable medium for execution by a computer and/or processor. Examples of computer-readable media include, but are not limited to, electronic signals (transmitted over wired and/or wireless connections) and/or computer-readable storage media. Examples of computer-readable storage media include, but are not limited to, a read only memory (ROM), a random access memory (RAM), a register, cache memory, semiconductor memory devices, magnetic media such as, but not limited to, internal hard disks and removable disks, magneto-optical media, and/or optical media such as CD-ROM disks, and/or digital versatile disks (DVDs). A processor in association with software may be used to implement a radio frequency transceiver for use in a WTRU, UE, terminal, base station, RNC, and/or any host computer.
This application claims the benefit of U.S. Provisional Patent Application No. 61/943,467, filed Feb. 23, 2014; the contents of which are incorporated by reference herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2015/017093 | 2/23/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
61943467 | Feb 2014 | US |