This invention concerns a method for measuring cognitive load. In other aspects the invention can be expressed as a computer system and as software that are used to perform the method.
The concept of cognitive load has been used in a variety of fields that deal with the human mind interacting with some external stimulants. The definition of cognitive load is slightly different in each field. For instance, in pedagogical literature cognitive load refers to the total amount of mental activity imposed on working memory at any instance in time; while in ergonomics literature it is described as the portion of operator information processing capacity, or resources that are required to meet cognitive task demands. Each field provides different methods to measure cognitive load.
In this specification the phrase “cognitive load” is defined as in the cognitive psychology literature, and its meaning is not a measurement of attention span, work load, stress, engagement or other external elements to a task. Cognitive load is defined here as the mental effort or demand required for a particular user to comprehend or learn some material, or complete some task [1]. Cognitive load is relative to both the user (i.e. their ability to process novel information) and the task being completed (i.e. complexity), at any single point in time. It is attributable to the limited capacity of a person's working memory and their ability to process novel information.
Conventional methods for measuring cognitive load, include:
There are a number of problems with these methods for measuring cognitive load, including:
Objectively quantifying cognitive load that can be applied uniformly across fields as a standard, or to allow comparison between subjects remains an open problem. In fact, historically, the most consistent results for cognitive load assessments have been achieved through self-rating subjective measures. These allow users to describe in fine detail their perceived level of cognitive load induced by various types of task.
In a first aspect the invention provides a method for measuring a person's cognitive load, the method comprising the steps of:
Grammar is the branch of linguistics that deals with syntax (the use of words in the construction of phrases and sentences), morphology (internal structure of words), and also with semantics (meaning and interpretation) [2], [3]. It covers the rules governing the use of any given natural language and includes morphology and syntax, often complemented by semantics, phonetics (physical sounds of human speech), phonology (use of sound to encode meaning), and pragmatics (ability of a speakers to communicate more than that which is explicitly stated) [4].
The invention concerns the discovery that grammatical features can be used to assess a person's cognitive load, and how they can be assessed by identification, weighting and combination. The use of grammatical features represents a divergence from the known methods of providing a measure of a person's cognitive load. It is an advantage of the invention that by concentrating on the grammatical features it is able to provide an objective measure of the person's cognitive load. It is a further advantage that it provides this measure in a non-intrusive manner. Since the invention only requires word based input, it can be applied to any task where the input is word based.
The method may also comprise the step of classifying the task, such as task type and task complexity. In this case, the predetermined grammatical features of step (b) may be weighted and combined in step (c) according to the task classification for more accurate cognitive load measurement.
The predetermined grammatical features of step (b) may include one or more of:
The word based input may be produced from speech of the person when performing the task. The speech may be automatically converted to text to form the word based input.
The method may further comprise classifying the task based on predetermined task and user profiles. These predetermined task and user profiles may be generated based on a training cycle. The training cycle may comprise identifying the grammatical features that are indicators of the person's cognitive load, and may be based on the specific task.
Steps (b) and (c) may be performed in substantially real time.
The word based input of step (a) may be text input that is typed by the person when performing the task. Alternatively, it may be written words that are automatically converted to electronic text.
The weighting and/or combining of step (c) may be based on the task, such as task domain or classification. The weighting may also be person dependent.
The method may further comprise the step of receiving physiological input about the person while performing the task. The method may further comprise in step (b) identifying predetermined physiological features from the physiological input that are indicative of the person's cognitive load. Alternatively, the weighting of step (c) may be influenced by the physiological input.
The method may further comprise the step of receiving environmental data about the environment of the person while performing the task. In that case, the weighting of step (c) may be influenced by the environmental data.
In a further aspect, the invention provides a computer system to measure a person's cognitive load while performing a task comprising:
In yet a further aspect, the invention provides software operable when installed on a computer that can interface with a person to perform the method of measuring the person's cognitive load as described above.
An example of the invention will now be described with reference to the accompanying drawings, in which:
In complex, data-intense situations, users can experience high levels of cognitive load. This can interfere with their ability to complete a task and also adversely affect their performance of the task.
The intelligent user interface system of this example, which is aware of the user's changes in cognitive load can alleviate this problem by implementing output strategies to modulate the pace, content, and format of the output interaction in real-time and/or by determining the resources needed by the user to complete the task effectively and efficiently. The interface is aware of the user's cognitive load based on an assessment of the grammatical features of the user's speech input. This is to be contrasted with acoustic and phonetic features of the user's speech input.
The method of measuring the user's cognitive load from which these output strategies will be based will now be described with reference to
In this example the user 200, which is a student, is performing a task that is domain dependent, the domain in this case being on-line distance learning.
In performing the learning task, the user 200 interacts with the human-computer interface by providing speech. In alternative examples, the interface can be part of a computer assisted human-human interaction system such as the systems typically deployed in air traffic control, emergency services and call centres.
The components of the example interface of this example include:
In this example, during training, the user 200 is required to complete a set of tasks using the speech-enabled interactive interface 202. They are able to use mainly their speech to interact with the system 202. As they complete the tasks, their speech, which includes their unconsciously-generated grammatical features, is recorded and fed 100 into a cognitive load analysis unit 216.
More specifically, the user 200 is assigned a set of tasks for the user to complete while interacting with the system 202. These tasks must have definite variation in complexity so that the measurements at each cognitive load level can be taken separately. Different methods of feature classification will be derived but not all may be applied depending on the task application instance.
As the user 200 completes the tasks, their speech is recorded 212 and converted to text transcriptions using a speech-to-text conversion component 214. This is non-intrusive, objective, and can be performed in real-time.
The cognitive load analysis component 216 receives the transcriptions (including time alignment information) and identifies/extracts any applicable set of selected grammatical feature categories 104 for each level of cognitive load. The feature categories that are chosen depend on the classified task application area. Therefore, only a few task specific features will be extracted by the cognitive load analysis unit.
Users 200 exhibit several grammatical changes in their natural language usage while interacting with the system that can be used to determine their cognitive load. A selection of grammatical features can be used for particular application domain and at particular time intervals for measuring cognitive load of a user. They are highly unobtrusive as the data can be collected while the subject completes some other tasks without them being aware of the cognitive load analysis taking place in real-time.
The grammatical features can be extracted from the spoken language, or the language that is typed directed into the system by the classifier. The grammatical features include:
These features are then calculated and weighted 106 for that user 200 and combined 106 together by the combiner using a feature combination method to acquire a reliable measure of the cognitive load.
Referring now to
In control measurement mode (i.e. training), the module extracts 302 and records the relevant set of grammatical features from the parsed transcription at each level of cognitive load along with the user profile 210 and task 208 characteristics. All available user and task attributes are divided into homogeneous clusters based on clustering algorithm, such as Decision Tree. Each cluster can then be assigned a task ID where corresponding grammatical features can be tagged accordingly. Features from the higher load tasks can be assigned higher weights and given a larger significance during the actual measurement process later.
In actual measurement mode (i.e. real-time assessment of cognitive load) the task is classified in 304, where a fast search is done to retrieve the task ID, for example, for an online learning or examination task with its difficulty level, subject type etc., from the Decision Tree built during training as shown in
Emphasis will be given to the Negative Emotions and Cognitive Words in this case. On the other hand when the task is classified as a reading task, the predetermined features to be extracted can include:
Here the emphasis will be given to the Pause features. Similarly, when the task is classified as a collaborative problem solving task, the predetermined features to be extracted from the transcription can include:
The module 216 then extracts 302 these features from the parsed 300 transcription and calculates and assigns weights 106 to the extracted features 310.
Then the module 216 combines 312 the weighted features 310 to achieve the measure of cognitive load based on the speech input of the user 200. The combination mechanism is based on a probabilistic framework that combines various classifiers 310 together. This combination mechanism can be performed as described in the co pending patent application, published as WO2006AU00914. Various models can be used to build the feature classifiers. Some of the more feasible ones include Gaussian mixture models, hidden Markov models, linear discriminate analysis, decision trees, and artificial neural networks. For example, classification scores for word-duration features can be combined with those scores for the word-category-frequency features, to improve the accuracy and robustness of the cognitive load measurement.
The result of the combination 312 is output and it represents the measure of cognitive load 218 being experienced by a user 200 for a particular task.
Depending on the proximity of this measure to an optimized pre-set cognitive load target level for that task, the next task or system output or response is verified for appropriateness or changed. For example, in the case of a distance education tutorial, if the cognitive load is too low, this feedback may be provided in real time to the tutor who can then accelerate the progress of the tutorial. If the cognitive load is too high, the interface can be automatically programmed to minimise all open applications displayed to the user (i.e. graphs on display) and to only show the video display of the tutor.
A further example will now be described where the input speech was recorded from a bush fire incident control room. The features that were identified and extracted included:
Speech data sets were input into the system for various types of task. All the tasks involved different levels of difficulty or complexity to induce different levels of cognitive load. This analysis showed that the user's cognitive load while performing the different tasks was statistically different.
The table of
This table shows that, for all sets of speech data, the trend is stable for all selected features regardless of whether the resulting trend is significant. Additionally, for many of the features, the trend is statistically significant which confirms that these features can be used to robustly determine the user's level of cognitive load.
It will be appreciated by persons skilled in the art that numerous variations and/or modifications may be made to the invention as shown in the specific embodiments without departing from the scope of the invention as broadly described.
The cognitive load measurement method and system presented is both flexible and extensible. It is customisable and has the ability to be applied to various application areas and adapt its processing by selecting task and application specific grammatical features and perform cognitive load measurements accordingly.
Since the invention has the capability to process and analyse text transcriptions, its functionality can be easily extended to many other input modalities, not limited to just speech. For example, the user could just use the keyboard for text input, instead of the speech, to interact with the system and the system can be easily extended to record the text input and measure the user's cognitive load level normally by performing data cleaning, parsing, feature extraction, classification, and combination processes.
Similarly another example could be the use of pen input by writing the interaction text, either using a digital pen on a computer screen or using a traditional pen and a paper. The written text then can be easily converted to digital transcriptions by using some optical character recognition (OCR) technology and fed into the cognitive load module for normal cleaning, parsing, feature extraction, classification, and combination processes to evaluate load levels.
The selection of grammatical features listed above can be varied in used in any possible combination. Using Machine Learning, certain features could be weighted more heavily for particular users for particular task type, for example, if the machine learning indicates that these features provide better clues for the user's cognitive load fluctuations.
As shown in
The assessment of grammatical features could be integrated with other cognitive load measurement systems and applications. This includes multi-modal applications with cognitive load assessment capability to supplement and complement their overall performance.
The system could also be provided with any online web-based speech-enabled applications.
A text summarisation engine can be incorporated in the system to further enhance the measurement of the cognitive load of the user.
Another extension is to enable the system to perform the semantic analyses, i.e. to understand the meaning and interpretation of what the user is saying under a particular task situation. This will provide with more contextual information to improve the cognitive load measurement accuracy.
A further example of the invention is a business call centre. The users at the call centres interact with numerous customers on daily basis to satisfy their queries and provide timely support to solve their technical problems. They often work under very high pressure situations. They need to recall and retrieve relevant information in a matter of seconds and react appropriately. In such a highly stressful situation, the operators should be able to interact with the technology impeccably. The computer systems the operators use, should make the access to relevant information as easy as possible while allowing the operator to be in control of the interaction. If the system were able to customise the interface based on user's cognitive load and current situation to manage the flow of information to the operators, they would be able to handle their queries more effectively and support the customers efficiently.
The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive.
[1] F. Paas, et. al., “Cognitive load measurement as a means to advance cognitive load theory”. Educational Psychologist, 2003, 38, 63-71.
[2] WordNet, A lexical database for the English language; Cognitive Science Laboratory, Princeton University, Princeton, N.J.; http://wordnet.princeton.edu; Last accessed on Sep. 16, 2008.
[3] Language Glossary, Dept of Language and Linguistics, University of Essex, Wivenhoe Park, Colchester C04 3SQ, United Kingdom; http://www.essex.ac.uk/linguistics/; Last accessed on Sep. 16, 2008.
[4] Wikipedia, the free encyclopedia; http://en.wikipedia.org/wiki/Grammar; Last accessed on Sep. 16, 2008.
Number | Date | Country | Kind |
---|---|---|---|
2008905089 | Sep 2008 | AU | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/AU2009/001289 | 9/29/2009 | WO | 00 | 5/9/2011 |