Hearing loss is one of the most common handicapping conditions today. It has been estimated that as much as 10 percent of the population have a hearing loss of some kind (Schein and Delk, 1974). This proportion has not changed significantly over the years. A recent estimate by the hearing aid industry indicates that, as of 2009, 31.5 million Americans were hard of hearing (Kotchkin, 2009). What has changed significantly in recent years is the rapidly growing use of cochlear implants by people with severe or profound hearing loss. The number of cochlear implants has grown by 20 percent per year over the past five years. One company alone has already provided over 120,000 individuals with a cochlear implant (see Cochlear Corporation website www.cochlearamericas.com).
The incidence and severity of hearing loss increase with advancing age. In particular, the percentage of people using hearing aids increases rapidly after 45 years of age. La Plante et al. (1992) found that less than 10 percent of hearing aid users are under 45 years of age and that this percentage doubles to almost 20 percent for hearing-aid users in the 45-to-64 age range. The percentage doubles once again for hearing aid users over 75 years of age.
Only a small proportion of the population that could benefit from acoustic amplification actually use hearing aids. Many older people who have acquired hearing losses later in life are reluctant to obtain hearing aids. A common problem is that adults who have been fitted with hearing aids are often too busy (and/or reluctant) to attend classes in auditory rehabilitation. For those who are motivated but are too busy to attend regular classes at a clinic, a home-based system for practice and self training with a few well placed visits to a clinic to monitor progress would be of great value. Another problem is that structured training programs can be boring and a high degree of motivation is needed to use training programs of this type over long periods of time.
Visual speech cues play an important role in face-to-face communication when the auditory signal is degraded as a result of a hearing loss and/or the addition of noise, reverberation or other distortions. Visual speech cues complement, to a large extent, the auditory cues that are lost in the impoverished speech signal. The auditory cues signaling place of articulation, for example, are quite subtle while the corresponding visual speech cues are far more prominent. The auditory cues signaling manner of articulation are not as subtle as those for place of articulation and are less subject to error. The visual cues for manner of articulation, in contrast, are more subtle than those for place of articulation. The complementary nature of visual and auditory speech cues is even more striking with respect to voicing. The voicing contrast is relatively easy to perceive auditorially even for people with some degree of hearing loss, but is very difficult to detect visually.
There are two basic approaches to speech reception training, the analytic approach and the synthetic approach (Schow and Nerbonne, 2002). In the analytic approach, the emphasis is on training the student to recognize the individual sounds of speech (or in the case of visual speech recognition, to recognize the articulatory gestures associated with individual sounds). It is assumed that with improved recognition of the components of speech, improved recognition of words, phrases and sentences will follow. The synthetic approach, in contrast, emphasizes training to recognize the meaning of what is said even if individual speech components are not recognized. This approach places much greater emphasis on the use of context.
There are also different approaches with respect to the use of auditory and/or visual cues in rehabilitation training. The approaches vary from the use of audition only (Pollack, 1970) to the opposite extreme of withholding or attenuating auditory cues during speechreading training (Jeffers and Barley, 1971) to an emphasis on integrating visual and auditory cues (Garstecki, 1982). A training technique that emphasizes the integration of all available cues, audition, vision, and context, as well as possible repair strategies, is that of Continuous Discourse Tracking ((DeFilippo and Scott, 1978; DeFilippo, 1988).
The experimental evidence on the effectiveness of auditory training in improving speech reception skills is strongest for hearing impaired individuals who have great difficulty understanding speech and who receive substantial amounts of auditory training. In a classic study sponsored by the National Research Council, an average increase on the order of 30 percentage points in speech recognition scores was obtained for both “partially deaf” and “profoundly deaf” students over a five year period, most of the improvement being obtained after two years of training (Hudgins, 1954). Similarly, substantial long term improvements in speech reception ability have been obtained with cochlear implant patients receiving significant amounts of auditory training (Clark, 2003).
The effect of long-term training on speech reception skills with moderately hearing impaired adults has not been studied to the same extent. Small gains in speechreading ability have been reported by Walden et al (1977, 1981) for intensive training after a relatively short period of time. Similarly, Kricos and Holmes (1996) have shown small improvements in auditory training with older adults. Recent research by Humes, et. al. (2009) has also shown significant improvements in speech recognition in noise with auditory training. A related issue is that of acclimatization to acoustic amplification. Gatehouse (1993) has shown that at least six weeks of exposure to a new hearing aid is needed before acclimatization shows significant improvements in speech reception.
There have been few experimental studies comparing the analytic and synthetic approaches to speech reception training. Of the few studies that have been reported, Boothroyd, et al. (1987) compared the two procedures with 5 adult subjects fitted with cochlear implants. The subjects were found to benefit more from the synthetic approach using sentence materials. There is, however, a substantial overlap between the two approaches in practice. Implementation of the analytic approach is often followed by the synthetic approach once some degree of mastery at the analytic level has been achieved. Similarly, implementation of the synthetic approach usually begins with an introduction to the different sounds of speech.
Blarney and Alcantara (1994) in a review of auditory training procedures point out that success in auditory training is dependent on a number of complex variables, the most important being the degree of difficulty in understanding speech, the motivation for improved understanding, evidence of progress (i.e., success breeds success), the amount of auditory training and the appropriateness of the auditory training program for each individual's needs. In view of the above it is surprising that more attention has not been given to the problem of auditory training in a noisy environment for adult hearing aid users. Difficulty in understanding speech in noise is extremely common among hearing aid users, hearing aids are not particularly helpful in a noisy environment, and many hearing aid users simply switch off their instruments in a noisy environment. In this case, lack of success breeds lack of success. This lack of success is also partly responsible for the large number of people with significant hearing loss who do not use hearing aids in the belief that hearing aids will be of little if any benefit.
Computer-based techniques provide an efficient means of speech reception training for students with hearing loss. Such training methods have many advantages (Sims and Gottermeier, 2000). These include adaptive matching of the training program to the user's needs and level of competence, using novel and interesting displays, providing unobtrusive objective monitoring of student progress and, in addition, providing the designer of the training system information on those aspects of the system that need to be improved.
The Dynamic Audio Visual Interactive Device (DAVID) first developed in 1973 has been used successfully at the National Technical Institute for the Deaf for many years. A more recent application of computer based training is Seeing and Hearing Speech developed by Sensimetrics.
Another recent development using a computer-interactive training technique is the Listening and Auditory Communication Enhancement (LACE) program developed by Sweetow and his colleagues. Significant improvements in objective evaluations with appropriate controls have been reported by. Sweetow and Sabes (2004, 2006) in evaluating this training program and a major hearing aid company, Starkey Laboratories, has begun to distribute LACE. This is a welcome development and it is a sign that major hearing aid companies are beginning to pay more attention to the importance of aural rehabilitation in acoustic amplification.
The revival of interest in aural rehabilitation has been driven in part by pressure from funding sources to demonstrate cost-effectiveness with appropriate outcome measures. Chisholm et al (2004) and Hawkins (2005) have reviewed the short- and long-term outcomes of adult aural rehabilitation and have documented the positive outcomes, such as reduction in self-perception of hearing handicap and better use of hearing aids and communication strategies.
To summarize the background, people who use hearing aids or hearing amplifiers to improve their ability to hear are faced with several issues, and in some instances disadvantages, including a) reliance on use of a device for all hearing improvements; b) inability to understand conversations in crowded and/or noisy environments; c) the number of people with hearing loss requiring aural rehabilitation is substantial, on the order of millions; and d) conventional aural training programs are prohibitively expensive. As a result, many hearing-disabled individuals are falling behind the curve on improving their comprehension skills, and unfortunately, skills that are not utilized on a regular basis tend to atrophy. There is a growing awareness of the need for more effective methods of aural rehabilitation. At the same time, there is also a need for the development of practical computer-based techniques that offer the means for achieving this goal.
An effective way of improving speech reception skills, including cognitive skills, is to exercise these skills as much as possible. To this end, it would be ideal to develop a system that makes methods of auditory training entertaining so that students will want to use the system for long periods of time, thereby getting substantial practice in honing auditory and auditory-visual speech reception skills without being bored. A system that is fun to use will not only maintain motivation but will also serve as an enticement for students to use the system more often and for long periods of time thereby improving their speech reception skills. In addition, it would be ideal to make such a system an interactive adaptive procedure that will be used so as to automatically match the training material to the user's needs and level of performance.
The user's rate of progress will be monitored objectively, which in some embodiments can include online tracking of progress that may be monitored by a professional, and the system will provide appropriate positive reinforcement to continue to maintain motivation. The system could be configured around a moderately priced personal computer, or internet enabled device so that it would be affordable and convenient to use at home, such as an inexpensive DVD- or web-browser-based version of the system. To meet these goals, a system has been developed and evaluated experimentally. It is described herein and addresses all of the needs and provides additional functionality.
Auditory training systems are disclosed and include at least one speech perception activity, and at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof.
An executable software system is disclosed for auditory training and provides at least one speech perception activity, and at least one audio signal, at least one video signal or a combination thereof, wherein the software is user interactive, performance adaptive or a combination thereof.
An auditory training system that includes a computer, internet-based system, server-based system or another hardware device, an executable software system that provides at least one speech perception activity, and at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof and wherein the computer, internet-based system, server-based system or another hardware device executes the software system to initiate and run the auditory training system.
Methods of training and improving auditory skills, cognitive skills or a combination thereof, includes: providing at least one speech perception activity, providing at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof; and utilizing the at least one speech perception activity in combination with the at least one audio signal, at least one video signal or a combination thereof in order to improve auditory skills, cognitive skills or a combination thereof.
Methods of training and improving auditory skills, cognitive skills or a combination thereof includes providing a computer, internet-based system, server-based system or another hardware device, providing an executable software system that provides at least one speech perception activity, providing at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof and wherein the computer, internet-based system, server-based system or another hardware device executes the software system to initiate and run the auditory training system; and utilizing the computer, internet-based system, server-based system or another hardware device to execute the executable software system that provides at least one speech perception activity in combination with the at least one audio signal, at least one video signal or a combination thereof.
Specifically, auditory training systems are disclosed and include at least one speech perception activity, and at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof. As used herein, the term “auditory” refers to speech reception or speech comprehension, and an “auditory training system” is one that uses audio, video or a combination thereof to identify, assess and ultimately improve auditory and auditory-visual speech reception skills and cognitive skills, such as speech and language. In some contemplated embodiments, auditory training systems also include at least an amount of interference, distortion or a combination thereof of the speech signal so as to adjust the difficulty with which speech is perceived, such as background noise.
The concepts of user interactive and performance adaptive are important in the scope of contemplated embodiments. Contemplated systems and software are not merely “read only” or one-way systems, but are designed to be two-way systems, in that the user responds to and interacts with the system and/or software. Therefore, information is being provided to the user from the system and in response, the user is providing responsive information to the system. Contemplated systems may also be performance adaptive. This important concept is one of the keys to the ability of the user to improve during the use of the system and/or software. Performance adaptation means that the system and/or software processes the information provided by the user and utilizes that information to both a) determine the current skill level of the user and b) chart an individual course of action for the user to move the use to the next milestone. In the case of manual embodiments, as described later, the information provided by the user includes information that the user is ready to transition to the next difficulty level. In the case of internet or server-based systems, the information provided by the user comprises either or both of the answers and feedback to the activities and/or information that the user is ready to transition to the next difficulty level.
An executable software system is disclosed for auditory training and provides at least one speech perception activity, and at least one audio signal, at least one video signal or a combination thereof, wherein the software is user interactive, performance adaptive or a combination thereof.
An auditory training system that includes a computer, internet-based system, server-based system or another hardware device, an executable software system that provides at least one speech perception activity, and at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof and wherein the computer, internet-based system, server-based system or another hardware device executes the software system to initiate and run the auditory training system. It is contemplated that the term “executes” means either executes directly or executes indirectly through a series of other commands, language or software.
Methods of training and improving auditory skills, cognitive skills or a combination thereof, includes: providing at least one speech perception activity, providing at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof; and utilizing the at least one speech perception activity in combination with the at least one audio signal, at least one video signal or a combination thereof in order to improve auditory skills, cognitive skills or a combination thereof.
Methods of training and improving auditory skills, cognitive skills or a combination thereof includes providing a computer, internet-based system, server-based system or another hardware device, providing an executable software system that provides at least one speech perception activity, providing at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof and wherein the computer, Internet-based system, server-based system or another hardware device executes the software system to initiate and run the auditory training system; and utilizing the computer, internet-based system, server-based system or another hardware device to execute the executable software system that provides at least one speech perception activity in combination with the at least one audio signal, at least one video signal or a combination thereof.
In some embodiments, an executable software system is disclosed for auditory training and provides at least one puzzle, and at least one audio signal, at least one video signal or a combination thereof, wherein the software is user interactive and performance adaptive. Also, in some embodiments, contemplated puzzles comprise at least one “cross-sentence puzzle”.
In some embodiments, an auditory training system includes a computer, an internet-based system, a server-based system or another hardware device, an executable software system that provides at least one puzzle, and at least one audio signal, at least one video signal or a combination thereof, wherein the software is user interactive and performance adaptive and wherein the computer, internet-based system, server-based system or another hardware device executes the software system to initiate and run the auditory training system.
As mentioned, a core component of a contemplated system, as described herein, is a speech perception activity, which is designed to engage and challenge the user through at least one puzzle, game or other interactive activity in order to improve auditory perception and overall auditory and/or cognitive skills. These contemplated speech perception activities may comprise audio signals, video signals or a combination thereof.
In some embodiments, a contemplated speech perception activity is a “cross-sentence” puzzle. A contemplated puzzle is similar to a crossword puzzle except that answers to the puzzle items consist of short sentences or phrases instead of single words. Each clue in the puzzle consists of text in which a phrase or sentence is missing. The subject is provided with the answer in the form of a video recording of the text including the missing sentence or phrase. In order to solve the item, it is necessary for the listener to integrate the auditory and visual information conveyed by the video recording with the contextual cues provided by the text. For auditory training without visual cues, the video signal is suppressed and the audio channel only is used. Background noise is one contemplated method or technique that is used to control the difficulty of each puzzle item. Other methods of controlling the difficulty of a puzzle item is to add reverberation to the speech signal, or speed up the rate at which the speech is produced, or to distort the speech signal in other ways.
The technique is best described through an example.
The subject's task is to combine the auditory-visual information provided by the noisy video recording with the sentence context contained in the caption in order to solve the item. The subject enters what he/she perceives as the missing words via the computer keyboard. The solution to this puzzle item is: “It takes a lot of time”. If the correct words have been entered by the subject, they appear in bold black letters in the appropriate squares of the puzzle. Incorrect words appear in red. If the subject is unable to solve this item at this stage, he/she may try another item, as is the usual strategy in solving crossword puzzles. If the subject goes on to solve items #1, #2 and #3 down, the words “it”, “lot” and “time” will appear in the row for #4 across, thereby making this item easier to solve the next time it is attempted.
The nine puzzle items and their solutions are presented here for the reader's interest.
#1 down
Clue: “Where there's a will I want to ______”
Solution: “be in it”
#2 down
Clue: “A blunder at the right moment is better than cleverness at ______”
Solution: “the wrong time” (Carolyn Wells, author)
#3 down
Clue: “Delusions of grandeur make me feel ______”
Solution: “a lot better about myself” (Jane Wagner, humorist)
#4 across
Clue: “ ______ to prepare a good impromptu speech”
Solution: “It takes a lot of time” (after Mark Twain)
#5 across
Clue: “Competition brings out the best products and in people ______”
Solution: “about the worst” (David Sarnoff)
#6 down
Clue: “Good taste is the ______”
Solution: “worst vice ever invented” (Edith Sitwell)
#7 across
Clue: “Suppose you were an idiot. And suppose you were a member of Congress ______”
Solution: “but I repeat myself” (Mark Twain)
#7 down
Clue: “Conservatives believe in reform, ______”
Solution: “but not now” (Mort Sahl)
#8 down
Clue: “Experience lets you recognize a mistake when you ______”
Solution: “repeat it”
#9 across
Clue: “ ______ to satisfy our deep need to complain”
Solution: “Language was invented” (Lily Tomlin)
In order to solve the speech perception activity, which is a cross-sentence puzzle in this embodiment, the subject is required to make maximum use of his/her speech reception skills. Therefore, it is important for the puzzle to be challenging but not overly difficult, which is accomplished by adjusting the level of background noise (cafeteria noise that had been equalized in level over time) adaptively to match the subject's performance level. Initially, this was done by decreasing the noise level after each incorrect entry and increasing the noise level if all of the missing words are entered correctly. Subsequently, a number of more refined adaptive methods were evaluated. These include:
A further refinement of a contemplated system has been the development and inclusion of instructional material focused on the synthetic approach to training. The listener is periodically presented with short paragraphs telling her how to improve her listening skills, which she can then practice as she works through the puzzles.
The software also keeps a detailed record of each student's progress. The data record includes the puzzle item chosen for each trial, the number of times each video recording is played back on each trial, the response time for each playback, the speech-to-noise ratio on each playback and the words correctly identified on each playback. A test score equal to a weighted average of the speech-to-noise ratio when the puzzle item is solved and the time taken to solve the item. This test score is provided as feedback to the subject to indicate how well they are doing. All the collected data are analyzed to fine-tune the system by determining relative difficulty of the items and speakers, and the kind of instructional material provided. The data can also be analyzed and presented to rehabilitation professionals, thus helping them to monitor the progress and customize the treatment of their patients.
The problem of spelling errors in the listener's typed responses was addressed by using a public domain spell-check engine in an innovative way. In this application, we want to accept misspellings of the target word, but only if they do not represent a misunderstanding of the utterance; that is, only if they are not valid words in themselves. For example, “bin” and “bun” are spelled similarly, but represent very different concepts. On the other hand, “bin” and “bim” are also spelled similarly, but “bim” isn't a word. Most spell-check engines provide a metric of confusion which describes how easily confused lexically one word is with another. Knowing this, and the target word, we were able to evaluate a response as follows:
A large number of speech perception activities are needed, including cross-sectional puzzles, to provide many hours of intensive speech-reception training. In order to prepare large numbers of activities and/or puzzles rapidly and efficiently a computer program was developed for generating cross-sentence puzzles from a large database of puzzle items. The program operated in the following way. An initial puzzle item is entered into the computer. The computer then searches for short sentences or phrases in the database containing words that are common to the initial item. The possible items are then ranked in order of how many words each item contains that is common to other words in the database. The entry with the highest number of common words with other words in the database is then selected to be the next puzzle item since it is likely to have more intersections than any other possible item. The computer then searches the database for the next set of possible items, and so on. The process is continued until a puzzle with 20 to 30 items is obtained.
A compendium of thousands of witty sayings, clever quotations and comedic punch lines was compiled and indexed. Then, 15 puzzles were generated for use in a feasibility study. Subsequently, 24 additional puzzles were generated for a much larger study and product deployment. All the puzzles were double-checked to make sure that there were no inappropriate items (i.e., items with racist, sexist or off-color content were excluded). Video recordings were made of the selected puzzle items by professional actors.
Pilot experiments were performed on a group of 10 adult hearing aid users with sensorineural hearing losses ranging from 14 to 57 dB (pure tone average at 500, 1000, 2000 Hz). The subjects had no additional handicapping conditions or evidence of cognitive problems. Their ages ranged from 50 to 80 years of age. All of the subjects had English as a first language. The subjects were recruited from audiological clinics and senior citizen centers in the San Francisco area. All but one of the subjects were experienced hearing aid users (more than 1 year of hearing-aid use). Subject #10 had just been provided with acoustic amplification for the first time.
Each subject was given an IBM T-30 laptop computer to use over a period of three weeks. Each computer was loaded with 15 puzzles. Two of the puzzles were used to explain the procedure and to allow the subject to get used to using the system. The subject then used the remaining puzzles over a 3-week period. The subjects were asked to use the system whenever it was convenient. It was recommended that the subjects use the training system for about ½ hour each day, but they could use it for longer or shorter periods if they wished. At the start of the experiment a video recording of the IEEE sentence test (Hawley, et al. 1999) was administered in noise using an up-down adaptive paradigm to estimate the speech-to-noise ratio at 50% correct sentence identification. The test was administered twice to obtain an estimate of test-retest variability. The IEEE sentence test was administered again at the end of the 3-week training period followed by a structured interview. The video recordings of the IEEE sentences were made by the same speaker who recorded the test items for the cross-sentence puzzle.
The results of the experiment are shown in
A statistical analysis was performed on the data of the experienced hearing aid users (Subjects 1 to 9). Subject 10 was omitted from the statistical analysis since the large improvement shown by this subject (15.5 dB) resulted from the joint effect of auditory training and adaptation to acoustic amplification. A repeated measures analysis of variance for the experienced hearing-aid users showed a statistically significant decrease in the speech-to-noise ratio corresponding to 50% intelligibility (F=8.8 (df=1.8), p=0.017). The average improvement was 2.8 dB. A statistical analysis of each subject's performance showed that three subjects (Subjects 1 to 3) did not show a statistically significant change in their speech-to-noise ratio for 50% intelligibility at the end of the training program. The remaining subjects showed a significant decrease in the speech-to-noise ratio, p ranging from <0.1 to <0.0001. For the 6 experienced hearing-aid users showing a significant improvement, the average improvement was 4.1 dB.
An important feature of a contemplated training program is that it is intended to be entertaining so as to motivate students to use the system over long periods of time. The time spent on the system by each subject for each training session was monitored by the computer.
It is revealing to note that the system was used frequently for periods of time in excess of 40 minutes. The average time spent on the system in a training session was 45 minutes, but there were large differences among subjects. On several occasions, the more highly motivated subjects used the system continuously for periods of up to two hours or more, as shown by the bars to the right of the figure. Not surprisingly, the subjects showing the largest improvements in performance were the ones who spent more time using the system.
The system also monitored the student's progress during training. Whenever a puzzle was completed, feedback was provided to the student in the form of an index between 0 and 100 which served as a measure of the student's performance. The performance index was based on a weighted average of the speech-to-noise ratio required for solving the puzzle and the average number of attempts at solving the test items.
The final stage of the evaluation consisted of a structured interview with each subject. With one exception, all of the subjects responded positively regarding the value of the system, whether they enjoyed using the system and whether they felt that their ability to understand speech in noise improved as a result of the training program.
The responses to the open ended questions indicated that all but one of the subjects enjoyed using the system and that they felt that their speech reception ability in noise had improved and that they would continue using the system if it was available. The one subject who did not enjoy using the training system did not like the style of humor that was used. This problem can be rectified by including different types of humor for different tastes. Several very useful suggestions were also made for improving the system, such as speeding up the rate of adjustment for matching the difficulty of the training program to each student's level of performance; each test item in a puzzle should be a challenge to the student, either too difficult or too easy. The feasibility study showed significant improvements in speech recognition in noise using the computer-based speech-reception training system.
The average improvement for the 9 experienced hearing-aid users was 2.8 dB. A much greater improvement (15.5 dB) was obtained with a first-time user of acoustic amplification. This result, however, should be considered as no more than a promising case study and that more detailed experiments are needed with new hearing-aid users to determine how much of an improvement can be attributed to the training program and how much is a result of acclimatization to acoustic amplification. It should be noted that the training period was of short duration (3 weeks) and that there was no evidence of a slowing down in the improvement over time for the subjects showing significant improvements in performance. It should also be noted that the feasibility study focused on improving speech reception with both visual and auditory cues. The system can also be used with less sophisticated instrumentation for improving listening skills for audition only.
One feature of contemplated training programs that distinguishes them from traditional auditory training programs is that contemplated training programs are intentionally designed to be entertaining. A training system that is fun to use will not only maintain motivation, it will also be used more intensively and for longer periods of time.
The underlying assumption is that increased use of the training system will result in greater improvements in speech reception ability. The results of the feasibility study support both of these points. All but one of the subjects enjoyed using the system and most of the subjects used the system for longer periods of time per training session than was initially recommended (45 minutes, on average, as opposed to the recommended period of ½ hour per training session). Several of the subjects used the system continuously for periods exceeding an hour at a time and, on average, the subjects who used the system for longer periods of time showed larger improvements in performance. The one subject who did not enjoy using the system did not like the style of humor that was used. This problem can be addressed by developing training materials with different styles of humor to suit different tastes. The entertaining aspect of the training system is expected to be a major plus in marketing the system to prospective users.
An additional strength of a contemplated training system is its adaptive format in which the difficulty of the puzzle items is adjusted automatically to match the student's level of performance. This concept has already been discussed as performance adaptivity. Some of the subjects complained that the rate of adjustment was too slow resulting in some puzzle items being either much too difficult or much too easy. This problem has been addressed in a subsequent study in which more efficient adaptive methods are used to speed up the rate of convergence to each student's level of performance. The use of computer-interactive techniques also allowed for detailed, unobtrusive record keeping as well as on-line computation of relative performance for providing helpful feedback to the student.
These feasibility studies demonstrated that the proposed method of speech-reception training and contemplated systems can improve speech-reception skills in experienced hearing-aid users. In addition, a case study with a first-time user of acoustic amplification showed promise of much larger improvements in speech-reception skills for new hearing-aid users.
It is contemplated that a significant improvement will be obtained over and above that to be expected from acclimatization to acoustic amplification without auditory training. More extensive experiments have been undertaken. Contemplated experimental results not only serve to emphasize the importance of auditory training when hearing aids are first fitted, they also demonstrate that contemplated systems can provide this training in a convenient, efficient and enjoyable way.
Major hearing aid companies are beginning to pay more attention to the importance of providing auditory training for new hearing-aid users and are interested in cost-effective ways of providing this training.
Additional contemplated versions of a contemplated speech-reception training program are currently being implemented and experimentally evaluated:
Two versions of the DVD-based speech-reception training program are contemplated. One contemplated version (manual version) requires the user to solve each puzzle with pencil and paper in much the same way as a conventional crossword puzzle is solved. Another contemplated version, referred to as the adaptive version, has been also developed. It does not require the user to record his/her responses manually, except for a single index on completion of a puzzle. This index specifies the subject's average level of performance in solving the puzzle and is needed to evaluate progress in the training program. An automated up-down adaptive strategy is used to find the difficulty level at which each puzzle item is solved. An honor system is used in which the subject decides if he/she has solved the puzzle item on each trial. The method is described in greater detail below
A conventional DVD player is the only equipment needed to implement the system; however, any system that can play a DVD is contemplated and can be utilized. Standard or Blu-Ray™ DVD discs are prepared containing video recordings of the puzzle items. For the purpose of evaluating the system, there is one puzzle per disc, but most contemplated systems will have several puzzles per disc. Each disc will be programmed using a standard disc authoring system. The following is a description of the contents of the discs for the two versions of the DVD-based system.
A contemplated set of DVD discs for the manual version will have a set of 7 menus on each disc. The menus will be of the form shown in
The following example illustrates how the manual system is used. The student is given a booklet containing a set of cross-sentence puzzles. The booklet has the same format as a booklet of crossword puzzles except that in order to solve the puzzle, words rather than letters need to be entered in each open box in the puzzle. This is done manually using a pencil (with an eraser nearby to correct wrong entries as the student proceeds to solve the puzzle). The student or user typically begins by selecting a menu of average difficulty; e.g., DIFFICULTY LEVEL 4 (the default condition). The student then clicks on one of the clues shown in the menu. The student might begin by clicking on Clue 4 ACROSS. The video recording corresponding to this clue then appears on the screen. The speech-to-noise ratio for this recording corresponds to an average level of difficulty (i.e., level 4 of 7 possible levels). For this particular puzzle item the video recording would show a speaker producing the sentence “It takes a lot of time to prepare a good impromptu speech”. The clue for this item is “ ______ to prepare a good impromptu speech” which appears as a caption below the speaker's face.
If the student or user is able to recognize the missing words without difficulty, (s)he enters the answer in the booklet showing the puzzle and then clicks on INCREASE LEVEL in order for the next puzzle item to be more difficult. If some, but not all of the missing words are ‘recognized, the student can either attempt to solve the puzzle item again, or try another puzzle item without changing the level of difficulty. If the student cannot recognize the missing words after several attempts, (s)he would then click on DECREASE LEVEL so as to reduce the difficulty of the puzzle. On entering the solution to a puzzle item, the student also enters the difficulty level at which the puzzle item was solved. On completing a puzzle, the student clicks on SHOW SOLUTION in order to view each puzzle item knowing the solution. This will not only serve to reinforce the student's auditory-visual perception of each puzzle item, but will also allow the student to check the accuracy of his/her answers.
The DVD-based speech-reception training system maintains the interactive nature of the computer-based training system, but depends on an honor system for adjusting the level of difficulty to match the student's level of performance. It is also less convenient in that the answers to each puzzle item are entered manually using pencil and paper. Although less convenient than the computer-based system, this procedure has been found to work quite well in a preliminary trial, which is not surprising considering that the use of pencil and paper is widely used in solving crossword puzzles.
The adaptive version of the DVD-based training system has a similar structure. There are 7 menus for each level of difficulty, as used for the Manual version, but with one difference. The two icons showing INCREASE LEVEL and DECREASE LEVEL are replaced with icons showing I HAVE SOLVED THE ITEM and I HAVE NOT SOLVED THE ITEM, respectively. The subject attempts to solve a given puzzle item by clicking on the screen and playing the video recording corresponding to the puzzle item. The subject can play the video recording as often as needed and then decides whether or not s(he) has solved the item and clicks on the appropriate icon. If the subject has not solved the item, s(he) clicks on I HAVE NOT SOLVED THE ITEM. The menu then changes to one corresponding to a less difficult level of difficulty (i.e., the speech-to-noise ratio of the video recording for that menu is increased). The subject then attempts to solve the puzzle item at the easier level of difficulty. The sequence is repeated with the level of difficulty becoming progressively easier until the subject solves the puzzle item and clicks on the icon I HAVE SOLVED THE ITEM. The next menu then shows the correct solution. If the subject judged correctly that s(he) had solved the puzzle item, the subject clicks on an icon indicating this and the next menu will appear with a new test item at a higher level of difficulty; i.e., at a poorer speech-to-noise ratio. If the subject, on seeing the menu with the correct solution, realizes that s(he) had, in fact, not solved the puzzle item, the subject then clicks on an icon indicating this and the next menu then appears with a new test item at an easier level of difficulty; i.e., the associated video recording will be at a higher speech-to-noise ratio.
The procedure continues until all the items on a given puzzle have been solved. The subject then records the level of difficulty at which the last puzzle item was solved. Since the level of difficulty is adjusted adaptively, the final level of difficulty in solving the puzzle is a measure of the subject's level of performance for that puzzle and this information is used in tracking the subject's performance in the training program. Note that in the Manual version, the subject is free to choose the puzzle items in any order; in the Adaptive version, the puzzle items are presented in a predetermined order.
Experiment 1 is designed to evaluate the speech-reception training program with new hearing-aid users. Comparative data with experienced hearing-aid users will also be obtained. A crossover AB/BA experimental design will be used with two groups of new hearing-aid users. Group 1 will use the speech-reception training program for 6 weeks immediately after being fitted with a hearing aid followed by 6 weeks without using the training program. Group 2 will not receive speech-reception training until they have worn their hearing aids for 6 weeks after being fitted. This will allow them time to acclimatize to acoustic amplification after which they will use the speech-reception training program for a 6-week period. A third group of subjects will consist of experienced hearing-aid users. The experienced hearing aid users will begin with 6 weeks without training followed by 6 weeks with training. There will be 24 subjects in each group (see statistical power analysis below).
Subjects who are due to receive 6 weeks of speech-reception training will be given a lap top computer on loan so that they can use the system at home at their own convenience. The subject will be shown how to use the speech-reception training program and will be given two puzzles to practice with under supervision. The audio output of the computer will drive a pair of good quality stereo loudspeakers. The subjects will be instructed to adjust the loudness to a comfortable level and listen with their personal hearing aids as they would in every day listening to television.
The subject will the be given 24 puzzles to work on for the next 6 weeks. The subject will be asked to solve 4 puzzles per week. It is estimated that this will require an average of ½ hour of training each day. Puzzles in three categories of humor will be available for the subjects to choose from. On completing a puzzle, the subject will be asked to rate the humor of each puzzle item on a 5-point scale (e.g., 1=not at all amusing, 5=highly amusing). The ratings will be used to check that an appropriate category of humor has been selected. If low humor ratings are obtained, the humor category will be changed to a more appropriate one. On completion of the experiment, the ratings will be analyzed so as to develop a finer categorization of the humor for the final product.
Arrangements will be made to contact the subjects on a weekly basis, either by telephone or personal visit, to check on progress and resolve any problems that may occur. The computer will be returned at the end of the 6-week training period, the data on the computer will then be downloaded and the system prepared for the next subject. The experimental evaluations will be staggered with 12 subjects using computers at the same time. The estimated time to complete the experiment is 9 months allowing several weeks for slippage and unexpected problems.
All three groups of subjects will be evaluated on four occasions:
The subjects will be recruited from physicians’ and audiologists' offices and local speech and hearing centers, senior retirement homes and senior citizens' service centers and organizations, of which there are many in close proximity to SKERI. Dr Simon has a well established network of contacts with these groups and has not had any difficulty obtaining volunteers for her research. Subjects will be evenly divided between male and female with appropriate representation of minority groups. All subjects will be native speakers of English since the training materials are in English.
The subjects will be in the age range between 20 and 70 years of age and will have moderate to severe sensorineural hearing losses typical of the majority of hearing aid users. The new hearing aid users will have been fitted with a hearing aid for the first time. The experienced hearing aid users will have used a hearing aid for at least two years. All subjects will be screened on the basis of pure tone threshold audiometry, speech reception and recognition testing and acoustic-immittance testing. In addition to these audiologic evaluations, measures of auditory processing [the Test of Basic Auditory Capabilities [TBAC], (Watson, 1987)] and cognitive function (Wechsler Adult Intelligence Scale-Revised [WAIS-R], and the Wechsler Memory Scale-Revised [WMS-R], (Wechsler, 1981, 1987)) will also be obtained. These latter tests will screen subjects to ensure intact cognition. Subjects will also demonstrate relatively good physical health and normal daily functioning.
The IEEE sentence test, and the Abbreviated Profile of Hearing Aid Benefit (APHAB) and an interview will be administered at the end of each 6-week period (Times 1 to 4). Table 1 shows the history of training and testing times for each subject group. The interviews will be modeled on the COSI technique (Dillon et al, 1997). Prior to the start of training, the interview will explore each subject's expectations regarding the speech-reception-training program. On completion of a 6-week training period, the interview will focus on the extent to which the training program has met their expectations. The final interview at the end of the experiment will focus on the subject's overall evaluation of the training program and their assessment of its long-term benefits.
In addition to the measurements obtained at 6-weekly intervals, a detailed log will be kept of the use of the system by each subject on each puzzle. This log will include the speech-to-noise ratios when each puzzle item is solved. These speech-to-noise ratios are expected to be correlated with the speech-to-noise ratios for sentence recognition in noise as obtained with the IEEE sentence test. The data are analyzed using a multi-variate repeated-measures analysis of variance.
The Abbreviated Profile of Hearing Aid Benefit (APHAB) is a 24-item item self-assessment inventory in which the respondents report the amount of trouble they are having with communication or noises in various everyday situations. The APHAB has four subscales: Ease of Communication, Reverberation, Background Noise, and Aversiveness. A multivariate correlation analysis will be performed with the subscales of APHAB and the speech recognition in noise tests.
The data obtained on the Background Noise subscale will be of particular interest. It is anticipated that improvements in speech recognition in noise measured with the IEEE sentence test will show concomitant improvements in the Background Noise subscale and possibly also in the Reverberation subscale.
The data obtained by the computer in monitoring the details of each subject's performance (e.g., speech-to-noise ratio required to solve a puzzle time, number of attempts at solving a puzzle item, time taken for each attempt) will provide a large body of useful information. Note that there are 24 subjects, each of whom will complete 24 puzzles and that there are 6 speakers. If the training program is divided into 4 quarters, it is possible to design a 12×12×12 Latin square with subjects, puzzles and speaker x quarter as the three factors. This design will allow efficient, unbiased estimates of the average performance of each subject during each quarter of the training program, the average difficulty in understanding each speaker, and the average difficulty of each puzzle. Each of these averages can be expressed in terms of several measures, such as the average speech-to-noise required to solve a puzzle, the average number of attempts per puzzle, or the time taken to solve a puzzle. An efficient, objective index of relative performance will be obtained from a weighted average of these measures, the weighting to be determined from a multivariate discriminant analysis of the data.
This performance index will be useful in assessing the progress of each subject during the training program, for analyzing inter-speaker differences (important for both basic research and for developing training strategies for difficult speakers) and for identifying puzzles that are either too easy or too difficult. These outliers will be omitted from the final product.
The ratings of the humor in each puzzle will be analyzed using non-parametric techniques. The median and quartile ratings across subjects will be obtained for each puzzle item. If the 1st and 3rd quartiles do not differ by more than 2 points on the 5-point scale, that item will be defined as being homogenous. A non-homogenous item is one in which a large proportion of the subjects rated the humor one way (e.g., low ratings of 1 or 2) and another large proportion of the subjects rated the humor another way (e.g., high ratings of 4 or 5). The puzzle items will then be sub-divided into categories that are homogenous for different groups of subjects. It is anticipated that these subject groupings will reflect differences such as age, gender, and level of education. The categorization of the puzzle items obtained in this way will be much finer than the 3-categories used in Experiment 1.
The anticipated results for Experiment 1 are:
Experiment 2 will evaluate two versions of the DVD-based speech-reception training program. The same crossover AB-BA experimental design will be used as in Experiment 1. Two groups of new hearing aid users will participate. One group will begin with the Manual version, the other with the Adaptive version. After 6 weeks, the group using the Manual version will switch to the Adaptive version and the group using the Adaptive version will switch to the Manual version for another 6 weeks of training. The experiment will be terminated after a third 6-week interval without training. New hearing aid users will participate in this experiment since they are expected to show large training effects and will thus provide a sensitive test of any differences in the effectiveness of the two versions of the DVD-based training system. As in Experiment 1, there will be 24 subjects in each group, the statistical power analysis being the same for the two experiments.
Each of the subjects will be given a DVD player with a set of discs containing the puzzles to be worked on. On receiving the DVD player, the subject will be shown how to use the training program and will be given two puzzles to practice with under supervision. As in Experiment 1, the subject will be asked to solve 4 puzzles per week. Arrangements will be made to contact the subjects every week, either by telephone or personal visit, to check on progress and resolve any problems that may occur. The DVD player will need to be returned at the termination of training. The puzzle booklets used with the Manual version and the indices of performance recorded by the subjects using the Adaptive version will be picked up at weekly intervals. This procedure will allow for data analysis to be performed on an ongoing basis and will also allow for the early detection of any problems during training.
As in the first experiment, the subjects will be evaluated on four occasions. 1) At the start of the training program, 2) after 6 weeks of training with one version of training system, 3) after 6 weeks of training with the second version of the system, and 4) 6 weeks after termination of training. The same test battery as in Experiment 1 will be used. On completion of training (week 12), the subjects will be given a questionnaire focusing on a comparative assessments of the Manual and Adaptive versions from the subjects' perspective. The experiment will be staggered with half of the subjects in each group receiving training at the same time. The estimated time to complete the experiment is 9 months allowing several weeks for slippage and unexpected problems.
The data analysis will be analogous to that of Experiment 1. A multi-variate repeated-measures analysis of variance will be performed on data obtained with the IEEE test and a multivariate correlation analysis will be performed with the subscales of APHAB and the speech recognition in noise tests.
The anticipated results for Experiment 2 are:
1) Both versions of the DVD-based system will show significant improvements in speech recognition in noise.
2) The two versions of the DVD-based system are expected to show similar improvements in speech recognition in noise, but user preference will strongly favor one of the two versions.
3) As in Experiment 1, a small reduction in speech recognition ability is anticipated after 6 weeks without training.
Experiment 3 is designed to evaluate the computer-based speech-reception training program for audition only. The scope of this experiment is much narrower than the previous two experiments. Only one group of 24 subjects will be tested over a 12 week period in order to obtain data on the improvement in speech recognition in noise for a computer-based speech-reception training program using audition only. The same computer-based will be used as in Experiment 1, but with the video signal suppressed.
The subjects will be evaluated on three occasions: 1) At the start of the training program, 2) after 6 weeks of training, and 3) 6 weeks after termination of training. The anticipated duration of this experiment is 6 months. As in the previous experiments, a multi-variate repeated-measures analysis of variance will be used to analyze the data.
The anticipated results for Experiment 3 are:
1) Speech-reception training using audition only will show significant improvements in auditory speech recognition in noise
2) Auditory-visual speech recognition in noise will also improve, but by a smaller amount than that obtained with auditory-visual speech-reception training (comparative data obtained from Experiments 1 and 2)
The least sensitive comparison of interest is that between two groups of subjects for two experimental conditions. For example, in Experiment 1 a contrast of interest is the difference in speech-to-noise ratio between Week 0 and Week 6 for the group of subjects receiving speech-reception training compared to the difference between Weeks 0 and 6 for the group of subjects not receiving speech-reception training. This comparison involves a linear combination of 4 measurements (2 for each group) so that the variance of the data for this comparison is 4 times the variance of each measurement. The repeated measures analysis of variance in the feasibility study showed a between-subject variance of 1.4 dB2 (excluding the outlier). For a linear combination of 4 group means the variance will be (4×1.4/n) dB2 where n is the number of subjects within each subject group. Assuming a normal distribution, a minimum of 22 subjects is needed in order to detect a difference as small as 1 dB with an error probability of 0.05. Increasing the number of subjects in a group to 24 will allow for tests of more than sufficient sensitivity for the least sensitive comparisons of interest.
Methods 700 of training and improving auditory skills, cognitive skills or a combination thereof, includes: providing at least one speech perception activity 710, providing at least one audio signal, at least one video signal or a combination thereof 720, wherein the system is user interactive, performance adaptive or a combination thereof; and utilizing 730 the at least one speech perception activity in combination with the at least one audio signal, at least one video signal or a combination thereof in order to improve auditory skills, cognitive skills or a combination thereof, as shown in
Methods of training and improving auditory skills, cognitive skills or a combination thereof 800 includes providing a computer, internet-based system, server-based system or another hardware device 810, providing an executable software system 820 that provides at least one speech perception activity, providing at least one audio signal, at least one video signal or a combination thereof, wherein the system is user interactive, performance adaptive or a combination thereof and wherein the computer, internet-based system, server-based system or another hardware device executes the software system to initiate and run the auditory training system; and utilizing the computer, internet-based system, server-based system or another hardware device 830 to execute the executable software system that provides at least one speech perception activity in combination with the at least one audio signal, at least one video signal or a combination thereof, as shown in
It should be noted that this contemplated system user manual is directed to one version of the contemplated system, but as other contemplated embodiments are developed, such as 64-bit, iPad, other tablets, hand held devices, and the like, additional user manuals or amended user manuals will be and/or are drafted. For example, another contemplated embodiment provides that contemplated systems are distributed via internet download and the content is stored on and provided from central or regionally located servers. Another embodiment provides that contemplated systems are fully internet browser-based and therefore, no downloading is needed by the user. Each of these contemplated embodiments would be fully described in an overall user manual or have individual user manuals directed to their individual capabilities.
Enhanced Communication with ReadMyQuips™
Perhaps the most common complaint among people with hearing loss is that speech is difficult to understand in a noisy environment. Often, because of this difficulty, people isolate themselves (and their significant others) from many social situations that they previously enjoyed. ReadMyQuips™ is an innovative training program specially designed to help you face these challenges in a fun and creative way. Clinical studies indicate that with dedicated practice using ReadMyQuips™, many people can significantly improve their speech perception capabilities, thus gaining the confidence they need to tackle everyday, difficult listening situations.
Face-to-face communication makes use of a skill known as lip reading. Actually, the more correct term for this skill is speech reading, because although it is true that the lips convey valuable information, they are not the only source of clues to comprehension. When we speech read, we also get information from facial expressions, body language, and the situational context of the conversation.
There are two basic approaches to improving speech reading. One approach involves focusing on the individual sounds of speech and the associated facial patterns. This is known as the Analytic approach. Unfortunately, parts of speech look and sound different when they are used in natural conversation as opposed to the way they look and sound when produced individually. Also, by focusing too intensely on one sound, it is likely that another sound will be missed. This makes the Analytic approach problematic.
The other approach—and the one we recommend—is to try to understand the message that is being communicated and then let your brain put the pieces together. This method is known as the Synthetic or Global approach, since the focus is on understanding the message as a whole—the big picture—and not on deciphering individual sounds. Besides being more effective, this technique is also generally less stressful because it involves intuition and imagination.
One of the most remarkable aspects of human communication is that everybody speechreads (lipreads) to some extent, even those with perfectly normal hearing. It is an intuitive process, sort of like learning to run. And, as with running, training can help improve your skills significantly. The key to the global approach to speechreading is to get lots of practice.
And practice is what we hope to inspire you to do with ReadMyQuips™. A unique aspect of this training program is that it is entertaining. Your task is to solve a puzzle similar to a crossword puzzle except that you have to speechread the answer to each clue. While you are having fun solving the puzzle, you are also exercising and improving your speechreading skills.
ReadMyQuips™ is also unique in that it is adaptive. As you work through the puzzles, the difficulty level changes to match your growing proficiency. As a result, each puzzle continues to be challenging but solvable, even as your skills improve.
The first few puzzles may be relatively easy for you to comprehend, but as you go on the noise level will increase and you will be forced to depend upon speechreading cues more and more. This is as it should be. In real-life, varying levels of background noise are a ubiquitous presence, and the information gained through speechreading becomes more and more important as the noise increases.
Each time you play one of the video clips, fill in as many words of the sentence as you can, guessing when you are not exactly sure (the program will inform you whether or not you're correct). As in real-life, the more of a sentence you guess correctly, the easier (generally) the rest of the sentence should be for you. This is because the more you know the context of a sentence—any type of context (e.g., the situation, topic, speaker, etc.)—the more likely it is that you will comprehend the rest of the utterance. Remember: your goal is to comprehend the sentence, and you should use whatever cues you can to accomplish this purpose. You will find that the blank response boxes (where you type your answer) are usually framed by preceding and/or following visible words; these are part of the sentence and thus are also cues, much as you would find in real life (don't you often get just part of a sentence, but not all of it?). Therefore, before you click on the video image to play the sentence, first read these framing words and then try to fill in the remaining words in the response boxes.
If there is one key element to speechreading, it is that you must see the lips in order to do it. Now this may sound like a facetious statement, but it is not. Many people with hearing loss seem to focus their eyes everywhere but where they should: The fact of the matter is that we have often been conditioned to look our conversational partners “right in the eye.” Now that's well and good in most situations, but it's not conducive to maximizing the information you can get from a person's lip movements. So, in going through these lessons, lower your eyes a bit; some people focus somewhere around the nose and some directly on the lips. As it happens, your ability to perceive the tiny and rapid movements of the moving lips is better at the center point of your eye focus than at positions even slightly off. And even when you look directly at the lips, you should still be able to appreciate the broader facial expressions. After some practice, you'll soon determine which focus point is best for you.
ReadMyQuips™ also allows you to display the speaker's face in full-screen mode. Think of this as comparable to the real-life situation of first being some distance from a speaker (the usual screen display) and then being only a few feet away (for more personal conversations).
Seeing the lips as best you can implies that your corrected vision (if you wear eyeglasses) is accurate and up to date. We have often seen people with uncorrected or improperly corrected vision struggle to speechread when they could barely see a person's lips six or eight feet away. So before you begin ReadMyQuips, be sure that your eyeglass prescription is up to date; you're not going to do very well if the lip movements are simply a blur.
In real-life, it sometimes takes a bit of assertiveness in order for you to see a person's lips as well as possible. For example, the person who is talking to you while eating, smiling, or otherwise distorting their lips during the conversation is going to be much harder to understand. It's up to you to inform such people what they have to do in order for you to understand them (and presumably, they do want to be understood or else why talk to you at all?). As for the surgeons in the operating room taking out your appendix: forget it! Just be sure they know you're not going to be able to understand them very well when they have their mouth covered with a mask (though you should depend upon your hearing as much as you can in this situation).
We emphasize that ReadMyQuips™ is not a typical speechreading program, but that it targets background noise as a primary training factor. This is how it is in real life. We expect that most people using this program wear hearing aids or cochlear implants. These are marvelous devices and there is no question that they help most people with hearing loss hear better. But though they are undoubtedly necessary, they are often insufficient, at least in noisy places. This is where vision and speech reading come in. It turns out that precisely those speech sounds that are the most difficult to hear (like /f/and/th/) are the ones easiest to see. The reverse is true as well; those sounds that can easily be confused visually (try to see the difference between a/p/, /b/, and /m/—there is none!) can be differentiated through hearing. So with hearing and vision working together, the person with a hearing loss has a much better chance of comprehending speech than with either alone.
What this program is designed to do is to give you practice in comprehending speech under increasingly noisy conditions. Your mission (and you've decided to undertake it!) is to tolerate the loudest noise you can while still being able to completely understand an utterance. Not all of the speakers are equally intelligible; some are easier to understand than others. This, too, is what is normally found in real life. Some people, like ventriloquists, hardly move their lips at all. Others over-exaggerate each sound in trying to be “helpful” to you. Neither is very desirable, but both types are commonly found. You will not find these extremes in any of our four speakers, but you will (and likely already have) in real-life.
The ultimate goal of any type of any audio/visual training program is to enhance your overall communication skills. Sometimes, in spite of your best use of the visual and auditory cues available to you, you still don't understand what someone is saying. You can simply say “what?” but then the person is likely to repeat the sentence exactly as they said if before. Sometimes this is enough (though it is better for you to ask the person “can you say that again?”). When communication breaks down, it is often helpful if you can inform the person you're talking to what he/she has to do to make it easier for you to understand. That is, if they're talking too quickly or too softly, ask the person to “slow down a bit” or to “say that again just a little louder.” Think of this as grandma's scolding you and telling you to “speak more clearly” (which really means for you to pronounce your words a bit more precisely). This is called “clear speech” (repeating, rephrasing, slowing down, more precise pronunciation). Years ago, at MIT, researchers found that clear speech was a very effective way of improving speech perception, compared to the way a person normally spoke. Fortunately, you won't have to scold any of the speakers in this program!
Don't get discouraged as you go through the exercises in the ReadMyQuips™ program, and don't give up. You should have difficulty; if you didn't, you wouldn't be undertaking the training in the first place. It is necessary for you to keep at it, keep practicing, trying to tolerate the loudest background sounds you can while still understanding the utterances. We suggest that you devote about 30 minutes for each lesson four or five times a week. You don't have to finish a lesson in one day; it will be there waiting for you when you're ready to continue. By the time you finish the last puzzle, you should be able to tolerate louder levels of noise while still understanding the sentence. And this improvement will carry over to the noisy situations you confront in real-life.
To install ReadMyQuips™, insert this DVD in your computer's DVD drive. Depending on your system configuration, the setup program may start automatically, or you may be presented with the option to start it. If not:
Once started, the setup program will guide you through the installation process. You will be prompted for an install location, and given the option to install the software alone, or together with all video.
The full installation (softwares-video) requires approximately 4 GB of space on your hard-drive. However, if you do not install the video, you will need to insert the DVD every time you wish to use the program. In addition, on some systems, playing the video from DVD may result in intermittent skipping and stuttering. For best results, we recommend installing the video onto your computer.
Note: You must have administrator privileges on your computer to install this software. Also, you may encounter a number of security warnings, either from Windows itself or from your anti-virus software. Click Run, Open, or Allow when prompted.
Once you have installed the software, you can start the program by double-clicking on its icon on the desktop, or by selecting it from the Start Menu.
Important Note: Support for ReadMyQuips™ under Mac OS/X is still in its preliminary phase. We are making every effort to improve the functionality of the program in OS/X, but you should expect some issues, particularly regarding video playback.
To install ReadMyQuips™, insert this DVD in your computer's DVD drive, double-click on the RMQ icon on your desktop. Before installing, please double-click on the file called LICENSE.PDF to read the License Agreement. By installing this software you are accepting this agreement. Now, open the OSX folder and drag the ReadMyQuips™ application to your Applications folder (or wherever you wish on your hard drive). If you like, you can also install the video by dragging the RMQMedia folder somewhere on your hard drive.
The full installation (software+video) requires approximately 4 GB of space on your hard-drive. However, if you do not install the video, you will need to insert the DVD every time you wish to use the program. In addition, on some systems, playing the video from DVD may result in intermittent skipping and stuttering of the video. For best results, we recommend installing the video onto your computer.
Once you have installed the software you can start the program by double-clicking on its icon in your Applications folder. You can also add ReadMyQuips™ to your dock by dragging it there from the Applications folder.
The first time you run ReadMyQuips™ you will have to tell the system where the video is located (either on the DVD or on your hard drive). You will be prompted to do this at program startup. You can also do it later by taking the following steps:
5. Click OK.
Important Note: In order to use the update feature under OSX, you may have to give yourself write permission to the ReadMyQuips™ bundle by following the following steps after copying ReadMyQuips™ to your hard disk:
6. Click OK.
You should now be able to update ReadMyQuips™ normally.
You can uninstall ReadMyQuips™ at any time. Under Windows, just select the Uninstall option from the ReadMyQuips™ folder on the Start Menu. Under OS-X, just drag the ReadMyQuips™ application to the trash. Note that if you install the video on your computer, you will have to uninstall it separately.
Guess the quips . . . . . . as you learn to read lips!
ReadMyQuips™ consists of a set of puzzles. Each is like a crossword puzzle, except that each blank box represents a word rather than a letter. The words go together across or down to form witty or wise quotations (quips) which you must guess. The clues are video recordings of the quips spoken in a noisy environment.
To work on a quip, double-click on one of the boxes which comprise it. A new window will open in which you can play the video and try to guess the missing words.
Guess the missing words.
Click anywhere on the image to play the video. Type your guesses into the blank boxes at the bottom of the screen, and press ENTER or the SPACE BAR to move to the next box. If you are right, the box will turn gray, and you won't be able to type in it any more. Otherwise, the text you type will show in red. To reveal the word in any box, right click in the box.
To expand the image to fill the screen, click Fullscreen; to return again, click Exit Fullscreen
Click Make Easier or Make Harder to change the level of the speech relative to the noise.
Click Return to Puzzle if you want to try another item. If you guess all the words correctly, you will return to the puzzle automatically.
As you work on a puzzle, the difficulty (i.e., the loudness of the background noise relative to the speech) will automatically adapt to your progress. The quips will get harder to understand as you guess them correctly, and easier as you miss them.
You will also be able to adjust the difficulty of an individual quip while you are viewing the video.
To set the overall audio levels, use the volume controls on your computer.
ReadMyQuips™ comes with several puzzles built-in. You can either work on one puzzle until you've finished, or move back and forth between puzzles by clicking on the Next Puzzle and Previous Puzzle buttons at the bottom of the screen.
If a puzzle is too big to fit in the window, you can use the scroll bars on the bottom and right sides of the puzzle to view the hidden portion.
In some puzzles, each quip has been recorded by more than one speaker. When you open a quip, a speaker will be chosen at random from those available.
You can limit the available speakers for a puzzle at any time by selecting Choose Speakers from the Settings menu.
Note that this option will not be enabled if there is no choice of speakers for the current puzzle.
You can make the text of a puzzle bigger or smaller by selecting Zoom In or Zoom Out from the Settings menu. If a puzzle is too big to fit in the window, you can use the scroll bars on the bottom and right sides of the puzzle to view the hidden portion.
You can also control whether video recordings play on the full screen or in a window by selecting Play video using full screen from the Settings menu.
When you quit the program, your progress and settings will be saved automatically so you can pick up where you left off.
If you'd like to save your current progress and start over, you can choose Save Session from the File menu at the top o the screen. You will be prompted for a location in which to save the current session information.
Then choose New Session to clear all your answers and start over, or Open Session to choose a saved session from your disk and restore it.
Please help us make ReadMyQuips™ better by reporting any software errors you encounter, and telling us what you think. Just choose Send Feedback from the Help menu.
You can also automatically download and install the latest updates to ReadMyQuips™ by choosing Update from the Help menu*.
Note that you must have an active internet connection. Also, if you use a firewall, you may have to configure it to allow ReadMyQuips™ to access the internet.
* Under OSX, if you receive a “Permission Denied” error when trying to update, please see the Installation Instructions for a solution.
Many aspects of ReadMyQuips™ are configurable. To view these options, select Advanced Configuration from the Settings menu. You will then have access to an advanced dialog. You can restore all settings to their initial values at any time by selecting Restore Defaults from the Settings menu.
Note: You should use caution when modifying most of the settings below. If you have doubts or questions, leave the settings at their default values.
The advanced configuration dialog is organized into four tabs.
The font size (in points) is for all puzzles. Take care when setting this that it does not make any puzzles larger than the screen. Using a 1024×768 monitor, a font size of 11 seems to work well for all the puzzles provided with the software.
An adjustment to the font size which can be used to make the font larger in the player window (the window in which video is displayed and you can guess answers). The font in this window will be equal to “Font Size+Player Font Adjustment”
Check this box to display a picture as background for each puzzle. The program will look in the media folder for each puzzle for a file called [puzzlename].JPG. If such a file does not exist, then the default wallpaper file specified below will be displayed.
The default wallpaper file to be displayed when wallpaper is enabled and no wallpaper file specific to the current puzzle can be found.
The color of the puzzle background when no wallpaper is selected.
Check this box to enable the video when solving clues. If unchecked, only the audio will be played.
The default extension (eg MP4 or AVI) of the video files used by ReadMyQuips™. Note that most puzzles (including those provided with ReadMyQuips™) specify their own video file type automatically, so this parameter is generally ignored.
Check this box to override any video file type information included with puzzles. ReadMyQuips™ will only look for video files with the “Default Video File Extension” specified above.
Check this box to cause ReadMyQuips™ to play a lower resolution version of the video. Useful if you are experiencing intermittent problems with video playback.
Use this box to control the way in which ReadMyQuips™ renders video. Can be useful if you are having video rendering issues.
Levels
Check this box to enable the use of background noise.
Selects the “way” file containing the background noise. The software ships with this file set to “NST NOISE.WAV” which contains actual cafeteria noise.
Selects the file containing the speech to be played when testing levels. Can be a MPG video file or a WAV file, but only the audio will be played. Initially set to “TEST SPEECH.WAV”
Sets the calibrated signal level relative to system maximum. This determines the minimum signal-to-noise level. The raw signal level (i.e the system gain in dB relative to system maximum) is equal to the calibrated signal level minus the calibration offset for that file, plus the reference offset (see below).
Sets the current signal-to-noise ratio. If adaptation is enabled, this will be modified as you works through the puzzles, according to the adaptive rules defined below.
Click this button to play the specified noise file and test speech file at the specified levels.
Check this box to keep the signal level constant. When unchecked, the signal level set above will be maintained unless it is necessary to reduce it in order to achieve a target SIN ratio.
Check this box to use audio level calibration information in setting levels. If this box in unchecked, then actual signal level and SIN may vary depending on the power of the various audio files.
Specify the audio level calibration information file to use. By default, the software uses “CALIBRATION.TXT” which contains pre-generated calibration information for all media in the ReadMyQuips™ video pack, as well as any calibration files packaged with additional content.
A constant used to ensure that a maximum calibrated signal level of 0dB will not exceed the system maximum gain. See above for more information.
Check this box to allow you to manually change the S/N ratio when solving an item.
Check this box to enable adaptive modification of the S/N level.
The amount by which the speech-to-noise ratio will be increased when you don't guess all the words of an item correctly, or decreased when you get them all on the first try.
By default, the noise level will be decreased every time you fail to identify all the words in a given item. This parameter lets you specify a minimum percent correct above which the S/N will remain unchanged. For example, if you set this to 50%, and you identify 6 out of 10 words, the S/N will remain unchanged on the following attempt.
Determines the number of times you may play the video before the first attempt on an item is considered complete (ie before the noise level is changed according to the adaptive rules). Applies only to the first attempt after an item is opened for solving.
Determines the number of times you may play the video before an attempt on an item is considered complete (i.e before the noise level is changed according to the adaptive rules). Applies to all attempts after the first.
S/N Offset for New Item
When you open an item for the first time (an item never before attempted), the S/N is computed based on the S/N for all other open items in the puzzle. Use this offset to raise or lower the computed starting S/N by a fixed amount.
When disabled, the S/N for a new attempt is determined by the S/N at which you completed the last item. When enabled, the starting S/N is computed by a more complex algorithm which should more accurately track your level of proficiency.
When using smart averaging, only the most recently solved items are counted toward the average. This parameter specifies how many attempts to use.
When enabled, takes the amount of context available (ie the proportion of visible words to hidden words) into account, by adjusting the effective S/N of a presentation according to the context level.
The context level which is considered normal, expressed as a proportion of visible words to hidden words.
The difference in context level which will establish a 1 dB change in the effective S/N of a presentation. For example, if the context baseline is 50% and there are 6 out of 10 words visible, and the context delta is 10, then the presentation will have an effective S/N that is 1 dB higher than the actual S/N—that is, it will be treated as though it had been presented at a higher S/N because it is relatively easier than the norm.
When you install ReadMyQuips™ you have the option of installing the video to your hard drive, or leaving it on the DVD. By default, ReadMyQuips™ searches all mounted DVD drives for video, in alphabetical order. If your video is located elsewhere (on a network drive, for example, or an external hard drive), you can inform ReadMyQuips of the location of the video using this dialog.
A list of places ReadMyQuips™ should look for video. Locations will be searched in the order in which they appear in the list. Use Move Up and Move Down to change the search order. Use Add to navigate to a folder containing video and add it to the list
When this box is checked (the default), ReadMyQuips™ will search automatically for video on all mounted DVD drives and in all installed video folders. If you uncheck this box, only the folders you explicitly specify in the list will be searched.
If you experience problems with video playback (for example, skipping or freezing of the picture, sound and picture out of sync, etc.), one of the following solutions may remedy the problem.
Install Video to your Hard Drive
If you have not already done so, try installing the video to your hard drive rather than trying to play it from DVD. To do so, follow the installation instructions at the beginning of this manual.
Adjust Power Settings on your Computer
Many computers, especially laptops, automatically limit the performance of their processors to conserve power when running on batteries. This may cause problems with playback of the video included with ReadMyQuips™. You should check to make sure your computer's power management software is set to maximize performance. For Macintosh computers, this option is available in the Energy Saver section of System Preferences. For most windows computers, a similar option is usually available on the control panel, although some computers may have their own, proprietary power management software. For more information, consult the documentation that came with your computer
ReadMyQuips™ includes video encoded in both high and low resolution formats. The low resolution format may work better on some older computers. To use the lowers format, select Advanced Configuration from the Settings menu, switch to the Display tab, and click on the checkbox labeled Use Low-Resolution Video If Available.
You can also try to use an additional format which is slightly better quality than the included low-resolution video. On the Display tab of the Advanced Configuration dialog, verify that the Video File Extension is set to “mq800.mpg”, and then click on the checkbox labeled Always use this format to select that format for all puzzles.
If you experience problems running ReadMyQuips™ on a Macintosh computer, please try some of the following fixes.
ReadMyQuips™ requires the Java Virtual Machine 5.0. This is included with OS/X 10.5 (Leopard). To verify that it is present on your system, open Java Preferences (from the Finder choose Go and then Utilities, and then double click on Java Preferences), and make sure J2SE 5.0 32-bit is listed in the Java Applications section of the General tab. If you are running Leopard and this version of Java is not present, you can download and install the latest Java Updates directly from Apple at http://support.apple.com/downloads/Java for Mac OS X 10 5 Update 4.
More recent versions of OS/X 10.5 will attempt to run applications in 64-bit mode on compatible computers. However, ReadMyQuips™ will only function when run in 32-bit mode. To verify that this is configured properly, right-click (control-click) on the ReadMyQuips™ application in the Finder, and select Get Info (or highlight the ReadMyQuips™ application and choose Get Info from the File menu). Make sure the checkbox labeled “Open in 32-bit Mode” is selected.
Mac OS/X 10.6 (Snow Leopard) does not include the correct version of Java to run ReadMyQuips™. However, it is possible to download the correct version and install it manually. Good, step-by-step instructions for this process are available at http://chxor.chxo.com/post/183013153/installing-lava-1-5-on-snow-leopard.
If you experience problems with ReadMyQuips™ that are not resolved by any of the suggestions above, you can request assistance directly from within the program by, selecting Send Feedback from the Help menu. Describe the problem you are having in the box provided, and be sure to provide your e-mail address. Someone from technical support will contact you as soon as possible.
This United States Utility Application claims priority to U.S. Provisional Patent Application Serial No. 61/310,145 filed on Mar. 3, 2010, which is commonly-owned and incorporated herein in its entirety by reference.
Number | Date | Country | |
---|---|---|---|
61310145 | Mar 2010 | US |