The present invention relates to apparatus, methods and computer readable media for providing an environment with secure speech capabilities. More particularly, the invention relates to providing such an environment by using noise canceling and noise covering techniques. The present invention can be deployed on various types of portable devices including, but not limited to phones, PDAs, pagers, and portable computers.
Most business travelers bring work with them on trips to maximize the useful time away from the office, and to give them something to do on an airplane or train. They usually bring along a portable computer of some type so that they can edit or create documents and presentations, or write software. Most of these travelers also carry a cellular phone, and some carry a dictating device such as a tape recorder or digital voice recorder. The recording is used to make notations or to dictate documents for a secretary to transcribe when the traveler returns to the office.
Dictating information in an airplane can be distracting to the other passengers and can also be uncomfortable for the speaker as some of the dictated information may be sensitive. No one wants to sit next to someone that is continually dictating, especially on a late “red-eye” flight. It would be desirable for there to be no noise or distraction while speaking or dictating.
There is a considerable amount of prior art in the fields of noise cancellation and echo cancellation. Some of these methods employ algorithms and technologies to remove background noise, echo effects, and other undesirable content from the desired audio. Although these systems have been developed to remove the noise component from speech, almost none of them have been used to cancel the entire speech audio content.
One approach that does attempt to cancel the speech content is set forth in U.S. Pat. No. 5,526,421 to Berger et al. In Berger et al., a voice transmission system, such as a telephone, includes a microphone in combination with active sound cancellation means comprising a speaker and a signal processor for generating a mirror-image wave form with respect to the signal generated when speaking into the microphone. The mirror-image wave form is used to activate the speaker, thereby canceling the user's voice.
Unfortunately this approach does not necessarily provide the user with complete privacy, since the cancellation is never perfect and does not prevent the user's voice from being overheard by others. This is because the loudspeaker or other sound source which provides the canceling audio is not identical to the original source, and is at a location which is at least slightly displaced therefrom. While the speaker's voice may be at least partially canceled, it is virtually impossible to match frequency, phase and amplitude so as to have complete cancellation in all regions of the space surrounding the speaker. Under these circumstances, there is a large probability that someone in the vicinity of the speaker will be able to hear and understand what the speaker is saying. The result is a lack of complete privacy, and the very purpose of the system is defeated.
It is therefore an aspect of the present invention to provide a system and a method for canceling a speaker's voice and masking it, so that complete privacy is assured.
It is another aspect of the invention to provide a masking signal, and apparatus for generating it, that is related to the sounds that are to be masked.
In accordance with the invention, sound is converted into an electrical signal by a microphone and the electrical signal is analyzed. The electrical signal also is converted into another signal that is the exact opposite of the incoming electrical signal. This resulting signal is them digitally modulated, amplified, and sent back by a speaker or speakers mounted in the device, canceling out the audible portion of the sound.
The invention may be implemented by a specialized electronic circuit and associated software algorithms embodied by the specialized circuitry and other programs installed on the device. The circuit may employ techniques and methods normally found in the area of noise reduction. In accordance with the invention, as in some noise reduction applications, the noise component of a particular signal is detected, often using a statistical characterization. The noise component is then removed from the signal, either by introducing an identical noise component having a phase opposite to that of the original, or by some other means. The signal components that are 180 degrees out of phase with each other cancel each other out, removing the noise component.
The present invention provides a similar type of method, but instead of applying it to the noise component, the method is applied to the main voice component. In this case, the person speaking into a microphone causes a signal to be produced. This signal is immediately analyzed, then inverted, and retransmitted using speakers built into the device. The speakers are designed to create an acoustic field that is directed in the same way as the person's voice, except that the field is minimized in the direction of the microphone. Thus the microphone receives the person's voice alone; all other listeners receive both the person's voice and the output of the speakers. Since these are equal in amplitude and opposite in phase, they cancel, so that other listeners hear nothing. The user is then free to dictate or speak into a device without others being able to discern the speaker's content.
The invention employs the phenomenon called “acoustic masking” in recognizing that one sound, if of sufficient amplitude, can make another sound difficult or impossible to hear. Using this phenomenon, the device generates a noise component that disguises or hides the speaker's content. That noise is subtracted from the voice component. The frequency of the noise can also be shifted within the audible range to discourage eavesdropping.
Thus, in accordance with the invention, a method for masking speech, a system having components for masking speech, and an article of manufacture having computer readable program code thereon, the code causing the computer to perform, the method are all provided. The method comprises generating an electrical signal representative of the speech, using the electrical signal to provide an audio signal which cancels the speech; and providing a speech masking signal to mask any speech not canceled. The speech masking signal is supplied by a babble generator. The speech masking signal may be produced by rearranging the speech so that it is not intelligible.
The speech masking signal may be provided by pitch inverting the speech. The pitch may be inverted by low pass filtering the electrical signal representative of the speech to provide a low pass filtered signal; mixing the low pass filtered signal with a carrier frequency to produce an output signal including the carrier frequency and two side bands; and low pass filtering the output signal to produce the masking signal.
The speech masking signal may also be produced reading out digital representations of signals disruptive to the understanding of speech from a memory; and converting the digital representations to the speech masking signal. The digital representations may be in the form of one of pulse code modulation and adaptive pulse code modulation, and may be representative of multiple persons speaking simultaneously. The speech masking signal may be produced by sampling portions of the speech; and providing the portions in reverse order. At least one of amplitude and frequency of the masking signal may be changed at intervals. The intervals may be fixed, variable or random.
The speech masking signal may be produced by sampling portions of the speech; and providing the portions with a predetermined time delay. In this case also, the speech masking signal may be produced by sampling portions of the speech; and providing the portions in reverse order and with a predetermined time delay. Again, at least one of amplitude and frequency of the masking signal may be changed at fixed, variable or random intervals.
The speech masking signal may be produced by sampling portions of the speech; and providing the portions in a random manner. The portions may be provided so as to be random relative to amplitude and frequency of the speech.
These and other aspects, features, and advantages of the present invention will become apparent upon further consideration of the following detailed description of the invention when read in conjunction with the drawing figures, in which:
One form of preferred embodiment is a circuit consisting of an amplifier, a analog-to-digital converter, an inverter, a digital-to-analog converter, a power amplifier and two or more speakers. The invention may then be embodied as a computer program (or as computer readable program code stored on a computer readable storage medium) in a computing device that inverts the phase of an incoming signal and uses that inversion to cancel or disguise the sound of the speaker's voice, thus preventing other individuals in the general vicinity of the speaker from hearing clearly what the user is saying. This system can also be implemented using a microphone coupled to an analog-to-digital converter and then to the circuitry described in
It will be understood that the system of
Referring specifically to
The digital signal at Amp/Inverter 105 may be optionally time shifted a small amount to compensate for latencies in components and software, and is then converted to analog at DAC/Delay 112. The signal is then routed to the cancellation speaker 107.
The system in accordance with the present invention also contains a babble generator 110 (
Referring to
Referring to
In
In another form of the present invention, which may utilize a portion of the approach of
In yet another form of the present invention, the phonemes detected by the phoneme recognizer 210 and subsequently generated by the phoneme generator 214 are randomly replaced or modified using a form of a phoneme order generator 216 that selects at random, or in a reverse order, a phoneme or phonemes, respectively, to substitute for existing phonemes, prior to output to the speaker. If operating in a random mode, the order of the generated phonemes is a random variation of the order of the recognized phonemes. If operating in a reverse order mode, then during a given fixed or random, but relatively short time interval, phonemes are delayed and played back in reverse order. The amplitudes of the phonemes may also be varied in a random manner, at fixed or random intervals. These approaches all serve to further assist in rendering the speech unintelligible.
The embodiments of the babble generators 110 and 110A of
Referring to
Variations described for the present invention can be realized in any combination desirable for each particular application. Thus particular limitations, and/or embodiment enhancements described herein, which may have particular advantages to the particular application need not be used for all applications. Also, it should be realized that not all limitations need be implemented in methods, systems and/or apparatus including one or more concepts of the present invention.
The present invention can be realized in hardware, software, or a combination of hardware and software. Any kind of computer system, or other apparatus adapted for carrying out the methods and/or functions described herein, is suitable. A typical combination of hardware and software could be a general purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein. The present invention can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which, when loaded in a computer system, is able to carry out these methods.
Computer program means or computer program in the present context include any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after conversion to another language, code or notation, and/or reproduction in a different material form.
Thus the invention includes an article of manufacture which comprises a computer usable medium having computer readable program code means embodied therein for causing a function described above. The computer readable program code means in the article of manufacture comprises computer readable program code means for causing a computer to effect the steps of a method of this invention. Similarly, the present invention may be implemented as a computer program product comprising a computer usable medium having computer readable program code means embodied therein for causing a function described above. The computer readable program code means in the computer program product comprising computer readable program code means for causing a computer to effect one or more functions of this invention. Furthermore, the present invention may be implemented as a program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for causing one or more functions of this invention.
It is noted that the foregoing has outlined some of the more pertinent objects and embodiments of the present invention. The concepts of this invention may be used for many applications. Thus, although the description is made for particular arrangements and methods, the intent and concept of the invention is suitable and applicable to other arrangements and applications. It will be clear to those skilled in the art that other modifications to the disclosed embodiments can be effected without departing from the spirit and scope of the invention. The described embodiments ought to be construed to be merely illustrative of some of the more prominent features and applications of the invention. Other beneficial results can be realized by applying the disclosed invention in a different manner or modifying the invention in ways known to those familiar with the art. Thus, it should be understood that the embodiments has been provided as an example and not as a limitation. The scope of the invention is defined by the appended claims.