The present invention generally relates to microphone calibration and, more particularly, to a method, apparatus and system for removing ambient noise from microphone signals in an array of microphones.
Noise suppression is often required in many communication systems and content distribution devices to suppress noise to improve communication quality and media comprehension. Noise suppression can be achieved using various techniques, some of which can be classified as single microphone techniques and array microphone techniques.
Array microphone noise reduction techniques use multiple microphones placed at different locations and separated from each other by some minimum distance to form a beam. Conventionally, the beam is used to pick up speech that is then used to reduce the amount of noise picked up outside the beam. Thus, the array microphone techniques can suppress non-stationary noise. Multiple microphones, however, also themselves create more noise. In addition, such techniques do not use configuration parameters of a system and known audio signals to enable noise cancellation as described herein.
Embodiments of the present invention address the deficiencies of the prior art by providing a method, apparatus and system for microphone array calibration.
In an embodiment of the present invention a method for microphone array calibration includes recording audio using at least two microphones, which comprise an array of microphones, using a target microphone of the array of microphones, determining an attenuation factor for audio originating from respective locations of other microphones of the array of microphones, using a target microphone of the array of microphones, determining a delay factor for audio originating from respective locations of other microphones of the array of microphones and implementing the determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.
In an alternate embodiment of the present invention, an apparatus for microphone array calibration includes a memory for storing program routines and data and a processor for executing the program routines. In such an embodiment, the apparatus is configured to record audio using at least two microphones, which comprise an array of microphones, use a target microphone of the array of microphones to determine an attenuation factor for audio originating from respective locations of other microphones of the array of microphones, use a target microphone of the array of microphones to determine a delay factor for audio originating from respective locations of other microphones of the array of microphones, and implement the determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.
In yet an alternate embodiment of the present invention, an operator can communicate with a display using NFC communication for purposes of, for example, configuring the display or receiving information regarding the status of the display.
In one embodiment of the present invention, a method for interacting with content on a display device using near-field communication (NEC) includes determining an identity of a NFC-enabled mobile communications device in proximity to the display device, the display device including near-field communication capabilities and selecting content to communicate to the display device based on the determined identity of the NEC-enabled mobile communications device.
In an alternate embodiment of the present invention, an apparatus for interacting with content on a display device using near-field communication (NFC) includes a memory for storing program routines and data and a processor for executing the program routines. The processor of the apparatus, when executing the program routines is configured to perform the steps of determining an identity of a NFC-enabled mobile communications device in proximity to the display device, the display device including near-field communication capabilities and selecting content to communicate to the display device based on the determined identity of the NFC-enabled mobile communications device.
In an alternate embodiment of the present invention, a system for microphone array calibration includes at least two microphones comprising an array of microphones, at least one audio source, an apparatus including a memory for storing program routines and data, and a processor for executing the program routines. In such a system, the apparatus is configured to record audio using at least two microphones, which comprise an array of microphones, use a target microphone of the array of microphones to determine an attenuation factor for audio originating from respective locations of other microphones of the array of microphones, use a target microphone of the array of microphones to determine a delay factor for audio originating from respective locations of other microphones of the array of microphones and implement the determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.
The teachings of the present invention can be readily understood by considering the following detailed description in conjunction with the accompanying drawings, in which:
It should be understood that the drawings are for purposes of illustrating the concepts of the invention and are not necessarily the only possible configuration for illustrating the invention. To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures.
The present invention advantageously provides a method, apparatus and system for microphone array calibration. Although the present invention will be described primarily within the context of an in-store retail advertising network environment and advertising content distribution and specifically a check-out application, the specific embodiments of the present invention should not be treated as limiting the scope of the invention. It will be appreciated by those skilled in the art and informed by the teachings of the present invention that the concepts of the present invention can be advantageously applied to any content distribution or communications network to calibrate an array of microphones as described herein.
The functions of the various elements shown in the figures can be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions can be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which can be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and can implicitly include, without limitation, digital signal processor (“DSP”) hardware, read-only memory (“ROM”) for storing software, random access memory (“RAM”), and non-volatile storage. Moreover, all statements herein reciting principles, aspects, and embodiments of the invention, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future (i.e., any elements developed that perform the same function, regardless of structure).
Thus, for example, it will be appreciated by those skilled in the art that the block diagrams presented herein represent conceptual views of illustrative system components and/or circuitry embodying the principles of the invention. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudocode, and the like represent various processes which may be substantially represented in computer readable media and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
Although in the system 100 of
In one embodiment of the present invention, the content distribution system 100 of
In one embodiment of the present invention, software for controlling the various elements of the in-store advertising network 200 and the content distribution system 100 can include a 32-bit operating system using a windowing environment (e.g., MSWindows™ or X-Windows operating system) and high-performance computing hardware. The advertising network 200 can utilize a distributed architecture and provides centralized content management and distribution control via, in one embodiment, satellite (or other method, e.g., a wide-area network (WAN), the Internet, a series of microwave links, or a similar mechanism) and in-store modules.
As depicted in
A recording company 204 can be a record label, music publisher, licensing/publishing entity (e.g., BMI or ASCAP), individual artist, or other such source of music-related content. The recording company 204 provides audiovisual content such as music clips (short segments of recorded music), music video clips, and the like. The movie studio 206 can be a movie studio, a film production company, a publicist, or other source related to the film industry. The movie studio 106 can provide movie clips, pre-recorded interviews with actors and actresses, movie reviews, “behind-the-scenes” presentations, and similar content.
The other content provider 208 can be any other provider of video, audio or audiovisual content that can be distributed and displayed via, for example, the content distribution system 100 of
In one embodiment of the present invention, content is procured via the network management center 210 (NMC) using, for example, traditional recorded media (tapes, CD's, videos, and the like). Content provided to the NMC 210 is compiled into a form suitable for distribution to, for example, the local distribution system 100, which distributes and displays the content at a local site.
The NMC 210 can digitize the received content and provide it to a Network Operations Center (NOC) 220 in the form of digitized data files 222. It will be noted that data files 222, although referred to in terms of digitized content, can also be streaming audio, streaming video, or other such information. The content compiled and received by the NMC 210 can include commercials, bumpers, graphics, audio and the like. All files are preferably named so that they are uniquely identifiable. More specifically, the NMC 210 creates distribution packs that are targeted to specific sites, such as store locations, and delivered to one or more stores on a scheduled or on-demand basis. The distribution packs, if used, contain content that is intended to either replace or enhance existing content already present on-site (unless the site's system is being initialized for the first time, in which case the packages delivered will form the basis of the site's initial content). Alternatively, the files may be compressed and transferred separately, or a streaming compression program of some type employed.
The NOC 220 communicates digitized data files 222 to, in this example, the content distribution system 100 at a commercial sales outlet 230 via a communications network 225. The communications network 225 can be implemented in any one of several technologies. For example, in one embodiment of the present invention, a satellite link can be used to distribute digitized data files 222 to the content distribution system 100 of the commercial sales outlet 230. This enables content to easily be distributed by broadcasting (or multicasting) the content to various locations. Alternatively, the Internet can be used to both distribute audiovisual content to and allow feedback from commercial sales outlet 230. Other ways of implementing communications network 225, such as using leased lines, a microwave network, or other such mechanisms can also be used in accordance with alternate embodiments of the present invention.
The server 110 of the content distribution system 100 is capable of receiving content (e.g., distribution packs) and, accordingly, distribute them in-store to the various receivers such as the set-top boxes 120 and displays 130. That is, at the content distribution system 100, content is received and configured for streaming. The streaming can be performed by one or more servers configured to act together or in concert. The streaming content can include content configured for various different locations or products throughout the sales outlet 230 (e.g., store). For example, respective set-top boxes 120 and displays 130 can be located at specific locations throughout the sales outlet 230 and respectively configured to display content and broadcast audio pertaining to products located within a predetermined distance from the location of each respective set-top box and display.
The various embodiments of the present invention provide a method, apparatus and system for microphone array calibration. That is various embodiments of the present invention described herein are directed towards removing ambient noise from the signal of a microphone existing in a commercial checkout environment such that a audio or sounds originating at a respective checkout counter can be isolated. More specifically, various embodiments of the present invention described herein are directed towards calibrating microphones contained in an array, for example in a plurality of display screens as depicted in
In one embodiment of the present invention, a process for determining noise, such as sounds and other audio signals generated in adjacent checkout lanes of the content distribution system of
Again, in equation (1) wji is the attenuation factor from counter j to counter i and Tij is the delay parameter from counter j to counter i. As a result, to isolate the sound coming from counter i, the following processing takes place. Each display broadcasts the recorded signals yi(t) to, for example, a processing device which, in various embodiments of the present invention, can reside at the set-top box 120, or a local or remote server such as the server 110 of the content distribution system 100 of
Although the processing apparatus of
Returning to equation one (1) above, in one embodiment of the present invention, to determine the attenuation factor wij and the delay factor Tij, known checkout sounds or tones generated by, for example, the scanners at the checkout counters are used. That is, in such an embodiment, a checkout scanner tone is a known sound and comprises a predetermined volume. If each scanner generates a checkout tone at a known time (t1), the microphone of a target display can detect the tones and communicate such information to, in one embodiment, an audio circuit in, for example a processing device or server of the present invention as described above.
In an alternate embodiment of the present invention in which local sounds are not known (i.e., the type and volume of audio generated locally is not known), a local microphone such as the microphone 1321 of a respective checkout lane 1341 can be used to record audio signals in its vicinity and using known techniques, such as beam-forming techniques and other audio signal processing techniques, can determine which audio signals are generated local to its vicinity and can also determine the volume and other physical properties of such locally generated audio signals. These determined parameters of the locally generated audio signals can then be used by a target microphone to determine attenuation and delay factors of such signals as described above. That is, in such embodiments, the locally generated audio signals as determined by respective microphones of an array can be used by a target microphone as known signals as described above to determine attenuation and delay factors of such signals as described above.
In one embodiment of the present invention, the audio circuit may comprise a discrete circuit card in, for example a display or a server of the present invention or may comprise a dedicated device such as the Network Audio Processor as described in co-pending U.S. patent application Ser. No. 12/733,214. The audio circuit of the present invention can compute the attenuation factor wij and the delay factor Tij for each scanner at each checkout counter having the information regarding known sounds generated at the checkouts.
More specifically, in one embodiment of the present invention, given that the scanning signal at position i is generated at time t1, Tij can be computed as the number of timeslots between t1 and the timeslot at which the scanning signal is first recorded at the microphone j. Alternatively, in an alternate embodiment of the present invention, the difference in timeslots between the first/highest peak across different recorded signals, rather than the beginning of the signal, can be used.
In one embodiment of the present invention, the attenuation factor wij is computed similarly. In particular, wij can be taken to be equal to one for all i. The factor wij is computed as the ratio of the signal at microphone j, at time t1+Tij over the signal at microphone i at time t1+Tii. In alternative embodiment of the present invention, the ratio of peaks or other positions in the waveform of the scanning sound can be used.
Once the attenuation factor wij and the delay factor Tij are computed, beam forming technique can be used so that sounds from the other checkout counters are removed from the audio signal received by the target microphone at, for example, the target display 100.
At step 404, an attenuation factor for sounds from all other microphones of the array other than a microphone being calibrated (i.e., a target microphone) is determined using, for example, known sounds from locations of the other microphones of the array. The method 400 proceeds to step 406.
At step 406, a delay factor for sounds from all other microphones of the array other than a microphone being calibrated (i.e., a target microphone) is determined using, for example, known sounds from locations of the other microphones of the array. The method 400 proceeds to step 408.
At step 408, the determined attenuation factor and delay factor are implemented for removing audio from an audio signal captured by the target microphone originating from respective locations of the other microphones of the array of microphones for example by, in one embodiment of the present invention, using beam forming processes/techniques. The method 400 can then be exited.
It should also be noted that, in accordance with alternate embodiments of the present invention, beam-forming processes/techniques can also be used when there is a wall of screens (such as a wall of screens in the TV section of a store). In this environment, embodiments of the present invention can be used to detect in front of which TV is audio/sound originating to determine, for example, in front of which screen is a viewer/customer standing. In addition, data could be collected concerning which screens are most viewed (e.g., stood in front of) to determine which displayed advertisements are most popular.
Having described various embodiments of a method, apparatus and system for microphone array calibration (which are intended to be illustrative and not limiting), it is noted that modifications and variations can be made by persons skilled in the art in light of the above teachings. It is therefore to be understood that changes may be made in the particular embodiments of the invention disclosed which are within the scope and spirit of the invention. While the forgoing is directed to various embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US12/72083 | 12/28/2012 | WO | 00 |