METHOD, APPARATUS AND SYSTEM FOR MICROPHONE ARRAY CALIBRATION

Description

FIELD OF THE INVENTION

The present invention generally relates to microphone calibration and, more particularly, to a method, apparatus and system for removing ambient noise from microphone signals in an array of microphones.

BACKGROUND OF THE INVENTION

Noise suppression is often required in many communication systems and content distribution devices to suppress noise to improve communication quality and media comprehension. Noise suppression can be achieved using various techniques, some of which can be classified as single microphone techniques and array microphone techniques.

Array microphone noise reduction techniques use multiple microphones placed at different locations and separated from each other by some minimum distance to form a beam. Conventionally, the beam is used to pick up speech that is then used to reduce the amount of noise picked up outside the beam. Thus, the array microphone techniques can suppress non-stationary noise. Multiple microphones, however, also themselves create more noise. In addition, such techniques do not use configuration parameters of a system and known audio signals to enable noise cancellation as described herein.

SUMMARY OF THE INVENTION

Embodiments of the present invention address the deficiencies of the prior art by providing a method, apparatus and system for microphone array calibration.

In an embodiment of the present invention a method for microphone array calibration includes recording audio using at least two microphones, which comprise an array of microphones, using a target microphone of the array of microphones, determining an attenuation factor for audio originating from respective locations of other microphones of the array of microphones, using a target microphone of the array of microphones, determining a delay factor for audio originating from respective locations of other microphones of the array of microphones and implementing the determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.

In an alternate embodiment of the present invention, an apparatus for microphone array calibration includes a memory for storing program routines and data and a processor for executing the program routines. In such an embodiment, the apparatus is configured to record audio using at least two microphones, which comprise an array of microphones, use a target microphone of the array of microphones to determine an attenuation factor for audio originating from respective locations of other microphones of the array of microphones, use a target microphone of the array of microphones to determine a delay factor for audio originating from respective locations of other microphones of the array of microphones, and implement the determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.

In yet an alternate embodiment of the present invention, an operator can communicate with a display using NFC communication for purposes of, for example, configuring the display or receiving information regarding the status of the display.

In one embodiment of the present invention, a method for interacting with content on a display device using near-field communication (NEC) includes determining an identity of a NFC-enabled mobile communications device in proximity to the display device, the display device including near-field communication capabilities and selecting content to communicate to the display device based on the determined identity of the NEC-enabled mobile communications device.

In an alternate embodiment of the present invention, an apparatus for interacting with content on a display device using near-field communication (NFC) includes a memory for storing program routines and data and a processor for executing the program routines. The processor of the apparatus, when executing the program routines is configured to perform the steps of determining an identity of a NFC-enabled mobile communications device in proximity to the display device, the display device including near-field communication capabilities and selecting content to communicate to the display device based on the determined identity of the NFC-enabled mobile communications device.

In an alternate embodiment of the present invention, a system for microphone array calibration includes at least two microphones comprising an array of microphones, at least one audio source, an apparatus including a memory for storing program routines and data, and a processor for executing the program routines. In such a system, the apparatus is configured to record audio using at least two microphones, which comprise an array of microphones, use a target microphone of the array of microphones to determine an attenuation factor for audio originating from respective locations of other microphones of the array of microphones, use a target microphone of the array of microphones to determine a delay factor for audio originating from respective locations of other microphones of the array of microphones and implement the determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.

BRIEF DESCRIPTION OF THE DRAWINGS

The teachings of the present invention can be readily understood by considering the following detailed description in conjunction with the accompanying drawings, in which:

FIG. 1 depicts a high level block diagram of a content distribution system in which an embodiment of the present invention can be applied;

FIG. 2 depicts a high level block diagram of an in-store advertising network for providing in-store advertising in which an embodiment of the present invention can be applied;

FIG. 3 depicts a high level block diagram of an apparatus for microphone array calibration in accordance with an embodiment of the present invention; and

FIG. 4 depicts a flow diagram of a method for microphone array calibration in accordance with an embodiment of the present invention.

It should be understood that the drawings are for purposes of illustrating the concepts of the invention and are not necessarily the only possible configuration for illustrating the invention. To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures.

DETAILED DESCRIPTION OF THE INVENTION

The present invention advantageously provides a method, apparatus and system for microphone array calibration. Although the present invention will be described primarily within the context of an in-store retail advertising network environment and advertising content distribution and specifically a check-out application, the specific embodiments of the present invention should not be treated as limiting the scope of the invention. It will be appreciated by those skilled in the art and informed by the teachings of the present invention that the concepts of the present invention can be advantageously applied to any content distribution or communications network to calibrate an array of microphones as described herein.

The functions of the various elements shown in the figures can be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions can be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which can be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and can implicitly include, without limitation, digital signal processor (“DSP”) hardware, read-only memory (“ROM”) for storing software, random access memory (“RAM”), and non-volatile storage. Moreover, all statements herein reciting principles, aspects, and embodiments of the invention, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future (i.e., any elements developed that perform the same function, regardless of structure).

Thus, for example, it will be appreciated by those skilled in the art that the block diagrams presented herein represent conceptual views of illustrative system components and/or circuitry embodying the principles of the invention. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudocode, and the like represent various processes which may be substantially represented in computer readable media and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.

FIG. 1 depicts a high level block diagram of a content distribution system in which an embodiment of the present invention can be applied. The content distribution system 100 of FIG. 1 illustratively comprises a checkout advertising distribution system including, illustratively one server 110, a plurality of receiving devices such as tuning/decoding means (illustratively set-top boxes (STBs)) 120₁-120_n, and a respective display 130₁-130_nfor each of the set-top boxes 120₁-120_n. As depicted in FIG. 1, the displays 130 each include a respective microphone 132₁-132_nand at least one speaker 133₁-133_nand are located in the proximity of a respective checkout lane 134₁-134_n. In the content distribution system 100 of FIG. 1, the microphones 132 of the displays 130 comprise an array of microphones. In such a system as the system 100 of FIG. 1, the microphones 132 are typically used to verify playout of the content on the displays 130 and can further be used for noise cancellation purposes.

Although in the system 100 of FIG. 1, each of the plurality of set-top boxes 120₁-120_n, is illustratively connected to a single, respective display, in alternate embodiments of the present invention, each of the plurality of set-top boxes 120₁-120_n, can be connected to more than a single display. That is, in alternate embodiments of the invention, displays of a plurality of checkout lanes can be controlled and in communication with a single set-top box. In addition, although in the content distribution system 100 of FIG. 1 the tuning/decoding means are illustratively depicted as set-top boxes 120, in alternate embodiments of the present invention, the tuning/decoding means of the present invention can comprise alternate tuning/decoding means such as a tuning/decoding circuit integrated into the displays 130 or other stand alone tuning/decoding devices and the like. Even further, receiving devices of the present invention can include any devices capable of receiving content such as audio, video and/or audio/video content.

In one embodiment of the present invention, the content distribution system 100 of FIG. 1 can be a part of an in-store advertising network. For example, FIG. 2 depicts a high level block diagram of an in-store advertising network 200 for providing in-store advertising. In the advertising network 200 of FIG. 2, the advertising network 200 and distribution system 100 employ a combination of software and hardware that provides cataloging, distribution, presentation, and usage tracking of music recordings, home video, product demonstrations, advertising content, and other such content, along with entertainment content, news, and similar consumer informational content in an in-store setting. The content can include content presented in compressed or uncompressed video and audio stream format (e.g., MPEG4/MPEG4 Part 10/AVC-H.264, VC-1, Windows Media, etc.), although the present system should not be limited to using only those formats.

In one embodiment of the present invention, software for controlling the various elements of the in-store advertising network 200 and the content distribution system 100 can include a 32-bit operating system using a windowing environment (e.g., MSWindows™ or X-Windows operating system) and high-performance computing hardware. The advertising network 200 can utilize a distributed architecture and provides centralized content management and distribution control via, in one embodiment, satellite (or other method, e.g., a wide-area network (WAN), the Internet, a series of microwave links, or a similar mechanism) and in-store modules.

As depicted in FIG. 2, the content for the in-store advertising network 200 and the content distribution system 100 can be provided from an advertiser 202, a recording company 204, a movie studio 206 or other content providers 208. An advertiser 202 can be a product manufacturer, a service provider, an advertising company representing a manufacturer or service provider, or other entity. Advertising content from the advertiser 202 can consist of audiovisual content including commercials, “info-mercials”, product information and product demonstrations, and the like.

A recording company 204 can be a record label, music publisher, licensing/publishing entity (e.g., BMI or ASCAP), individual artist, or other such source of music-related content. The recording company 204 provides audiovisual content such as music clips (short segments of recorded music), music video clips, and the like. The movie studio 206 can be a movie studio, a film production company, a publicist, or other source related to the film industry. The movie studio 106 can provide movie clips, pre-recorded interviews with actors and actresses, movie reviews, “behind-the-scenes” presentations, and similar content.

The other content provider 208 can be any other provider of video, audio or audiovisual content that can be distributed and displayed via, for example, the content distribution system 100 of FIG. 1.

In one embodiment of the present invention, content is procured via the network management center 210 (NMC) using, for example, traditional recorded media (tapes, CD's, videos, and the like). Content provided to the NMC 210 is compiled into a form suitable for distribution to, for example, the local distribution system 100, which distributes and displays the content at a local site.

The NMC 210 can digitize the received content and provide it to a Network Operations Center (NOC) 220 in the form of digitized data files 222. It will be noted that data files 222, although referred to in terms of digitized content, can also be streaming audio, streaming video, or other such information. The content compiled and received by the NMC 210 can include commercials, bumpers, graphics, audio and the like. All files are preferably named so that they are uniquely identifiable. More specifically, the NMC 210 creates distribution packs that are targeted to specific sites, such as store locations, and delivered to one or more stores on a scheduled or on-demand basis. The distribution packs, if used, contain content that is intended to either replace or enhance existing content already present on-site (unless the site's system is being initialized for the first time, in which case the packages delivered will form the basis of the site's initial content). Alternatively, the files may be compressed and transferred separately, or a streaming compression program of some type employed.

The NOC 220 communicates digitized data files 222 to, in this example, the content distribution system 100 at a commercial sales outlet 230 via a communications network 225. The communications network 225 can be implemented in any one of several technologies. For example, in one embodiment of the present invention, a satellite link can be used to distribute digitized data files 222 to the content distribution system 100 of the commercial sales outlet 230. This enables content to easily be distributed by broadcasting (or multicasting) the content to various locations. Alternatively, the Internet can be used to both distribute audiovisual content to and allow feedback from commercial sales outlet 230. Other ways of implementing communications network 225, such as using leased lines, a microwave network, or other such mechanisms can also be used in accordance with alternate embodiments of the present invention.

The server 110 of the content distribution system 100 is capable of receiving content (e.g., distribution packs) and, accordingly, distribute them in-store to the various receivers such as the set-top boxes 120 and displays 130. That is, at the content distribution system 100, content is received and configured for streaming. The streaming can be performed by one or more servers configured to act together or in concert. The streaming content can include content configured for various different locations or products throughout the sales outlet 230 (e.g., store). For example, respective set-top boxes 120 and displays 130 can be located at specific locations throughout the sales outlet 230 and respectively configured to display content and broadcast audio pertaining to products located within a predetermined distance from the location of each respective set-top box and display.

The various embodiments of the present invention provide a method, apparatus and system for microphone array calibration. That is various embodiments of the present invention described herein are directed towards removing ambient noise from the signal of a microphone existing in a commercial checkout environment such that a audio or sounds originating at a respective checkout counter can be isolated. More specifically, various embodiments of the present invention described herein are directed towards calibrating microphones contained in an array, for example in a plurality of display screens as depicted in FIG. 1, such that noise received or detected by the microphones in the array of display screens can be removed from the audio signal detected at a target display screen. Again, although various embodiments of the present invention will be described primarily within the context of a commercial advertising network environment and advertising content distribution, the specific embodiments of the present invention should not be treated as limiting the scope of the invention. It will be appreciated by those skilled in the art and informed by the teachings of the present invention that the concepts of the present invention can be advantageously applied in any content distribution or communications environment to calibrate an array of microphones as described herein.

In one embodiment of the present invention, a process for determining noise, such as sounds and other audio signals generated in adjacent checkout lanes of the content distribution system of FIG. 1, to be removed from at least one microphone in an array of microphones can be accomplished through, in one embodiment of the present invention, a beam-forming process/technique. For describing an embodiment of the present invention, let t be a timeslot at which microphones record a sound (e.g., every msec), y_i(t) be the signal received or detected by microphone at screen i at timeslot t, x_i(t) be the sound signal generated at counter i at timeslot t (including, for example, the conversation between the cashier and the customer at counter i, the scanning sounds made by the checkout machine, etc.), T_ijbe a weight value (delay parameter) based on time delay from counter i to counter j, and w_ijbe a weight value (attenuation factor) based on the distance between counter i to counter j. As such, a microphone at position i receives a signal y_ithat includes sounds from all counters which can be determined according to equation one (1), which follows:

$\begin{matrix} y_{i} (t) = \sum_{j = 1}^{n} w \times (t - T) . & (1) \end{matrix}$

Again, in equation (1) w_jiis the attenuation factor from counter j to counter i and T_ijis the delay parameter from counter j to counter i. As a result, to isolate the sound coming from counter i, the following processing takes place. Each display broadcasts the recorded signals y_i(t) to, for example, a processing device which, in various embodiments of the present invention, can reside at the set-top box 120, or a local or remote server such as the server 110 of the content distribution system 100 of FIG. 1 or the NMC 210 or NOC 220 of the in-store advertising network 200 of FIG. 2. Having these signals, to isolate the sound at counter i at time t (i.e. x_i(t)), the processing device solves the linear system of equation one (1). The unknowns in this system are the signals x_iat different timeslots t.

FIG. 3 depicts a high-level block diagram of a processing apparatus, which in various embodiments of the present invention can be a set-top box 120, or a local or remote server such as the server 110 of the content distribution system 100 of FIG. 1 or the NMC 210 or NOC 220 of the in-store advertising network 200 of FIG. 2. More specifically, the processing device of FIG. 3 illustratively comprises a processor 310 as well as a memory 320 for storing control programs, file information, stored signals and the like. The processor 310 cooperates with conventional support circuitry 330 such as power supplies, clock circuits, cache memory and the like as well as circuits that assist in executing the software routines stored in the memory 320. As such, it is contemplated that some of the process steps discussed herein as software processes may be implemented within hardware, for example, as circuitry that cooperates with the processor 310 to perform various steps. The processing apparatus also contains input-output circuitry 340 that forms an interface between various functional elements communicating with the processing apparatus.

Although the processing apparatus of FIG. 3 is depicted as a general purpose computer that is programmed to perform various control functions in accordance with the present invention, the invention can be implemented in hardware, for example, as an application specified integrated circuit (ASIC). As such, the process steps described herein are intended to be broadly interpreted as being equivalently performed by software executed by a processor, hardware, or a combination thereof. In addition, although the processing apparatus of FIG. 3 is depicted as a separate component, the functionalities of the processing device in accordance with the concepts and embodiments of the present invention described herein can be incorporated into an existing system component such as a set-top box, server and the like.

Returning to equation one (1) above, in one embodiment of the present invention, to determine the attenuation factor w_ijand the delay factor T_ij, known checkout sounds or tones generated by, for example, the scanners at the checkout counters are used. That is, in such an embodiment, a checkout scanner tone is a known sound and comprises a predetermined volume. If each scanner generates a checkout tone at a known time (t₁), the microphone of a target display can detect the tones and communicate such information to, in one embodiment, an audio circuit in, for example a processing device or server of the present invention as described above.

In an alternate embodiment of the present invention in which local sounds are not known (i.e., the type and volume of audio generated locally is not known), a local microphone such as the microphone 132₁of a respective checkout lane 134₁can be used to record audio signals in its vicinity and using known techniques, such as beam-forming techniques and other audio signal processing techniques, can determine which audio signals are generated local to its vicinity and can also determine the volume and other physical properties of such locally generated audio signals. These determined parameters of the locally generated audio signals can then be used by a target microphone to determine attenuation and delay factors of such signals as described above. That is, in such embodiments, the locally generated audio signals as determined by respective microphones of an array can be used by a target microphone as known signals as described above to determine attenuation and delay factors of such signals as described above.

In one embodiment of the present invention, the audio circuit may comprise a discrete circuit card in, for example a display or a server of the present invention or may comprise a dedicated device such as the Network Audio Processor as described in co-pending U.S. patent application Ser. No. 12/733,214. The audio circuit of the present invention can compute the attenuation factor w_ijand the delay factor T_ijfor each scanner at each checkout counter having the information regarding known sounds generated at the checkouts.

More specifically, in one embodiment of the present invention, given that the scanning signal at position i is generated at time t₁, T_ijcan be computed as the number of timeslots between t₁and the timeslot at which the scanning signal is first recorded at the microphone j. Alternatively, in an alternate embodiment of the present invention, the difference in timeslots between the first/highest peak across different recorded signals, rather than the beginning of the signal, can be used.

In one embodiment of the present invention, the attenuation factor w_ijis computed similarly. In particular, w_ijcan be taken to be equal to one for all i. The factor w_ijis computed as the ratio of the signal at microphone j, at time t₁+T_ijover the signal at microphone i at time t₁+T_ii. In alternative embodiment of the present invention, the ratio of peaks or other positions in the waveform of the scanning sound can be used.

Once the attenuation factor w_ijand the delay factor T_ijare computed, beam forming technique can be used so that sounds from the other checkout counters are removed from the audio signal received by the target microphone at, for example, the target display 100.

FIG. 4 depicts a flow diagram of a method for microphone array calibration in accordance with an embodiment of the present invention. The method 400 of FIG. 4 begins at step 402 during which environmental sounds/audio is recorded by at least two microphones, which comprise an array of microphones. The method 400 proceeds to step 404.

At step 404, an attenuation factor for sounds from all other microphones of the array other than a microphone being calibrated (i.e., a target microphone) is determined using, for example, known sounds from locations of the other microphones of the array. The method 400 proceeds to step 406.

At step 406, a delay factor for sounds from all other microphones of the array other than a microphone being calibrated (i.e., a target microphone) is determined using, for example, known sounds from locations of the other microphones of the array. The method 400 proceeds to step 408.

At step 408, the determined attenuation factor and delay factor are implemented for removing audio from an audio signal captured by the target microphone originating from respective locations of the other microphones of the array of microphones for example by, in one embodiment of the present invention, using beam forming processes/techniques. The method 400 can then be exited.

It should also be noted that, in accordance with alternate embodiments of the present invention, beam-forming processes/techniques can also be used when there is a wall of screens (such as a wall of screens in the TV section of a store). In this environment, embodiments of the present invention can be used to detect in front of which TV is audio/sound originating to determine, for example, in front of which screen is a viewer/customer standing. In addition, data could be collected concerning which screens are most viewed (e.g., stood in front of) to determine which displayed advertisements are most popular.

Having described various embodiments of a method, apparatus and system for microphone array calibration (which are intended to be illustrative and not limiting), it is noted that modifications and variations can be made by persons skilled in the art in light of the above teachings. It is therefore to be understood that changes may be made in the particular embodiments of the invention disclosed which are within the scope and spirit of the invention. While the forgoing is directed to various embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof.

Claims

1. A method, comprising: recording audio using at least two microphones, which comprise an array of microphones;using a target microphone of the array of microphones, determining an attenuation factor for audio originating from respective locations of other microphones of the array of microphones;using a target microphone of the array of microphones, determining a delay factor for audio originating from respective locations of other microphones of the array of microphones; andimplementing said determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.
2. The method of claim 1, comprising using known parameters of audio generated by audio sources at respective locations of the other microphones of the array of microphones to determine an attenuation factor for the audio at a location of the target microphone.
3. The method of claim 1, comprising using known parameters of audio generated by audio sources at respective locations of the other microphones of the array of microphones to determine a delay factor for the audio at a location of the target microphone.
4. The method of claim 1, wherein audio originating locally to a respective microphone of the array of microphones is recorded by the respective microphone to establish a first time of recording for determining a delay factor for the audio at a location of the target microphone.
5. The method of claim 1, wherein audio originating locally to a respective microphone of the array of microphones is recorded by the respective microphone to establish a first magnitude of recording for determining an attenuation factor for the audio at a location of the target microphone.
6. The method of claim 1, wherein the attenuation factor and the delay factor are determined using the following equation:
7. The method of claim 6, wherein wij is an attenuation factor from a location of one microphone, j, of the array of microphones, to a location of another microphone, i, of the array of microphones, Tij is a delay parameter from a location of one microphone, j, of the array of microphones to the location of another microphone, i, of the array of microphones, and xi(t) is audio generated at a location of a microphone, i, of the array of microphones at timeslot t.
8. The method of claim 1, comprising: positioning a microphone of the array of microphones at each location of a source of audio.
9. The method of claim 8, wherein the source of audio comprises a check-out counter in a retail environment.
10. The method of claim 1, comprising: removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone using beam-forming techniques.
11. The method of claim 1, comprising, using beam-forming techniques to determine physical properties of audio originating from respective audio sources at respective locations of a microphone of the array of microphones.
12. An apparatus for microphone array calibration, comprising: a memory for storing program routines and data; anda processor for executing said program routines;said apparatus configured to: record audio using at least two microphones, which comprise an array of microphones;use a target microphone of the array of microphones to determine an attenuation factor for audio originating from respective locations of other microphones of the array of microphones;use a target microphone of the array of microphones to determine a delay factor for audio originating from respective locations of other microphones of the array of microphones; andimplement said determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.
13. The apparatus of claim 12, wherein said apparatus comprises an audio circuit.
14. The apparatus of claim 12, wherein said apparatus comprises an integrated component of at least one of a server and a set-top box.
15. A system for microphone array calibration, comprising: at least two microphones comprising an array of microphones;at least one audio source;an apparatus including a memory for storing program routines and data, and a processor for executing said program routines, said apparatus configured to: record audio using at least two microphones, which comprise an array of microphones;use a target microphone of the array of microphones to determine an attenuation factor for audio originating from respective locations of other microphones of the array of microphones;use a target microphone of the array of microphones to determine a delay factor for audio originating from respective locations of other microphones of the array of microphones; andimplement said determined attenuation factor and delay factor for removing audio originating from respective locations of the other microphones of the array of microphones from an audio signal captured by the target microphone.
16. The system of claim 15, wherein said at least two microphones comprise microphones of at least one network audio processor.
17. The system of claim 15, wherein said at least two microphones comprise microphones in a check-out lane of a retail environment.
18. The system of claim 15, wherein said audio source comprises a scanner.

PCT Information

Filing Document	Filing Date	Country	Kind
PCT/US12/72083	12/28/2012	WO	00

METHOD, APPARATUS AND SYSTEM FOR MICROPHONE ARRAY CALIBRATION

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

PCT Information