This disclosure generally relates to acoustic devices that include microphone arrays for capturing acoustic signals.
An array of microphones can be used for capturing acoustic signals along a particular direction.
In general, in one aspect, this document features an apparatus that includes an array of multiple microphones, and a passive directional acoustic element disposed between at least two of the multiple microphones. The passive directional acoustic element includes a pipe having an elongated opening along at least a portion of the length of the pipe, and an acoustically resistive material covering at least a portion of the elongated opening. One or more structural characteristics of the passive acoustic element is configured for capturing a target frequency range in accordance with a target beam pattern associated with the array.
In another aspect, this document features a device that includes one or more acoustic transducers, an array of multiple microphones, and a passive directional acoustic element disposed between at least two of the multiple microphones. The passive directional acoustic element includes a pipe having an elongated opening along at least a portion of the length of the pipe, and an acoustically resistive material covering at least a portion of the elongated opening. One or more structural characteristics of the passive acoustic element is configured for capturing a target frequency range in accordance with a target beam pattern associated with the array, the target beam-pattern being selected in accordance with a threshold amount of spatial aliasing. The device also includes one or more processing devices configured to process signals captured by the array.
In another aspect, this document features a method that includes receiving an input signal from an array of multiple microphones, wherein the input signal includes acoustic signals received through a passive directional acoustic element disposed between at least two of the multiple microphones. The passive directional acoustic element includes a pipe having an elongated opening along at least a portion of the length of the pipe, and an acoustically resistive material covering at least a portion of the elongated opening. The method also includes generating, by one or more processing devices from the input signal, a beamformed signal that represents signals captured by the array in accordance with one or more directional sensitivity patterns of the array, and generating an output signal based on the beamformed signal.
In another aspect, the document features one or more machine-readable storage devices having encoded thereon computer readable instructions for causing one or more processing devices to perform various operations. The operations include receiving an input signal from an array of multiple microphones, wherein the input signal includes acoustic signals received through a passive directional acoustic element disposed between at least two of the multiple microphones. The passive directional acoustic element includes a pipe having an elongated opening along at least a portion of the length of the pipe, and an acoustically resistive material covering at least a portion of the elongated opening. The operations also include generating a beamformed signal that represents signals captured by the array in accordance with one or more directional sensitivity patterns of the array, and generating an output signal based on the beamformed signal.
Implementations of the above aspects may include one or more of the following features. The array can be a linear array or a non-linear array. The multiple microphones can be disposed around the periphery of an acoustic device. The pipe can have a substantially uniform hollow cross-section along its length. The acoustically resistive material can include at least one of: wire mesh, sintered plastic, or fabric. The array can include six or more microphones separated by passive directional acoustic elements. The array of multiple microphones can be disposed along a substantially circular path, and the passive directional acoustic element disposed between at least two of the multiple microphones can have a curved shape. The array of multiple microphones can be disposed on a top surface or sidewall of the apparatus. The target beam-pattern can be selected in accordance with a threshold amount of spatial aliasing. The target frequency range can have a bandwidth substantially equal to 20 KHz.
Various implementations described herein may provide one or more of the following advantages. By using passive directional acoustic elements together with microphones, wideband acoustic signals (e.g., over a bandwidth of 20 KHz) can be captured with high fidelity. For example, on one hand, the use of passive directional acoustic elements may allow for significantly mitigating effects of spatial aliasing on the captured signals without using a large number of microphones. On the other hand, by using a limited number of discrete microphones in an array, the resultant sensitivity pattern of the array may be prevented from becoming overly directional. Therefore, in some cases, using multiple microphones in combination with one or more passive directional acoustic elements may allow for implementing arrays that are sensitive over a large bandwidth without being overly directional. Such arrays may be useful, for example, in small form factor devices usable for recording high-fidelity audio such as that captured or recorded for virtual reality (VR) applications.
Two or more of the features described in this disclosure, including those described in this summary section, may be combined to form implementations not specifically described herein.
The details of one or more implementations are set forth in the accompanying drawings and the description below. Other features, objects, and advantages will be apparent from the description and drawings, and from the claims.
This document describes technology in which multiple microphones are employed with passive directional acoustic elements in capturing acoustic signals over a wide bandwidth and with a target amount of directionality. For example, a circular array of six microphones, with passive directional acoustic elements disposed between each pair of microphones, may be used to capture signals in a bandwidth of up to 20 KHz. In the absence of the passive directional acoustic elements, the spacing between the microphones needed for capturing such a bandwidth without substantial spatial aliasing would be very small (e.g., in the order of a fraction of a centimeter), which typically would lead to a requirement of a large number of microphones that may not be practically feasible in many applications. On the other hand, in some cases, the directionality associated with the passive acoustic elements may be too high for some practical applications. The technology described herein can be used for obtaining a combination of microphones and passive directional acoustic elements that yields a target sensitivity pattern (e.g., with a target amount of directionality) over a large bandwidth. This can be used, for example, to implement wideband arrays using a small number of microphones. Such arrays may be implemented on small form-factor devices (e.g., personal acoustic devices or small VR cameras) to capture high fidelity audio over large bandwidths.
Microphone arrays can be used for capturing acoustic signals along a particular direction. For example, signals captured by multiple microphones in an array may be processed to generate a sensitivity pattern that emphasizes the signals along a “beam” in the particular direction and suppresses signals from one or more other directions. An example of such a device 100 is shown in
In some implementations, a directional audio capture device may also be realized using a single microphone together with a slotted interference tube. An example of such a device 150 is shown in
Nyquist criterion dictates that in order to reconstruct an audio signal from a set of spatial samples (e.g., with uniform sampling occurring along a given spatial dimension), the sampling period must be equal to less than half of the wavelength corresponding to the smallest wavelength (or highest frequency) present in the audio signal. If this criterion is not satisfied, different components of the reconstructed signal may become indistinguishable from (or aliases of) one another, causing the reconstructed audio signal to be potentially distorted due to the effect known as spatial aliasing. Higher the bandwidth of the audio intended to be captured without spatial aliasing, smaller the spacing required between microphones in the corresponding array. In devices and applications directed to speech capture only (corresponding, for example, to a bandwidth of about 4 KHz-8 KHz), the spacing between microphones is typically high enough for microphone arrays to be implemented on various devices. However, for high-fidelity applications, where audio over a much larger bandwidth (e.g., 20 KHz) is intended to be captured, the spacing between the microphones in an array becomes small (e.g., in the order of a fraction of a centimeter), thereby requiring a large number of microphones that may not be feasible to implement in many applications. For example, in recording high-fidelity audio for virtual reality (VR) applications, it may be desirable to capture audio not just over the frequency range corresponding to speech, but over the entire human audible range spanning approximately 20 KHz. The large number of microphones and/or the low spacing requirement may make it unfeasible for implementing a suitable microphone array for the purpose, for example, on small form-factor devices.
Because the acoustically resistive material 210 acts as an array of microphones or sensors, and the arrival of an acoustic signal at two consecutive sensors are delayed by approximately d/co (where d is the distance between the holes and co is the speed of sound), the resultant array acts as a directional microphone in a manner similar to the shotgun microphone described above with reference to
In some implementations, using passive directional acoustic elements 205 in conjunction with an array of microphones may allow for achieving a desired tradeoff between bandwidth and directionality. For example, a device that includes a microphone array with a passive directional acoustic element 205 disposed between one or more pairs of microphones (such as the device 200 shown in
Various types and forms of passive directional acoustic elements 205 may be used for the technology described herein. In some implementations, a passive directional acoustic element 205 may have a substantially uniform and cylindrical hollow cross-section. In some implementations, the passive directional acoustic element 205 may have a rectangular cross section, and shaped to reduce impedance mismatch between the interior and exterior of the pipe or tubular structure. In some implementations, this may prevent the formation of standing waves within the passive directional acoustic elements 205. Various views of examples of passive directional acoustic elements 205 with a rectangular cross section are shown in
In some implementations, using an array of microphone in conjunction with passive directional acoustic elements 205 allows high-fidelity wideband audio capture systems to be implemented on small form-factor devices. Examples of such devices include personal acoustic devices, video capture devices such as VR cameras, teleconference microphones, or other audio-visual devices used for capturing high-fidelity wideband audio. In some implementations, in addition to the microphones and passive directional acoustic elements, a device can also include one or more acoustic transducers for generating audio signals and/or one or more processing devices configured to process, for example, signals captured or recorded using the microphones and passive directional acoustic elements.
The technology described herein may be implemented on devices of various shapes and sizes. Examples of such devices are illustrated in
The examples of
In some implementations, one or more structural characteristics of passive acoustic elements 205 disposed between microphones can be configured for capturing a target frequency range in accordance with a target beam pattern associated with the array. The target beam-pattern can be selected, for example, in accordance with a threshold or target amount of spatial aliasing. In some implementations, a length of a passive directional acoustic element can be determined based on a frequency above which the element effectively captures audio signals. In some cases, this can be determined as a fraction (e.g., substantially equal to ¼) of the corresponding wavelength. If six microphones are disposed on the sidewall of a cylindrical device of 8 cm diameter, the separation between two consecutive microphones (and hence the length of a curved passive directional acoustic element disposed between the microphones) is approximately 4 cm. The wavelength below which such a passive directional acoustic element may be found to be effective is substantially equal to 16 cm which corresponds to a frequency of about 2 KHz.
Therefore, a passive directional acoustic element of 4 cm may be used in capturing audio signals of frequencies 2 KHz or more.
Operations of the process 700 includes receiving an input signal from an array of multiple microphones, wherein the input signal includes acoustic signals received through a passive directional acoustic element disposed between at least two of the multiple microphones (702). The passive directional acoustic element can include a pipe having an elongated opening along at least a portion of the length of the pipe, and an acoustically resistive material covering at least a portion of the elongated opening. In some implementations, the passive directional acoustic element is substantially similar to those described above with reference to
Operations of the process 700 also includes generating a beamformed signal that represents signals captured by the array in accordance with one or more directional sensitivity patterns of the array (704). The beamformed signal can be generated, for example, using a delay-and-sum beamforming process such as one described above with reference to
The functionality described herein, or portions thereof, and its various modifications (hereinafter “the functions”) can be implemented, at least in part, via a computer program product, e.g., a computer program tangibly embodied in an information carrier, such as one or more non-transitory machine-readable media or storage device, for execution by, or to control the operation of, one or more data processing apparatus, e.g., a programmable processor, a computer, multiple computers, and/or programmable logic components.
A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a network.
Actions associated with implementing all or part of the functions can be performed by one or more programmable processors executing one or more computer programs to perform the functions of the calibration process. All or part of the functions can be implemented as, special purpose logic circuitry, e.g., an FPGA and/or an ASIC (application-specific integrated circuit). In some implementations, at least a portion of the functions may also be executed on a floating point or fixed point digital signal processor (DSP) such as the Super Harvard Architecture Single-Chip Computer (SHARC) developed by Analog Devices Inc.
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. Components of a computer include a processor for executing instructions and one or more memory devices for storing instructions and data.
Other embodiments and applications not specifically described herein are also within the scope of the following claims. For example, the parallel feedforward compensation may be combined with a tunable digital filter in the feedback path. In some implementations, the feedback path can include a tunable digital filter as well as a parallel compensation scheme to attenuate generated control signal in a specific portion of the frequency range.
Elements of different implementations described herein may be combined to form other embodiments not specifically set forth above. Elements may be left out of the structures described herein without adversely affecting their operation. Furthermore, various separate elements may be combined into one or more individual elements to perform the functions described herein.
This application is a continuation application to U.S. patent application Ser. No. 15/406,045, filed on Jan. 13, 2017, the entire contents of which are hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
6084973 | Green | Jul 2000 | A |
6321428 | Toda | Nov 2001 | B1 |
8351630 | Ickler et al. | Jan 2013 | B2 |
8358798 | Ickler et al. | Jan 2013 | B2 |
8447055 | Jankovsky et al. | May 2013 | B2 |
20060126865 | Blamey | Jun 2006 | A1 |
20090310811 | Inoda et al. | Dec 2009 | A1 |
20100098266 | Mukund | Apr 2010 | A1 |
20120275621 | Elko | Nov 2012 | A1 |
20130287225 | Niwa | Oct 2013 | A1 |
20160161588 | Benattar | Jun 2016 | A1 |
20160353201 | Akino | Dec 2016 | A1 |
Number | Date | Country | |
---|---|---|---|
20180359565 A1 | Dec 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15406045 | Jan 2017 | US |
Child | 16107464 | US |