Microphone arrays have been used to improve fidelity and reduce effects of ambient noise. Arrays of two or more microphones may be used to capture specific audio signals while reducing the effects of background noise and other undesirable sounds. Various beamforming algorithms may be used to combine the signals from each of the microphones in the array so that audio signals originating from a particular direction constructively interfere and generate a highest magnitude response over audio signals originating from other directions.
These beamforming algorithms were originally implemented in signal processing devices, which required each of the microphones to be individually wired to separate inputs of the signal processing device, typically an integrated circuit. The number of wires therefore increases proportionally with the number of microphones in the array. Incorporating these extra wires requires additional expense and space in the device. Additionally, the extra wires may affect the overall reliability of the system as the likelihood of a defect, malfunction, or problem with one or more of the wire sets tends to increases as the number of wires increases.
The inventors perceive a need for a microphone array supporting a large number of microphones with a limited number of connecting wires.
Embodiments of the invention provide a system having an array of microphone stages in which each of the microphone stages may be coupled together in a daisy chain. Each microphone stage may include a microphone, an analog to digital converter, a decimation unit, a receiver, an adder, and a transmitter. The analog to digital converter may convert sampled audio signals from the microphone into digital codes. The decimation unit may decimate digital codes from the analog to digital converter output to an audio frame rate of a serial interface used to transmit data between microphone stages. The decimation unit may also include a delay unit to implement a delay at each microphone stage.
The transmitter in the first microphone stage may transmit the output of the decimation unit to the receiver of the second microphone stage in the array. The adder in the second microphone stage may add the transmitted digital codes from the first stage to the output of the decimation unit in the second microphone stage. The transmitter in the second microphone stage may then transmit the sum of these digital codes from the adder to the receiver in the third microphone stage, where the adder in the third microphone stage may add the summed digital codes from the second microphone stage to the output of the decimation unit in the third microphone stage, and so on.
Since each stage may add its audio codes to the cumulative sum of digital audio codes from the prior stages, a smaller and fixed amount of bandwidth may be allocated to the transmission of audio data than if each microphone stage were to transmit its audio data to a final device in a separate channel.
A microphone circuit stage in a daisy chained microphone array may include a microphone, an analog to digital converter coupled to the microphone, a decimation unit coupled to the analog to digital converter, a first input of an adder coupled to the decimation unit, a receiver coupled to a second input of the adder, and a transmitter coupled to an output of the adder.
A serial data stream may be used to transmit audio data and configuration data between the transmitters and receivers in each of the microphone stages. The audio data that is transmitted may represent a cumulative sum of sampled audio data obtained from each of the microphones at prior microphone stages in the daisy chain. At each microphone stage, the audio data obtained from the microphone may be added to the sum of the audio data at the prior stages and this sum may replace the audio data being transmitted over the serial data stream.
The configuration data may include microphone address assignment data to assign an address to microphone in the array, delay data to set a variable delay associated with one or more microphones in the array, gain data to set a variable gain associated with one or more microphones in the array and/or other configuration settings pertaining to the microphone array. In some instances, the delay data and gain data may be transmitted over a separate channel than the audio data and address assignment data.
Each delay unit may have a delay selected to insure that the audio data from its associated microphone corresponding to sound from a particular direction is time-aligned with the data represented by the cumulative sum arriving at the receiver of that microphone stage. To achieve precise control over the steering direction, the delay at each microphone node may need to be adjusted in finer steps than can be accomplished by using an integer number of sample-periods, and therefore it may be necessary to set both an integer as well as a fractional delay amount. In these conditions it is possible to use the higher frequency clock rates used in the oversampling analog-to-digital converters to implement the fractional part of the delay, and the integer part of the delay can be implemented at the lower sample rates that correspond to the audio frame rate. The integer and the fractional delays can be set using the configuration channel transmitted to each microphone.
The analog digital converter 112 may oversample the analog audio signal at higher frequencies, such as 2.4 MHz or higher in some embodiments, than are used to transmit the serial audio data between microphone circuits 100 in the microphone array, which may transmit data at 44 kHz.
A decimation unit 118 may be coupled to the converter 112 and may decimate the digital code words generated at the converter 112. In some instances, the decimation unit 118 may decimate the digital code words by first filtering or otherwise transforming the ADC output codes before downsampling the codes. In other instances, the decimation unit 118 may decimate the digital code words by directly downsampling the ADC output codes. The decimation unit 118 may include a delay unit with one or more delay elements 113 and/or 115 that may implement a delay using memory or storage units at various locations within the structure of the decimation filter to ensure that the output codes from the decimation unit 118 align with the decimated code words at the other microphone stages from audio signals received at different times at each of the other microphones.
In an embodiment, the decimation unit 118 may include a fine integer delay element 113, a coarse integer delay element 115, a decimator 114, and/or a gain 116.
A fine variable integer delay 113 may be coupled to the analog to digital converter 112. The fine variable integer delay 113 may include a fine delay input 121, which may be used to select a whole number of cycles at the higher frequency of the analog to digital converter 112 to delay the generated digital code words outputted by the converter 112. Since the analog to digital converter 112 may operate at frequencies many times higher than the serial data stream, the fine variable integer delay 113 may enable the selection of a precise delay amount to maximize alignment of the audio signals received at different times at each microphone in the microphone array.
A decimator 114 may be coupled to the output of the fine variable integer delay 113. The decimator 114 may be used to reduce the sampling rate of the digital code words outputted by the converter 112 so that the remaining digital code words correspond to the serial data stream frame rate.
A coarse variable integer delay 115 may be coupled to the output of the decimator 114. The coarse variable integer delay 115 may include a coarse delay input 122, which may be used to select a whole number of clock cycles of the serial data stream frame clock to delay the decimated code words.
A variable gain 116 may be coupled to the output of the coarse variable integer delay 122. The variable gain 116 may include a gain set input 123, which may be used to specify the magnitude of gain applied to the delayed output of the decimator 114.
An adder 117 may be coupled to the output of the decimation unit 118. The adder 117 may add the output of the decimation unit 118 to a cumulative sum 131 of prior microphone circuit stages 0 to N−1 (assuming the microphone stage 100 in
In addition to the cumulative audio data 131 and 141 that may be received 130 and transmitted 140 over a serial data stream, certain configuration data may also be transmitted. In this example, a microphone address may be assigned to each microphone stage 100 in the daisy chain as part of the configuration data. A first microphone circuit in the microphone array may be assigned address ‘00001’, which may be transmitted as part of the serial data stream input to the first microphone circuit. The first microphone circuit may then store its assigned address in a memory 119 and its adder circuitry may then add a ‘1’ to the address and output the new address ‘00010’, which may be assigned to the next microphone circuit in the array and so on. Thus, the microphone circuit at stage N may be assigned address N 132. Its adder 117 may add a ‘1’ to the address N and a microphone address N+1 142 to be assigned to the next microphone stage may be transmitted 140 to the next stage.
In the embodiment shown in
In some embodiments, the fine delay 121, coarse delay 122, and gain set 123 inputs in each microphone circuit 100 may be pre-configured before the microphone array is used. During the pre-configuration, each circuit 100 may be supplied with predetermined input values that have been optimized based on the intended use of the microphone array. In other embodiments, these inputs 121, 122, and 123 may be dynamically configured. In some instances this configuration data may be stored in a memory 119.
An Integrated Interchip Sound protocol (I2S) may be used to transmit, encode, decode, and process audio data in the serial data stream. Other protocols may also be used in other embodiments.
Each of the components shown in
As shown in this figure, a serial data stream input to the stage N circuit 100 may include cumulative audio data 131 representing the sum of digital audio codes from the prior stages (0 to N−1). The serial data stream input may also include a microphone address assignment to address N 132.
The adder 117 in the stage N circuit 100 may then add the output digital audio code originating from the stage N microphone 110 to the cumulative sum from stages 0 to N−1 131 to create a new cumulative sum for stage 0 to N 141, which may be outputted to the next N+1 microphone stage 100.
The adder 117 in the stage N circuit 100 may also add 1 to the assigned microphone address N 132 and output a new assigned microphone address N+1 142 that may be sent to the next N+1 stage circuit 100.
The adder 117 in the stage N+1 circuit 100 may then add the output digital audio code originating from the stage N+1 microphone 110 to the cumulative sum of stages 0 to N 141 outputted by the stage N circuit 100 to create a new cumulative sum for stage 0 to N+1 151, which may be outputted 140 to the next N+2 microphone stage 100.
The adder 117 in the stage N+1 circuit 100 may also add 1 to its assigned microphone address N+1 142 and output a new assigned microphone address N+2 152 that may be sent to the next N+2 stage circuit 100.
The data stream channel 135 may be used to transmit the cumulative sum of audio data 131 and 141 from each microphone circuit 100 as well as microphone circuit address assignment data 132 and 142 to assign unique addresses to each microphone circuit 100 in the array.
The command stream channel 165 may be used to transmit configuration data and commands to each of the microphone circuits 100 in the array. The packets transmitted over the command stream channel 165 may be formatted to include a microphone address 161, a command code word 162, and a data word 163.
The microphone address 161 may specify the assigned address of the microphone circuit 100 for which the corresponding command code word 162 and data word 163 is intended.
The command code word 162 may specify a command that the addressed microphone circuit 161 is to execute. These commands may specify instructions such as setting a digital delay 118, setting a gain, bypassing a microphone circuit, muting a microphone, or setting an analog preamplifier gain.
The data word 163 may specify a value associated with the command code word 162. For example, the value specified in data word 163 may be an amount of delay or gain that is to be set.
Each microphone circuit 100 may passively listen to the packets transmitted over the command stream. When a microphone circuit 100 identifies packets having a microphone address 161 corresponding to its assigned microphone address 132, the microphone circuit 100 may then execute the command 162 and apply or set the value specified in the data word 163. Each microphone may also passively listen for packets that contain an address that is designated as a “global broadcast” address. When such an address is transmitted, all microphones receive the same command at the same time.
In some embodiments, the amount of delay to be applied may be calculated in advance and then encoded into the data words 163 associated with respective set delay commands 162 for each microphone address 161. For example, if a microphone array has a total length of 3 feet, sound may arrive at the first microphone at the beginning of the array approximately 3 ms before arriving at the last microphone at the end of a linear array (assuming sound travels at roughly 1 ms/ft). In the case of a decimated sample rate of 44.1 KHz, a one sample delay would last approximately 22 μs. To obtain the 3 ms delay, a delay of 136 sample cycles would be needed (3 ms/22 μs). Thus, the data word 163 may be set to ‘10001000’ (corresponding to the decimal number 136), to specify a delay value of 136 cycles associated with the set delay command 162 for the last microphone in the array.
In some instances buffering the cumulative audio data 131 from prior stages in order to perform the addition at adder 117 may cause an additional delay in each node. In these instances the programmed delay may be adjusted to account for this delay. For example, if a one sample delay per stage is caused by the buffering of the cumulative data, a corresponding sample offset could be subtracted from the programmed delay depending on the position of the stage in the array.
Adding a FIR/IIR filter 401 may enable more precise control over the resulting beamforming output pattern though the use of customizable weighted sums of audio output codes from prior microphone stages instead of a simple sum. The FIR/IIR filter 401 may also enable customized control of the frequency response and related dynamics of each microphone circuit 100, as well as enabling the use of adaptive nulls in the beamforming output pattern.
The FFT circuit 501 may be configured with a set of coefficients 502. The gain 504 may be configured with a corresponding binary set of complex coefficients 503. With FFT circuit 501, the serial data stream 130 and 140 may correspond to a serial scan of some or all of the FFT outputs. Each output from the FFT circuit 501 may include a real and imaginary component. These outputs may be added 117 to a corresponding cumulative sum 131 of similar outputs from the prior stages (0 to N−1). The result of each of these additions at adder 117 may then be outputted 140 and sent to the next microphone stage.
Since each of the FFT outputs are complex, additional bandwidth may be needed over the previously mentioned embodiments that include outputs with only a real component. Additionally, since each of the outputs are individually combined to create the cumulative sum, some data frame may be overlapped which may also require additional bandwidth.
Each of the complex coefficients may be used to implement a complex rotation of the output codes from FFT circuit 501 at the gain 504. Additionally, the use of the FFT circuit 501 may require additional bandwidth to output the different components. These coefficients may be uploaded to the microphone circuits 100, such as through the use of commands 162 and data word 163 in the command stream 165. Other uploading techniques may also be used and in some instances, each filter 401 may be preconfigured with a predetermined set of coefficients.
The analog to digital converter 112 may oversample the analog audio signal at higher frequencies, such as 2.8 MHz or higher in some embodiments, than are used to transmit the serial audio data between microphone circuits 100 in the microphone array, which may transmit data in each channel at 44 kHz. A fine variable integer delay unit 113 may be coupled to the converter 112 to delay the digital output from the converter 112 an integer number of converter clock cycles.
A decimator 114 may be coupled to the output of the fine integer variable delay 113. The decimator 114 may be used to reduce the sample-rate of the digital code words outputted by the converter 112 to equal the sample-rate of the serial data stream frame-clock rate.
The output from the decimator 114 may be coupled to circuits associated with two or more separate channels. For example, coarse integer variable delay circuits 615 and 625 may be coupled to the output of the decimator 114. Each of these delay circuits 615 and 625 may include individually configurable delay inputs 622, which may be used to select a whole number of clock cycles of the serial data stream 130 to delay the decimated code words. In some instances, the selected delay at the coarse integer delay circuit 615 associated with the decimated code words intended for a first channel may be different than that selected for the coarse integer delay circuit 625 associated with a second channel.
Separate variable gains 616 and 626 may be coupled to the respective outputs of the coarse integer variable delay circuits 615 and 625. Each variable gain 616 and 626 may include a gain set input 623, which may be used to individually configure the magnitude of gain applied to the delayed output of the decimator 114 associated with each channel.
Separate adders 617 and 627 may be coupled to the respective outputs of variable gains 616 and 626. Each adder 617 and 627 may add the output from its respective variable gain 616 and 626 to a respective cumulative sum 631 and 632 of prior microphone circuit stages 0 to N−1 (assuming the microphone stage 100 in
After the adders 617 and 627 add the output of their respective variable gains 616 and 626 to the respective cumulative sums 631 and 632 of their respective channels, the resulting sum may then be outputted as respective new cumulative sums 651 and 652 of microphone stages 0 to N on each respective channel. These new cumulative sums may then be transmitted to the next microphone stage N+1. Thus, each microphone stage in the microphone array may add its output code of each channel to that of the prior stages to eventually create a single cumulative beamformed output code for each channel.
The preamplifier 702 may initially boost analog signals from the microphone 701. The preamplifier 702 may be coupled to the analog delay and/or filter circuit 704. The preamplifier 702 may be supplied with a gain set input 703 which may be used to specify an amount of gain to be applied to the analog output signals from the microphone 701.
The analog delay/filter circuit 704 may include an analog delay, a finite impulse response (FIR) filter, an infinite impulse response (IIR) filter, or both a delay and a filter. The analog delay, if included, may have a switched capacitor or charge-coupled-device (CCD) analog delay line. The switched capacitors in the analog delay may also be used to perform the analog summing at amplifiers 712 and/or 713, which may save power in some instances. The analog delay 704 may be supplied with a delay set input 705 which may be used to specify the capacitors to be switched in the delay line to obtain a desired delay. The analog delay 704 may also have an analog equivalent architecture to that shown in
The output of the delay circuit 704 may be coupled to a first input of an amplifier 712. A resistor may be coupled between the output of the delay circuit 704 and the amplifier input 712.
An input 711 to the microphone stage 700 may be coupled to the first input of amplifier 712, the output of amplifier 712, and the output of delay circuit 704. A resistor may be coupled between the input 711 to the microphone stage 700 and the first input of amplifier 712, the output of amplifier 712, and the output of delay circuit 704. Another resistor may be coupled between the first input of amplifier 712 and the output of amplifier 712.
The output of amplifier 712 may be coupled to another amplifier 713. Both these amplifiers 712 and 713 may be inverting amplifiers.
The output of amplifier 713 may be coupled to an output 714 of the microphone stage 700. This output may be coupled to an input of a next microphone stage (not shown) in the microphone array.
The coupling of stage inputs to the respective outputs of prior stages and the stage outputs to the respective inputs of subsequent stages may form the daisy chain configuration of the stages in the microphone array.
In some embodiments, each analog stage may also include a receiver 720 connected to a separate analog control channel(s) 725. The receiver 720 may monitor the control channel(s) 725 and upon detecting a gain setting 703 or delay setting 705 intended for the microphone stage 700, may supply the detect gain setting 703 and/or delay setting 705 as a gain set input 703 to the preamplifier 702 and/or a delay set input 705 to the delay circuit 704.
An Inter-Integrated Circuit protocol (I2C) may be used to transmit, decode, and process gain set 703 and delay set 705 signals over the control channel 725 and by the receiver 720. Other protocols may also be used in other embodiments.
In box 801, audio sampled from a microphone may be converted to digital codes. An analog to digital converter may converted the analog audio signals from the microphone into digital codes. The converter may sample the analog audio signals at higher frequencies than may be used to transmit audio data between microphone stages.
In box 802, a subset of the digital codes may be selected. During the code selection process a delay may be implemented to ensure that the decimator output codes at each stage correspond to codes at the other stages from similar audio signals arriving at different times at the microphones of the other stages. The digital codes may also be decimated to a rate corresponding to the frequency used to transmit audio data between microphone stages.
In box 803, the decimated digital output code from the current stage may be added to a cumulative sum of digital audio codes from prior microphone stages.
In box 804, the resulting sum of adding the digital code from the current stage to the cumulative sum of digital audio codes from prior microphone stages may be sent to a next microphone stage in the array, where the digital code from the next stage may be added to the cumulative sum, and so on. The process may repeat until the digital codes from each stage in the microphone array have been added to the cumulative sum.
Once the digital codes from each of the stages in the array have been added to the cumulative sum, in box 806 the resulting grand total of the digital codes from each of the stages may be outputted by the microphone array as the final beamformed output code.
An input of a variable amplifier 1011 may be coupled to each microphone 110 in each microphone circuit stage 100. The output of the variable amplifier 1011 may be coupled to an analog to digital converter 1012, which may convert sampled amplified analog audio signals into digital codes.
The output of the analog to digital converter 1012 may be coupled to both a level detector 1013 and an adder 1014. The adder 1014 may add the digital code generated at a current stage to a sum of codes from prior stages. The resulting total may then be sent to an adder in a next stage 100 to add the digital code from the next stage 100 to the sum of the digital codes from the prior stages, and so on, until a grand total sum of all the digital codes 1020 is outputted at the final stage in the array.
The level detector 1013 may identify an audio level of the audio signal corresponding to the digital code outputted by the analog to digital converter 1012 in each stage 100. The output of the level detector 1013 in each stage may be coupled to a logic circuit 1015. The logic circuit 1015 may also be coupled to a output of a logic circuit from a prior stage 100 or to the output of a level detector 1013 from a prior stage 100.
The logic circuit 1015 may compare the identified audio level from a current stage to that of a prior stage to identify a highest or maximum audio level. The identified highest audio level from the comparison may be then selected and sent on to logic 1015 in a next stage 100. The logic 1015 in the next stage 100 may then compare the identified audio level from the level detector 1013 in the next stage to the previously selected highest audio level to identify a new highest audio level, which may then be selected and sent to the next stage, and so on. At the end of the array, the maximum audio level may be identified.
The output of the last logic circuit 1015 in the last circuit stage 100 may be coupled to a microcontroller. The microcontroller may calculate a subsequent gain 1030 to be applied to each of the variable amplifiers 1011 based on the identified highest audio level. The calculated gain 1030 may then be sent to a gain set input of each variable amplifier 1011 coupled to the microcontroller. A digital to analog converter 1016 may be coupled between the microcontroller and the gain set input of each variable amplifier 1011 to convert the digital output from the microcontroller into an analog gain set signal for the variable amplifier 1011.
In this example, the decimation factor indicates that 128 samples of the original ADC output code may cycled through in same time as one sample of the serial data stream. Thus, if the fractional delay is associated with the original ADC output sample rates 1120, it would be possible to select a fractional delay amount in increments of 1/128th the sample rate of the serial data stream.
In the first stage, the sample rate 1120 of the ADC output codes may be decimated by a factor of 16. This may reduce the delay step increments 1130 from 128 to 8, as now only 8 samples may by cycled through in the same time as one sample of the serial data stream. Thus, if the fractional delay is associated with the output of the first stage, it would be possible to select a fractional delay amount in increments of ⅛th the sample rate of the serial data stream.
In the second stage, the sample rate of the output of the first stage may be decimated by a factor of 2. This may reduce the delay step increments 1130 from 8 to 4, as now only 4 samples may by cycled through in the same time as one sample of the serial data stream. Thus, if the fractional delay is associated with the output of the first stage, it would be possible to select a fractional delay amount in increments of ¼th the sample rate of the serial data stream.
In the third stage, the sample rate of the output of the second stage may be decimated by a factor of 2. This may reduce the delay step increments 1130 from 4 to 2, as now only 2 samples may by cycled through in the same time as one sample of the serial data stream. Thus, if the fractional delay is associated with the output of the first stage, it would be possible to select a fractional delay amount in increments of ½th the sample rate of the serial data stream.
In the fourth stage, the sample rate of the output of the third may be decimated by a factor of 2 generating the final desired sample rate of the serial data stream.
Logic may be used to determine which of the filter stages a delay unit should be associated with. The logic may be configured to select the filter stage having a lowest possible sample rate to provide a predetermined minimum fractional step size. For example, if an application requires a fractional delay step size that is at least one third the size of the serial data stream, the delay unit may be coupled to the output of the second filter stage. However, if the application requires a fractional delay step size that is at least one fifth that of the serial data stream, the delay unit may be coupled to the output of the first filter stage, and so on. Providing the ability to select the filter stage having the lowest sample rate may, in some instances, reduce the size of the delay unit and conserve power.
The foregoing description has been presented for purposes of illustration and description. It is not exhaustive and does not limit embodiments of the invention to the precise forms disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from the practicing embodiments consistent with the invention. For example, some of the described embodiments refer to a decimation circuit with a decimation factor of 128 that may include four filter stages, but in other embodiments, different decimation factors and/or numbers of stages may be used.
This application claims the benefit, under 35 U.S.C. §119(e), of U.S. Provisional Patent Application No. 61/559,435, filed Nov. 14, 2011, the contents of which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61559435 | Nov 2011 | US |