This disclosure generally relates to microphone arrays. More particularly, the disclosure relates to a microphone array for a speaker system, such as a voice-enabled speaker system.
Voice-enabled devices such as speaker systems (also referred to as, “smart speakers”) are increasingly present in homes, offices and other environments. These devices allow users to control various functions using voice commands. However, given their portability and size, it can be challenging to configure microphones in these devices to effectively process vocalized user input.
All examples and features mentioned below can be combined in any technically possible way.
Various implementations include a microphone array for a speaker system. In some implementations, the microphone array has an asymmetric configuration of microphones.
In some particular aspects, a microphone array is mounted in a housing having a primary X axis, a primary Y axis perpendicular to the primary X axis, and a primary Z axis perpendicular to the primary X axis and the primary Y axis. The microphone array can include: a set of microphones positioned in a single plane perpendicular to the primary Z axis and axially asymmetric with respect to both the primary X axis and the primary Y axis.
In other particular aspects, a system includes: a speaker housing having a primary X axis, a primary Y axis perpendicular to the primary X axis, and a primary Z axis perpendicular to the primary X axis and the primary Y axis; and a microphone array contained within the speaker housing, the microphone array having a set of microphones positioned in a single plane perpendicular to the primary Z axis and axially asymmetric with respect to both the primary X axis and the primary Y axis.
In additional particular aspects, a microphone array is mounted in a housing having a primary X axis, a primary Y axis perpendicular to the primary X axis, and a primary Z axis perpendicular to the primary X axis and the primary Y axis. The microphone array can include: a set of microphones positioned in a single plane perpendicular to the primary Z axis of the housing and axially asymmetric with respect to both the primary X axis of the housing and the primary Y axis of the housing, where all microphones in the set of microphones are stationary relative to the housing.
In other particular aspects, a system includes: a speaker housing having a primary X axis, a primary Y axis perpendicular to the primary X axis, and a primary Z axis perpendicular to the primary X axis and the primary Y axis; and a microphone array contained within the speaker housing. The microphone array can include a set of microphones positioned in a single plane perpendicular to the primary Z axis of the housing and axially asymmetric with respect to both the primary X axis of the housing and the primary Y axis of the housing, where all microphones in the set of microphones are stationary relative to the housing.
In additional particular aspects, a system includes: a speaker housing having a primary X axis, a primary Y axis perpendicular to the primary X axis, and a primary Z axis perpendicular to the primary X axis and the primary Y axis; and a microphone array contained within the speaker housing. The microphone array includes: a set of microphones positioned in a single plane perpendicular to the primary Z axis of the housing and axially asymmetric with respect to both the primary X axis of the housing and the primary Y axis of the housing. The system further includes: a printed wiring board coupled with the set of microphones; and a core section contained within the speaker housing, where the printed wiring board is coupled with the core, and where the core includes a set of recesses each at least partially housing one of the set of microphones.
Implementations may include one of the following features, or any combination thereof.
In some cases, the set of microphones is rotationally symmetric about the Z axis.
In certain implementations, the set of microphones is rotationally asymmetric about the Z axis.
In particular cases, the microphone array includes a printed wiring board coupled to the set of microphones.
In some implementations, the set of microphones includes at least two microphones. In certain cases, the set of microphones includes six microphones.
In particular cases, a cross-section of the housing along the single plane is a non-circular shape. In certain implementations, the cross-section of the housing along the single plane has a substantially rectangular shape.
In some cases, the set of microphones yields beams with a directivity index substantially equal to a directivity index of beams from a reference set of microphones positioned symmetrically about a perimetric boundary line with respect to the housing.
In certain implementations, the speaker system further includes a core section contained within the speaker housing, where the printed wiring board is coupled with the core, and the core includes a set of recesses each at least partially housing one of the set of microphones. In some cases, the printed wiring board is located between the set of microphones and a top section of the speaker housing, and the printed wiring board further includes a set of apertures extending therethrough for receiving the set of microphones. In particular implementations, the speaker system also includes an acoustically transparent screen between the printed wiring board and the top section of the speaker housing.
In some aspects, the housing has an ellipsoidal cross-section having length along the primary X axis that is distinct from a length along the primary Y axis.
Two or more features described in this disclosure, including those described in this summary section, may be combined to form implementations not specifically described herein.
The details of one or more implementations are set forth in the accompanying drawings and the description below. Other features, objects and benefits will be apparent from the description and drawings, and from the claims.
It is noted that the drawings of the various implementations are not necessarily to scale. The drawings are intended to depict only typical aspects of the disclosure, and therefore should not be considered as limiting the scope of the implementations. In the drawings, like numbering represents like elements between the drawings.
This disclosure is based, at least in part, on the realization that an asymmetric microphone array can be beneficially incorporated into a speaker system. For example, an array of microphones can be positioned asymmetrically relative to a speaker housing to provide a directivity index substantially equal to a symmetric array having a greater number of microphones. The array of microphones can be positioned to enhance the directivity index of several beams with different look directions. In various implementations, microphone arrays are located in a speaker housing having a horizontal cross-section that is non-circular in shape.
Commonly labeled components in the FIGURES are considered to be substantially equivalent components for the purposes of illustration, and redundant discussion of those components is omitted for clarity.
A microphone array, e.g., in a speaker system such as a voice-enabled speaker system, can include a set of microphones arranged to detect voice commands from a user.
In the data flow of
In some cases, as depicted in phantom, the microphone signals 40 can be initially processed by the echo canceller 60 and subsequently processed by the beam former 50, however, in this example depiction, those microphone signals 40 are initially sent to the beam former 50. The beam former 50 can be configured to filter particular microphone signals 40 according to the configuration of the array 10 in order to achieve a desired directionality. Formed beams 70 are sent from the beam former 50 to the echo canceller 60 in order to remove self-playback from the microphone signals 40 or the formed beams 70. These filtered beams 80 are then sent to a beam selector 90 in order to select the beam attributable to the voice input 20 from the user 30. This selected beam 100 is then processed by the wake word identifier 110 to determine whether the voice input 20 includes that wake word (e.g., “Alexa” or “Siri”). After determining that the voice input 20 includes the correct wake word (or phrase), a command identifier and processor 120 can parse and/or analyze the selected beam 100 from the voice input 20 for one or more particular commands (e.g., “play songs by the band ‘Boston’”) and identify an appropriate response (e.g., by playing the first song listed alphabetically in a list of stored songs by the artist “Boston”). An application processor 130 can receive playback instructions 140 from the command identifier and processor 120, and provide output signals 150 to a transducer 160 (e.g., via digital signal processor, not shown) for providing an audio output, such as audio content or a voice response (e.g., back to user 30).
It is understood that one or more of the above-noted functions described with reference to
As seen in
It is understood that the terms “upper” and “lower” are merely intended to provide examples of relative positional information in one configuration of a speaker system. These terms can be interchanged, and may refer to distinct portions of a speaker system, depending upon its orientation and intended use. As such, they are not intended to be limiting to particular orientations.
Overlying the core section 250, as shown more clearly in
In various implementations, as shown in
In some example implementations, the microphones 300 can be positioned in an axially asymmetric pattern with respect to both the primary X axis and the primary Y axis, but can be rotationally symmetric about the Z axis. That is, the microphones 300 in the array 10 can be positioned such that a full rotation about the Z axis results in two or more matching positions to an original position, e.g., an order of two (2) or more.
In other example implementations, the microphones 300 can be positioned asymmetrically with respect to both the primary X axis and the primary Y axis, and can additionally be rotationally asymmetric about the Z axis. In these cases, a complete rotation about the Z axis only results in one matching position (i.e., the original position), or an order of one (1).
As illustrated in
In an additional example implementation, as shown in the graphical depiction of
As described with reference to
The microphone array 10 disclosed according to various implementations can yield beams (e.g., formed beams 70,
Additionally, the microphone array configurations disclosed according to various implementations can be used to adapt an array in a circular (cross-sectional) housing to a non-circular (cross-sectional) housing, such as a housing have an elliptical shape or rectangular shape in order to provide beams with a substantially equivalent directivity index.
Locations of microphones (e.g., microphones 300 in the array 10) can be based upon known locations of interference between voice input(s) 20, environmental sounds, and the physical construction of the speaker system (e.g., speaker system 200). That is, this asymmetric configuration of microphones 300 in the array 10 can be based at least in part upon a consistency in directivity index across all beams formed from the audio input at microphones 300 in the array 10. In some cases, the number of beams formed from microphone inputs is fixed, and can be used to iteratively calculate directivity index for all beams at a plurality of positions. According to some example implementations, twelve (12) beams are formed using the array 10. Locations of microphones can be based upon an acceptable deviation in directivity index from a reference array, such as an array generating twelve beams with equally azimuthal spaced microphones (e.g., at look directions every 30 degrees around a circle). In a particular example, microphone locations are determined such that a plane wave arriving at each microphone 300 from any direction will have different path lengths, such that the magnitude and phase differences between the microphones 300 support beamforming for each desired look direction.
Additionally, acoustic shadowing resulting from sound scattered off of a housing having a distinct cross-sectional shape from its corresponding microphone array can negatively affect beamforming, e.g., where an azimuthal symmetrical arrangement of microphones is employed in non-circular housing. As such, the asymmetric configuration of microphones 300 in array 10 (within a non-circular housing) can enhance beamforming when compared with the conventional, symmetrical array within a non-circular housing.
In various implementations, components described as being “coupled” to one another can be joined along one or more interfaces. In some implementations, these interfaces can include junctions between distinct components, and in other cases, these interfaces can include a solidly and/or integrally formed interconnection. That is, in some cases, components that are “coupled” to one another can be simultaneously formed to define a single continuous member. However, in other implementations, these coupled components can be formed as separate members and be subsequently joined through known processes (e.g., soldering, fastening, ultrasonic welding, bonding). In various implementations, electronic components described as being “coupled” can be linked via conventional hard-wired and/or wireless means such that these electronic components can communicate data with one another. Additionally, sub-components within a given component can be considered to be linked via conventional pathways, which may not necessarily be illustrated.
A number of implementations have been described. Nevertheless, it will be understood that additional modifications may be made without departing from the scope of the inventive concepts described herein, and, accordingly, other implementations are within the scope of the following claims.
This application is a continuation filing that claims priority to U.S. patent application Ser. No. 15/799,021, filed on Oct. 31, 2017, which is hereby incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
7269263 | Dedieu | Sep 2007 | B2 |
7792313 | Dedieu | Sep 2010 | B2 |
8868413 | Siotis | Oct 2014 | B2 |
9479627 | Rung | Oct 2016 | B1 |
9659577 | Langhammer | May 2017 | B1 |
9812128 | Mixter et al. | Nov 2017 | B2 |
20030112984 | Graumann | Jun 2003 | A1 |
20040120532 | Dedieu et al. | Jun 2004 | A1 |
20050058300 | Suzuki | Mar 2005 | A1 |
20110019857 | Matsui | Jan 2011 | A1 |
20110137109 | Zahnert | Jun 2011 | A1 |
20120019689 | Zurek et al. | Jan 2012 | A1 |
20120259628 | Siotis | Oct 2012 | A1 |
20170188141 | Rung | Jun 2017 | A1 |
20170263268 | Rumberg et al. | Sep 2017 | A1 |
20180206039 | Vilermo | Jul 2018 | A1 |
20180352324 | Choisel | Dec 2018 | A1 |
Number | Date | Country |
---|---|---|
101548554 | Sep 2009 | CN |
102708874 | Oct 2012 | CN |
102860039 | Jan 2013 | CN |
104412616 | Mar 2015 | CN |
2981097 | Feb 2016 | EP |
Entry |
---|
International Search Report and Written Opinion for International Application No. PCT/US2018/057480, dated Dec. 19, 2018, 13 pages. |
Number | Date | Country | |
---|---|---|---|
20190149913 A1 | May 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15799021 | Oct 2017 | US |
Child | 16244636 | US |