This application relates to the field of acoustics processing technologies, and in particular, to a sound acquisition component array and a sound acquisition device.
Smart devices that support far-field speech interaction are usually equipped with sound acquisition components array to strengthen speech recognition performance. Therefore, a structure and a directional ability of a sound acquisition component array become important in a far-field speech interaction solution.
To take a product appearance design of a smart device into consideration, an oval sound acquisition component array formed by using eight sound acquisition components arranged in an oval shape is provided in the related art. In the eight sound acquisition components, two groups of sound acquisition components, each group including three sound acquisition components, are disposed at two sides of the x-axis of a rectangular coordinate system respectively, and the remaining two sound acquisition components are disposed on the x-axis. The eight sound acquisition components are symmetrical about the x-axis and the y-axis of the rectangular coordinate system and an overall structure of the eight sound acquisition components is of an elongated shape.
However, during processing of audio signals acquired by the oval sound acquisition component array in the related art, audio signals acquired by the eight sound acquisition components need to be processed, which results in a larger volume of to-be-processed data and affects processing efficiency.
A sound acquisition component array and a sound acquisition device are provided according to various embodiments of this application.
According to a first aspect of this application, a sound acquisition component array is provided, including two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components,
According to a second aspect of this application, a sound acquisition component array is provided, including two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components,
A sound acquisition device is provided, including the foregoing sound acquisition component arrays.
Details of one or more embodiments of this application are provided in the accompanying drawings and descriptions below. Other features, objectives, and advantages of this application become apparent from the specification, the accompanying drawings, and the claims.
To describe the technical solutions in the embodiments of this application more clearly, the accompanying drawings required for describing the embodiments are briefly described hereinafter. Apparently, the accompanying drawings in the following description show merely some embodiments of this application, and a person of ordinary skill in the art may obtain other accompanying drawings according to these accompanying drawings without creative efforts.
The following clearly and comprehensively describes the technical solutions in the embodiments of this application with reference to the accompanying drawings in the embodiments of this application. Apparently, the described embodiments are some embodiments of this application rather than all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of this application without creative effects shall fall within the protection scope of this application. When the following descriptions are made with reference to the accompanying drawings, unless otherwise indicated, the same numbers in different accompanying drawings represent the same or similar elements.
With the popularity of smart speakers and derivatives thereof, speech interaction between humans and machines, especially far-field speech interaction, has gradually become an important man-machine interaction interface and is considered as the most important user traffic entrance in the future. A sound acquisition device provided with a sound acquisition component may acquire audio signals from surrounding space and process the audio signals in a pre-determined manner, to implement applications such as speech-based man-machine interaction.
According to different specific application scenarios, the sound acquisition device may also have different product forms. For example, the sound acquisition device may include but is not limited to at least one of a smart speaker, a smart television, a smart television set-top box, a smart robot, and a smart in-vehicle device.
For example,
With the development of application scenarios of sound acquisition and processing technologies, requirements on sound acquisition components are increasingly high. Currently, a sound acquisition component array formed by a plurality of sound acquisition components is proposed in the industry, so as to improve audio signal acquisition performance and support more functions. A sound acquisition component array taking both performance and product appearance into consideration is provided in the solutions shown in this application.
Before the solutions shown in this application are described, several terms in the solutions of this application are described first.
1. Sound Acquisition Component
In this application, a sound acquisition component refers to a hardware device component used for transforming a sound (waves generated by vibrations of an object) into an analog signal (electrical signal). In some embodiments, some sound acquisition components may further transform the obtained analog signal into a digital sampling signal.
The sound acquisition component may include a microphone, a pickup, a sound sensor, or the like according to various circuit structures.
2. Sound Acquisition Component Array
Generally, the sound acquisition component may acquire audio signals at only one point, and acquisition performance and functions that can be implemented are relatively limited. Therefore, to improve the performance and functions of sound acquisition, a solution in which a plurality of sound acquisition components are arranged at different spatial positions to form a sound acquisition component array is provided in the related art. Audio signals acquired by the plurality of sound acquisition components in the sound acquisition component array are centrally processed by using an audio signal processing chip, so that the performance of sound acquisition can be improved and new functions can be developed. For example, in a smart device having a speech recognition function, by using a plurality of sound acquisition component arrays formed by a plurality of sound acquisition components, a speech of a target user may be strengthened, noise in an environment may be suppressed, and a sound source direction may be positioned, thereby finally improving speech recognition performance in a speech interaction (especially far-field speech interaction) scenario.
In the related art, an annular array is a common array in sound acquisition component arrays.
{(xi=r·cos((i−1)*60°),yi=r·sin((i−1)*60°))|i=1,2, . . . ,6},
A steering vector of the sound acquisition component array is defined as a(θ, φ, f), and an expression of the a(θ, φ, f) is as follows:
In this case, an “array intrinsic lobe pattern” of the sound acquisition component array is defined as B(θ0, φ0, θ, φ, f), and an expression of the B(θ0, φ0, θ, φ, f) is as follows:
∥a(θ0,φ0,f)Ha(θ,φ,f)∥2/N2,0≤θ≤90, 0≤φ≤360,
To ensure that a direct sound transmission path between the sound acquisition component array and a target speaker is not blocked, the sound acquisition component array often needs to be arranged on a top surface or a front surface of a smart device. Therefore, the shape and an occupied area of the sound acquisition component array constitute a limitation on a product appearance and structure. By using an annular array widely used in products such as a smart speaker on the market currently as an example, an occupied area thereof is a circle with a radius of approximately 3.5 cm. Therefore, the appearance design of a smart speaker equipped with such a sound acquisition component array usually adopts a cylindrical shape or a shape similar to a cylinder, while the thickness of a hardware product cannot be reduced, and it is difficult to place such a hardware product in people's home.
To take a product appearance design of a square device into consideration, an oval array with eight sound acquisition components is further provided in the related art.
(x1,yi)=(dx,dy), (x2,y2)(0,dy), (x3,y3)=(−dx,dy),
(x4,y4)=(−2dx,0), (x5,y5)=(−dx,−dy), (x6,y6)=(0,−dy),
(x7,y7)=(dx,−dy), (x8,y8)=(2dx,0),
There is a small quantity of sound acquisition components in the annular array with six sound acquisition components shown in
Therefore, an array with six sound acquisition components structure that occupies an elongated area (such as a rectangle area or an oval area) is provided in this application. A sound acquisition component array of such a structure may be arranged on smart hardware having an elongated top or front surface, and can maintain a similar spatial distinguishing capability (especially in a main direction 270° used by a user).
The two second sound acquisition components 620 are located at a first side of a line connecting the two first sound acquisition components 610, and the two third sound acquisition components 630 are located at a second side of the connecting line opposite to the first side of the connecting line. The two second sound acquisition components 620 are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components 630 are symmetrical about the perpendicular bisector. A distance between the two first sound acquisition components 610 is greater than a distance between the two second sound acquisition components 620, and the distance between the two first sound acquisition components 610 is greater than a distance between the two third sound acquisition components 630. The distance between the two second sound acquisition components 620 is different from the distance between the two third sound acquisition components 630.
In order to more intuitively describe a relative position relationship of the foregoing six sound acquisition components, a rectangular coordinate system is used as a reference in
The two second sound acquisition components 620 are located in a first quadrant and a second quadrant of the rectangular coordinate system respectively. A vertical distance between the second sound acquisition component 620 and the y-axis of the rectangular coordinate system is a second length, and a vertical distance between the second sound acquisition component 620 and the x-axis is a third length.
The two third sound acquisition components 630 are respectively located in a third quadrant and a fourth quadrant of the rectangular coordinate system. A vertical distance between the third sound acquisition component 630 and the y-axis of the rectangular coordinate system is a fourth length, and a vertical distance between the third sound acquisition component 630 and the x-axis is a fifth length.
The first length is greater than the second length, the first length is greater than the fourth length, and the second length is different from the fourth length.
The solution in this embodiment of this application provides an array with six sound acquisition components, which is symmetrical about a perpendicular bisector of a connecting line between two sound acquisition components, but is not symmetrical about the connecting line between the two sound acquisition components. The array with six sound acquisition components can adapt to an elongated appearance design extending along a direction of the connecting line between the two sound acquisition components. Moreover, the array with six sound acquisition components has fewer sound acquisition components compared with an array with eight sound acquisition components, and less data needs to be processed during audio signal processing, thereby improving the efficiency of audio signal processing while achieving adaptability to the elongated appearance design.
The two second sound acquisition components 720 are located at a first side of a line connecting the two first sound acquisition components 710, and the two third sound acquisition components 730 are located at a second side of the connecting line opposite to the first side of the connecting line. The two second sound acquisition components 720 are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components 730 are symmetrical about the perpendicular bisector. A distance between the two first sound acquisition components 710 is greater than a distance between the two second sound acquisition components 720, and the distance between the two first sound acquisition components 710 is greater than a distance between the two third sound acquisition components 730.
The distance between the two second sound acquisition components 720 is different from the distance between the two third sound acquisition components 730.
In order to more intuitively describe a relative position relationship of the foregoing six sound acquisition components, a rectangular coordinate system is used as a reference in
The two first sound acquisition components 710 are located at two sides of an origin on the x-axis of the rectangular coordinate system respectively. A distance between the first sound acquisition component 710 and the y-axis of the rectangular coordinate system is a first length.
The two second sound acquisition components 720 are located in a first quadrant and a second quadrant of the rectangular coordinate system respectively. A vertical distance between the second sound acquisition component 720 and the y-axis of the rectangular coordinate system is a second length, and a vertical distance between the second sound acquisition component 720 and the x-axis is a third length.
The two third sound acquisition components 730 are located in a third quadrant and a fourth quadrant of the rectangular coordinate system respectively. A vertical distance between the third sound acquisition component 730 and the y-axis of the rectangular coordinate system is a fourth length, and a vertical distance between the third sound acquisition component 730 and the x-axis is a fifth length.
The first length is greater than the second length, the first length is greater than the fourth length, and the second length is different from the fourth length.
In this embodiment of this application, the distance between the two first sound acquisition components 710, the distance between the two second sound acquisition components 720, and the distance between the two third sound acquisition components 730 may follow a certain ratio.
For example, in a possible implementation, the distance between the two first sound acquisition components 710 is three times the distance between the two second sound acquisition components 720, and the distance between the two third sound acquisition components 730 is twice the distance between the two second sound acquisition components 720.
Accordingly, corresponding to the sound acquisition component array disposed according to the rectangular coordinate system shown in
For example, in another possible implementation, a ratio of the distance between the two first sound acquisition components 710 to the distance between the two second sound acquisition components 720 and/or a ratio of the distance between the two third sound acquisition components 730 to the distance between the two second sound acquisition components 720 may be other values. For example, the distance between the two first sound acquisition components 710 may be 2.8 times or 3.2 times the distance between the two second sound acquisition components 720, and the distance between the two third sound acquisition components 730 may be 1.8 times or 2.2 times the distance between the two second sound acquisition components 720.
In this embodiment of this application, a vertical distance between the second sound acquisition component 720 and a connecting line of the two first sound acquisition components 710 and a vertical distance between the third sound acquisition component 730 and the connecting line may follow a certain proportional relation.
For example, in a possible implementation, the vertical distance between the second sound acquisition component 720 and the connecting line of the two first sound acquisition components 710 is the same as the vertical distance between the third sound acquisition component 730 and the connecting line.
Accordingly, corresponding to the sound acquisition component array disposed according to the rectangular coordinate system shown in
Alternatively, in another possible implementation, the vertical distance between the second sound acquisition component 720 and the connecting line of the two first sound acquisition components 710 may be different from the vertical distance between the third sound acquisition component 730 and the connecting line. For example, corresponding to the sound acquisition component array disposed according to the rectangular coordinate system shown in
For example, the sound acquisition component is a microphone (mic), the first length is three times the second length, the fourth length is twice the second length, and the third length is the same as the fifth length. The array shown in
(x1,y1)=(3dx,0), (x2,y2)=(dx,dy), (x3,y3)=(−dx,d dy), and
(x4,y4)=(−3dx,0), (x5,y5)=(−2dx,−dy), (x6,y6)=(2dx,−dy),
In some embodiments, a ratio of the distance between the two second sound acquisition components 720 to the vertical distance between the second sound acquisition component 720 and the connecting line (that is, the connecting line between the two first sound acquisition components 710) is 5:2, that is, a ratio of the second length to the third length is 5:4.
For example, in a possible implementation, the second length is 1.5 cm, and the third length is 1.2 cm.
In some embodiments, the sound acquisition component may be a microphone assembly or a pickup assembly.
In some embodiments, the six sound acquisition components are located in the same plane.
In this embodiment of this application, to achieve a better audio signal acquisition effect and reduce the complexity of audio signal processing, the six sound acquisition components shown in
Using the sound acquisition component being a microphone as an example, it can be learned from a comparison of the intrinsic lobe patterns of
In view of this, the asymmetric array with six mics shown in this application is more adaptive to layout in a narrow plane than the annular array with six mics, and can support a more flexible appearance design of a smart hardware product. In addition, by using fewer microphones than that in the oval array with eight mics, hardware costs and calculation complexity are reduced. Moreover, better spatial separation performance is obtained around a main direction (270°) used by a user.
Based on the above, the solution in this embodiment of this application provides an array with six sound acquisition components, which is symmetrical about a perpendicular bisector of a connecting line between two sound acquisition components, but is not symmetrical about the connecting line between the two sound acquisition components. The array with six sound acquisition components can adapt to an elongated appearance design extending along a direction of the connecting line between the two sound acquisition components. In this case, the array with six sound acquisition components has fewer sound acquisition components compared with an array with eight sound acquisition components, and less data needs to be processed during audio signal processing, thereby improving the efficiency of audio signal processing while achieving better adaptability to the elongated appearance design.
In another exemplary embodiment of this application, a sound acquisition device is further provided and includes the foregoing sound acquisition component array shown in
In some embodiments, the sound acquisition component array is horizontally disposed on a top surface of the sound acquisition device or is vertically disposed on a front surface of the sound acquisition device.
The top surface is an outer surface facing upward when the sound acquisition device is placed according to a designated pose, and the front surface is a designated outer surface among outer surfaces facing in a horizontal direction when the sound acquisition device is placed according to a designated pose.
The designated pose is an installation or placement pose of the sound acquisition device for normal use according to a design requirement.
For example, the designated pose may be an installation or placement pose of the sound acquisition device according to instructions. In another example, the designated pose may be an installation or placement pose of the sound acquisition device according to instructions of a device operating manual.
Alternatively, the designated pose may be a determined installation or a placement pose according to an installation/placement instruction assembly (for example, a support frame, an anti-slip mat, and mounting holes reserved for wall mount components) in the sound acquisition device. For example, when a surface of the sound acquisition device is provided with a support frame or an anti-slip mat, the designated pose is a pose in which the surface where the support frame or the anti-slip mat is located is vertically downward; or when a mounting hole reserved for a wall mount component is provided on a surface of the sound acquisition device, the designated pose is a pose in which the surface where the mounting hole is located is perpendicular to a horizontal plane.
In some embodiments, when the sound acquisition component array is horizontally disposed on a top surface of the sound acquisition device, a first direction pointed to by a perpendicular bisector of the connecting line between the two first sound acquisition components is the same as or opposite to a second direction that a front surface of the sound acquisition device faces towards.
In this embodiment of this application, the sound acquisition device may be a smart device having a long elongated top surface. To achieve the best sound acquisition effect, in such a smart device, a direction of a symmetry axis of the oval array with six sound acquisition components (that is, the vertical coordinate direction of the rectangular coordinate system corresponding to the sound acquisition component array shown in
For example, in
Alternatively, in
Alternatively, when the sound acquisition component array is vertically disposed on the front surface of the sound acquisition device, a third direction pointed to by the perpendicular bisector of the connecting line between the two first sound acquisition components is a vertical upward direction or a vertical downward direction.
For example, in
For example, in
Other embodiments of this application will be apparent to a person skilled in the art from consideration of the specification and practice of the disclosure here. This application is intended to cover any variations, uses or adaptive changes of this application. Such variations, uses or adaptive changes follow the general principles of this application, and include well-known knowledge and conventional technical means in the art that are not disclosed in this application. The specification and the embodiments are considered as merely exemplary, and the scope and spirit of this application are pointed out in the following claims.
It is to be understood that this application is not limited to the precise structures described above and shown in the accompanying drawings, and various modifications and changes can be made without departing from the scope of this application. The scope of this application is described by the appended claims.
The foregoing descriptions are merely exemplary embodiments of this application, but are not intended to limit this application. Any modification, equivalent replacement, or improvement made within the spirit and principle of this application shall fall within the protection scope of this application.
Number | Date | Country | Kind |
---|---|---|---|
201811610594.2 | Dec 2018 | CN | national |
This application is a continuation application of PCT Patent Application No. PCT/CN2019/128338, entitled “SOUND COLLECTION ASSEMBLY ARRAY AND SOUND COLLECTION DEVICE” filed on Dec. 25, 2019, which claims priority to Chinese Patent Application No. 201811610594.2, filed with the State Intellectual Property Office of the People's Republic of China on Dec. 27, 2018, and entitled “SOUND ACQUISITION COMPONENT ARRAY AND SOUND ACQUISITION DEVICE”, all of which are incorporated herein by reference in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
9294860 | Carlson | Mar 2016 | B1 |
20120263333 | Akino | Oct 2012 | A1 |
20130201162 | Cavilia | Aug 2013 | A1 |
20140219472 | Huang et al. | Aug 2014 | A1 |
20170094223 | Burenius | Mar 2017 | A1 |
20180182392 | Li | Jun 2018 | A1 |
20190132672 | Dick | May 2019 | A1 |
Number | Date | Country |
---|---|---|
106723736 | May 2017 | CN |
107943449 | Apr 2018 | CN |
108156559 | Jun 2018 | CN |
207495509 | Jun 2018 | CN |
108471561 | Aug 2018 | CN |
108632653 | Oct 2018 | CN |
109660918 | Apr 2019 | CN |
Entry |
---|
Tencent Technology, ISR, PCT/CN2019/128338, Mar. 25, 2020, 2 pgs. |
Zhongguancun Online, “8-Inch HD Display Tencent Dingdong Smartscreen First Evaluation”, Dec. 18, 2018, 9 pgs., Retrieved on the Internet: https://baijiahao.baidu.com/s?id=1620183630263608564&wfr=spider&for=pc. |
Tencent Technology, WO, PCT/CN2019/128338, Mar. 25, 2020, 4 pgs. |
Tencent Technology, IPRP, PCT/CN2019/128338, Jun. 16, 2021, 5 pgs. |
Number | Date | Country | |
---|---|---|---|
20210266664 A1 | Aug 2021 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2019/128338 | Dec 2019 | US |
Child | 17319024 | US |