1. Field of the Invention
The present invention relates to a method for producing an optimum sound field of loudspeakers, especially to a method for producing an optimum sound field of loudspeakers in which reference points are set around a required sound field to get audio signals and related gradient on the boundary and an optimization method is used to calculate output signals of each loudspeaker for producing a sound field required. Thus the listeners can still get best sounds even they turn their heads or move around.
2. Description of Related Art
The conventional stereophonic recording methods were firstly developed by Blumlein. Later, an integrated and multi-level environment such as Dolby Digital or digital theater system (DTS) surrounding technology is produced by a plurality set of loudspeakers so as to pursue a virtual reality stereo system and a better acoustic environment. Generally, stereophonic sound can be achieved by using at least two audio channels, through a configuration of several loudspeakers, to provide sound heard from various directions, as in natural hearing. In theaters or cinemas, a plurality of loudspeakers is placed around a performance space to surround the audience, allowing the audience to hear sounds coming from all around them. This is wave field synthesis (WFS) technique that creates virtual acoustic environments by a large number of loudspeakers.
Moreover, due to fast progress of technology and higher standard of living, people have higher requirements for electronic devices that display images and sounds. They concern not only the image quality, but also the quality of the sound field. However, the surround loudspeaker distribution formed by placement of a plurality of loudspeakers is of high cost. In a confined or narrow area such as parlors or rooms, the optimal placement of the loudspeakers may not easily be achieved. In 1992, Dolby laboratory has developed surround sound multichannel audio system 5.1 by using digital audio encoding technologies. The 5.1 surround sound system is the most common in home theaters. The 5.1 surround sound speaker system includes five speakers-two left and right front speakers, two left and right rear speakers, a quality center speaker, and a powered subwoofer for deep, rumbling bass tones. Except the subwoofer, other speakers are highly directional so as to create a surround sound field. When users are in such sound field, they have an optimum surround sound experience at home as in the theater. In order to provide optimum sound quality, the placement of the speakers needs to meet certain requirements of angles and distances to the listener. However, these requirements are often at odds with the space constraints such as a limited space in vehicles or a compact room arrangement. Thus the best placement of the speakers is unable to be achieved and users can't get an even, all-surrounding audio experience.
A technique that reproduces optimum stereo sound by small-scale loudspeaker array units has been developed. It's based on the ability of humans to localize sound sources by two binaural cues-interaural time differences of arrival and interaural intensity differences. The time difference is defined as the difference in arrival time of a sound at the left and right ears while the intensity difference is the amplitude difference generated between the right and left ears. A head related transfer function (HRTF) is used to simulate surround sound playback from loudspeakers fixed at certain positions. By adjusting audio frequency and delay of HRTF, stereo sounds are simulated and played by the fixed loudspeakers. The conventional HRTF describes time and amplitude differences of binaural response for a listener. The differences are generated due to anatomical structure of the listener's head and pinnae and are used to detect the position of the sound source. The technique by using binaural time differences and amplitude differences, together with reverberation, chorus to have auditory perception is called duplex theory. Refer to
Thus there is a room for improvement and a need to provide a novel method for producing a sound field.
The primary object of the present invention is to provide a method for producing an optimum sound field by loudspeakers in which reference points are set up around a sound field to get audio signals and related gradient on boundary of the sound field. An optimization method is used to calculate output signals of each loudspeaker for producing the sound field required. Thus the listeners can still get best sounds even when they turn their heads a bit or move their bodies.
In order to achieve the above object, a method for producing an optimum sound field of loudspeakers of the present invention includes a plurality of steps. Firstly, enclose a surface of a human head region with a first closed geometric shape. The first closed geometric shape can be a triangle, a square, a circle, an ellipse, etc. Then enclose the first closed geometric shape with at least one second closed geometric shape, outside the human head region. The first closed geometric shape and the second closed geometric shape are corresponding to each other with similar geometric shape. Next set up a plurality of reference points on the first closed geometric shape and on the second closed geometric shape respectively. The position of each reference point on the first closed geometric shape is corresponding to the position of each reference point on the second closed geometric shape. Later, use signal strength of the reference point on the second closed geometric shape, and a gradient of the signal strength of the reference points obtained from the corresponding reference points on the first and the second closed geometric shapes to obtain a transfer function from a virtual loudspeaker/a real loudspeaker to each point of the sound field respectively. Then create a virtual sound field and a real sound field according to the transfer function of the virtual loudspeaker and the transfer function of the real loudspeaker respectively. At last, obtain an optimum signal of the real loudspeaker by minimizing an error between the virtual sound field and the real sound field with a boundary condition. In the method mentioned above, the method for obtaining an optimum signal of the real loudspeaker includes a pseudo-inverse matrix method.
In the method mentioned above, the boundary condition is that the convolution of the transfer function from the real loudspeaker to each point of the sound field and sound source signal of the real loudspeaker is equal to the real sound field.
In the method mentioned above, the number of the real loudspeakers is smaller than the number of the reference points.
In the method mentioned above, the intervals between two adjacent reference points are all the same.
In the method mentioned above, the first closed geometric shape and the second closed geometric shape are preferred to be concentric circles with different radii.
Thereby a virtual loudspeaker is synthesized within a limited range around the listener's head and is in a desired direction. Thus no matter the listener's head is turned or the body is shifted or moved a bit but the head is still within a certain range, the direction of the virtual loudspeaker the listener perceived remains the same
Moreover, there is no need to place a plurality of loudspeakers on the boundary for producing the sound field required. Thus not only the problem of the placement of loudspeakers restricted by limited space at home can be solved, the huge cost involved in buying the loudspeakers is also saved. Therefore users have a higher acceptance and usage of the present invention and the present invention can be widely adopted.
The structure and the technical means adopted by the present invention to achieve the above and other objects can be best understood by referring to the following detailed description of the preferred embodiments and the accompanying drawings, wherein
Refer to
Step one (S1): enclosing a surface of a human head region with a first closed geometric shape 1. The first closed geometric shape 1 can be a triangle, a square, a circle, an ellipse, etc. In this embodiment, the first closed geometric shape 1 is a circle enclosing the head region. Also refer to
Step two (S2): enclosing the first closed geometric shape 1 with at least one second closed geometric shape 2, outside the human head region. The first closed geometric shape 1 and the second closed geometric shape 2 are corresponding to each other with similar geometric shape. In this embodiment, the first closed geometric shape 1 and the second closed geometric shape 2 are concentric circles with different radii.
Step three (S3): setting up a plurality of reference points P on the first closed geometric shape 1 and on the second closed geometric shape 2 respectively. The position of each reference point P on the first closed geometric shape 1 is corresponding to the position of each reference point P on the second closed geometric shape 2.
Step four (S4): using signal strength of the reference point P on the second closed geometric shape 2 and a gradient of the reference points P corresponding to each other on the first and the second closed geometric shapes 1, 2 to form a transfer function from a virtual loudspeaker/a real loudspeaker to each point of a sound field respectively;
Step five (S5): creating a virtual sound field and a real sound field according to the transfer function of the virtual loudspeaker and the transfer function of the real loudspeaker respectively; and
Step six (S6): obtaining an optimum signal of the real loudspeaker by minimizing an error between the virtual sound field and the real sound field with a boundary condition.
Moreover, the number of the real loudspeakers is smaller than the number of the reference points P. The following embodiment further reveals application range of the present invention but not intended to limit the present invention.
The audio signals enclosing the surface of the human head region and their gradients represent the sound people can hear. Thus there are three circles in this embodiment. One is the first closed geometric shape 1 while the rest two are the second closed geometric shapes 2. The gradient of the middle circle is computed by finite-difference approximations. Moreover, due to the ability to be discretized, several reference points P around the human head are taken to perform approximation. The radius of the first closed geometric shape 1 is ranging from 6 to 8.5 centimeters (cm) while the radius of the two second closed geometric shapes 2 are respectively between 9.5-13 cm and 11.5-18 cm. The number of the reference point P is ranging from 12 to 36. In this embodiment, the first closed geometric shape 1 whose radius is 7.5 cm is enclosing the human head region while one second closed geometric shape 2 whose radius is 12.5 cm is around the first closed geometric shape 1 and the other second closed geometric shape 2 whose radius is 17.5 cm is enclosing the above second closed geometric shape 2. Moreover, the first closed geometric shape 1 and each second closed geometric shape 2 are respectively set up 24 reference points P. The interval between two adjacent reference points P is the same as that of any other two adjacent reference points P, a 15 degree interval. The parameters mentioned above are of only an embodiment of the present invention, not intended to limit the present invention. People skilled the filed know that the radius of the first closed geometric shape 1, the radius of the second closed geometric shape 2, and the number of the reference points P can be modified.
The following mathematic equation is used to represent whole process of producing an optimum sound field of loudspeakers. It is assumed that a sound source signal of a virtual sound field SF, is represented by v. A transfer function from a virtual loudspeaker to each point of a sound field is represented by G. A sound source signal of a real sound field SF, is represented by O. A transfer function from a real loudspeaker to each point of the sound field is represented by H, M is the number of the virtual loudspeaker. N is the number of the real loudspeaker, and L is the number of the reference point P. Thus the virtual sound field and the real sound field can be respectively represented by SFv=vG and SFr=oH. The vector hereof vεR1×M and oεR1×N respectively represent an input signal of the virtual loudspeaker and an input signal of the real loudspeaker while the matrices GεRM×L and HεRN×L respectively represent the transfer function from the virtual loudspeaker to each point of the sound field, and the transfer function from the real loudspeaker to each point of the sound field. Since audio signals in a sound field having no noise can be determined by signals at the boundary and their gradients, the transfer function from the virtual loudspeaker/real loudspeaker to each point of the sound field is formed by the signal strength and gradient of each reference point P on the middle circle in this embodiment. The functions of v, o, G, H are represented by the followings:
wherein G1 and H1 are respectively the transfer function of the signal strength from the virtual loudspeaker to each point of the sound field and the transfer function of the signal strength from the real loudspeaker to each point of the sound field; G1(l,r) and H1(l,r) represent respective elements. G2 and H2 are respectively the transfer function of the gradient from the virtual loudspeaker to each point of the sound field and the transfer function of the gradient from real loudspeaker to each point of the sound field; G2(l,r) and H2(l,r) represent respective elements; lv is the location of the virtual loudspeaker; lr is the location of the real loudspeaker; bεR1×M and eεR1×N respectively represent vector magnitude of the virtual loudspeaker and vector magnitude of the real loudspeaker; φ is phase difference required for sound field reconstruction of each real loudspeaker; c is sound speed, xl
Find a solution by minimizing the real sound field and the virtual sound field.
The inverse matrix of H is unable to get directly since H is not square. Thus pseudo-inverse matrix H+ is used to get the least square error solution. The singular value decomposition of H is used to construct the pseudo-inverse matrix as the following:
HN×L=UN×NΣN×LVL×LT·
Thus, the pseudo-inverse matrix H+ is obtained.
Both sides of the constraint are multiplied by conjugate transpose matrix of H as the following: oH
o=vG
An optimal sound source signal of the real sound field is obtained: o*=vA, wherein the matrix A is represented as:
A=GHT(HHT)−1
Thereby we can observe the time differences and the strength to differences of the real loudspeaker that can produce a sound field similar to the virtual sound field.
At last, signals of the real loudspeaker are obtained which minimizes the square error:
error=∥SFr−SFv∥22.
In summary, the virtual sound source signal is substituted into the transfer function from the virtual loudspeaker to each point of the sound field to form the virtual sound field while the sound source signal of the real sound field (signal of the real loudspeaker) is obtained by the virtual sound field and opposite transfer function from the real loudspeaker to each point of the sound field. The signal of the real loudspeaker is calculated by the minimization of the error between the real sound field and the virtual sound field around the listener. Thus the optimum sound field can be obtained in the least square error sense under any conditions. Thereby the listener can get an optimal sound effect within a sound field produced by loudspeakers even they slightly turn their heads or shift their bodies. Moreover, the present invention can be applied to home theaters or car audio systems. The loudspeakers can produce a sound field around the listener and generate a sound signal required.
Refer to
In accordance with the above embodiments, it is learned that the present invention has following advantages
Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the invention in its broader aspects is not limited to the specific details, and representative devices shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalent.
Number | Date | Country | Kind |
---|---|---|---|
101125003 A | Jul 2012 | TW | national |
Number | Name | Date | Kind |
---|---|---|---|
3824342 | Christensen et al. | Jul 1974 | A |
4119798 | Iwahara | Oct 1978 | A |
5862227 | Orduna-Bustamante et al. | Jan 1999 | A |
8638946 | Mahabub | Jan 2014 | B1 |
20080056517 | Algazi et al. | Mar 2008 | A1 |
20110249825 | Ise | Oct 2011 | A1 |
20110261973 | Nelson et al. | Oct 2011 | A1 |
20110293108 | Mejia et al. | Dec 2011 | A1 |
20120020485 | Visser et al. | Jan 2012 | A1 |
20120288124 | Fejzo et al. | Nov 2012 | A1 |
Number | Date | Country | |
---|---|---|---|
20140016801 A1 | Jan 2014 | US |