The invention relates to sound reproduction systems and more specifically to the reproduction of sound in two sound zones within a listening domain.
In today's media-driven society, there are ever more ways for users to access audio, with a plethora of products producing sound in the home, car or almost any other environment. Potential audio programmes include a large variety of music, speech, sound effects and combinations of the three. It is also increasingly common for products producing audio to be portable. This wide range of increasingly portable products which produce audio coupled with the ubiquity of audio in almost all facets of society naturally leads to an increase in situations in which there is some degree of audio-on-audio interference.
Examples of such situations might include audio produced by a laptop computer in a room with a television; a mobile phone conversation whilst a car radio is on, or in the presence of piped music in a shopping centre; or competing workstations in an office environment. It is therefore of interest in a number of areas, within the audio industry and beyond, to evaluate the perceived effect of audio interference upon a target audio programme.
A system for reproduction of different sound signals in a plurality of independent sound zones is described in GB 2472092 A. However, contrary to the method and system according to the present invention, the system described in this document uses loudspeakers placed in or adjacent each different zones. Furthermore the system divides the total frequency band into a high frequency band and a low frequency band and directs the high frequency components into the appropriate zone by using a directional loudspeaker array, whereas the amplitude, phase and delay of the low frequency components are adjusted according to the specific sound zone.
On the above background, it is an object of the invention to implement methods in audio renderings systems that are enabled to eliminate the undesired interference among sound zones identified in a listing domain. According to the invention this can be achieved by a traditional setup of loudspeakers, i.e. the present invention does not require that the loudspeakers be placed in or adjacent to each different sound zone.
The invention includes a control system configured to adjust primary parameters, like amplification, filtering, and delay of the individual sound rendering systems present in the listening area and alternatively or supplemental to preprocess the audio signal programme and thereby to obtain a predefined “threshold of acceptability” for an interfering audio programme.
The invention is based on research results documented in the following document, which is hereby incorporated by reference:
Audio Engineering Society—Convention Paper Presented at the 132nd Convention 2012 Apr. 26-29 Budapest, Hungary
“Determining the Threshold of Acceptability for an Interfering Audio Programme”,
In the above document there is described an experiment that was performed in order to establish the threshold of acceptability for an interfering audio programme on a target audio programme, varying the following physical parameters: target programme, interferer programme, interferer location, interferer spectrum, and road noise level. Factors were varied in three levels in a Box-Behnken fractional factorial design. The experiment was performed in three scenarios: information gathering, entertainment, and reading/working. Nine listeners performed a method of adjustment task to determine the threshold values. Produced thresholds were similar in the information and entertainment scenarios, however there were significant differences between subjects, and factor levels also had a significant effect: interferer programme was the most important factor across the three scenarios, whilst interferer location was the least important.
More specifically the invention addresses the problem a user has when listening to a target programme in one sound zone and is annoyed by an interfering sound coming from another source, appearing randomly or continuously, this sound perceived as noise by the user.
The methods applied for creating and controlling virtual sound zones are disclosed in a patent from the applicant U.S. Pat. No. 7,813,933: “Method and Apparatus for Multichannel Upmixing and Downmixing” which is hereby incorporated by reference.
According to a first aspect, the present invention relates to a method for the reproduction of multi-channel sound signals in virtual sound zones, where the method comprises the following steps:
(i) providing one or more sound rendering systems comprising one or more sound emitting transducers, amplifier means, filtering means and delay means, which means are controllable by external control signals, and microphone means;
(ii) providing system controller means configured to provide said control signals for said one or more sound rendering systems;
(iii) providing means for defining one or more sound zones that are perceived as different sound areas by human listeners;
(iv) based on said definitions of sound zones, controlling said amplifier means, filter means and delay means such that said sound emitting transducers produce said different sound zones;
where the gain of each respective amplifier means is chosen such that the resultant sound pressure level in said first sound zone is at least equal to the sound pressure level in the first zone produced by the total acoustic output from the second sound zone plus an acceptance factor that is generally a function of at least a mode of operation of a listener in the first sound zone and the interferer programme, interferer location and interferer spectrum.
According to a specific embodiment of the invention, the acceptance factor is furthermore a function road noise, which yields the inventive method particularly applicable for sound reproduction in the cabin of a vehicle.
According to a second aspect, the invention relates to a system for the reproduction of multichannel sound signals in virtual sound zones, wherein the system comprises
(a) A system controller enabled to receiving multichannel sound signals;
(b) where the system controller is enabled to provide sound signals and control data to one or more sound rendering systems, such as a number of loudspeakers as for instance in a standard 2.0 or 2.1 stereophonic or 5.0 or 5.1 surround sound system;
(c) where at least one of the one or more sound rendering systems includes one or more active sound transducers, each including control of amplifier—, filtering and delay means and microphone means;
(d) where the system controller is enabled to configure and control a first sound zone and a second sound zone, which two sound zones are being perceived as two different sound areas by listeners;
(e) where the system controller configures each of the individual sound rendering systems so that a specific sound isolation is obtained between the first- and the second sound zone.
A system where the sound isolation between the first- and the second sound zone is characterized as a level of interference from an audio programme provided in the second zone to an active listener in the first zone.
The term “threshold of acceptability” is important to note, and it is point where the listener is happy with the situation, or the interferer is ‘no longer annoying’. In an informal listening test, this task seemed much more natural than trying to quantify the extent of the annoyance experienced. In addition the task being performed by the user has a pronounced effect on the acceptability threshold.
It has been found that a number of variable parameters like target programme material, interferer programme material, spectrum of program and interferer material and the location of the sound zones, has an effect on the experience of listening to the target audio in the presence of the interfering audio.
Thus, in the method and system of the present invention, the sound isolation parameters are based on the findings done in the above mentioned study, i.e. the parameters for “the threshold of acceptability”, which basically is the dB level the “interfering/noise signal” must be suppressed, with reference to the target programme.
As audio-on-audio interference is a relatively novel research area, there is little in the way of research looking into the acceptable level of interfering audio. The current invention applies the actual findings into methods and practical operational functionalities by introducing modes of operations and factors having different impact in the alternative modes of operation. The modes of operation include: a range of scenarios, programme material and other parameters that may affect the situation, and combines:
The modes of operation include use scenarios, where the scenarios reflect realistic tasks that people may carry out in the presence of an interfering audio programme:
Thus, an aspect of the invention is a system where the listener in the first zone is active in alternative modes of operation by listening to a target programme:
and with each mode having individual values of sound isolation to obtain specific threshold of acceptability for an interfering audio programme provided in the second zone.
The modes of operation may include user subjects, thus different individual may react different on an interfering sound, this reaction being dependent on gender, age, gender, experience, education and alike.
The modes of operation include influencing factors at the target programme:
and with each mode having individual values of sound isolation to obtain specific threshold of acceptability for the interfering audio programme perceived in the first zone.
The mode of operation include influencing factors related to the interfering programme and is the same in all scenarios, as interference could potentially come from any source regardless of the target task:
The data identified as the Information Scenario main effect are listed below.
The effect of all of the factors is fairly intuitive: speech-on-speech interference has a lower threshold of acceptability than music-on-speech; low-pass filtering increases the threshold (possibly because of a decrease in sibilance or transients); adding road noise increases acceptability (presumably as the interferer becomes more masked); and sports commentary targets produce a slightly higher threshold (possibly due to the consistent crowd noise).The target programme was included as independent variable (three levels: male news speech, sports commentary, female news speech).The effect of location was less pronounced.
For the influence of factors in Information Scenario, the difference in acceptability threshold between the conditions producing the highest and lowest thresholds for each factor, detailing the factor levels producing the extreme threshold values, are listed below.
The ‘Difference’ indicates the difference in dB between the levels producing the highest and lowest thresholds.
The data identified as the Entertainment Scenario main effect are listed below, and illustrates the error bar plots for the most influential factors.
For the influence of factors in Entertainment Scenario, the difference in acceptability threshold between the conditions producing the highest and lowest thresholds for each factor, detailing the factor levels producing the extreme threshold values, are listed below.
It can be seen that the interferer programme is again the most influential factor with a difference of 8 dB between the highest and lowest thresholds; the factor levels producing the highest and lowest threshold are the same as in the information task. Target programme has a larger effect in the entertainment task; this could be attributed to the nature of the programme material used in this scenario, with vocal pop music more heavily compressed and therefore masking the interfering programme more consistently. The magnitude of the effect of road noise is similar to that in the information scenario and that of spectrum slightly lower. Again, interferer location had the smallest effect on threshold.
The ‘Difference’ indicates the difference in dB between the levels producing the highest and lowest thresholds.
The data identified as the Reading/Working Scenario main effect are listed below, and illustrates the error bar plots for the four influential factors.
The order of importance of the factors is somewhat different to the previous scenarios, and the magnitude of the important effects is much larger. Introducing road noise at 70 mph increases the threshold of acceptability by approximately 19 dB; this can be attributed to the extra masking provided by the road noise when there is no target programme. The magnitude of the effect of interferer programme is similarly inflated to 15 dB, with the same programme items as in the previous scenarios producing the lowest and highest thresholds. The interferer spectrum and location have similar effects to the information and entertainment scenarios.
For the influence of factors in Reading/Working Scenario, the difference in acceptability threshold between the conditions producing the highest and lowest thresholds for each factor, detailing the factor levels producing the extreme threshold values, are listed below.
The ‘Difference’ indicates the difference in dB between the levels producing the highest and lowest thresholds.
Conclusively the disclosed experimental data for the threshold of acceptability is derived with the 50% and 95% acceptable points for each scenario as displayed below:
These results provide useful information as to the level of audio interference which may be considered acceptable during the performance of certain tasks.
There are pronounced differences between subjects: inexperienced listeners produced median threshold values between 10 dB and 18 dB above those of experienced listeners. Some of these differences were attributed to a different understanding of the task between subjects. At the same time, some of these differences may be attributed to personal differences between listeners (e.g. temperament, mood, prior experience etc.).
The effect of physical parameters is somewhat determined by the task, and is seemingly heavily influenced by the target programme. In the reading/working scenario, there is up to 19 dB difference between thresholds produced at different levels of road noise and for different interferer programmes. The effect of each factor is less pronounced in the information and entertainment scenarios, with the most influential parameters being interferer programme (approximately 8 dB between the means for the highest and lowest threshold groups). In conclusion, it seems that interferer programme has the greatest effect on threshold, followed by road noise level, spectrum and target programme which are more or less important depending on scenario. Interferer location was found to be the least influential parameter in all cases.
A user selected and activated target programme as provided in the virtual first sound zone (2) and delivering a certain sound pressure level SPL(t).
An interferer active programme may be another sound source provided in the second sound zone (3) and delivering a certain sound pressure level SPL(i) in the first sound zone (2).
In modes in which the interferer pressure level may be controlled by the system controller (4) the virtual second sound zone is adjusted accordingly to accommodate to the pre-defined threshold of acceptability parameter values:
Thus according to an embodiment of the invention:
The values of the sound isolation related to experienced users are typically:
Alternatively or supplemental to adjusting the SPL to an acceptable level, it may be possible to change the signal by increasing the level at which the interference is acceptable.
In a preferred embodiment of the invention, the variables for modes of operation, scenario and factors and isolation are enumerated in a constraint domain table including all legal combinations of the defined variables; the table to be processed by a constraint solver to find the actual parameters settings for amplifiers, filters, and delays related to the addressed sound zone.
The constraint solver processing enables an arbitrary access mode to information with no order of sequences required.
According to the invention, the constraint solver domain table is organized as relations among variables in the general mathematical notation of ‘Disjunctive Form’: Variable 1.1 and Variable 1.2 and Variable 1.3 and Variable 1.n
Or Variable 2.1 and Variable 2.2 and Variable 2.3 and Variable 2.n
Or . . .
Or . . .
Or Variable m.1 and Variable m.2 and Variable m.3 and Variable m.n
An alternatively definition term is the ‘Conjunctive Form’:
Variable 1.1 or Variable 1.2 or Variable 1.3 or Variable 1.n
And Variable 2.1 or Variable 2.2 or Variable 2.3 or Variable 2.n
And . . .
And . . .
And Variable m.1 or Variable m.2 or Variable m.3 or Variable m.n
With this method of defining the problem/solution domain, it becomes a multi-dimensional state space enabling equal and direct access to any point in the defined set of solutions.
The present invention addresses an area with a wide range of applications and may be applied to any system which aims to mitigate the effects of audio-on-audio interference, for example, noise-cancellation systems or source separation algorithms.
Number | Date | Country | Kind |
---|---|---|---|
2012 00167 | Mar 2012 | DK | national |
Number | Name | Date | Kind |
---|---|---|---|
7813933 | Martin | Oct 2010 | B2 |
8111836 | Graber | Feb 2012 | B1 |
20030002680 | Akiyama | Jan 2003 | A1 |
20040223620 | Horbach | Nov 2004 | A1 |
20060034467 | Sleboda | Feb 2006 | A1 |
20060262935 | Goose | Nov 2006 | A1 |
20070189549 | Scheel | Aug 2007 | A1 |
20080101620 | Horbach | May 2008 | A1 |
20100076793 | Goldstein et al. | Mar 2010 | A1 |
20100124337 | Wertz | May 2010 | A1 |
20100135503 | Shin et al. | Jun 2010 | A1 |
20100284544 | Kim et al. | Nov 2010 | A1 |
20100290635 | Shridhar | Nov 2010 | A1 |
20100329488 | Holub | Dec 2010 | A1 |
20120020486 | Fried | Jan 2012 | A1 |
20120140945 | Harris | Jun 2012 | A1 |
20120327304 | Kashi | Dec 2012 | A1 |
20140014525 | Smith | Jan 2014 | A1 |
20150100991 | Risberg | Apr 2015 | A1 |
Number | Date | Country |
---|---|---|
05-344584 | Dec 1993 | JP |
Entry |
---|
Danish Search Report for DK PA 2012 00167 dated Sep. 18, 2012. |
Terence Betlehem and Paul D. Teal: “A constrained optimization approach for multi-zone surround sound”, International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, IEEE, May 22, 2011, pp. 437-440. |
Wu Y. J. and Abhayapala T. D.: “Multizone 2D soundfield reproduction via spatial band stop filters”, Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA '09, IEEE, Nov. 18, 2009, Piscataway, NJ, USA, pp. 309-312. |
Yan Jennifer Wu and Thushara D. Abhayapala: “Spatial Multizone Soundfield Reproduction: Theory and Design”, IEEE Transactions on Audio, Speech and Language Processing, Aug. 1, 2011, IEEE Service Center, New York, NY, USA, vol. 19, No. 6, pp. 1711-1720. |
Number | Date | Country | |
---|---|---|---|
20130230175 A1 | Sep 2013 | US |