Typically, acoustic scenes are recorded using a set of microphones. Each microphone outputs a microphone signal. For example, 25 microphones may be used for an audio scene of an orchestra. A sound engineer then mixes the 25 microphone output signals, e.g., into a standard format such as a stereo format, a 5.1 format, a 7.1 format, a 7.2 format, or any other corresponding format. In case of a stereo format, e.g., the sound engineer or an automatic mixing process generates two stereo channels. In the case of a 5.1 format, mixing results in five channels and one subwoofer channel. Analogously, in case of a 7.2 format, e.g., mixing results in seven channels and two subwoofer channels. If the audio scene is to be rendered in a reproduction environment, a mixing result is applied to electrodynamic loudspeakers. In a stereo reproduction scenario, there are two loudspeakers, the first loudspeaker receiving the first stereo channel and the second loudspeaker receiving the second stereo channel. For example, in a 7.2 reproduction format, there are seven loudspeakers at predetermined positions, and two subwoofers, which can be placed relatively arbitrarily. The seven channels are applied to the corresponding loudspeakers, and the subwoofer channels are applied to the corresponding subwoofers.
The use of a single microphone arrangement when capturing audio signals, and the use of a single loudspeaker arrangement when reproducing the audio signals typically neglects the true nature of the sound sources. European patent EP 2692154 B1 describes a set for capturing and reproducing an audio scene, in which not only the translation but also the rotation and, in addition, the vibration is captured and reproduced. Thus, a sound scene is not only reproduced by a single capturing signal or a single mixed signal but by two capturing signals or two mixed signals that, on the one hand, are recorded simultaneously, and that, on the other hand, are reproduced simultaneously. This ensures that different emission characteristics of the audio scene are recorded compared to a standard recording, and are reproduced in a reproduction environment.
To this end, as is illustrated in the European patent, a set of microphones is placed between the acoustic scene and a (imaginary) listener space to capture the “conventional” or translation signal that is characterized by a high directionality, or high quality.
In addition, a second set of microphones is placed above or to the side of the acoustic scene to record a signal with lower quality, or lower directionality, that is intended to represent the rotation of the sound sources in contrast to the translation.
On the reproduction side, corresponding loudspeakers are placed at the typical standard positions, each of which has a omnidirectional arrangement to reproduce the rotation signal, and a directional arrangement to reproduce the “conventional” translational sound signal. In addition, there is a subwoofer at each of the standard positions, or there is only a single subwoofer at an arbitrary location.
European patent EP 2692144 B1 discloses a loudspeaker for reproducing, on the one hand, the translational audio signal and, on the other hand, the rotatory audio signal. Thus, the loudspeaker has, on the one hand, an arrangement that emits in an omnidirectional manner, and, on the other hand, an arrangement that emits in a directional manner.
European patent EP 2692151 B1 discloses an electret microphone that can be used for recording the omnidirectional or the directional signal.
European patent EP 3061262 B1 discloses earphones and a method for manufacturing earphones that generate both a translational sound field and a rotatory sound field.
European patent EP 3061266 B1 discloses earphones and a method for producing earphones configured to generate the “conventional” translational sound signal by using a first transducer, and to generate the rotatory sound field by using a second transducer arranged perpendicular to the first transducer.
Recording and reproducing the rotatory sound field in addition to the translational sound field leads to a significantly improved and therefore high-quality audio signal perception that almost conveys the impression of a live concert, even though the audio signal is reproduced by the loudspeaker or headphones or earphones.
This achieves a sound experience that can almost not be distinguished from the original sound scene in which the sound is not emitted by loudspeakers but by musical instruments or human voices. This is achieved by considering that the sound is emitted not only translationally but also in a rotary manner and possibly also in a vibrational manner, and is therefore to be recorded and reproduced accordingly.
A disadvantage of the concept described is that recording the additional signal that reproduces the rotation of the sound field represents a further effort. In addition, there are many pieces of music, for example classical pieces or pop pieces, where only the conventional translational sound field has been recorded. Typically, the data rate of these pieces is heavily compressed, e.g., according to the MP3 standard or the MP4 standard, contributing to an additional deterioration of quality, however, which is typically only audible for experienced listeners. On the other hand, there are almost no audio pieces that have not been recorded at least in the stereo format, i.e. with a left channel and a right channel. Rather, the development goes towards generating more channels than only a left and a right channel, i.e. generating surround recordings with five channels or even recordings with higher formats, for example, which is known under the keyword MPEG surround or Dolby Digital in the technology.
Thus, there are many pieces that have been recorded at least in the stereo format, i.e. with a first channel for the left side and a second channel for the right side. There are even more and more pieces where recording has been done with more than two channels, e.g., for a format with several channels on the left side and several channels on the right side and one channel in the center. Even higher level formats use more than five channels in the horizontal plane and in addition also channels from above or channels from obliquely above and possibly also, if possible, channels from below.
However, these formats all have in common that they only reproduce the conventional translatory sound by applying the individual channels to corresponding loudspeakers with corresponding transducers.
An embodiment may have an apparatus for supplying sound in a space with a first loudspeaker and a second loudspeaker, comprising: a control signal generator for generating a first control signal for the first loudspeaker and a second control signal for the second loudspeaker, wherein the control signal generator comprises: a mix signal generator stage for generating a first mix signal for the first control signal and a second mix signal for the second control signal from a first channel signal or a second channel signal such that the first mix signal and the second mix signal comprise a phase difference; a mixer stage for mixing the first channel signal with the first mix signal to acquire the first control signal, and for mixing the second channel signal with the second mix signal to acquire the second control signal; and an interface for transmitting the first control signal to the first loudspeaker and for transmitting the second control signal to the second loudspeaker.
Another embodiment may have a method for supplying sound in a space with a first loudspeaker and a second loudspeaker, comprising: generating a first control signal for the first loudspeaker and a second control signal for the second loudspeaker, wherein generating comprises: generating a first mix signal for the first control signal and a second mix signal for the second control signal from a first channel signal or a second channel signal such that the first mix signal and the second mix signal comprise a phase difference; mixing the first channel signal with the first mix signal to acquire the first control signal, and for mixing the second channel signal with the second mix signal to acquire the second control signal; and transmitting the first control signal to the first loudspeaker and for transmitting the second control signal to the second loudspeaker.
Another embodiment may have a space, comprising: a first loudspeaker and a second loudspeaker; and an apparatus for supplying sound in the space according the invention.
An inventive apparatus for suppling sound in a space (or in a room) with a first loudspeaker and a second loudspeaker includes a control signal generator for generating a first control signal for the first loudspeaker and a second control signal for the second loudspeaker. In particular, the control signal generator includes a mix signal generator stage and a mixer stage. The mix signal generator stage generates a first and a second mix signal having a phase difference with respect to each other. The mixer stage mixes the two mix signals with the first and the second channel signal, respectively. The inventive apparatus further includes an interface for transmitting the first control signal to the first loudspeaker and for transmitting the second control signal to the second loudspeaker. The interface may be configured to be wired or wireless and, depending on the implementation, may already include power amplifiers or not.
In addition, depending on the implementation, the interface may perform further measures for the control signals, e.g. equalizer processing of the signals, source coding of the signals or source coding and transmitter processing of the signals, so as to transmit the signals, e.g. in a wireless manner by means of a wireless protocol such as Bluetooth or DECT, to an input interface of a loudspeaker module which then typically also includes a power amplifier.
The present invention is based on the finding that a difference wave field around the two loudspeakers and therefore around a person irradiated by the loudspeakers can already be generated by generating a first and a second mix signal, both derived from the first channel signal, the second channel signal or both channel signals, said difference wave field representing, in addition to the translational sound output by the two loudspeakers, also the rotatory sound leading to a significant quality improvement of the subjective audio perception. In particular, no separate loudspeakers are required for generating the difference sound field, but the difference wave field is generated by applying the control signals for the loudspeakers accordingly with signals having a phase difference with respect to each other, wherein this phase difference is advantageously 180°, however, it may be in a range between 160° and 200°, where an almost identical effect is achieved as in the case of the signals having the advantageously best phase shift of 180°.
The effect of the difference wave field is better the closer the first and the second loudspeaker are arranged with respect to each other. Preferably, the loudspeakers should have a distance of at least 10 cm and at most 1 m, wherein distances in the range of 20 cm (e.g. 15-30 cm) are advantageous, as is the particular case in headrests of vehicle seats or other seats, e.g., in waiting areas. The relatively close spatial arrangement of the two loudspeakers particularly achieves that separate sound generators are not necessary for generating the difference wave field. Instead, it is sufficient that the two loudspeakers obtain the special inventive control signals.
In order to generate the control signals, only one channel signal, i.e. either the left channel signal or the right channel signal, can be used. Alternatively, a sum of the two channel signals, i.e. a mono signal, can be used. Alternatively and advantageously, the calculation of the mix signals is based on using a difference between the two channel signals, said difference dominating the mix signals. Depending on the implementation, this difference may be used directly, or it may be combined with a sum signal, or it may be combined with the left channel signal or the right channel signal. However, it is advantageous to either use the difference signal alone for calculating the mix signals, or to use the difference signal in a combination with the sum signal from both channels, wherein the proportion of the difference signal and the proportion of the sum signal in the final mix signals is adjustable, and is advantageously set such that the difference signal determines at least ⅔ of the two mix signals with respect to the corresponding energy in the signals.
In advantageous embodiments, in addition, a distance measurement is provided to determine the distance of the head of the person to be irradiated, or the two ears of the person to be irradiated. These distance sensors are advantageously configured as ultrasound sensors. Such a distance sensor is provided next to each loudspeaker, e.g. in a headrest of a passenger seat or a seat in a waiting area. Thus, the distance of the head side in the vicinity of this loudspeaker may be determined. The distance measurement is used to perform a volume compensation, a bass compensation, or a delay compensation. If the distance between the head and the loudspeaker increases, the level is increased on the side where the distance increase was measured, or the basses are increased for this loudspeaker. Optionally, a delay adaptation for the loudspeaker may be performed, such that the delay for this loudspeaker decreases compared to other loudspeakers.
However, if the distance between the head and the loudspeaker increases, the level and/or the basses for this loudspeaker are decreased. Furthermore, the delay adjustment for this loudspeaker is again optionally performed compared to other loudspeakers such that the delay for the loudspeaker having an increased distance to the head of the user increases.
According to the invention, the loudspeakers are mounted in a space, such as an interior space in a vehicle, e.g. a land vehicle (car, train, sled, motor vehicle, . . . ), an air vehicle (“passenger” aircraft, helicopters, zeppelin, etc.), a water vehicle (boat, ferry, yacht, sailboat, etc.), or a spacecraft, and advantageously in a headrest for a driver of the vehicle or for a passenger or a co-pilot of the vehicle. Alternatively, the space may also be a waiting room, e.g. in a train station, an airport, or a civil service office, or a doctor's office, etc., having arranged therein, for additional comfort or for transmitting information, seats with headrests or seats with apparatuses through which of a person in the room may be supplied with sound relatively close to the head of the person.
The headrest, or the loudspeakers, comprise at least one left and one right loudspeaker arranged left and right, respectively, of a respective ear of the driver or of the passenger or of the person in general.
Preferably, the other sound system in the vehicle or in the space (or room) continues to run in addition and performs, e.g., a localization of sound sources in the space (or room), possibly by amplitude panning, etc. The additional acoustic irradiation by the loudspeakers takes place in parallel to the conventional acoustic irradiation.
Preferably, loudspeakers are provided at several locations in the vehicle, wherein loudspeakers for an unoccupied seat are deactivated, e.g., which may be achieved by a sensor or alternative means.
The loudspeakers generate difference soundwave fields. They may be generated via an oscillating surface (planar transducer) or via two neighboring piston converters oscillating in push-pull mode (loudspeakers) or via other described generators. Mono signals and/or difference signals (L-R and/or R-L) may serve as source signals for the generation of the difference soundwave field.
A synthetic generation of the rotation signal is possible if there is an audio piece with more than one channel, i.e. already having two, e.g. stereo, channels or even more channels. According to the invention, calculating an at least approximate difference obtains at least an approximation with respect to the difference signal, or rotation signal, which may then be used to drive the respective loudspeakers together with the respective channel signal. To this end, a calculation of two mix signals having a phase difference with respect to each other is performed.
In a further embodiment, in which there are more than two channels, e.g. in case of a 5.1 signal, a down mixer for the first channel signal, e.g., i.e. for the left channel, and a further down mixer for the second channel signal (i.e. for the right channel) are connected upstream of the control signal generator. However, if the signal is available as an original microphone signal, such as an ambisonics signal with several components, each down mixer is configured to accordingly calculate, from the ambisonics signal, a left channel or a right channel which is then used by the control signal generator to calculate the control signals.
According to a first aspect of the present invention, the loudspeakers are arranged separately from the control signal generator. In such an embodiment, the loudspeakers have signal inputs that may be wired or wireless, wherein a signal for a sound generator in the loudspeaker is generated at each signal input. The control signal generator providing the control signals for the sound generator is arranged away from the actual loudspeaker and is connected to the loudspeakers via a communication link such as a wired connection or a wireless connection.
In another embodiment, the control signal generator is integrated into the loudspeakers or into a loudspeaker or into the vehicle. In such a case, in the loudspeaker with an integrated signal processor, the common mode signal is derived, and, depending on the implementation and the embodiment, the push-pull signal is derived separately, or is derived from the common mode signal. An aspect of the present invention therefore concerns the loudspeaker without a signal processor. Another aspect of the present invention therefore concerns the signal processor without a loudspeaker, and a further aspect of the present invention concerns the loudspeaker with an integrated signal processor.
In a further embodiment of the present invention, when a multi-signal is available, e.g. as a stereo signal or as a signal with three or more channels, the control signals are derived from this multi-channel representation. In case of a stereo signal, e.g., a side signal representing the difference of the left and the right channel is calculated, this side signal then possibly being attenuated or amplified accordingly, and, depending on the implementation, being mixed with a non-high pass filtered or a high pass filtered common mode signal. If the output signal has several channels, the mix signals may be generated from differences between any two channels of the multi-channel representation. Thus, e.g., a difference between the left and the right rear (right surround) may be generated, or alternatively, a difference between the center channel and any one of the other four channels of a five-channel representation. In case of such a five-channel representation, however, as is the case in a stereo representation, a difference between left and right may be determined to generate the side signal. In a further embodiment, certain channels of the five-channel representation may be added, i.e., a two-channel down mix may be determined. An exemplary implementation for generating a two-channel down mix signal consists of adding, possibly with weighting factors, left rear (left surround), left, and center, to generate a left down mix channel. To generate the right down mix channel, the right rear channel (right surround) is added with the right channel and the center channel, possibly again with weighting factors. The mix signal may then be determined on the basis of a difference formation from the left down mix channel and the right down mix channel.
Embodiments of the present invention will be detailed subsequently referring to the appended drawings, in which:
The headrest, or the sound generators, comprises at least one left and one right loudspeaker arranged left and right, respectively, of a respective ear of the driver or the passenger.
Preferably, the other sound system in the vehicle continues to run in addition and, e.g., performs a localization of sound sources in the space, possibly by means of amplitude panning, etc. The additional acoustic irradiation through the loudspeakers takes place in parallel to the conventional acoustic irradiation.
Preferably, loudspeakers are provided at several seats in the vehicle, wherein loudspeakers for an unoccupied seat are deactivated, e.g. which may be achieved by means of a sensor or alternative means.
The loudspeakers generate difference soundwave fields. These may be generated via an oscillating surface (planar transducer) or via two neighboring piston transducers oscillating in push-pull mode (loudspeakers), or via other described transducers. Mono signals and/or difference signals (L-R and/or R-L) may serve as source signals for the generation of the difference soundwave field.
The mix signal generator stage 12 is configured to generate the mix signals such that the first mix signal 13a and the second mix signal 13b comprise a phase difference with respect to each other. This phase difference of the mixed signals is advantageously between 1600 and 200°, and most advantageously at 180°, so as to obtain, by means of the sound field generated through the two loudspeakers in the space to be supplied, a listening impression that is as psycho-acoustically and qualitatively positive as possible. The mixer stage 14 is configured to mix the first channel signal 6 with the second mix signal 13a to obtain the first control signal 15a. In addition, the mixer stage 14 is configured to mix the second channel signal 8 and the second mix signal 13b to obtain the second control signal 15b. The control signal generator means 10 has connected downstream thereof an interface 16 to transmit the first control signal 15a and the second control signal 15b to the first loudspeaker 21 and the second loudspeaker 22, respectively.
Depending on the embodiment, the interface may be a wired interface so that the first loudspeaker 21 and the second loudspeaker 22 are arranged at an output of the interface 16, possibly via amplifiers 46, 48 illustrated in
Preferably, the space is an interior space of a vehicle, and the apparatus further comprises the first loudspeaker and the second loudspeaker 21, 22, wherein these two loudspeakers are arranged in the vehicle. Alternatively, the space may be the interior room of a waiting area of a civil service office, an airport, a ferry terminal, or any other “station”, or also the waiting area of a doctor's office, having arranged therein seats provided with headrests or the like to achieve a sound supply for a user of the seats.
In particular, as shown in
By means of the loudspeaker arranged in the vicinity of the two ears of the user, a wave field comprising the common mode portion and comprising a push-pull mode portion, or differential mode portion, due to the mix signals is generated around the user.
Through this, a particularly natural and high quality sound impression is generated for the space to be supplied with the sound, not only having a pleasant and natural sound quality, but also a high speech comprehensibility in case the main aspect is the transmission of messages.
In advantageous embodiments of the present invention, the mix signal generator stage 12 includes an input stage 12a and a branching stage 12b, wherein the input stage is configured to generate a common signal, as can be seen in
The branching stage 12b downstream of the input stage 12a includes a branching point 35 and a first downstream 90° phase shifter 34, or a second downstream −90° phase shifter 36. Both phase shifters 34, 36 are configured to generate at their output a signal pair having the phase difference. In addition, the downstream stage 12b in
In the embodiment shown in
At the output of the branching stage, which is also the output of the control signal generator 12, there are the two control signals 15a, 15b that are amplified within the interface 16, comprising the two amplifiers 46, 48 in the embodiment shown in
In the embodiment shown in
In the embodiment shown in
In the embodiment shown in
Furthermore, it is to be noted that the interface design, which includes a wireless implementation in the embodiment shown in
In the embodiment shown in
Even though the branching stage illustrated in
In addition, it is to be noted in advantageous embodiments that the difference signal is a direct difference between left and right, i.e. a difference that is achieved if one of the two signals is inverted and the addition between the one signal and the inverted other signal is then calculated. Alternatively, other differences may be calculated, e.g., achieved by mathematically calculating a difference without having available an explicit phase shifter. In addition, the corresponding “difference signals” may be calculated in the time range or in the frequency range or in an LPC range. In other embodiments, there is a phase shifter that does not generate a phase shift of 180°, but only with a value of between 90° and 180°. There is still a difference, however, it does not correspond to the actual “mathematical” difference. Such a difference signal is also useful in an embodiment of the present invention so as to generate a difference wave field at the output of the two loudspeakers if mix signals are derived from such a “difference signal”.
In particular, if the distance between the head and the loudspeaker increases, the level of the loudspeaker signals is increased by a sound processor not shown in
However, if the distance between the head and the loudspeaker decreases, and is measured by one of the ultrasound sensors 51, 52, a sound processor is configured to reduce the level of the corresponding loudspeaker signal and/or the basses for the loudspeakers. Optionally, the delay for this loudspeaker, compared to the other loudspeaker, may be increased if a distance reduction is detected between the head and the loudspeaker.
In addition, in advantageous embodiments, in which the rooms/spaces to be supplied have several seating positions, such as several seats in a vehicle or several seats in a waiting area in a stationary space, a detector is arranged at each seat so as to detect if the seat is actually occupied by a listener. If it detects that the seat is occupied, the loudspeakers are driven as is illustrated in
A advantageous embodiment of the present invention is located within a mobile device, such as a mobile phone, a tablet, a notebook, etc. In particular, the control apparatus, or the apparatus for generating a control signal, is loaded onto the mobile phone as a hardware element or as an app, or program. The mobile phone is configured to receive the first audio signal and the second audio signal or the multi-channel signal from any source that may be local or in the internet, and to generate the control signals as a function thereof. These signals are transmitted from the mobile phone to the sound generator with the sound generator elements either in a wired manner or in a wireless manner, e.g. via Bluetooth or WiFi. In the latter case, the sound generator elements have to have a battery supply, or a power supply in general, to achieve corresponding amplifications for the received wireless signals, e.g. according to the Bluetooth format or the WiFi format.
Even though some aspects have been described within the context of a device, it is understood that said aspects also represent a description of the corresponding method, so that a block or a structural component of a device is also to be understood as a corresponding method step or as a feature of a method step. By analogy therewith, aspects that have been described within the context of or as a method step also represent a description of a corresponding block or detail or feature of a corresponding device. Some or all of the method steps may be performed while using a hardware device, such as a microprocessor, a programmable computer or an electronic circuit. In some embodiments, some or several of the most important method steps may be performed by such a device.
Depending on specific implementation requirements, embodiments of the invention may be implemented in hardware or in software. Implementation may be effected while using a digital storage medium, for example a floppy disc, a DVD, a Blu-ray disc, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, a hard disc or any other magnetic or optical memory which has electronically readable control signals stored thereon which may cooperate, or cooperate, with a programmable computer system such that the respective method is performed. This is why the digital storage medium may be computer-readable.
Some embodiments in accordance with the invention thus comprise a data carrier which comprises electronically readable control signals that are capable of cooperating with a programmable computer system such that any of the methods described herein is performed. Generally, embodiments of the present invention may be implemented as a computer program product having a program code, the program code being effective to perform any of the methods when the computer program product runs on a computer.
The program code may also be stored on a machine-readable carrier, for example.
Other embodiments include the computer program for performing any of the methods described herein, said computer program being stored on a machine-readable carrier.
In other words, an embodiment of the inventive method thus is a computer program which has a program code for performing any of the methods described herein, when the computer program runs on a computer.
A further embodiment of the inventive methods thus is a data carrier (or a digital storage medium or a computer-readable medium) on which the computer program for performing any of the methods described herein is recorded. The data carrier, the digital storage medium, or the recorded medium are typically tangible, or non-volatile.
A further embodiment of the inventive method thus is a data stream or a sequence of signals representing the computer program for performing any of the methods described herein. The data stream or the sequence of signals may be configured, for example, to be transmitted via a data communication link, for example via the internet.
A further embodiment includes a processing unit, for example a computer or a programmable logic device, configured or adapted to perform any of the methods described herein.
A further embodiment includes a computer on which the computer program for performing any of the methods described herein is installed.
A further embodiment in accordance with the invention includes a device or a system configured to transmit a computer program for performing at least one of the methods described herein to a receiver. The transmission may be electronic or optical, for example. The receiver may be a computer, a mobile device, a memory device or a similar device, for example. The device or the system may include a file server for transmitting the computer program to the receiver, for example.
In some embodiments, a programmable logic device (for example a field-programmable gate array, an FPGA) may be used for performing some or all of the functionalities of the methods described herein. In some embodiments, a field-programmable gate array may cooperate with a microprocessor to perform any of the methods described herein. Generally, the methods are performed, in some embodiments, by any hardware device. Said hardware device may be any universally applicable hardware such as a computer processor (CPU), or may be a hardware specific to the method, such as an ASIC.
While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations and equivalents as fall within the true spirit and scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
10 2021 207 693.9 | Jul 2021 | DE | national |
10 2021 211 051.7 | Sep 2021 | DE | national |
This application is a continuation of copending International Application No. PCT/EP2022/069667, filed Jul. 13, 2022, which is incorporated herein by reference in its entirety, and additionally claims priority from German Applications Nos. DE 10 2021 207 693.9, filed Jul. 19, 2021, and DE 10 2021 211 051.7, filed Sep. 30, 2021, all of which are incorporated herein by reference in its entirety. The present invention concerns electroacoustic and in particular concepts for generating and reproducing audio signals in a space, such as in a vehicle or a stationary space, such as a hall, a waiting area, etc.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/EP2022/069667 | Jul 2022 | US |
Child | 18402911 | US |