1. Priority Claim
This application claims the benefit of priority from European Patent Application No. EP12002030.0-1247, filed Mar. 22, 2012, which is incorporated by reference.
2. Technical Field
The present invention relates to a method for retrieving and a system for reproducing an audio signal.
3. Related Art.
Convolution can be used in acoustics such as in a sound recording studio. The increasing computing power of special DSPs (DSP—Digital Signal Processor) and the home computer also permits the use of convolution in sound studios. When one excites a room with a short (broadband) pulse, one hears an echo that is characteristic for this room and that emphasizes or damps specific frequency components of the pulse as a result of the room's geometry and dimensions, the room's basic structure, the room's interior, and other specific characteristics. If the echo is now recorded, one thus obtains the impulse response of this room. The impulse response contains the complete characteristic of the (linear) room. In the technique of convolution, this impulse response is now utilized in order to combine any other desired acoustic signals with the impulse response through the mathematical process of convolution. For example, a discrete, fast convolution (FFT—Fast Fourier Transformation) for discrete (digitized) periodic signals is used to generate the acoustic characteristic of the room. As an alternative to determining impulse responses for a specific room, the impulse response can also be obtained through modeling, such as ray tracing and a source image model.
When a room is bounded by flat surfaces, the reflected sound components can be calculated by means of the source image method by constructing mirror-image sound sources. By means of modeling, it is possible to alter the position of the sound source and thus generate a new impulse response. By means of the impulse response, a signal for reproduction is faded out using an associated filter. The spatial impression is the auditory perception that one receives from the room itself when a sound event occurs. The spatial impression augments the acoustic information that comes directly from the sound source with important information about the environment, about the size and character of the room. The spatial impression consists of multiple components: the perception of the width and depth of the room, (room size); the perception of liveliness, which prolongs each sound event and can fuse a sound event with a following sound event; and the perception of space. Digital filters are a tool used in digital signal processing. One implementation of a filter can be achieved using convolution. This type of filter is called an FIR filter (Finite Impulse Response). Small rooms can be simulated based on an approximate image expansion, such as for rectangular non-rigid wall enclosures.
An audio signal reproduction system can retrieve an audio signal. The system may include a selection mode for retrieving the audio signal. The selection mode may be started based on a first user input. In the selection mode, a first audio signal can be reproduced. In addition, in the selection mode, a second audio signal can be reproduced substantially simultaneously with the first audio signal. In the selection mode, the first audio signal and the second audio signal can be filtered. The first audio signal may be at a first location and the second audio signal may be at a second location when separated acoustically in a virtual acoustic space by the filtering. In selection mode, in order to select the first audio signal, the first location in the virtual acoustic space can be positioned acoustically closer to a listening position of the user than the second location of the second audio signal.
In the audio signal reproduction system, the first audio signal can be automatically selected and the selection mode can end if a timing out of a time counter is detected. In order to end the selection mode, the first audio signal can be reproduced and the second audio signal can be faded out. The selection of the audio signal can occur conveniently. A selection of the audio signal can also be made solely acoustically. For example an occupant in a vehicle need not look at a display to make a selection, and may instead use audible sound a hands free command, such as voice commands to make a selection.
The system for reproducing an audio signal can be connected to electroacoustic transducers, such as loudspeakers or headphones. The system for reproducing an audio signal can be configured to start a selection mode for retrieving an audio signal. The starting of the selection mode can be based on a first user input. The system may be configured to reproduce a first audio signal in the selection mode. The system may be configured to reproduce a second audio signal substantially simultaneously with the first audio signal in the selection mode. The system may be configured to filter the first audio signal and the second audio signal in the selection mode by means of a filter. The first audio signal may be at a first location and the second audio signal may be at a second location. The first location and the second location can be separated acoustically in a virtual acoustic space based on the filtering. Here the filtering can position the first audio signal at a different three dimensional location in the virtual acoustic space than the second audio signal, such as by using convolution techniques. The filter may be realized by a digital signal processor.
The system for reproducing an audio signal can be configured to position the first location in the virtual acoustic space acoustically closer to a listening position of a listener/user than the second location of the second audio signal in order to select the first audio signal. The system can be configured to position the first location in the selection mode. The system for reproducing an audio signal can be configured to end the selection mode, if a timing out of a time counter is detected. The system can be configured to reproduce the first audio signal and to fade out the second audio signal in order to end the selection mode.
In an example, in the selection mode, a second user input may be detected before the timing out of the time counter. The first audio signal may be selected and the selection mode may end, if the second user input for selecting the first audio signal is detected. Therefore the system for reproducing an audio signal may operate to end the selection mode, if the second user input for selecting the first audio signal is detected.
The first audio signal and the second audio signal may be digital signals, which have a number of channels, for example, two stereo channels. A reproduction mode for the reproduction of audio signals can be provided before the selection mode and after the selection mode. In the reproduction mode, in contrast to the selection mode, a single audio signal, for example, a stereo signal of a radio receiver may be reproduced at a given time. In addition, other modes, for example, for outputting navigation instructions, telephone, or other audible sound can be provided. Additionally, in other examples, at least a third audio signal and/or a fourth audio signal may be reproduced during selection mode. The user can hear the first, second, third and fourth audio signals concurrently, but acoustically separated. In some examples, two or more audio signals are reproduced during selection mode.
In another example, the first audio signal and the second audio signal can be reproduced substantially simultaneously, but separated in a virtual acoustic space. This example operation can also be called spatialization. In this case, there are several possibilities for separation of the first audio signal and the second audio signal. For example, the first audio signal can be reproduced exclusively by at least one first loudspeaker, whereas substantially simultaneously the second audio signal can be reproduced exclusively by at least one second loudspeaker. In this case, the distance of the arrangement of the first loudspeaker and of the second loudspeaker can provide the distance between the first location of the first audio signal and the second location of the second audio signal in the virtual acoustic space. In other examples, additional audio signals and loudspeakers may be used.
In another example, more than two audio signals can be output over at least two loudspeakers arranged at a distance from one another. The audio signals can be reproduced by both loudspeakers at different volume such that an audio image perceived by a listener can be closer to one loudspeaker or the other loudspeaker, such as further left or further right in the virtual acoustic space. In addition, an audio signal can be reproduced such that an audio image is perceived as being in the middle between the loudspeakers, such as when both loudspeakers are at the same volume. Separation in the virtual acoustic space of the perceived audio image to be in several intermediate positions between far left and far right is also called panning
In another example, the first audio signal and the second audio signal can be arranged in different spatial depths of the virtual acoustic space. For this purpose, convolution is used in that each audio signal is filtered with different filter coefficients. For example, an FIR filter (Finite Impulse Response Filter), sometimes also called a transversal filter, can be used for the convolution. The location of the audio signal can be positioned as desired in the virtual acoustic space by means of the filter parameters, where the filter parameters may be developed by the convolution. A number of first filter coefficients can be loaded in a first filter block of a filter for filtering for the first location and a number of second filter coefficients can be loaded in a second filter block of a filter for the second location. In this case, the location in the virtual acoustic space is the perceived source position at which the listener locates the corresponding audio signal acoustically.
To end the selection mode, if the timing out of a time counter is not detected, the user need not wait for the time counter to lapse, but can end the selection mode at any time using a second user input for selecting the first audio signal. However, the timing out of the time counter provides an automatic ending of the selection mode, so that the selection mode may be manually concluded by a user, or automatically concluded based on the time counter. For example, in certain conditions in a vehicle, such as a traffic situation, a driver has the choice of making or not making the second user input. There also could be other events, such as acoustic events, interrupting or terminating the selection mode, like receipt of a phone call or issuance of navigation instructions.
In an example, in the selection mode, the first location of the first audio signal in the virtual acoustic space and/or the second location of the second audio signal in the virtual acoustic space can be changed based on a third user input. The first audio signal and/or the second audio signal at a location in the virtual acoustic space can be perceived as being positioned closer to a listening position of the user or farther from the listening position of the user by using the third user input. Advantageously, audio signals, positioned acoustically closer to the listening position, can be defined for a selection. In the selection mode, preferably for selection of the first audio signal, the first location in the virtual acoustic space can have a perceived position closer to a listening position of the user than the second location of the second audio signal.
In the selection mode, the second and/or third user input may occur via touching of a touch screen or actuation of a button or actuation of a selector wheel or any other form of user input for adjusting a perceived position.
In an example, the selection mode the first audio signal and the second audio signal can be reproduced at a different volume in the virtual acoustic space. Advantageously, in the selection mode for the selection of the first audio signal a first volume of the first audio signal can be controlled to be higher than a second volume of the second audio signal in the virtual acoustic space. In this case, the first audio signal in the virtual acoustic space is perceived to be closer to a listening position of the user than the second audio signal.
According to an example, the first audio signal is associated with a first audio source and the second audio signal is associated with a second audio source. For example, the first audio signal originates from a first radio receiver and the second audio signal originates from a second radio receiver. With the first radio receiver a first radio station is received, whereas with the second receiver a second radio station is received. In selection mode, the user can hear both radio stations concurrently but acoustically separated. With the second input the user may decide which radio station is selected for continuous playback, and exit the selection mode.
According to an example, the first audio signal can be associated with a first database entry and the second audio signal can be associated with a second database entry. The first audio signal in this case is advantageously generated from a first audio file and the second audio signal in this case is advantageously generated from a second audio file.
In an example, a first visual information item, associated with the first audio signal, can be displayed. Moreover, a second visual information item, associated with the second audio signal, can be displayed. One of the first or second visual information item can be, for example, a text and/or a picture and/or a video. The other of the first or second visual information item can be, for example, a cover or title or a station name or the like. An acoustic arrangement of the first location of the first audio signal and the second location of the second audio signal can correspond to a visual arrangement of the first visual information item and the second visual information item on a display. The display can be any mechanism or device for providing visual images, such as, for example, a screen or a projector.
There can be several options for arranging the first visual information item with respect to the second visual information item. For example, if the first visual information item is arranged in a position in front of the second visual information item, such as if the first visual information item partially conceals from view the second visual information item, then the first location of the first audio signal can also be arranged in a similar corresponding perceived acoustic position in front of a perceived acoustic position of the second location of the second audio signal in the virtual acoustic space. For example, if the first visual information item is arranged to the left of the second visual information item, then the first location of the first audio signal can also be arranged to the left of the second location of the second audio signal in the virtual acoustic space. For example, if the first visual information item is arranged above the second visual information item, then the first location of the first audio signal can also be arranged above the second location of the second audio signal in the virtual acoustic space.
Other systems, methods, features and advantages of the invention will be, or will become, apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features and advantages be included within this description, be within the scope of the invention, and be protected by the following claims.
The invention may be better understood with reference to the following drawings and description. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.
a and 2b is another block diagram example of the audio signal reproduction system, loudspeakers, and a virtual acoustic space that also shows an example display of selection from a database.
a and 3b is yet another block diagram example of the audio signal reproduction system, loudspeakers, and a virtual acoustic space that also shows an example display for selecting an individual title.
In
The controller 141 may include a processor. The processor may be one or more general processors, digital signal processors, application specific integrated circuits, field programmable gate arrays, digital circuits, analog circuits, combinations thereof, and/or other now known or later developed devices for analyzing and processing data.
The arithmetic unit 140 may also include memory. The memory may include a main memory, a static memory, and/or a dynamic memory. The memory may include, but is not limited to computer readable storage media, or machine readable media, such as various types of non-transitory volatile and non-volatile storage media, including but not limited to random access memory, read-only memory, programmable read-only memory, electrically programmable read-only memory, electrically erasable read-only memory, flash memory, magnetic tape or disk, optical media and the like. In one example, the memory includes a cache or random access memory for the processor. In addition or alternatively, the memory may be separate from the processor, such as a separate cache memory of a processor, the system memory, or other memory. The memory may also include (or be) an external storage device or database for storing data. Examples include a hard drive, compact disc (“CD”), digital video disc (“DVD”), memory card, memory stick, floppy disc, universal serial bus (“USB”) memory device, or any other device operative to store data. The memory is operable to store data and instructions executable by the processor. The functions, acts or tasks illustrated in the figures or described may be performed by or in connection with the programmed processor executing the instructions stored in the memory. The functions, acts or tasks may be independent of the particular type of instructions set, storage media, processor or processing strategy and may be performed by software, hardware, integrated circuits, firm-ware, micro-code and the like, operating alone or in combination. Likewise, processing strategies may include multiprocessing, multitasking, parallel processing and the like.
The filter 142 is configured, for example, as a DSP (Digital Signal Processor). Further, the system 100 can include, or be coupled with a first output circuit 180, which can be connected to a first loudspeaker 810. Further, the system 100 can include, or be coupled with a second output circuit 190, which can be connected to a second loudspeaker 820. Output circuit 180, 190 may include, for example, a digital-to-analog converter and an amplifier for amplifying and outputting an analog signal to loudspeaker 810, 820. In other examples, any number of output circuits and loudspeakers may be included.
If the loudspeakers are arranged at a distance to one another, a surround sound is generated by the system 100 by means of spatialization. The surround sound can be generated, for example, by outputting the third audio signal SC exclusively by first loudspeaker 810 and the second audio signal SB exclusively by second loudspeaker 820. In this case, the third audio signal SC for user X at a listening position is heard exclusively from the direction “far left,” whereas the second audio signal SB for user X is heard exclusively from the direction “far right.” The first audio signal SA may be output through both loudspeakers 810, 820 at the same volume and therefore is perceived as being heard by user X from the middle M between the two loudspeakers 810, 820. The middle M in this case designates a first location PA in a virtual acoustic space 890, “right” designates a second location PB in acoustic space 890, and “left” designates a third location PC in acoustic space 890. In addition, the user X designates a listening position in the virtual acoustic space. In other examples, any other locations and listening positions may be used.
The substantially simultaneous reproduction of the first audio signal SA, the second audio signal SB, and the third audio signal SC, as shown in
In the example of
For the system to enter the selection mode, a user input received from user X may direct the system to leave, for example, a reproduction mode in which a radio station is reproduced as a stereo signal. Receipt of the input can be the result of, for example, the actuation of a button, touching of the touch screen, performing a gesture command, or by a voice command recognized by speech recognition, e.g., “search function.” With the start of the selection mode, the stereo reproduction of the reproduction mode is deactivated.
In the example of
The selection mode of the system shown in
In addition to the acoustic retrieval of the audio signal SA, in the example of
The radio station list is generated by controller 141. In the example of
The visual arrangement of the first visual information item VA, the second visual information item VB, and the third visual information item VC can correspond to an acoustic arrangement of the first location PA of the first audio signal SA, the second location PB of the second audio signal SB, and the third location PC of the third audio signal SC in the virtual acoustic space 890. Thus, in the
In addition, filter 142 can be configured to influence additional audio properties of the first audio signal SA, the second audio signal SB, and the third audio signal SC. For example, the frequency response of the audio signals SA, SB, SC can be changed by the filtering. In addition, filter 142 can be configured to selectively boost or cut the frequency ranges in the first audio signal SA and/or in the second audio signal SB and/or in the third audio signal SC. For example, in
Another example of an infotainment system is shown schematically in
A system for reproducing an audio signal 200 included in the infotainment system in the example of
In the example of
The audio signals SA, SB, SC, SD, SE, SF, SG can be filtered in the selection mode by filter 242 in such a way that, as shown schematically in
The acoustically perceived location PA, PB, PC, PD, PE, PF, PG of the respective audio signal SA, SB, SC, SD, SE, SF, SG in virtual acoustic space 890 corresponds thereby to the position of the associated visual information items, such as cover image VA, VB, VC, VD, VE, VF, VG on touch screen 900. The arrangement of the acoustically perceived locations PA, PB, PC, PD, PE, PF, PG of the audio signals SA, SB, SC, SD, SE, SF, SG in virtual space 890 and accordingly the arrangement of the visual information items, such as cover images VA, VB, VC, VD, VE, VF, VG on touch screen 900 can be changed by means of an input by user X, for example, by moving a finger across touch screen 900. For example, if the finger slides from right to left across touch screen 900, the cover VG is faded out far left, a new cover is faded in on the right (not shown), and instead of the cover VA, the cover VB is moved to the foreground (indicated by an arrow in
The filter coefficient sets in the filter blocks are correspondingly re-loaded such that, the audio signal SG is faded out, a new audio signal associated with the new cover is faded in (not shown), and instead of the audio signal SA, the audio signal SB is acoustically positioned three dimensionally in the foreground (indicated by an arrow) of the virtual acoustic space 890.
In addition to the three dimensional virtual placement in the virtual acoustic space 890, each of the audio signals SA, SB, SC, SD, SE, SF, SG can be reproduced at a different volume in accordance with the desired acoustically perceived location. For example, in the situation of
At the end of the selection mode, the display on touch screen 900 of
A further example of an infotainment system is shown schematically in
A process flow in the form of a schematic flow diagram with example operational process steps 1 to 8 of the system for reproducing audio signals is shown schematically in
In a third step 3, a user input “Input1” is detected. The user input in this case, for example, is made by selection of the audio signal SA for continuous play, for example, by touching the associated title VA on a touch screen 900. If there is a user input, the selection mode SM is ended in a sixth step 6, then the filter coefficients for reproducing the selected audio signal SA are changed in the filter blocks in step 7, and the selected audio signal SA is reproduced continuously in an eighth step 8.
If, in contrast, the user input “Input1” does not occur in the third step 3, in a fourth step 4 another user input “Input2” is detected. The additional user input is, for example, a moving of a visual information item, such as a cover on touch screen 900 by a sliding movement of the finger. If the further user input is detected, the filter coefficients in the filter blocks are correspondingly changed in the fifth step 5, so that in the next step 2 the specific audio signal SA, SB, SC is heard at moved acoustically perceived locations in virtual space 890 or faded out and/or a new audio signal is faded in. If no further user input occurs in the fourth step 4, the filtering occurs unchanged with the previous filter coefficients.
The invention is not limited to the shown embodiment variants in
While various embodiments of the invention have been described, it will be apparent to those of ordinary skill in the art that many more embodiments and implementations are possible within the scope of the invention. Accordingly, the invention is not to be restricted except in light of the attached claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
EP12002030.0-1247 | Mar 2012 | EP | regional |