The present disclosure relates generally to audio processing, and more specifically to speaker and room virtualization for audio signal that is to be provided to headphones.
When a user listens to music with headphones, audio signals that are mixed to come from the left or right side sound to the user as if they are located adjacent to the left and right ears. Audio signals that are mixed to come from the center sound to the listener as if they are located in the middle of the listener's head. This placement effect is due to the recording process, which assumes that audio signals will be played through speakers that will create a natural dispersion of the reproduced audio signals within a room, where the room provides a sound path to both ears. Playing audio signals through headphones sounds unnatural because there is no sound path to both ears. Also, the lack of room reflections concentrates the audio signals in the listener's head.
In accordance with the present disclosure, a system for audio processing for headphones is disclosed. The system includes a room reflection emulation system for emulating sound reflections in a room, and a room acoustics emulation system for emulating acoustic properties of the room. A head, shoulder and ear emulation system for emulation sound reflections near the head is also provided.
Other systems, methods, features, and advantages of the present disclosure will be or become apparent to one with skill in the art upon examination of the following drawings and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the present disclosure, and be protected by the accompanying claims.
Aspects of the disclosure can be better understood with reference to the following drawings. The components in the drawings are not necessarily to scale, emphasis instead being placed upon clearly illustrating the principles of the present disclosure. Moreover, in the drawings, like reference numerals designate corresponding parts throughout the several views, and in which:
In the description that follows, like parts are marked throughout the specification and drawings with the same reference numerals. The drawing figures might not be to scale and certain components can be shown in generalized or schematic form and identified by commercial designations in the interest of clarity and conciseness.
The present disclosure implements an algorithm that emulates speakers placed in a room for use with stereo headphones, to simulate the existence of sound paths to both ears, and also to add stereo reverberation for a realistic room effect. The location of the virtual speakers and the associated room size (which is reflected in the reverberation effect) are user selectable. This disclosure uses delay and cross-mixing of the left and right channel audio signals to the headphone speakers, but extensions to N-channel sound with additional audio signals (such as left front, left rear, right front and right rear) are also possible. The delay and mixing amplitude is based on a physical environment.
The present disclosure includes a tuned stereo reverb algorithm that emulates room reflections. There is very little coloration of the sound so it is basically unnoticeable.
Some previous simple reverb solutions cause metallic sound. The density of the disclosed reverb is high enough to not cause unnatural sound. Likewise, some previous reverb solutions use identical reverb on both sound channels, but such applications do not emulate the reflections that would normally be heard by a listener. In contrast, the disclosed system uses tuned non-identical reverb to generate a stereo room effect.
The disclosed cross-mixing, delay and reverb processing is efficiently configured so as to be within the processing capability of a general purpose processor, such as a personal computer or tablet computer, or of other embedded systems, such as those used in personal electronic devices, cellular telephones or other common devices.
The present disclosure can be used to emulate a room environment with virtual speakers for use with headphones. The user can select the angle to the center where the virtual speakers should be located. A head-related transfer function (HRTF) algorithm is applied to each audio channel so as to cause the sound to appear to the user to come from that angle. The user can also select the room size, which can be used by the reverb engine for intensity and duration of the reverberation effect.
As used herein, “hardware” can include a combination of discrete components, an integrated circuit, an application-specific integrated circuit, a field programmable gate array, or other suitable hardware. As used herein, “software” can include one or more objects, agents, threads, lines of code, subroutines, separate software applications, two or more lines of code or other suitable software structures operating in two or more software applications or on two or more processors, or other suitable software structures. In one exemplary embodiment, software can include one or more lines of code or other suitable software structures operating in a general purpose software application, such as an operating system, and one or more lines of code or other suitable software structures operating in a specific purpose software application.
The first stage of system 100 includes HRTF emulation, which emulates sound reflections that would normally occur when the audio signals travel around the head to the ears, such as to model reflection of audio signals by the listener's shoulders. Each channel of audio pulse code modulated (PCM) signals passes through a pair of HRTF emulation engines. Each HRTF engine emulates the sound coming in as having a predetermined azimuth and elevation angle with respect to the user. The second stage of system 100 includes a stereo reverberation generator, which is discussed in greater detail herein.
1. Head shadow filter—the head shadow filter provides attenuation on higher frequency audio components when the source is within the shadow of the head, i.e., on the opposite side from the channel being processed.
2. Head delay filter—the head delay filter emulates the delay for sound to pass around head to the ear.
3. Shoulder reflection processor—the shoulder reflection processor emulates reflections when sound is reflected from shoulder to ear.
4. Pinnae reflection processor—the pinnae reflection processor emulates reflections that occur within the pinnae.
For the head shadow filter, the azimuth angle θ of sound is used to generate a variable α, where:
The transfer function of the 1-tap infinite impulse response (IIR) filter that emulates head shadowing can then be calculated by:
where
ωO=speed of sound/radius of head, and
FS=sampling rate
The head shadow filter can be implemented using this algorithm in conjunction with a first order IIR digital filter.
The head delay filter can be implemented using a first order all-pass digital filter. The group delay for the azimuth angle θ can be defined as:
where the gain can be defined as:
g
sh=cos(+90)*0.15
A
n={1,5,5,5,5}
B
n={2,4,7,11,13}
D
n={1,0.5,0.5,0.5,0.5}
Delay can be defined as:
where
φ is the elevation angle.
In one exemplary embodiment, the gain for the 5 stages can be G={0.5, −0.4, 0.5, −0.25, 0.25}
In one exemplary embodiment, 5 stages of nested all-pass filters can be used to create reverb. An exemplary nested delay structure timeline is shown in
It should be emphasized that the above-described embodiments are merely examples of possible implementations. Many variations and modifications may be made to the above-described embodiments without departing from the principles of the present disclosure. All such modifications and variations are intended to be included herein within the scope of this disclosure and protected by the following claims.
The present application claims benefit of U.S. Provisional patent application 61/598,267, entitled “Speaker and Room Virtualization Using Headphones,” filed Feb. 13, 2012, which is hereby incorporated by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
61598267 | Feb 2012 | US |