Aspects disclosed herein generally relate to a system and method for providing a virtual sound effect with one or more loudspeakers. In particular, the embodiments disclosed herein may provide, but not limited to, a combination of a virtualizer and in-wall loudspeakers to provide a full surround sound experience without any visible loudspeakers.
Multi-channel systems are known to have a complex setup procedure and may be deeply influenced by the placement of loudspeakers. These issues may lead to an inconsistent sound field reproduction along with a notorious setup procedure and wires in the space. Moreover, literature on a spatial cross model suggests that visual modality may influence information from other senses. Some research indicates that vision dominates what we hear, when varying the degree of spatial congruency. Thus, vision may have a greater influence on integrated localization than hearing. If the listener can see the loudspeaker(s) and wires, the overall sound location perception may be greatly influenced.
A well-designed, all-in-one system, such as soundbar may reduce setup complexity. However, normally, these systems, due to form factor constrains, may suffer from lack of low frequency. Also, recent soundbars also tend to provide wider stereo image by using side wall reflections. Performance of such technology may be deeply influenced by the side walls and normally may require additional calibration procedure.
In at least one embodiment, an apparatus for providing a virtual sound effect in a listening environment is provided. The apparatus includes at least one controller and an audio playback device. The audio playback device includes the at least one controller that is programmed to receive an audio input signal from an audio input source and to apply a head related transfer function (HRTF) to the audio input signal. The at least one controller is further programmed to apply crosstalk cancellation to the audio input signal and to generate an audio output signal after applying the HRTF and the crosstalk cancellation to the audio input signal for playback by at least one loudspeaker that is invisible to a listener in the listening environment.
In at least one embodiment, an apparatus for providing a virtual sound effect in a listening environment is provided. The apparatus includes at least one controller and an audio playback device. The audio playback device includes the at least one controller that is programmed to receive an audio input signal from an audio input source and to apply a head related transfer function (HRTF) to the audio input signal. The at least one controller is further programmed to apply crosstalk cancellation to the audio input signal and to generate an audio output signal after applying the HRTF and the crosstalk cancellation to the audio input signal. The at least one controller being further programmed to transmit the audio output signal to a first concealed loudspeaker and to a second concealed loudspeaker for playback in the listening environment with a surround sound experience.
In at least another embodiment, a method for providing a virtual sound effect in a listening environment is provided. The method includes receiving an audio input signal at an audio playback device from an audio input source and applying at least head related transfer function (HRTF) to the audio input signal. The method further includes applying crosstalk cancellation to the audio input signal and generating an audio output signal after applying the HRTF and the crosstalk cancellation to the audio input signal. The method further includes generating a location for at least one virtual loudspeaker in the listening environment to provide the effect that the at least one virtual loudspeaker is playing back the audio output signal while at least one concealed loudspeaker that is physically located in the listening environment plays back the audio output signal.
The embodiments of the present disclosure are pointed out with particularity in the appended claims. However, other features of the various embodiments will become more apparent and will be best understood by referring to the following detailed description in conjunction with the accompany drawings in which:
As required, detailed embodiments of the present invention are disclosed herein; however, it is to be understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. The figures are not necessarily to scale; some features may be exaggerated or minimized to show details of particular components. Therefore, specific structural and functional details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for teaching one skilled in the art to variously employ the present invention.
It is recognized that the controllers/devices as disclosed herein and in the attached Appendix may include any number of microprocessors, integrated circuits, memory devices (e.g., FLASH, random access memory (RAM), read only memory (ROM), electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), or other suitable variants thereof), and software which co-act with one another to perform operation(s) disclosed herein. In addition, such controllers as disclosed utilizes one or more microprocessors to execute a computer-program that is embodied in a non-transitory computer readable medium that is programmed to perform any number of the functions as disclosed. Further, the controller(s) as provided herein includes a housing and the various number of microprocessors, integrated circuits, and memory devices ((e.g., FLASH, random access memory (RAM), read only memory (ROM), electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM)) positioned within the housing. The controller(s) as disclosed also include hardware-based inputs and outputs for receiving and transmitting data, respectively from and to other hardware-based devices as discussed herein.
It is recognized that the audio input source 102, the HRTF block 104, and the crosstalk cancellation block 106 may be incorporated into a single device such as an audio playback device 110. Alternatively, the audio play back device 100 may be distributed into a plurality of devices. The audio playback device 110 includes at least one controller 103 (“the controller 121”) to execute any number of the operations as disclosed herein. In one example, the audio playback device 110 may correspond to a mobile device such as, but not limited to, a cell phone (e.g., smartphone, i-Phone S, etc.), a handheld computer (e.g., a personal digital assistant (“PDA”), etc.), a tablet (e.g., i-Pad @, etc.), a portable audio device (e.g., i-Pad®, etc.) or other suitable variant thereof. It is also recognized that the audio playback device 110 may be used in connection with a home audio system (e.g., TV, a media player such as, for example, a Blu-ray player, etc.) or for any system for that matter that generally plays back audio in a surround sound formal. In general, the crosstalk cancellation block 106 is configured to reproduce a desired signal at a single target position while cancelling out the sound at all remaining target positions.
The audio playback device 110 also includes a user interface 111 to enable listeners the ability to assign virtual locations for the loudspeaker(s) 108. This aspect will be discussed in more detail below. The audio playback device 110 may include any number of transceivers 112 to facilitate wireless communication such as the wireless receipt of audio data and/or to facilitate wireless transmission of audio data to the loudspeaker 108 for playback in a listening environment 115. The audio playback device 110 may utilize any number of wireless protocols to facilitate wireless communication. For example, the wireless protocol may include Bluetooth @, WiFi @D, etc. The audio playback device 110 includes a controller 114 for executing code to enable the transmission of the audio data to the speakers 108. The audio data may be in the form of, but not limited to, the following file formats: wav, mp3, wma, etc. The audio playback device 110 is further configured to communicate via the WiFi connection to a server 116 in order to retrieve and store any number of the foregoing audio data for playback.
The audio playback device 110 may also be configured to transmit data to the loudspeaker 108 when the audio playback device 110 receives the data from an external source via the one or more of the transceivers 112. For example, the audio playback device 110 may receive the audio data as broadcast from a radio station (or tower) via frequency modulation (FM) or amplitude modulation (AM), etc. It is also recognized that the audio playback device 110 and the loudspeakers 108 may be integrated with at least one of the loudspeakers 108 and wirelessly communicate with any remaining loudspeakers 108.
The audio playback device 110 may be implemented in any system that utilizes, but not limited to, a surround sound format. The audio playback device 110 may be used as a virtual upmixer and create any number of artificial multi-channel sources. Various nonlimiting examples include 5.1 channels or 7.1 channels. With respect to surround sound, this may involve various loudspeakers that surround the listener. Surround sound may involve a technique that enriches the fidelity and depth of audio reproduction by using multiple audio channels from loudspeakers that surrounds one or more listeners. In general, aspects disclosed herein may provide a virtualizer 109 that is provided by the audio playback device 108 (e.g., via the HRTF block 104 and the crosstalk cancellation block 106) to deliver a full surround sound experience without visible loudspeakers 108. It is also recognized that aspects disclosed herein may apply to any number of multi-channel encoding techniques such as, but not limited to, Dolby®, THX®, etc.
For example, the virtualizer 109 may be defined as block that includes crosstalk cancellation (via the crosstalk cancellation block 106) and HRTF (via the HRTF block 108) to provide a stereo output that mimics a similar sensation of a fully calibrated multi-channel audio system. In one example, the audio input source 102 may decode audio input sources into multi-channel audio (e.g., 5.1, 7.1, etc.). The HRTF block 104 may position audio objects to a corresponding location in space utilizing HRTF. The HRTF generally corresponds to a transfer function that describes the manner in which sound from a sound source will arrive at an eardrum of a user. This may also include influencing the shape of the listener's outer ear, the shape of the listener's head and body, and the acoustic characteristics of the environment. The HRTF may also affect if a listener can accurately perceive what direction sound is coming from. The crosstalk cancellation block 106 may cancel out the stereo cross contamination terms to widen a sound field in the listening environment. In general, audio signals that include directional cues are to be reproduced at the ear's of the listener. However, crosstalk may smear these cues and adversely affect the localization of sound. Thus, crosstalk cancellation may be used so that sound from the loudspeakers 108 to contralateral ears may be minimized.
As noted above, in order to achieve wider sound field perception, the audio playback device 110 may employ the crosstalk cancellation block 106 to perform crosstalk cancellation. Thus, let us assume that G(rk) serves as the crosstalk cancellation function between the kth speaker and an optimized position r. The signals received by two cars are given by s,
s=Hq Eq. (1)
where H is the transfer function between the cars of the listener 150 and the loudspeakers 108a, 108b, and q is the source strength, which may be written as,
q=Gd Eq. (2)
where G is the matrix of G(rk) and d is the input signal. The error between the input signal and the received signal may be,
e=d−s Eq. (3)
To minimize the error signals e, G may be given by:
G=[HHH]−1HH Eq. (4)
To position audio objects to the corresponding location in space, the Eq. (4) may be modified as,
G
T
=C
F[HHH]−1HH Eq. (5)
where CF is the matrix of the head-related transfer functions.
In operation 404, the audio playback device 110 applies the HRTF to the audio input signal. In operation 406, the audio playback device 110 applies the crosstalk cancellation to the audio input signal. In operation 408, the audio playback device 110 generates an audio output signal after applying the HRTF and the crosstalk cancellation to the audio input signal. In operation 410, the audio playback device 110 generates a location for at least one virtual loudspeaker 200a-200e in the listening environment 115 to provide the effect that the at least one virtual loudspeaker 200a-200e is playing back the audio output signal while the loudspeaker 108 in the listening environment 115 plays back the audio output signal.
While exemplary embodiments are described above, it is not intended that these embodiments describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention. Additionally, the features of various implementing embodiments may be combined to form further embodiments of the invention.
This application claims the benefit of U.S. provisional application Ser. No. 62/955,844 filed Dec. 31, 2019, the disclosure of which is hereby incorporated in its entirety by reference herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/067466 | 12/30/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62955844 | Dec 2019 | US |