1. Field of the Invention
The present invention relates to audio products. More particularly, the present invention relates to methods for enhancing audio transmitted from a portable media player.
2. Description of the Related Art
With the advancements in audio technology, increased memory storage, and increased computer processing power, attention has been placed on providing advanced audio effects on computers through software and in some cases add-in boards such as PCI boards.
Concurrently, the field of portable audio has also expanded dramatically. Current portable devices are capable of storing and rendering thousands of music and other audio tracks over headphones or other sound reproduction transducers connected to the portable players. Many of these devices are small enough to be carried in a user's pocket. Unfortunately, many of the advanced audio effects presently available and expected to be available in the future require processing demands so large that they cannot be easily integrated within the portable devices without adversely affecting the portability nature of the devices. Further constraints are imposed on the portable device from power requirements and the need to load or manage content in the device. Currently, content is conventionally loaded into the device by tethering the device to a host PC, for example by a USB cable. Typically, a content management program is run on the PC in order to download the content.
In order to conserve storage space in the portable device and to minimize the bandwidth requirements for transmission of the content, audio content loaded from the host PC is compressed, for example according to an mp3 or wma codec. These codecs provide good fidelity when decoded but nonetheless provide music that is lacking in some attributes. Unfortunately, portable media players provide very little options for the user to modify the perceptual attributes of the sound rendered by the portable device. These typically include nothing more than equalization and fail to suitable address the customized listening needs of the user or the environment.
Portable media devices often place heavy demands on power from batteries integrated into the device. These batteries are often rechargeable but need at least intermittent connection to a power supply for recharging.
The typical user's experience thus involves listening to “flat” music through headphones, periods where the device's batteries are connected to a power supply for recharging, and periods of connecting the portable device to a host PC for managing the device's content. Unfortunately, each of these steps often is incompatible with other ones of the steps, hence requiring multiple cables and familiarity with various software programs.
It is therefore desirable to provide a device and methods capable of improving the user experience for the user of a portable media player.
The present invention provides a module that incorporates advanced audio enhancement effects. These advanced effects cover a wide range including but not limited to 3D audio spatialization (virtualization) and transient enhancement. The device is configured for receiving an audio input signal, in analog or digital form, enhancing the received signal, preferably through a user control on the portable device, and transmitting an output audio signal having the enhanced effects embedded in the signal. The output signal may be either digital or audio. Further, the output signal may be a mono signal, a stereo signal, a MIDI signal, or a multichannel signal (such as 2.1, 5.1, 7.1, etc.)
In accordance with one embodiment, an external audio enhancement module is provided. The module is configured to receive an audio input signal from a portable media player, such as for a non-limiting example, a Zen Vision M portable media player manufactured and distributed by Creative Technologies LTD. The module is further configured to process the received audio signal using advanced audio processing techniques and to transmit the processed signal to an audio reproduction system. Preferably, the module is further configured to receive the device in a docking connector and to provide synchronization (content management), battery power management (recharging and external supply) and transmission of rendered content to a plurality of devices through a wired or wireless connection.
In accordance with yet another embodiment, a method for enhancing an audio signal is provided. An audio signal is received. The audio signal is converted to a digital representation if necessary and digitally filtered to enhance the perceptual characteristics of the audio signal. The enhancements include upmixing the signal to a multichannel representation and virtualizing the multichannel representation for rendering over a two-channel playback system. In another embodiment the characteristics of the audio signal are dynamically enhanced, in response to the energy level of the signal or transient detection. In yet another embodiment, the processing includes any combination of upmixing, virtualization, and dynamic enhancement. In a preferred embodiment, the method includes enabling the user to control the amount of audio enhancement interactively through the use of a user control integrated with the portable audio enhancement module.
These and other features and advantages of the present invention are described below with reference to the drawings.
Reference will now be made in detail to preferred embodiments of the invention. Examples of the preferred embodiments are illustrated in the accompanying drawings. While the invention will be described in conjunction with these preferred embodiments, it will be understood that it is not intended to limit the invention to such preferred embodiments. On the contrary, it is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. The present invention may be practiced without some or all of these specific details. In other instances, well known mechanisms have not been described in detail in order not to unnecessarily obscure the present invention.
It should be noted herein that throughout the various drawings like numerals refer to like parts. The various drawings illustrated and described herein are used to illustrate various features of the invention. To the extent that a particular feature is illustrated in one drawing and not another, except where otherwise indicated or where the structure inherently prohibits incorporation of the feature, it is to be understood that those features may be adapted to be included in the embodiments represented in the other figures, as if they were fully illustrated in those figures. Unless otherwise indicated, the drawings are not necessarily to scale. Any dimensions provided on the drawings are not intended to be limiting as to the scope of the invention but merely illustrative.
In accordance with one embodiment, a module that incorporates advanced audio enhancement effects is provided. These advanced effects cover a wide range including but not limited to 3D audio spatialization (virtualization) and transient enhancement. The device is configured for receiving an audio input signal, in analog or digital form, enhancing the received signal, and transmitting an output audio signal having the enhanced effects embedded in the signal. The output signal may be either digital or audio. Further, the output signal may be a mono signal, a stereo signal, a MIDI signal, or a multichannel signal (such as 2.1, 5.1, 7.1, etc.)
Advanced audio enhancements are now provided for use with Personal Computers, such as in conjunction with plug in audio sound cards. The present invention, in various embodiments, makes these audio enhancements available to portable media devices such as mp3 players in an audio enhancement module. These enhancements include but are not limited to 3D audio spatialization or virtualization effects and transient enhancement techniques, as will be explained in further detail below.
Preferably a module is provided for integration or connection to other electronic products in order to enable a system to include audio enhancement algorithms. In some embodiments, the audio enhancements include but are not limited to dramatic improvements to the recording and playback of MP3 music, enhancement to the listening experiences for movies, and improvements in general to the capabilities, performance and quality of audio and music creation. Preferably, the module is configured to enable the user to select from several surround audio enhancement effects and to control the amount of the effect to suit the users' preferences. Many of these audio enhancements are commercially available in computer sound cards provided by Creative Technologies Ltd. For example, the X-Fi soundcard provided by Creative Technologies Ltd. presently allows a user options including MultiSpeaker Surround 3D Surround, MultiSpeaker Surround 3D Headphone, MultiSpeaker Surround 3D Virtual, X-Fi Crystallizer, Dolby Digital decoder, DTS-Interactive 5.1 Encoder.
In one embodiment, a module is provided that permits selection of the above mentioned types of audio enhancements through GPIO pin (level detect) or I2C bus, and Digital input/output. The audio enhancements preferably include but are not limited to the enhancement effects which are described in further detail below.
X-Fi CMSS®-3D enables a user to Upgrade MP3 music and movies into surround sound with headphones or multichannel speakers. This is important because it enables a consumer to enjoy stereo MP3s and movies in surround sound created with X-Fi CMSS®-3D technology. While the present invention is intended to include the basic upmixing of stereo content, the audio enhancement module is preferably configured to remix the audio intelligently to match the speaker system (including headphones). The audio enhancement techniques that may be configured into the module include advanced techniques to extract specific audio elements so your music and movies sound more alive than ever, i.e., to add in one embodiment “punchiness”. The potential benefits when listening to or creating music are abundant. For example, static stereo music may be converted into surround sound suitable for playback over multichannel Speakers. Although in one embodiment, the enhanced listening experience is provided when listening to music tracks, the invention includes other forms of media content. For example, videos or movies may be enhanced to provide surround sound over headphones or speakers. Thus, an example surround sound processing technique suitable for integration into the module to enjoy movies is the X-Fi CMSS®-3D.
Other processing techniques provided in other embodiments enable up mixing, virtualization over output channels, transient enhancement, and decoding. These are generally described with the following examples:
1. X-Fi CMSS®-3D Surround: Provides a multichannel playback for stereo music or movies.
2. X-Fi CMSS®-3D Headphone: Provides a multichannel playback experience over headphones for all types of content
3. X-Fi CMSS®-3D Virtual: Provides a multichannel playback experience over two loudspeakers for all types of content
4. X-Fi Crystallizer™—provided dynamic enhancements based on detection of transients and other energy level variations.
5. Dolby Digital decoder
6. DTS-Interactive 5.1 Encoder
Preferably, the module includes at least 3D audio virtualization technologies such as X-Fi CMSS®-3D Headphone and X-Fi CMSS®-3D Virtual. These virtual technologies are designed to reproduce a natural sounding multi-channel listening experience over headphones or two loudspeakers, with both multi-channel and two-channel source formats. By providing these technologies in a module that can be connected to a portable player such as an mp3 player or portable video player, these enhanced audio effects such as virtualization may be made available to consumers on the go.
In further detail, X-Fi Creative MultiSpeaker Surround 3D (CMSS-3D) is available for both rendering over headphones and speakers. Creative's CMSS®-3D Headphone technology helps the listener forget that he or she is wearing headphones, by delivering a compelling multi-channel listening experience, with any two-channel or multi-channel audio content. This is the result of the combination of three exclusive ingredients: HRTF filters, Environmental early reflections and Ambience extraction. The benefits of X-Fi CMSS®-3D Headphone include Timbre preservation; Improved externalization, frontalization and front-back discrimination. Further benefits include a natural sense of immersion, for both multi-channel and two-channel sources as well as reduced listening fatigue. Creative's CMSS®-3D Headphone technology employs advanced signal processing algorithms to place listeners in a natural, fully immersive sound field. Through these processing techniques operating with the assistance of at least one processor in the module, the auditory awareness of wearing headphones vanishes and is replaced by a transparent listening experience and the sensation of “being there”.
Creative's CMSS®-3D Virtual technology provides a convincing multi-channel listening illusion using only two loudspeakers for a listener located at the “sweet spot”, with any two-channel or multi-channel audio content. It combines the following ingredients: HRTF filters, Cross-talk canceller and Ambience extraction. The benefits of X-Fi CMSS®-3D Virtual include timbre preservation, convincing side and rear virtual loudspeaker localization, natural sense of immersion, with both multi-channel and two-channel sources. Preferably the module is configured to provide calibration according to the placement of the loudspeakers with respect to the listener. That is, the user can control the amount of the virtualization effect to optimize the effect to adjust for the loudspeaker positions.
Transient enhancement is provided in one embodiment by Creative's X-Fi Crystallizer. This is an intelligent, automated audio-restoration processor carefully designed to bring the full benefit of audio playback to 16-bit legacy audio content. Crystallizer selectively identifies significant transients in the original 16-bit audio playback stream and dynamically enhances these to compensate for the studio mastering compromises inherent in the limited dynamic range of CD audio. The end result of applying Crystallizer enhancement depends to some extent on the details of the content to which it is applied and to the user-specified degree of enhancement. In general, though, Crystallizer produces crisper high frequencies, punchier mid-range percussion (snare drums, congas) and note onsets, and stronger kick bass hits. This audio enhancement technique is thought to be especially beneficial to listeners of compressed music such as music encoded in the mp2, wma, or ATRAC formats. This configuration is believed to enhance MP3s and movies to the point of even sounding better to many listeners than content available from the original CD or DVD. It is a low-impact algorithm designed to improve the dynamic range of an audio stream by enhancing the natural transients. The transient enhancement provided by the Crystallizer algorithms can be very effective at sharpening an audio track (particularly compressed tracks, such as MP3 files), brightening sound effects, etc. Music benefits provided include low and high frequencies are enhanced while the dynamics are improved. Movie benefits include adding realism to explosions, gun-shots and high-impact audio sequences.
In this embodiment, the portable media player is preferably docked in the module 106. Further details as to the connections (ports) and a sample form factor are illustrated in
For example, where an analog signal is provided to the module, the module may be configured to enhance the audio with CMSS-3D surround (404). That is, a 5.1 audio signal may be generated, for example, form a two channel input signal, transients enhanced in Crystallizer portion 406 and encoded in the DTS 5.1 format in encoder 408. The signal may then be transmitted in this embodiment in digital form through the use of suitable cables to a 5.1 system 409.
Alternatively, for example for headphone use, the signal may be converted to a virtual surround signal in virtual headphone processing portion 410, enhanced in crystallizer (transient enhancement) portion 412 and forwarded to a wireless transmitter 416 or alternatively a stereo line output 418. When the transmitter is placed into effect, the study room 420 may be equipped with a wireless receiver 210 that provides a line out connection or a headphone connection. Of course at his point, the line out connection may be connected to an amplifier for amplification through a conventional stereo system as one alternative. Though the input (from the mp3 player) is shown as analog, this is for illustrative purposes only. Input to the module 204 may be digital. It should be noted that in preferred embodiments the analog input signals are converted to digital for digital signal processing in the module. That is, the audio enhancement techniques described are preferably implemented through digital signal processing techniques.
Block 815 will preferably include cross-talk cancellation modules to preserve the virtualization effect. Crosstalk cancellation techniques are known to those of skill in the art and hence full details will not be provided herein. In similar fashion, the upmixed signal may be fed to headphone virtualization block 821 implementing technology such as Creative's CMSS-3D headphone technology for delivering a multichannel listening experience over two headphone channels. Crosstalk cancellation modules are not required in this path. Optionally, the signal is then fed to a Crystallizer (transient detection-enhancement) block 822 before output as an analog signal for delivery to a set of headphones.
In one embodiment, each of the virtualization modules (i.e., for headphones and speakers) include ambience extraction. With two-channel sources, the virtual surround loudspeakers are used to reproduce a natural enveloping ambience based on the ambient information already present in the recording. In another embodiment, a sound expansion processing technique is included in the module. For example, Creative's StereoXpand ambience extraction algorithm (also employed in CMSS-3DSurround and CMSS-3DHeadphone) may be integrated into the module. In one embodiment, in the virtualization blocks (e.g., blocks 815, 821) a 3D surround algorithm identifies ambient sound components in the original recording (such as room reverberation or applause) to derive the surround signals and perform center-channel extraction. This provides many benefits including enhancing the natural sense of immersion and depth without introducing overwhelming or unnatural ambience; enlarging the “sweet spot” in the listening room (by anchoring center-panned sounds in the front center channel and limiting the leakage of localized sounds in the surround channels) and preserving the original frontal stereo image in balance, width and timbre. It does not introduce instability in the frontal stereo image or in the surrounding ambiance. It does not introduce distortions in the surround channels, even with perceptually encoded sources such as MP3 or WMA.
Upmixing and the subsequent virtualization may be used to provide the perception of multiple speaker locations, for example including 5.1, 6.1, or 7.1 playback systems. The virtualization, such as provided by blocks 815 and 821 uses Head Related Transfer Function (HRTF) fitters to provide immersive 3D audio rendering, timbre preservation, improved externalization, frontalization and front-back discrimination.
Portable player have become quite prevalent. Users with MP3 players listen to MP3 music, podcasting, voice recording, and news. Video and photo are included in the latest trend in portable player, for example enabling a user to play MPEG4 or DIVX video. These trends are enhanced through the current invention embodiments by allowing an in home and preferably portable audio interface with advance audio enhancements such as virtualization (X-Fi CMSS-3D Virtual or Headphone) and transient enhancement (Crystallizer) effects.
In addition, the modules of the various embodiments are preferably configured to up-mix stereo audio to multi-channel DTS Interactive format or other suitable formats. These can then be played in a home AV Receiver using a digital interface.
In accordance with another embodiment, wireless music is provided from the source portable media player. In one embodiment, convenience is provided by Portable player docking. The docking can be used to transmit control and data signals to and from the module and to provide automatic charging for players such as an iPod and Zen Vision.
The module can be configured in another embodiment to support “multi-cast” wireless receivers with line out or headphone out. Hence, a single audio enhancement module may be configured to supply a plurality of rooms or locations, each with its own set of loudspeakers or headphones.
The USB port is available to communicate with the PC for downloading content to the portable media player or simultaneously for charging. Preferably the separate module, such as enclosed in a suitable enclosure, is powered by DC adapter and able to charge the portable media player (e.g., mp3 player) battery in normal mode.
The transient enhancements in various embodiments include preferably enhancing musical dynamics by emphasizing sharp percussive sounds and transients, thereby creating a punchier and more dynamic listening experience.
By using the modules described in various embodiments of the present invention an enhance user listening experience is provided. Many different types of music audio signals may be processed and transmitted to rendering speakers, systems, or headphones. For example, enhanced audio provided at the output of the module may range from two channel analog to a high quality multi-channel digital signal. Further, transient may be detected and enhanced, the amount of the transient enhancement processing at the control of the user during playback.
An example and non-limiting specifications of one embodiment of the module include ADC resolution in Stereo: 16-bit, 44.1 kHz. DAC resolution, for example at the wireless receiver in one embodiment in Stereo: 16-bit, 44.1 kHz. Table 1 below identifies non-limiting examples of controls provided on the module in one embodiment. The rotary control in one embodiment is used to provide to the user level adjustment of Volume, virtualization, and transient enhancement.
Although the foregoing invention has been described in some detail for purposes of clarity of understanding, it will be apparent that certain changes and modifications may be practiced within the scope of the appended claims. Accordingly, the present embodiments are to be considered as illustrative and not restrictive, and the invention is not to be limited to the details given herein, but may be modified within the scope and equivalents of the appended claims.
This application claims priority from provisional U.S. Patent Application Ser. No. 60/806,454, filed Jun. 30, 2006, titled “Audio Enhancement Module for Portable Media Player” the disclosure of which is incorporated by reference in its entirety. This application is related to and incorporates by reference for all purposes U.S. patent application Ser. No. 11/744,465, filed May 4, 2007, titled “METHOD FOR ENHANCING AUDIO SIGNALS” which claims priority from provisional U.S. Patent Application Ser. No. 60/746,625, filed May 5, 2006.
Number | Name | Date | Kind |
---|---|---|---|
7970144 | Avendano et al. | Jun 2011 | B1 |
20040049379 | Thumpudi et al. | Mar 2004 | A1 |
20060018486 | Neoran et al. | Jan 2006 | A1 |
20070055395 | Debettencourt et al. | Mar 2007 | A1 |
Number | Date | Country |
---|---|---|
2006-094367 | Apr 2006 | JP |
2006094367 | Apr 2006 | JP |
Number | Date | Country | |
---|---|---|---|
20080008324 A1 | Jan 2008 | US |
Number | Date | Country | |
---|---|---|---|
60806454 | Jun 2006 | US |