With almost all audio sounds received through a medium being somewhat compressed, there is an increasing need for enhancement for audio of all types such as MP3, streaming internet music, broadcast audio, audiobooks, cable programming audio, just about any kind of transmitted or broadcast audio. A main cause of the compression is reduced available bandwidth for both broadcast and internet (terrestrial and extra terrestrial) media. The future of this bandwidth is simple, there is not enough bandwidth to support the growing need and will only get worse.
The finite amount of bandwidth available, and the unavailability of much hope for expansion to meet the demand, has resulted in continuing narrowing of available bandwidths. As a result of increased demand, the quality of audio will also be lessened, this, in turn, leads to a loss in both harmonic and dynamic content. The only available solution is to add even more compression, or change the audio format to a compressed format (mp3, AAC, etc.) for a small footprint. More compression equals a much less audio quality for the end user. As far as anything with video goes, there is no way to compress video without a high degree of noticeable degradation, but degrading the audio is an acceptable practice.
In computer science and information theory, data compression, source coding, or bit-rate reduction involves encoding information using fewer bits than the original representation. Compression can be either lossy or lossless. Lossless compression reduces bits by identifying and eliminating statistical redundancy. No information is lost in lossless compression. Lossy compression reduces bits by identifying unnecessary information and removing it. The process of reducing the size of a data file is popularly referred to as data compression, although its formal name is source coding (coding done at the source of the data before it is stored or transmitted)1. 1 http://en.wikipedia.org/wiki/Audio_compression_(data)#Audio
Compression is useful because it helps reduce resource usage, such as data storage space or transmission capacity. Because compressed data must be decompressed to use, this extra processing imposes computational or other costs through decompression; this situation is far from being a free lunch. Data compression is subject to a space-time complexity trade-off. For instance, a compression scheme for video may require expensive hardware for the video to be decompressed fast enough to be viewed as it is being decompressed, and the option to decompress the video in full before watching it may be inconvenient or require additional storage. The design of data compression schemes involves trade-offs among various factors, including the degree of compression, the amount of distortion introduced (e.g., when using lossy data compression), and the computational resources required to compress and uncompress the data2. 2 See, n.1, above
There is a need for a process and system that can restore sound quality without the problems of the conventional systems and techniques.
With the inventive Max Sound Program (third version), a compressed audio can be processed and restored to much of its original quality by virtue of how the system works. The program is extremely dynamic in the way that it works and resynthesizes the missing content at a higher level than any other conventional process.
A process and system for enhancing and customizing sound comprises receiving an input audio sound and processing the input by one or more filter blocks to dynamically reshape the dynamic, phase and frequency content of the input audio sound. The filter block processed audio is combined in a sound mixer and tube harmonics are added to the output from the mixer. The tube harmonics added audio is fed to a highpass filter which sets a frequency limit of audio passing through this block. A low pass filter moves the audio in a plus or minus direction with a dynamic envelope control. A frequency divider is provided for shifting audio sound below a selected frequency down an octave. Output from this stage is edited by feeding a certain amount of each side of the audio to a corresponding opposite side. The frequency balance is adjusted by setting a band phase coherent equalizer for frequency adjustments. The original, unprocessed sound is then mixed with the processed audio. Gain is adjusted and the processed sound is outputted for use.
Details of the present invention will now be discussed by reference to the drawings.
Continuing with reference to
Next, Dynamic All-pass filter is identified by reference numeral 105. By way of background, an all-pass filter is a signal processing filter that passes all frequencies equally in gain, but changes the phase relationship between various frequencies. It does this by varying its phase shift as a function of frequency. Generally, the filter is described by the frequency at which the phase shift crosses 90° (i.e., when the input and output signals go into quadrature—when there is a quarter wavelength of delay between them). They are generally used to compensate for other undesired phase shifts that arise in the system, or for mixing with an unshifted version of the original to implement a notch comb filter. They may also be used to convert a mixed phase filter into a minimum phase filter with an equivalent magnitude response or an unstable filter into a stable filter with an equivalent magnitude response3. 3 http://en.wikipedia.org/wiki/All-pass_filter
According to an embodiment of the present invention All-pass Filter 105 is used to dynamically change the overall phase response of the audio from the previous block. The purpose of this is to minimize any undesired phase anomalies that might have occurred in the processed audio thus far. The following are controls for envelope follower in this section. Attack—amount of time it takes for the Envelope Follower to reach its predetermined level with a preferred range of 0.0 to 1.0. Release refers to that amount of time it takes before the Envelope Follower resets itself and has a preferred range of 0.0 to 1.0. Amount determines the amount of the Envelope Follower that is applied to this signal and has a range of 0 to 5 k. FC refers to filter cutoff of the lowpass filter used in this section and has a range of 100 to 20 k.
Envelope Follower 105 constantly measures an input signal and applies that same envelope to something else as a modulator or a simple shaper. By way of background, an envelope follower is an electronic circuit which produces a control signal proportional to the envelope of an input signal. Such a device is most commonly used for allowing a synthesizer to track the dynamics of a conventional instrument, such as a guitar or a piano4. 4 http://electronicmusic.wikia.com/wiki/Envelope_follower
Phase Release control 106 is different from the one in the DYN AP section. This setting selects the amount of release time for the entire DYN AP process before it begins to change again. This means that there is an adjustment range from an abrupt, or instant, change to a slow fade in type of change. This is necessary so that there are no undesirable phase anomalies introduced into the audio.
AP2 107 is an all-pass filter to further aid in correcting undesired phase anomalies in the audio processed thus far. There is a single control for the center frequency selection.
Inverter 108 inverts the phase of a particular filter bank by. In one embodiment, the phase is inverted by 180 degrees in a negative direction.
Gain 109 controls the final output gain of a single filter bank.
Following the processing above, the audio processed by the filter banks is fed to Summing Mixer (Master) 110, which mixes the audio from all four blocks into a single stereo stream. Preferably, this step includes a Master/Group volume control that allows for the overall, mixed volume adjustment from the Summing mixer.
Tube Simulation 120 is discussed next. By way of background, a Virtual Valve Amplifier (VVA) is software for simulating the sound of various valve amplifier designs. A VVA can be used to color the sound of a digital recording by adding “tube-warmth” in addition to adding subtle harmonics to enhance very old or muffled recordings. The algorithms behind a VVA are based on real vacuum tube circuits and non-linearities, mathematically simulating the large-signal transfer functions of various vacuum tubes and output transformers found in amplifier designs. A majority of this data was originally derived from extensive bench measurements on real vacuum tube amplifier circuits under varying operating conditions by engineers Craig Maier and Rick Carlson in the early 1990s. A VVA is a direct mathematical reconstruction of the same signal passing through a physical electron tube amplifier5. 5 http://en.wikipedia.org/wiki/Virtual_Valve_Amplifier
Tube Simulation 120 simulates the characteristics of the harmonics that occur when using analog tubes in an audio path. The Threshold controls the audio level at which the tube effect starts to work. Any audio below that level is passed unchanged until the amplitude reaches the assigned amount. A Bypass is provided so that audio can be sent to the next section unaffected.
Sparkle 130 shapes the harmonic content of the tube sound from the previous section and controls the amount to be added back into the processed audio before Tube Simulation 120. The controls and their preferred ranges and/or values are as follows:
HP FC—highpass filter cutoff frequency (range=4 to 10 k)
PRE TUBE BOOST—amount of boost as it enter the tube simulator (range=1 to 5)
TUBE THRESHOLD—amount of tube saturation (range=0.1 to 1.0)
POST BOOST—amount of boost after the tube simulator (range=0.1 to 2.0)
BALANCE—balance between input and tube sound mixed together (range=0.1 to 1.0)
GAIN—output gain for this section (range=−70 to 6)
MUTE—mutes this section (range=on/off).
Expander 140 takes the input audio and filters an amount determined by the cutoff (FC) that will be added back into the input audio, at the output of this section with envelope control of its level. The controls and their preferred ranges and/or values are as follows:
LP FC—lowpass frequency cutoff (range=300 to 8 k)
ENV REL—amount of time before the envelope releases (range=0.01 to .50)
ENV AMT—amount of the envelope applied to the processed signal (range=0.0 to 1.0)
BALANCE—balance between input and expander sound mixed together (range=0.0 to 1.0)
GAIN—output gain of this section (range=0.0 to 1.0)
MUTE—mute this section (range=on/off)
UMPH (Bass Expander) 150—This section is a frequency divider which takes everything from below a selected frequency and shifts it down an octave. This section has a gain for the amplitude of the octave down audio and a bypass. The lower range for this effect is fixed at 30 Hz while the upper range is adjustable, preferably from 50 Hz to 300 Hz.
Crossfade 160 feeds a small amount of Left side to Right side and Right side to Left side. According to an embodiment, there is an adjustable amount of 0.0 to 1.0 with 1.0 being a maximum of 3 dB.
EQ 170 includes three selectable EQ's here with Bypass capability. They are standard ISO centers for frequency (1 octave/10 band, ⅔ octave/15 band, ⅓ octave/31 band). Also, phase coherent allowing for extremely low phase change between frequencies.
Output Mixer 190 allows for adjustment of the WET/Dry mix and the final output gain amount 180.
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Next, an exemplary embodiment of the present invention is discussed by reference to
Embodiments of the present invention relate to U.S. Provisional Application Ser. No. 61/804,136, filed Mar. 21, 2013, entitled “AUDIO PROGRAM”, the contents of which are incorporated by reference herein and which is a basis for a claim of priority.
Number | Date | Country | |
---|---|---|---|
61804136 | Mar 2013 | US |