This application claims priority to Taiwan Application Serial Number 108109395, filed Mar. 19, 2019, which is herein incorporated by reference.
The present Invention relates to an audio processing method and an audio processing system. More particularly, the present invention relates to an audio processing method and an audio processing system to allow the output audio signal to become broader and more spatial.
When people heard the sound produced by a sound source, the audio signal usually reaches the left and right ear of the person at two different times, thereby generating different volume. People's brains interpret these differences in time and volume, and produce an auditory scene. Stereo is one method for producing the auditory scene, which provides the audio signal to plural speakers through plural independent sound channels. These speakers are arranged in a symmetrical manner, so that the speakers may produce the auditory scene. In general, stereo is achieved by two sound channels.
The present invention provides an audio processing method and an audio processing system for optimizing the auditory scene of an audio signal.
One aspect of the invention is directed to an audio processing method. The audio processing method includes: providing an input audio signal, in which the input audio signal is a mono audio signal; performing a transformation process on the input audio signal to transform the input audio signal from a time domain to a frequency domain, thereby obtaining an amplitude spectrum and a phase spectrum corresponding to the input audio signal; providing a processing function group, in which the processing function group includes a panning angle curve, a left channel separation curve, and a right channel separation curve; performing a panning process on the amplitude spectrum, thereby obtaining a panning amplitude signal according to the panning angle curve; performing a first broader process on the phase spectrum, thereby obtaining a left channel separation phase signal according to the left channel separation curve; performing a second broader process on the phase spectrum, thereby obtaining a right channel separation phase signal according to the right channel separation curve; performing a first inverse transformation process on the panning amplitude signal and the left channel separation phase signal, thereby obtaining an optimized left channel output audio signal corresponding to the time domain; and performing a second inverse transformation process on the panning amplitude signal and the right channel separation phase signal, thereby obtaining an optimized right channel output audio signal corresponding to the time domain.
In accordance with one or more embodiments of the invention, the panning angle curve is a curve function with a horizontal axis representing a time and a vertical axis representing a panning angle.
In accordance with one or more embodiments of the invention, each of the left channel separation curve and the right channel separation curve is a curve function with a horizontal axis representing a positive spectrum frequency and a vertical axis representing a separation phase angle.
In accordance with one or more embodiments of the invention, the panning process is configured to calculate a panning curve according to the panning angle curve and to multiply the amplitude spectrum by the panning curve, thereby obtaining the panning amplitude signal.
In accordance with one or more embodiments of the invention, the first broader process is configured to add the phase spectrum to the left channel separation curve, thereby obtaining the left channel separation phase signal. The second broader process is configured to add the phase spectrum to the right channel separation curve, thereby obtaining the right channel separation phase signal.
In accordance with one or more embodiments of the invention, the panning process is configured to control a panning angle of each frame of the Input audio signal through the panning angle curve.
In accordance with one or more embodiments of the invention, the first broader process is configured to control a separation phase angle of each of different spectra in the phase spectrum within each frame of the input audio signal through the left channel separation curve. The second broader process is configured to control the separation phase angle of each of different spectra in the phase spectrum within each frame of the input audio signal through the right channel separation curve.
Another aspect of the invention is directed to an audio processing method. The audio processing method includes: providing an Input audio signal, in which the input audio signal includes a left channel audio signal and a right channel audio signal; performing a first transformation process on the left channel audio signal to transform the left channel audio signal from a time domain to a frequency domain, thereby obtaining a left channel amplitude spectrum and a left channel phase spectrum corresponding to the left channel audio signal; performing a second transformation process on the right channel audio signal to transform the right channel audio signal from the time domain to the frequency domain, thereby obtaining a right channel amplitude spectrum and a right channel phase spectrum corresponding to the right channel audio signal; providing a processing function group, in which the processing function group includes a panning angle curve, a left channel separation curve, and a right channel separation curve; performing a first panning process on the left channel amplitude spectrum, thereby obtaining a left channel panning amplitude signal according to the panning angle curve; performing a second panning process on the right channel amplitude spectrum, thereby obtaining a right channel panning amplitude signal according to the panning angle curve; performing a first broader process on the left channel phase spectrum, thereby obtaining a left channel separation phase signal according to the left channel separation curve; performing a second broader process on the right channel phase spectrum, thereby obtaining a right channel separation phase signal according to the right channel separation curve; performing a first inverse transformation process on the left channel panning amplitude signal and the left channel separation phase signal, thereby obtaining an optimized left channel output audio signal corresponding to the time domain; and performing a second inverse transformation process on the right channel panning amplitude signal and the right channel separation phase signal, thereby obtaining an optimized right channel output audio signal corresponding to the time domain.
In accordance with one or more embodiments of the invention, the panning angle curve is a curve function with a horizontal axis representing a time and a vertical axis representing a panning angle.
In accordance with one or more embodiments of the invention, each of the left channel separation curve and the right channel separation curve is a curve function with a horizontal axis representing a positive spectrum frequency and a vertical axis representing a separation phase angle.
In accordance with one or more embodiments of the Invention, the first panning process is configured to calculate a panning curve according to the panning angle curve and to multiply the left channel amplitude spectrum by the panning curve, thereby obtaining the left channel panning amplitude signal. The second panning process is configured to calculate the panning curve and to multiply the right channel amplitude spectrum by the panning curve, thereby obtaining the right channel panning amplitude signal.
In accordance with one or more embodiments of the invention, the first broader process is configured to add the left channel phase spectrum to the left channel separation curve, thereby obtaining the left channel separation phase signal. The second broader process is configured to add the right channel phase spectrum to the right channel separation curve, thereby obtaining the right channel separation phase signal.
In accordance with one or more embodiments of the invention, the first panning process is configured to control a panning angle of each frame of the left channel audio signal through the panning angle curve. The second panning process is configured to control the panning angle of each frame of the right channel audio signal through the panning angle curve.
In accordance with one or more embodiments of the invention, the first broader process is configured to control a separation phase angle of each of different spectra in the phase spectrum within each frame of the left channel audio signal through the left channel separation curve. The second broader process is configured to control the separation phase angle of each of different spectra in the phase spectrum within each frame of the right channel audio signal through the right channel separation curve.
Another aspect of the invention is directed to an audio processing system for processing an input audio signal, in which the input audio signal includes a left channel audio signal and a right channel audio signal. The audio processing system includes: a transformation circuitry, a processing function circuitry, a panning circuitry, a broader circuitry, an inverse transformation circuitry, and an audio signal output circuitry. The transformation circuitry is configured to perform a first transformation process and a second transformation process. The first transformation process is performed on the left channel audio signal to transform the left channel audio signal from a time domain to a frequency domain, thereby obtaining a left channel amplitude spectrum and a left channel phase spectrum corresponding to the left channel audio signal. The second transformation process is performed on the right channel audio signal to transform the right channel audio signal from the time domain to the frequency domain, thereby obtaining a right channel amplitude spectrum and a right channel phase spectrum corresponding to the right channel audio signal. The processing function circuitry is configured to provide a processing function group, in which the processing function group includes a panning angle curve, a left channel separation curve, and a right channel separation curve. The panning circuitry is configured to perform a first panning process and a second panning process. The first panning process is performed on the left channel amplitude spectrum, thereby obtaining a left channel panning amplitude signal according to the panning angle curve. The second panning process is performed on the right channel amplitude spectrum, thereby obtaining a right channel panning amplitude signal according to the panning angle curve. The broader circuitry Is configured to perform a first broader process and a second broader process. The first broader process is performed on the left channel phase spectrum, thereby obtaining a left channel separation phase signal according to the left channel separation curve. The second broader process is performed on the right channel phase spectrum, thereby obtaining a right channel separation phase signal according to the right channel separation curve. The inverse transformation circuitry is configured to perform a first inverse transformation process and a second inverse transformation process. The first inverse transformation process is performed on the left channel panning amplitude signal and the left channel separation phase signal, thereby obtaining an optimized left channel output audio signal corresponding to the time domain. The second inverse transformation process is performed on the right channel panning amplitude signal and the right channel separation phase signal, thereby obtaining an optimized right channel output audio signal corresponding to the time domain. The audio signal output circuitry is configured to output the optimized left channel output audio signal and the optimized right channel output audio signal.
In accordance with one or more embodiments of the invention, the panning angle curve is a curve function with a horizontal axis representing a time and a vertical axis representing a panning angle. Each of the left channel separation curve and the right channel separation curve is a curve function with a horizontal axis representing a positive spectrum frequency and a vertical axis representing a separation phase angle.
In accordance with one or more embodiments of the invention, the first panning process is configured to calculate a panning curve according to the panning angle curve and to multiply the left channel amplitude spectrum by the panning curve, thereby obtaining the left channel panning amplitude signal. The second panning process is configured to calculate the panning curve according to the panning angle curve and to multiply the right channel amplitude spectrum by the panning curve, thereby obtaining the right channel panning amplitude signal.
In accordance with one or more embodiments of the invention, the first broader process is configured to add the left channel phase spectrum to the left channel separation curve, thereby obtaining the left channel separation phase signal. The second broader process is configured to add the right channel phase spectrum to the right channel separation curve, thereby obtaining the right channel separation phase signal.
In accordance with one or more embodiments of the invention, the first panning process is configured to control a panning angle of each frame of the left channel audio signal through the panning angle curve. The second panning process is configured to control the panning angle of each frame of the right channel audio signal through the panning angle curve.
In accordance with one or more embodiments of the invention, the first broader process is configured to control a separation phase angle of each of different spectra in the phase spectrum within each frame of the left channel audio signal through the left channel separation curve. The second broader process is configured to control the separation phase angle of each of different spectra in the phase spectrum within each frame of the right channel audio signal through the right channel separation curve.
The invention can be more fully understood by reading the following detailed description of the embodiment, with reference made to the accompanying drawings as follows:
Specific embodiments of the present invention are further described in detail below with reference to the accompanying drawings, however, the embodiments described are not intended to limit the present invention and it is not intended for the description of operation to limit the order of implementation. Moreover, any device with equivalent functions that is produced from a structure formed by a recombination of elements shall fall within the scope of the present invention. The using of “first”, “second”, etc. In the specification should be understood for identify units or data described by the same terminology, but is not referred to particular order or sequence.
The transformation circuitry 110 of the audio processing system 100 is configured to perform a transformation process on the input audio signal (i.e., the mono audio signal), so as to transform the mono audio signal from a time domain to a frequency domain, thereby obtaining an amplitude spectrum and a phase spectrum corresponding to the mono audio signal. In the first embodiment of the present invention, the transformation circuitry 110 uses Fourier transform to transform the mono audio signal from the time domain to the frequency domain, but the present invention is not limited thereto.
The processing function circuitry 120 of the audio processing system 100 is configured to provide a processing function group. The processing function group includes a panning angle curve, a left channel separation curve, and a right channel separation curve.
θ=0.01×sin 70t (1)
θ represents the panning angle, and its unit is radians. t represents the time, and its unit is second.
ΔØL(s)=ØΔ cos(2πf1s)cos(2πf2s) (2)
The right channel separation curve Illustrated in
ΔØR(S)=−ØΔ cos(2πf1s)cos(2πf2s) (3)
ΔØ represents the separation phase angle, and its unit Is radians. s represents the positive spectrum frequency, and its unit is hertz (Hz). ØΔ represents the maximum separation phase angle. f1 and f2 are preset frequency values. For example, f1=700 Hz and f2=0.5 Hz. However, the values of f1 and f2 may be adjusted according to the user requirements. It may be seen from the above expressions of the left channel separation curve and the right channel separation curve that phases of the left channel separation curve and the right channel separation curve of embodiments of the present invention are opposite to each other, but the present invention Is not limited thereto. It is noted that, in embodiments of the present invention, the panning angle curve illustrated in
Regarding
θ represents the aforementioned panning angle.
In the first embodiment of the present invention, the panning process is configured to control a panning angle of each frame of the mono audio signal through the panning angle curve. In other words, the panning process performed by the panning circuitry 130 of the audio processing system 100 may cause the enunciation position of an output audio signal to be changed, thereby enhancing the position sensing of the output audio signal. It is worth mentioning that the first embodiment of the present invention may control the panning angle of each frame of the input audio signal through the panning angle curve, so that a corresponding output audio signal may be smoothly switched at the left channel and the right channel when the panning angle curve is a continuous curve function.
The broader circuitry 140 of the audio processing system 100 includes a left channel broader circuitry and a right channel broader circuitry. The left channel broader circuitry is configured to perform a first broader process on the phase spectrum, thereby obtaining a left channel separation phase signal according to the left channel separation curve. The right channel broader circuitry is configured to perform a second broader process on the phase spectrum, thereby obtaining a right channel separation phase signal according to the right channel separation curve. In the first embodiment of the present invention, the first broader process is configured to add the phase spectrum to the left channel separation curve, thereby obtaining the left channel separation phase signal. In the first embodiment of the present invention, the second broader process is configured to add the phase spectrum to the right channel separation curve, thereby obtaining the right channel separation phase signal.
In the first embodiment of the present invention, the first broader process is configured to correspondingly adjust the broad level of the left channel audio signal. In the first embodiment of the present Invention, the second broader process is configured to correspondingly adjust the broad level of the right channel audio signal. In the first embodiment of the present invention, the first broader process is configured to control a separation phase angle of each of different spectra in the phase spectrum within each frame of the input audio signal through the left channel separation curve. In the first embodiment of the present invention, the second broader process is configured to control the separation phase angle of each of different spectra in the phase spectrum within each frame of the input audio signal through the right channel separation curve. In other words, the first broader process and the second broader process performed by the broader circuitry 140 of the audio processing system 100 may cause the output audio signal to be more stereoscopic, thereby allowing the output audio signal to become more spatial.
The inverse transformation circuitry 150 of the audio processing system 100 is configured to perform a first inverse transformation process and a second inverse transformation process. The first inverse transformation process is performed on the panning amplitude signal and the left channel separation phase signal, thereby obtaining an optimized left channel output audio signal corresponding to the time domain. The second inverse transformation process is performed on the panning amplitude signal and the right channel separation phase signal, thereby obtaining an optimized right channel output audio signal corresponding to the time domain. In the first embodiment of the present invention, each of the aforementioned first inverse transformation process and the aforementioned second inverse transformation process is inverse-Fourier transform, but the present invention is not limited thereto.
The audio signal output circuitry 160 of the audio processing system 100 is configured to output the optimized left channel output audio signal and the optimized right channel output audio signal. In the first embodiment of the present invention, the audio signal output circuitry 160 is sound card, but the present invention is not limited thereto.
It may be seen from the first embodiment of the present invention that the audio processing system 100 is configured to process the mono audio signal, thereby converting the mono audio signal into a stereo audio signal. Further, the audio processing system 100 creates the sound level change of the left channel and the right channel through the panning angle curve, so as to cause the enunciation position of the output audio signal to be changed, thereby enhancing the position sensing of the input audio signal. The audio processing system 100 further creates the phase delay of the left channel and the right channel through the left channel separation curve and the right channel separation curve, so as to cause the output audio signal to be more stereoscopic, thereby allowing the output audio signal to become more spatial. Specifically, the audio processing system 100 may convert the mono audio signal into the stereo audio signal, and may make the stereo audio effect and the broad effect of the stereo audio signal to be more obvious.
The transformation circuitry 210 of the audio processing system 200 is configured to perform a first transformation process and a second transformation process. The first transformation process is performed on the left channel audio signal to transform the left channel audio signal from the time domain to the frequency domain, thereby obtaining a left channel amplitude spectrum and a left channel phase spectrum corresponding to the left channel audio signal. The second transformation process is performed on the right channel audio signal to transform the right channel audio signal from the time domain to the frequency domain, thereby obtaining a right channel amplitude spectrum and a right channel phase spectrum corresponding to the right channel audio signal. In the second embodiment of the present invention, each of the aforementioned first transformation process and the aforementioned second transformation process is Fourier transform, but the present invention is not limited thereto.
The processing function circuitry 220 of the audio processing system 200 is configured to provide the processing function group. The processing function group includes the panning angle curve (e.g., the panning angle curve as shown in
The panning circuitry 230 of the audio processing system 200 includes a left channel panning circuitry and a right channel panning circuitry. The left channel panning circuitry is configured to perform a first panning process on the left channel amplitude spectrum, thereby obtaining a left channel panning amplitude signal according to the panning angle curve. The right channel panning circuitry is configured to perform a second panning process on the right channel amplitude spectrum, thereby obtaining a right channel panning amplitude signal according to the panning angle curve. In the second embodiment of the present invention, the first panning process is configured to calculate a panning curve according to the panning angle curve and then multiply the left channel amplitude spectrum corresponding to the left channel audio signal by the panning curve, thereby obtaining the left channel panning amplitude signal. In the second embodiment of the present invention, the second panning process is configured to calculate the panning curve according to the panning angle curve and then multiply the right channel amplitude spectrum corresponding to the right channel audio signal by the panning curve, thereby obtaining the right channel panning amplitude signal.
In the second embodiment of the present invention, the first panning process is configured to control a panning angle of each frame of the left channel audio signal through the panning angle curve, and the second panning process is configured to control a panning angle of each frame of the right channel audio signal through the panning angle curve. In other words, the first panning process and the second panning process performed by the panning circuitry 230 of the audio processing system 200 may cause the enunciation position of an output audio signal to be changed, thereby enhancing the position sensing of the output audio signal. It is worth mentioning that the second embodiment of the present invention may control the panning angle of each frame of the input audio signal through the panning angle curve, so that a corresponding output audio signal may be smoothly switched at the left channel and the right channel when the panning angle curve is a continuous curve function.
The broader circuitry 240 of the audio processing system 200 includes a left channel broader circuitry and a right channel broader circuitry. The left channel broader circuitry is configured to perform a first broader process on the phase spectrum, thereby obtaining a left channel separation phase signal according to the left channel separation curve. The right channel broader circuitry is configured to perform a second broader process on the phase spectrum, thereby obtaining a right channel separation phase signal according to the right channel separation curve. In the second embodiment of the present invention, the first broader process is configured to add the left channel phase spectrum to the left channel separation curve, thereby obtaining the left channel separation phase signal. In the second embodiment of the present invention, the second broader process is configured to add the right channel phase spectrum to the right channel separation curve, thereby obtaining the right channel separation phase signal.
In the second embodiment of the present invention, the first broader process is configured to control a separation phase angle of each of different spectra in the phase spectrum within each frame of the input audio signal through the left channel separation curve. In the second embodiment of the present invention, the second broader process is configured to control the separation phase angle of each of different spectra in the phase spectrum within each frame of the input audio signal through the right channel separation curve. In other words, the first broader process and the second broader process performed by the broader circuitry 240 of the audio processing system 200 may cause the output audio signal to be more stereoscopic, thereby allowing the output audio signal to become more spatial.
The inverse transformation circuitry 250 of the audio processing system 200 is configured to perform a first Inverse transformation process and a second inverse transformation process. The first inverse transformation process is performed on the left channel panning amplitude signal and the left channel separation phase signal, thereby obtaining an optimized left channel output audio signal corresponding to the time domain. The second inverse transformation process is performed on the right channel panning amplitude signal and the right channel separation phase signal, thereby obtaining an optimized right channel output audio signal corresponding to the time domain. In the second embodiment of the present invention, each of the aforementioned first inverse transformation process and the aforementioned second inverse transformation process is inverse-Fourier transform, but the present invention is not limited thereto.
The audio signal output circuitry 260 of the audio processing system 200 is configured to output the optimized left channel output audio signal and the optimized right channel output audio signal. In the second embodiment of the present invention, the audio signal output circuitry 260 is sound card, but the present invention Is not limited thereto.
It may be seen from the second embodiment of the present invention that the audio processing system 200 is configured to process a stereo audio signal. Further, the audio processing system 200 creates the sound level change of the left channel and the right channel through the panning angle curve, so as to cause the enunciation position of the output audio signal to be changed, thereby enhancing the position sensing of the output audio signal. The audio processing system 100 further creates the phase delay of the left channel and the right channel through the left channel separation curve and the right channel separation curve, so as to cause the output audio signal to be more stereoscopic, thereby allowing the output audio signal to become more spatial. Specifically, the audio processing system 100 may convert the mono audio signal into the stereo audio signal, and may make the stereo audio effect and the broad effect of the stereo audio signal to be more obvious.
It is noted that, various Implementations of the audio processing systems and the audio processing methods described herein may be realized in digital electronic circuitry, integrated circuitry, specially designed ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various implementations may include implementation in one or more computer programs that are executable and/or Interpretable on a programmable system including at least one programmable processor, which can be special or general purpose, coupled to receive data and instructions from, and to transmit data and instructions to, a storage system, at least one input device, and at least one output device.
Although the present invention has been described in considerable detail with reference to certain embodiments thereof, other embodiments are possible. Therefore, the spirit and scope of the appended claims should not be limited to the description of the embodiments contained herein. It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
108109395 A | Mar 2019 | TW | national |
Number | Name | Date | Kind |
---|---|---|---|
5671287 | Gerzon | Sep 1997 | A |
6198035 | Saito | Mar 2001 | B1 |
9589571 | Wuebbolt | Mar 2017 | B2 |
20100169103 | Pulkki | Jul 2010 | A1 |
20150036849 | Thompson | Feb 2015 | A1 |
Number | Date | Country |
---|---|---|
2015035492 | Mar 2015 | WO |