1. Technical Field
The present invention relates to a sound effect applying apparatus and a sound effect applying program for providing input voices with effects.
2. Related Art
As a method for applying distortive feeling to instrumental sounds and human voices, there has been known the distortion technology distorts input sounds by clipping input waveforms.
Further, there is proposed a sound effect applying apparatus in Japanese Non-examined Patent Publication No. 2003-288095. Based on input control parameters, the sound effect applying apparatus controls individual magnitudes of harmonic components and nonharmonic components in a sound to be synthesized so as to control the breathiness magnitude.
While there is known the method of applying the distortion to input sounds as mentioned above, it is desired to apply realistic distortion more meaningful to sounds.
It is therefore an object of the present invention to provide a sound effect applying apparatus and a sound effect applying program capable of applying a realistic distortion effect to input voices.
To achieve the above-mentioned object, the sound effect applying apparatus according to the present invention comprises: an input part that frequency-analyzes an input signal of sound or voice for detecting a plurality of local peaks of harmonics contained in the input signal; a subharmonics provision part that adds a spectrum component of subharmonics between the detected local peaks so as to provide the input signal with a sound effect; and an output part that converts the input signal of a frequency domain containing the added spectrum component into an output signal of a time domain for generating the sound or voice provided with the sound effect.
In one form, the subharmonics provision part adds between the local peaks a variable spectrum component having a gain which varies irregularly. For example, the subharmonics provision part adds the variable spectrum component in the form of a mixture of a plurality of spectrum components which have the same frequency but which have phase differences irregularly varying with one another.
Preferably, the subharmonics provision part further changes the gain of the variable spectrum component in accordance with a gain of the input signal. For example, the subharmonics provision part increases the gain of the variable spectrum component as the gain of the input signal increases, and holds the gain of the variable spectrum component when the gain of the input signal exceeds a given level.
Preferably, the subharmonics provision part adjusts parameters of the variable spectrum component to be added in accordance with a pitch of the input signal, the parameters specifying at least one of a type, a frequency and a gain of the variable spectrum component.
In another form, the subharmonics provision part adds a plurality of spectrum components having different frequencies between one local peak and another local peak next to said one local peak.
Preferably, the subharmonics provision part changes the gain of the spectrum components in accordance with a gain of the input signal. For example, the subharmonics-provision part increases the gain of the spectrum components as the gain of the input signal increases, and holds the gain of the spectrum components when the gain of the input signal exceeds a given level.
Preferably, the subharmonics provision part adjusts parameters of the spectrum components to be added in accordance with a pitch of the input signal, the parameters specifying at least one of types, frequencies, gains and numbers of the spectrum components.
The sound effect applying program according to the present invention is executable by a computer to perform a method comprising the steps of: frequency-analyzing an input signal of sound or voice for detecting a plurality of local peaks of harmonics contained in the input signal; adding a spectrum component of subharmonics between the detected local peaks so as to provide the input signal with a sound effect; and converting the input signal of a frequency domain containing the added spectrum component into an output signal of a time domain for generating the sound or voice provided with the sound effect.
The sound effect applying apparatus and the sound effect applying program according to the present invention can provide input voices with a more realistic distortion effect by adding subharmonics to the frequency spectrum of the input signal.
Since there is provided a spectrum component having irregularly varying gains between input voice's local peaks, the input voice can be converted into an output voice of the voice quality having creak (squeaking) distortion. Since there is provided a plurality of spectrum components having different frequencies between input voice's local peaks, the input voice can be converted into an output voice of the voice quality having growl (howling) distortion.
The effect intensity can be adjusted by specifying parameters such as types, frequencies, and gains of a spectrum component to be provided, or the number of spectrum components.
The more naturalistic voice quality conversion can be provided by controlling parameters such as types, frequencies, and gains for a spectrum component to be provided, or the number of spectrum components in accordance with an input signal's gain or pitch.
In
Reference numeral 5 denotes subharmonics provision means that performs processes in a frequency domain to provide input voices with distortion effects. The sound effect applying apparatus according to the embodiment of the present invention is described to have two types of subharmonics provision sections depending on the types of effects to be provided, i.e., a first subharmonics provision section 6 and a second subharmonics provision section 7. The subharmonics provision means 5 can provide input voices with processes performed in either or both the first subharmonics provision section 6 and the second subharmonics provision section 7.
The first subharmonics provision section 6 provides input voice with a creak (squeaking) distortion effect. The first subharmonics provision section 6 supplies spectrum components having irregularly varying gains between local peak frequencies of the input voice's frequency spectrum. The first subharmonics provision section 6 supplies spectrum components having irregularly varying gains by supplying a plurality of spectrum components having irregularly varying phase differences at the same frequency.
The second subharmonics provision section 7 provides input voice with a growl (howl) distortion effect. The second subharmonics provision section 7 supplies a plurality of spectrum components at different frequencies between local peak frequencies.
A parameter specification section 8 supplies parameters that control spectrum components provided by the first subharmonics provision section 6 and the second subharmonics provision section 7. The parameter specification section 8 supplies the first subharmonics provision section 6 and the second subharmonics provision section 7 with parameters concerning a spectrum component to be added such as its type, its frequency position (deviation from the center frequency between harmonics frequencies), its gain, and the number of spectrum components to be provided. Controlling the parameters makes it possible to adjust the intensity of effects provided by the first and second subharmonics provision sections 6 and 7. The first subharmonics provision section 6, second subharmonics provision section 7 and parameter specification section 8 collectively constitute a subharmonics provision part that adds a spectrum component of subharmonics between the detected local peaks so as to provide the input signal with a sound effect.
Reference numeral 9 denotes an inverse Fourier transform section that transforms a frequency spectrum into a time domain. In this case, the frequency spectrum of the input signal is provided with a spectrum component between local peaks by the first subharmonics provision section 6 or the second subharmonics provision section 7. Reference numeral 10 denotes an overlap and addition resynthesis section that synthesizes respective frame-based signals transformed into time-domain signals by the inverse Fourier transform section 9. Reference numeral 10 denotes an output section that outputs a voice signal supplied from the overlap and addition resynthesis section 10. The parameter specification section 8, inverse Fourier transform section 9, overlap and addition resynthesis section 10 and output section 11 collectively constitute an output part that converts the input signal of a frequency domain containing the added spectrum component into an output signal of a time domain for generating the sound or voice provided with the sound effect.
The above-mentioned constituent elements can be implemented not only as individual processing sections, but also by computer's program processes.
The following describes a subharmonics provision process performed by the first subharmonics provision section 6.
A clear voice provides the spectrum indicated by a solid line 21 in
However, the creak voice quality causes peaks (indicated by broken lines) other than the peaks corresponding to the harmonic frequencies near frequency positions (between harmonic frequencies) indicated by reference numeral 23 in
The first subharmonics provision section 6 reproduces the above-mentioned phenomenon by means of a signal process in the frequency domain. Referring now to
a) shows an input spectrum, where f0 denotes a pitch frequency.
The spectrum components in (b) and (c) are found at the same frequency positions. However, the phases in (c) are modified irregularly. Consequently, adding the spectrum components in (b) and (c) together irregularly varies the gains at frequency positions 1.5 f0, 2.5 f0, and so on. Further, adding the input spectrums in (a) can yield a spectrum containing subharmonics with irregularly varying gains. The method of generating subharmonics may be based on not only controlling phases as mentioned above, but also directly controlling gains.
In this manner, it is possible to provide input voices with the effect of creak (squeaking) voice quality.
Further, the intensity of this effect can be adjusted by changing gains for the sine-wave spectrum components in (b) and (c).
While there has been described the method of adding two sine-wave spectrum components in (b) and (c), it may be preferable to add three or more sine-wave spectrum components.
Spectrum components to be provided are not limited to sine-wave ones. They may be shaped like a triangular wave or may be extracted from a specified frequency range of previously recorded actual voice waveforms. More diversified effects become available because a user can select spectrum components to be provided according to his or her preference. Further, it may be preferable to specify types of spectrum components to be provided according to frequencies.
In addition, the intensity of effects can be adjusted by specifying how much frequency positions for the spectrum components to be provided should be deviated from the center of harmonic frequencies (deviation amount specification). Alternatively, it may be preferable to randomly vary the deviation amount.
The following describes a subharmonics provision process performed by the second subharmonics provision section 7.
Like the case in
However, it can be understood that the growl voice quality causes a plurality of peaks (indicated by broken lines in
The second subharmonics provision section 7 simulates this phenomenon to provide a distortion effect causing the growl voice quality.
This embodiment adds sine wave components for the number of n (an integer greater than or equal to 2) frequencies as subharmonics corresponding to the ith local peak in the input spectrum.
Assuming that k is 0, 1, 2, . . . , or n−1, the following equation is used to find frequency fki for the kth sine wave component to be added.
fki=(i+1)×pitchsyn+(k+1)×(1/(n+1))×pitch, (1)
In this equation, pitchsyn represents a synthesized pitch and “pitch” represents the input pitch.
This equation can add new n sine wave components at equal frequency intervals between harmonic frequencies.
Instead of evenly arranging frequencies as formulated in the above-mentioned equation, it may be preferable to add n sine wave components at random frequency intervals.
In this manner, the second subharmonics provision section 7 adds a plurality of spectrum components between the peak frequencies in the input spectrum to convert an input voice into the growl (howl) voice quality.
A user can control the number of subharmonics (n) to be added according to his or her preference to adjust the effect to be provided.
The effect intensity can be adjusted by adjusting gains for sine-wave spectrum components to be added. The effect intensity can be further fine-tuned by individually changing gains for respective sine-wave spectrum components.
Furthermore, the effect intensity can be controlled by controlling the phases of sine-wave spectrum components to be added.
Spectrum components to be provided are not limited to sine-wave ones. They may be shaped like a triangular wave or may be extracted from previously recorded actual voice waveforms. More diversified effects become available because a user can select spectrum components to be provided according to his or her preference.
The above-mentioned embodiment has no consideration for the magnitude (gain) of input voice. However, it may be more effective to vary the effect intensity in accordance with the input voice magnitude. For example, increasing the sound volume generally increases the feeling of growl (howl). On the contrary, decreasing the sound volume generally decreases the feeling of growl (howl).
The following describes another embodiment of the sound effect applying apparatus according to the present invention so as to represent such natural feeling by controlling the above-mentioned parameters in accordance with input voice characteristics such as gains and pitches.
The following differences will be clearly understood in comparison between
The parameter adjustment section 12 controls parameters supplied from the parameter specification section 8 in accordance with characteristics such as input voice's pitches and gains and supplies these parameters to the first subharmonics provision section 6 or the second subharmonics provision section 7.
This makes it possible to use parameters corresponding to characteristics such as input voice's pitches and gains and provide natural effects.
This example concerns provision of a growling effect and shows a case of varying gains of subharmonics to be added in accordance with the curve as shown in
In this manner, the growl effect decreases when the sound volume is small, making it possible to simulate the naturalness.
The effect intensity can be adjusted by controlling (A) a gain for subharmonic at the beginning of applying the effect in
There has been described the example of applying the growling effect by means of the second subharmonics provision section 7. When providing the effect, the first subharmonics provision section 6 can similarly simulate the naturalness by controlling parameters.
While the above-mentioned embodiment adjusts subharmonics gains, it may be preferable to adjust the other parameters such as the number of subharmonics, for example.
While there has been described the example of controlling parameters in accordance with input voice gains, it may be preferable to adjust parameters in accordance with input voice pitches.
The present invention can be applied to not only voice signals, but also musical instrument sounds and the like.
Number | Date | Country | Kind |
---|---|---|---|
2004-186012 | Jun 2004 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
3787602 | Okudaira | Jan 1974 | A |
3913442 | Deutsch | Oct 1975 | A |
4957030 | Suzuki | Sep 1990 | A |
4991218 | Kramer | Feb 1991 | A |
5442130 | Kitayama et al. | Aug 1995 | A |
5526431 | Shioda | Jun 1996 | A |
5536902 | Serra et al. | Jul 1996 | A |
5763807 | Clynes | Jun 1998 | A |
5781636 | Tai | Jul 1998 | A |
5862232 | Shinbara et al. | Jan 1999 | A |
5930373 | Shashoua et al. | Jul 1999 | A |
5963907 | Matsumoto | Oct 1999 | A |
6134330 | De Poortere et al. | Oct 2000 | A |
6316710 | Lindemann | Nov 2001 | B1 |
6336092 | Gibson et al. | Jan 2002 | B1 |
6504935 | Jackson | Jan 2003 | B1 |
6591240 | Abe | Jul 2003 | B1 |
6704711 | Gustafsson et al. | Mar 2004 | B2 |
7003120 | Smith et al. | Feb 2006 | B1 |
7027980 | Ramabadran et al. | Apr 2006 | B2 |
7135636 | Kemmochi et al. | Nov 2006 | B2 |
7136493 | Coats et al. | Nov 2006 | B2 |
7248702 | Packard | Jul 2007 | B2 |
7257230 | Nagatani | Aug 2007 | B2 |
7342168 | Setoguchi | Mar 2008 | B2 |
7389231 | Yoshioka et al. | Jun 2008 | B2 |
20020061109 | Aarts | May 2002 | A1 |
20030044023 | Larsen | Mar 2003 | A1 |
20030055647 | Yoshioka et al. | Mar 2003 | A1 |
20030221542 | Kenmochi | Dec 2003 | A1 |
20040011191 | Larsen | Jan 2004 | A1 |
20050004691 | Edwards | Jan 2005 | A1 |
Number | Date | Country |
---|---|---|
03-101798 | Apr 1991 | JP |
8328587 | Dec 1996 | JP |
11-175070 | Jul 1999 | JP |
2003-288095 | Oct 2003 | JP |
Entry |
---|
Gauffin, J., Granqvist, S., Hammarberg, B., Heriegård, S., Håkansson, A. “Irregularities in the voice: some perceptual experiments using synthetic voices.” 1995. Proceedings of the International Congress of Phonetic Sciences, session 24.1. vol. 2. pp. 242-245. |
Cheng-Gia Tsai, “Auditory Grouping in the Perception of Roughness Induced by Subharmonics: Empirical Findings and a qualitative Model,” ISMA, XP-002400181, Dept. of Musicology, Humboldt University Berlin, Germany (Nara), (Apr. 3, 2004). |
X. Amatriain, et al., “Spectral Modeling for Higher-level Sound Transformations,” Proceedings of Mosart Workshop on Current Research Directions in Computer Music, XP-002400179 (Naples, Italy), (2001). |
A. Loscos, J. Bonada, “Emulating Rough and Growl Voice in Spectral Domain,” Proc. of the 7th Int. Conference on Digital Audio Effects, XP 002400180, Music Technology Group of the Institut Universitari Audiovisual (Barcelona, Spain), (Oct. 8, 2004). |
Japanese Patent Office, “Notice of Rejection”, Patent Application No. 2004-186012, Drafting Date: Mar. 4, 2010, 6 pages. |
Number | Date | Country | |
---|---|---|---|
20050288921 A1 | Dec 2005 | US |