The present invention relates to an audio signal processing technique.
Heretofore, there have been proposed techniques for imparting a vibrato component to an audio signal obtained by picking up a singing voice. For example, Japanese Patent Application Laid-open Publication No. HEI-7-325583 (corresponding to U.S. Pat. No. 5,536,902) (hereinafter referred to as “patent literature 1”) discloses a technique that imparts a desired audio signal with a sine wave adjusted in amplitude and cyclic period in accordance with a depth and velocity of a vibrato component extracted from an audio signal. Further, Japanese Patent Application Laid-open Publication No. 2002-73064 (hereinafter referred to as “patent literature 2”) discloses extracting a vibrato component from a singing voice and imparts a vibrato to an audio signal on the basis of the extracted vibrato component. Furthermore, “Vibrato Modeling For Synthesizing Vocal Voice Based On HMM”, by Yamada Tomohiko and four others, Study Report of Information Processing Society of Japan, May 21, 2009, Vol. 2009-MUS-80, No. 5 (hereinafter referred to as “nonparent literature 1”) discloses a technique for imparting a synthesized sound of a singing voice with a vibrato component approximated by a sine wave.
However, with the prior art techniques disclosed in patent literature 1 and non-patent literature 1, where a vibrato component is approximated by a simple sine wave, would present that problem that it is difficult to impart a natural vibrato component that is generally the same as that in an actual voice. The prior art techniques would also present a problem in imparting a variation component of other character elements than a pitch.
In view of the foregoing, it is an object of the present invention to generate a variation component that allows a character element of an audio signal to vary in an auditorily natural manner.
In order to accomplish the above-mentioned object, a first aspect of the present invention provides an improved audio processing apparatus, which comprises: a phase setting section which sets virtual phases in a time series of character values representing a character element of an audio signal; a unit wave extraction section which extracts, from the time series of character values, a plurality of unit waves demarcated in accordance with the virtual phases set by the phase setting section; and an information generation section which generates, for each of the unit waves extracted by the unit wave extraction section, unit information indicative of a character of the unit wave. In the audio processing apparatus of the present invention, a set of a plurality of unit information for individual time points (i.e., variation information) (each of the unit information is indicative of a character of a unit wave corresponding to one cyclic period of a time series of character values representing a character element of an audio signal) is generated as information indicative of variation of the character element of an audio signal. In this way, the present invention can generate an audio signal where the character element varies in an auditorily natural matter, as compared to the technique where variation of a tone pitch is approximated with a sine wave as disclosed in patent literature 1 and non-patent literature 1.
Note that the term “virtual phases” is used herein to refer to phases in a case where the time series of character values is assumed to represent a periodic waveform (e.g., sine wave). For example, the phase setting section sets virtual phases of individual extreme value points, included in the time series of character values, to predetermined values, and calculates a virtual phase of each individual time point located between the successive extreme value points by performing interpolation between the virtual phases of the extreme value points.
In a preferred implementation, the audio processing apparatus of the present invention further comprises a phase correction section which corrects the phases of the unit waves, extracted by the unit wave extraction section, so that the unit waves are brought into phase with each other, and the information generation section generates the unit information for each of the unit waves having been subjected to phase correction by the phase correction section. Because the unit waves extracted by the unit wave extraction section are adjusted or corrected to be in phase with each other (i.e., corrected so that the initial phases of the individual unit waves all become a zero phase), this preferred implementation can, for example, readily synthesize (add) a plurality of the unit information, as compared to a case where the unit waves indicated by the individual unit information differ in phase.
In a preferred implementation, the audio processing apparatus of the present invention further comprises a time adjustment section which compresses or expands each of the unit waves extracted by the unit wave extraction section, and wherein the information generation section generates the unit information for each of the unit waves having been subjected to compression or expansion by the time adjustment section. Because the unit waves extracted by the unit wave extraction section are adjusted to a predetermined length, this preferred implementation can, for example, readily synthesize (add) a plurality of the unit information, as compared to a case where the unit waves indicated by the individual unit information differ in time length.
In the aforementioned preferred implementation which includes the time adjustment section, the information generation section includes a first generation section which, for each of the unit waves, generates, as the unit information, velocity information indicative of a character value variation velocity in the time series of character values in accordance a degree of the compression or expansion by the time adjustment section. Because velocity information indicative of a variation velocity of the character element of the audio signal is generated as the unit information, this preferred implementation can advantageously generate a variation component having the variation velocity of the character element faithfully reflected therein. Further, because the velocity information is generated in accordance a degree of the compression or expansion by the time adjustment section, the preferred implementation can reduce a load involved in generation of the velocity information, as compared to a case where the velocity information is generated independently of the compression/expansion by the time adjustment section.
In a further preferred implementation, the information generation section includes a second generation section which, for each of the unit waves, generates, as the unit information, shape information indicative of a shape of a frequency spectrum of the unit wave. Because shape information indicative of a shape of a frequency spectrum of the unit wave extracted from the audio signal is generated as the unit information, this preferred implementation can advantageously generate a variation component having a variation shape of the character element faithfully reflected therein. Further, if the second generation section is constructed to generate, as the shape information, a series of coefficients within a predetermined low frequency region of the frequency spectrum of the unit wave (while ignoring a series of coefficients within a predetermined high frequency region of the frequency spectrum), the preferred implementation can also advantageously reduce a necessary capacity for storing the unit information.
According to a second aspect of the present invention, there is provided an improved audio signal processing apparatus, which comprises: a storage section which stores a set of a plurality of unit information indicative of respective characters of a plurality of unit waves extracted from a time series of character values, representing a character element of an audio signal, in accordance with virtual phases set in the time series, the unit information each including velocity information to be used for control to compress or expand a time length of a corresponding one of the unit waves, and shape information indicative of a shape of a frequency spectrum of the corresponding unit wave; a variation component generation section which generates a variation component, corresponding to the time series of character values, from the set of the unit information stored in said storage section; and a signal generation section which impart the variation component, generated by said variation component generation section, to a character element of an input audio signal. In the audio signal processing apparatus of the present invention thus arranged, a variation component is generated from a set of a plurality of the unit information extracted from the time series of character values of the audio signal, and an audio signal imparted with such a variation component is generated. Thus, the present invention can generate an audio signal where the character element varies in an auditorily natural matter, as compared to the technique where variation of a tone pitch is approximated with a sine wave as disclosed in patent literature 1 and non-patent literature 1.
The present invention may be constructed and implemented not only as the apparatus invention as discussed above but also as a method invention. Also, the present invention may be arranged and implemented as a software program for execution by a processor such as a computer or DSP, as well as a storage medium storing such a software program. The software program may be installed into a computer of a user by being stored in a computer-readable storage medium and then supplied to the user in the storage medium, or by being delivered to the computer via a communication network.
The following will describe embodiments of the present invention, but it should be appreciated that the present invention is not limited to the described embodiments and various modifications of the invention are possible without departing from the basic principles. The scope of the present invention is therefore to be determined solely by the appended claims.
For better understanding of the object and other features of the present invention, its preferred embodiments will be described hereinbelow in greater detail with reference to the accompanying drawings, in which:
As shown in
The arithmetic processing device 22 performs a plurality of functions (variation extraction section 30 and variation impartment section 40) for processing an audio signal, by executing the programs PG stored in the storage device 24. In an alternative, the plurality of functions of the arithmetic processing device 22 may be distributed on a plurality of integrated circuits, or a dedicated electronic circuit (DSP) may perform the plurality of functions.
The variation extraction section 30 generates variation information DV characterizing variation over time of a fundamental frequency f0 (namely, vibrato) of an audio signal XA and stores the thus generated variation information DV into the storage device 24. The variation impartment section 40 generates an audio signal XOUT by imparting a variation component of the fundamental frequency f0, indicated by the variation information DV generated by the variation extraction section 30, to an audio signal X
The phase setting section 34 of
First, the phase setting section 34 sequentially sets virtual phases θ(ti) for the individual time points ti, corresponding to individual extreme value points E of the frequency series F
Second, the phase setting section 34 calculates a virtual phase θ(ti) for each of the time points ti other than the extreme value points E in the frequency series FB, by performing interpolation between virtual phases θ(ti) (θ(ti)=θm) at extreme value points E located immediately before and after the time points ti in question. More specifically, the phase setting section 34 calculates a virtual phase θ(ti) for each of the time points ti located between the m-th extreme value point E and the (m+1)-th extreme value point E, by performing interpolation between the virtual phase θ(ti) (=θm) at the m-th extreme value point E and the virtual phase θ(ti) (=θm+1) at the (m+1)-th extreme value point E. Such interpolation between the virtual phases θ(ti) may be performed using any suitable one of the conventionally-known techniques (typically, the linear interpolation).
A virtual phase θ(ti) for each time point ti within a portion δs preceding the first extreme value point E of the frequency series F
Intervals between the successive extreme value points E vary in accordance with a variation velocity of the fundamental frequency f0 (i.e., vibrato velocity) of the audio signal X
The unit wave extraction section 36 of
Because the temporal variation rate (i.e., variation rate over time) of the virtual phases θ(ti) varies in accordance with the vibrato velocity of the audio signal X
The unit wave processing section 38 of
As shown in
As shown in
As further shown in
The first generation section 561 generates velocity information V(ti) from each of the unit wave WA having been processed by the phase correction section 52 or from each of the unit waves WO before processed by the phase correction section 52. The velocity information V(ti) is representative of an index value that functions as a measure of the vibrato velocity of the audio signal XA. More specifically, the first generation section 561 calculates, as the velocity information V(ti), a relative ratio between the number of samples n of the unit wave Wo at the time point ti and the number of samples N of the unit wave WB having been adjusted by the time adjustment section 54 (N/n), as shown in
The second generation section 562 of
As understood from the foregoing, the shape information S(ti) is representative of an index value characterizing the shape of the unit wave Wo of one cyclic period, corresponding to a given time point ti, of the frequency series FA. Namely, a unit wave WC generated by the inverse Fourier transform of the shape information S(ti) (although the unit wave WC is generally identical to the unit wave WB, it is indicated by a different reference character from the unit wave WB for convenience of description) has a waveform (different in shape from the unit wave Wo) having reflected therein the shape of the unit wave Wo, corresponding to the given time point ti, of the frequency series FA. For example, a maximum value of the coefficient values of the frequency spectrum Q indicated by the shape information S(ti) represents a vibrato depth (i.e., variation amplitude of the fundamental frequency f0) in the audio signal XA. The foregoing are the construction and behavior of the variation extraction section 30.
The variation impartment section 40 of
If a variable P(ti) indicative of the time point tF (hereinafter referred to as “degree of progression”) in the unit wave WC indicated by the shape information S(ti) is introduced, the frequency f(ti) is defined by Mathematical Expression (1) below.
f(ti)=IDFT{S(ti), P(ti)} (1)
The function “IDFT{S(ti), P(ti)}” represents a numerical value (fundamental frequency fO) at the time point tF, designated by the degree of progression P(ti), in the unit wave WC of a time region where the frequency spectrum Q indicated by the shape information S(ti) has been subjected to inverse Fourier transform. Thus, Mathematical Expression (1) above can be expressed by Mathematical Expression (2) below.
In Mathematical Expression (2) above, “S(ti)k” indicates a k-th coefficient value of the N coefficient values (i.e., coefficient values of the frequency spectrum Q) constituting the shape information S(ti), and “j” is an imaginary unit.
The degree of progression P(ti) in Mathematical Expressions (1) and (2) can be defined by Mathematical Expression (3) below.
P(ti)=mod {p(ti), N} (3)
The function mod {a, b} in Mathematical Expression (3) represents a remainder obtained by dividing a numerical value “a” by a numerical value “b” (a/b). Further, the variable “p(ti)” in Mathematical Expression (3) corresponds to an integrated value of velocity information V(ti) till a time point (ti−1) immediately before the time point ti and can be expressed by Mathematical Expression (4) below.
As understood from Mathematical Expression (4) above, the value of the variable “p(ti)” increases over time to exceed a predetermined value N. The reason why the variable p(ti) is divided by the predetermined value N is to allow the degree of progression P(ti) to fall at or below the predetermined value N in such a manner that a given time point tF within one unit wave WC (N samples) is designated.
For convenience of description, let it be assumed here that the unit wave WC (N samples) represented by the shape information S(ti) is a sine wave of one cyclic period and that the shape information S(ti) is the same for all of the time points ti (t1, t2, t3, . . . ) If the velocity information V(ti) for each of the time points ti is fixed to a value “1”, then the degree of progression P(ti) increases by one at each of the time points ti (like 0, 1, 2, 3, . . . ) from the time point t1 to the time point tN. Thus, of the variation component C, a frequency f(ti) at the time point ti is set at a numerical value of an i-th sample, indicated by the degree of progression P(ti), of the unit wave WC (N samples) represented by the shape information S(ti). Namely, the variation component C constitutes a sine wave having, as one cyclic period, a portion from the time point t1 to the time point tN as shown in (A) of
If the velocity information V(ti) for each of the time points ti is a value “2”, then the degree of progression P(ti) increases by two at each of the time points ti (like 0, 2, 4, 6, . . . ) from the time point t1 to the time point tN/2. Thus, of the variation component C, a frequency f(ti) at the time point ti is set at a numerical value of a 2i-th sample, indicated by the degree of progression P(ti), of the unit wave WC (N samples) represented by the shape information S(ti). Accordingly, the variation component C constitutes a sine wave having, as one cyclic period, a portion from the time point t1 to the time point tN/2 as shown in (B) of
The variation component generation section 42 of
The signal generation section 44 imparts the audio signal XB with the variation component C generated in accordance with the above-described procedure. More specifically, the signal generation section 44 adds the variation component C to the time series of fundamental frequencies extracted from the audio signal XB, and generates an audio signal XOUT having, as fundamental frequencies, a series of numerical values obtained by the addition. Of course, generation of the audio signal XOUT, having the variation component C reflected therein, may be performed using any suitable one of the conventionally-known techniques.
In the instant embodiment, as described above, unit information U(ti) (comprising shape information S(ti) and velocity information V(ti)), each indicative of a character of a unit wave WO and corresponding to one cyclic period of a frequency series FA of an audio signal XA, is sequentially generated every time point ti, and a variation component C is generated using each of the unit information U(ti). Thus, the above-described embodiment can generate an audio signal XOUT having a vibrato character of the audio signal XA faithfully and naturally reproduced therein, as compared to the disclosed techniques of patent literature 1 and non-patent literature 1 where a vibrato is approximated with a simple sine wave. More specifically, the above-described embodiment can generate a variation component C, having a vibrato waveform (including a vibrato depth) of the audio signal XA faithfully reflected therein, by applying individual shape information S(ti) of variation information DV, and it can generate a variation component C, having a vibrato velocity of the audio signal XA faithfully reflected therein, by applying individual velocity information V(ti) of the variation information DV.
Note that patent literature 2 (Japanese Patent Application Laid-open Publication No. 2002-73064) identified above discloses a technique for imparting a vibrato to a desired audio signal by use of pitch variation data indicative of a waveform of a vibrato imparted to an actual singing voice. However, with such a technique disclosed in patent literature 2, where vibrato components indicated by the individual pitch variation data differ in phase and time length, a result obtained, for example, by adding together a plurality of the pitch variation data may not become a periodic waveform (i.e., vibrato component). By contrast, the above-described embodiment generates shape information S(ti) after uniformalizing the phases and time lengths of individual unit waves WO extracted from a frequency series FA. Thus, unit waves WC indicated by new shape information S(ti) generated by adding together a plurality of shape information S(ti) present a periodic waveform having characteristics of the original (i.e., non-added-together) individual shape information S(ti) appropriately reflected therein. Namely, the above-described first embodiment, where the phase correction section 52 and time adjustment section 54 adjust unit waves Wo, can advantageously facilitate processing of the shape information S(ti) (i.e., modification of the variation component C). In view of the above-described behavior, there may be suitably employed a modified construction where the variation component generation section 42 adds together a plurality of shape information S(ti) extracted from different audio signals XA to thereby generate new shape information S(ti).
Further, assuming a case where a vibrato component to be imparted to an audio signal in accordance with the technique disclosed in patent literature 2 is changed in time length, and if pitch variation data indicative of a waveform of the vibrato component are merely compressed or expanded in the time axis direction, characteristics of the vibrato component would vary, and thus, complicated arithmetic operations would be required for adjusting the time lengths while suppressing variation of the vibrato component. By contrast, the above-described first embodiment, where unit information U(ti) (shape information S(ti) and velocity information V((ti)) is generated per unit wave Wo, can advantageously facilitate the compression/expansion of the variation component C as compared to the technique disclosed in patent literature 2. More specifically, the above-described embodiment can expand the variation component C, by using common or same shape information S(ti) for generation of frequencies f(ti) of a plurality of time points ti. For example, the above-described embodiment identifies, from shape information S(t1), frequencies f(ti) at individual time points ti from the time point t1 to the time point t4, identifies, from shape information S(t2), frequencies f(ti) at individual time points ti from the time point t5 to the time point t8, and so on. On the other hand, the above-described embodiment may also compress the variation component C by using the shape information S(ti) at predetermined intervals (i.e., while skipping a predetermined number of the shape information S(ti)). For example, every other shape information S(ti) may be used, in which case shape information S(t1) is used for identifying a frequency f(t1) of the time point t1, shape information S(t3) is used for identifying a frequency f(t2) of the time point t2 and shape information S(t5) is used for identifying a frequency f(t3) of the time point t3 (with shape information S(t2) and shape information S(t4) skipped).
The following describe a second embodiment of the present invention. In the following description, elements similar in function and construction to those in the first embodiment are indicated by the same reference numerals and characters as used for the first embodiment and will not be described here to avoid unnecessary duplication.
In the above-described first embodiment, all coefficient values of a frequency spectrum Q of a unit wave WB are generated as shape information S(ti). However, in the second embodiment, the second generation section 562 generates, as shape information S(ti), a series of a plurality NO (NO<N) of coefficient values within a predetermined low frequency region of a frequency spectrum Q of a unit wave WB. In the arithmetic operation of Mathematical Expression (2) above, the variation component generation section 42 sets the variable S(ti)k of Mathematical Expression (2) to a coefficient value contained in the shape information S(ti) as long as the variable k is within a range equal to and less than the value “NO” and below, but sets the variable S(ti)k of Mathematical Expression (2) to a predetermined value (such as zero) as long as the variable k is within a range exceeding the value “NO”.
The second embodiment can achieve the same advantageous results as the first embodiment. Because the character of the unit wave WB appears mainly in a low frequency region of the frequency spectrum Q, it is possible to prevent characteristics of the variation component C, generated by use of the shape information S(ti), from unduly differing from characteristics of the vibrato component of the audio signal XA, although coefficient values in a high frequency region of the frequency spectrum Q are not reflected in the shape information S(ti). Further, the second embodiment, where the number of coefficient values (NO) is smaller than that (N) in the first embodiment (NO<N), can advantageously reduce the capacity of the storage device 24 necessary for storage of individual shape information S(ti) (variation information DV).
The above-described embodiments of the present invention can be modified variously as exemplified below. Two or more of the modifications exemplified below may be combined as necessary.
(1) Modification 1:
Whereas the embodiments of the present invention have been described above as using the variation information DV, generated by the variation extraction section 30, for generation of the variation component C, the variation information DV may be used for generation of the variation component C after the variation information DV is processed by the variation component generation section 42. For example, it is preferable that the variation component generation section 42 synthesize (e.g., add together) a plurality of shape information S(ti) as set forth above. More specifically, the variation component generation section 42 may, for example, synthesize a plurality of shape information S(ti) generated from audio signals XA of different voice utterers (persons), or synthesize a plurality of shape information S(ti) generated for different time points ti from an audio signal XA of a same voice utterer (person). Further, the variation width (vibrato depth) of the variation component C can be increased or decreased if the individual coefficient values of the shape information S(ti) are adjusted (e.g., multiplied by predetermined values).
(2) Modification 2:
Whereas the embodiments of the present invention have been described above in relation to the case where audio signals XA and XB are supplied from the common or same signal supply device 12, audio signals XA and XB may be in any other desired relationship. For example, audio signals XA and audio signals XB may be obtained from different supply sources. Further, in a case where an audio signal XA is used as an audio signal XB, variation information DV generated from an audio signal XA may be imparted again to the audio signal XA (XB), for example, after the audio signal has been processed. Further, the audio signals XB, which are to be imparted with variation information DV, do not necessary need to exist independently. For example, an audio signal XOUT may be generated by a variation component C corresponding to variation information DV being applied to voice synthesis. In each of the above-described embodiments, as understood from the foregoing, the signal generation section 44 can be comprehended as being a component that generates an audio signal XOUT imparted with a variation component C corresponding to variation information DV and does not necessary need to have a function of synthesizing a variation component C and an audio signal XB that exist independently of each other.
(3) Modification 3:
Whereas each of the above-described embodiments is constructed to perform setting of a virtual phase θ(ti) and generation of unit information U(ti) (i.e., extraction of a unit wave Wo) for each of the time points ti of the fundamental frequency f0 constituting the frequency series FA, a modification of the audio processing apparatus 100 may be constructed to change as desired the period with which the fundamental frequency f0 is extracted from the audio signal XA, the period with which the virtual phase θ(ti) is set and the period with which the unit information U(ti) is generated. For example, extraction of the unit wave Wo and generation of the unit information U(ti) may be performed at intervals of a predetermined (plural) number of the time points ti.
(4) Modification 4:
Whereas each of the embodiments has been described in relation to the case where the time length adjustment is performed by the time adjustment section 54 after the phase correction by the phase correction section 52, the phase correction may be performed by the phase correction section 52 after the time length adjustment by the time adjustment section 54. Further, only one of the phase correction by the phase correction section 52 and time length adjustment by the time adjustment section 54 may be performed, or both of the phase correction by the phase correction section 52 and time length adjustment by the time adjustment section 54 may be dispensed with.
(5) Modification 5:
Whereas each of the embodiments has been described in relation to the audio processing apparatus 100 provided with both the variation extraction section 30 and the variation impartment section 40, a modification of the audio processing apparatus 100 may be provided with only one of the variation extraction section 30 and the variation impartment section 40. For example, there may be employed a modified construction where variation information DV is generated by one audio processing apparatus provided with the variation extraction section 30, and another audio processing apparatus provided with the variation impartment section 40 uses the variation information DV, generated by the one audio processing apparatus, to generate an audio signal XOUT. In such a case, the variation information DV is transferred from the one audio processing apparatus (provided with the variation extraction section 30) to the other audio processing apparatus (provided with the variation impartment section 40) via a portable recording or storage medium or a communication network.
(6) Modification 6:
Whereas each of the embodiments has been described above as generating both shape information S(ti) and velocity information V(ti), only one of such shape information S(ti) and velocity information V(ti) may be generated as variation information DV. For example, in the case where generation of velocity information V(ti) is dispensed with, variation information DV can be generated by the arithmetic operation of Mathematical Expression (2) being performed after the velocity information V(ti) in Mathematical Expression (4) is set at a predetermined value (e.g., one). In this way, it is possible to generate variation information DV that reflects therein a shape (e.g., vibrato depth) of a unit wave Wo of an audio signal XA but does not reflect therein a vibrato velocity of the audio signal XA. On the other hand, in the case where generation of shape information S(ti) is dispensed with, variation information DV can be generated by the arithmetic operation of Mathematical Expression (2) being performed after the shape information S(ti) is set at a predetermined wave (e.g., sine wave). In this way, it is possible to generate variation information DV that reflects therein a vibrato velocity of an audio signal XA but does not reflect therein a shape (vibrato depth) of a unit wave Wo of the audio signal XA.
(7) Modification 7:
Whereas each of the embodiments has been described above as extracting, from a frequency series FA, a unit wave Wo corresponding to a portion Θ centering at a virtual phase θ(ti), the method for extracting a unit wave Wo by use of a virtual phase θ(ti) may be modified as appropriate. For example, a portion corresponding to a portion Θ of a 2π width having a virtual phase θ(ti) as an end point (i.e., start or end point) may be extracted as a unit wave Wo from a frequency series FA.
(8) Modification 8:
Further, each of the embodiments is constructed in such a manner that a frequency series FA and frequency series FB are extracted from the audio signal XA. Alternatively, such a frequency series FA and frequency series FB may be extracted, by the phase setting section 34 and unit wave extraction section 36, from a storage medium having the frequency series FA and frequency series FB prestored therein. Namely, the character extraction section 32 may be omitted from the audio processing apparatus 100.
(9) Modification 9:
Whereas each of the embodiments has been described above as generating the variation information DV having reflected therein variation in fundamental frequency f0 of the audio signal XA, the type of a character element for which the variation information DV should be generated is not limited to the fundamental frequency f0. For example, a time series of sound volume levels (sound pressure levels) may be extracted, in place of the frequency series FA, every time point ti of the audio signal XA, so that information DV having reflected therein variation over time of a sound volume of the audio signal XA can be generated. Namely, the basic principles of the present invention may be applied in relation to any desired types of character elements that vary over time.
This application is based on, and claims priority to, JP PA 2009-276470 filed on 4 Dec. 2009. The disclosure of the priority application, in its entirety, including the drawings, claims, and the specification thereof, are incorporated herein by reference.
Number | Date | Country | Kind |
---|---|---|---|
2009-276470 | Dec 2009 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5412152 | Kageyama et al. | May 1995 | A |
5536902 | Serra et al. | Jul 1996 | A |
6169241 | Shimizu | Jan 2001 | B1 |
6255576 | Suzuki et al. | Jul 2001 | B1 |
6965069 | Le-Faucheur et al. | Nov 2005 | B2 |
20030094090 | Tamura et al. | May 2003 | A1 |
Number | Date | Country |
---|---|---|
1 239 463 | Sep 2002 | EP |
1 239 463 | Sep 2002 | EP |
1 239 463 | Sep 2002 | EP |
1 742 200 | Jan 2007 | EP |
07-325583 | Dec 1995 | JP |
2002-073064 | Mar 2002 | JP |
Entry |
---|
Yamada, T. et al. (May 21, 2009). “Vibrato Modeling for HMM-based Singing Voice Synthesis,” IPSJ SIG Technical Report 2009(MUS-80-5):1-6. |
European Search Report mailed Mar. 24, 2011, for EP Application No. 10193423.0, nine pages. |
Number | Date | Country | |
---|---|---|---|
20110132179 A1 | Jun 2011 | US |