This application is based on and claims priority to Japanese Application No. 2006-198560, filed on Jul. 20, 2006, the disclosure of which is hereby incorporated herein by reference.
1. Field of the Invention
The present invention relates to a pitch conversion method and device, and in particular to a pitch conversion method and device for converting a pitch of an input signal into a desired (target) pitch in order to change e.g. a voice level or accent.
2. Description of the Related Art
Prior art examples [1] and [2] of the above-mentioned pitch conversion technology will now be described referring to
Prior Art Example [1] (PSOLA method):
In this pitch conversion technology, a pitch conversion is performed by overlapping and adding waveforms of an input signal per pitch cycle in conformity with a target pitch (namely, the input signal is eventually expanded or contracted in the direction of time axis), and is generally called a PSOLA (Pitch-Synchronous Overlap and Add) method (see e.g. patent document 1).
Namely, two waveforms W1 and W2 are firstly cut from the input signal “In” per pitch cycle T, and then window functions F1 and F2 are respectively applied to the cut waveforms W1 and W2 to adjust the amplitudes. In order to avoid discontinuity of waveforms at the boundary between an overlapped portion of the waveforms W1 and W2 by overlapping and adding which will be described later and the non-overlapped portion, the window functions F1 and F2 are set so that the sum of mutual contribution degrees may become “1” at the overlapped portion of the waveforms W1 and W2 as shown in
Then, two waveforms (not shown) whose amplitudes are adjusted by the window functions F1 and F2 are overlapped and added to obtain the output signal “Out”.
In such a prior art example [1], waveforms after the pitch conversion may be deformed since waveforms whose phases are different from each other are overlapped. This deformation is notable especially when a pitch conversion ratio (namely, an expansion and contraction ratio of the input signal in the direction of time axis) is large, which leads to a degradation of sound quality.
In order to deal with this problem, a prior art example [2] has been already proposed as described herebelow:
Prior Art Example [2]: Not shown
In this pitch conversion technology, a linear predictive analysis is firstly performed to the input signal, so that the signal is separated into an envelope signal (formant component) and a residual signal (harmonics component). Then, a pitch conversion is performed only to the residual signal in the same way as the above-mentioned prior art example [1], so that the residual signal after the pitch conversion has been performed and the original envelope signal are synthesized by using a linear predictive coefficient calculated from the input signal.
Thus, the pitch conversion can be performed without affecting the envelope signal, and the above-mentioned waveform deformation due to the pitch conversion can be reduced, so that a degradation of sound quality can be avoided (see e.g. patent document 2).
While in the above-mentioned prior art example [2] the pitch conversion can be performed without deteriorating the sound quality of the input signal compared with the above-mentioned prior art example [1], there is a problem that the linear predictive analysis and the signal separation/synthesis require processing of large data throughput (calculation amount or the like).
It is accordingly an object of the present invention to provide a pitch conversion method and device which can reduce data throughput while suppressing a degradation of sound quality due to a pitch conversion as much as possible.
[1] In order to achieve the above-mentioned object, a pitch conversion method (or device) according to one aspect of the present invention comprises: a degradation evaluation step of (or means) inputting an input signal pitch pattern per predetermined processing unit and a target pitch pattern for the input signal pitch pattern, and of calculating a degradation degree indicating how a waveform of the input signal degrades upon pitch conversion from the input signal pitch pattern to the target pitch pattern; and a pitch conversion step of (or means) performing the pitch conversion with predetermined data throughput depending on the degradation degree.
Namely, at a degradation evaluation step (or means), a degradation degree is calculated in advance of the execution of a pitch conversion, and at a pitch conversion step (or means), data throughput for performing the pitch conversion is switched over depending on the degradation degree.
Thus, when the degradation degree is small, the pitch conversion can be performed with small data throughput by using the pitch conversion technology shown in e.g. the above-mentioned prior art example [1] since a degradation of sound quality due to the pitch conversion does not occur. Also, only when a high-performance pitch conversion is required to be performed due to a large degradation degree, the pitch conversion can be performed by using the pitch conversion technology shown in e.g. the above-mentioned prior art example [2]. Therefore, it is possible to reduce a processing load (i.e. the entire data throughput).
[2] Also, in the above-mentioned [1], the degradation evaluation step (or means) may include an average pitch conversion amount calculation step of (or means) calculating an average pitch conversion amount by dividing a sum of pitch differences between the target pitch pattern and the input signal pitch pattern per predetermined cycle by a sum of pitches of the input signal pitch pattern per predetermined cycle, and a degradation degree calculation step of (or means) providing as the degradation degree to the pitch conversion step (or means) a value that is the average pitch conversion amount weighted by predetermined coefficients.
Namely, since this average pitch conversion amount is a value indicating how much pitch conversion is required to be performed for an input signal per predetermined processing unit (namely, how a waveform of an input signal can be deformed), the value can be used as the degradation degree.
[3] Also, in the above-mentioned [1], the pitch conversion step (or means) may include a first and second pitch conversion steps (or means) depending on a level of the degradation degree, accordingly the degradation evaluation step (or means) may also include the identical first and second pitch conversion steps (or means), and the degradation evaluation step (or means) may further include an average signal difference calculation step of (or means) calculating an average signal difference by dividing a sum of power differences between a first pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the first pitch conversion step (or means) and a second pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the second pitch conversion step (or means) per predetermined cycle by a sum of powers of the second pitch conversion result per predetermined cycle, and a degradation degree calculation step of (or means) providing as the degradation degree to the pitch conversion step (or means) a value that is the average signal difference weighted by predetermined coefficients.
Namely, the degradation evaluation step (or means) performs the pitch conversion to the part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern in advance of the execution of the pitch conversion at a subsequent pitch conversion step (or means) respectively at the first pitch conversion step (or means) and the second pitch conversion step (or means) which are the same as the pitch conversion step (or means) included at the subsequent stage.
An average signal difference obtained based on the results of both pitch conversions mentioned above is a value indicating a difference closer to a difference between the results of the pitch conversions as respectively and actually performed at the first pitch conversion step (or means) and the second pitch conversion step (or means) included in the pitch conversion step (or means). When the average signal difference is small, it can be regarded that there is no difference between the pitch conversion results regardless of the size of data throughput (namely, the degradation of sound quality due to the pitch conversion does not occur regardless of the size of the data throughput). Therefore, the average signal difference can be used as the degradation degree.
[4] Also, in the above-mentioned [1], the degradation evaluation step (or means) may include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and of determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and a degradation degree calculation step of (or means) providing as the degradation degree to the pitch conversion step (or means) a value that is the pitch pattern change degree weighted by predetermined coefficients.
Namely, since this pitch pattern change degree is a value obtained from a correlation between the change trend of the input signal pitch pattern and that of the target pitch pattern (namely, e.g. a value indicating whether or not the pitch of the input signal is required to be greatly changed), the value can be used as the degradation degree.
[5] Also, in the above-mentioned [2], the pitch conversion step (or means) may include a first and second pitch conversion steps (or means) depending on a level of the degradation degree, accordingly the degradation evaluation step (or means) may also include the identical first and second pitch conversion steps (or means), and the degradation evaluation step (or means) may further include an average signal difference calculation step of (or means) calculating an average signal difference by dividing a sum of power differences between a first pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the first pitch conversion step (or means) and a second pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the second pitch conversion step (or means) per predetermined cycle by a sum of powers of the second pitch conversion result per predetermined cycle, and the degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount and the average signal difference respectively weighted by predetermined coefficients.
[6] Also, in the above-mentioned [2], the degradation evaluation step (or means) may further include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and of determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and the degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount and the pitch pattern change degree respectively weighted by predetermined coefficients.
[7] Also, in the above-mentioned [3], the degradation evaluation step (or means) may further include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and of determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and the degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average signal difference and the pitch pattern change degree respectively weighted by predetermined coefficients.
[8] Also, in the above-mentioned [5], the degradation evaluation step (or means) may further include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and of determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and the degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount, the average signal difference, and the pitch pattern change degree respectively weighted by predetermined coefficients.
As the above-mentioned [5]-[8], the combination of two or three of the average pitch conversion amount, the average signal difference, and the pitch pattern change degree described in the above-mentioned [2]-[4] can be used as the degradation degree.
[9] Also, a pitch conversion method (or device) according to one aspect of the present invention comprises: a degradation degree extraction step of (or means) inputting a voice state and a phonemic type of an input signal per predetermined processing unit, and extracting a degradation degree corresponding to the voice state and the phonemic type inputted from a database in which degradation degrees indicating how a waveform of the input signal degrades upon pitch conversion from an input signal pitch pattern to a target pitch pattern for the input signal pitch pattern are associated with all of combinations of voice states and phonemic types estimated to be recorded; and a pitch conversion step of (or means) performing the pitch conversion with predetermined data throughput depending on the degradation degree.
Namely, in this database, the combination of all of the voice states and the phonemic types estimated as the input signal are associated with the degradation degree to be recorded. Therefore, it is possible to accurately reduce the data throughput depending on the degradation of the sound quality which may actually occur.
[10] Also, a pitch conversion method (or device) according to one aspect of the present invention comprises: a degradation evaluation step of (or means) inputting an input signal pitch pattern per predetermined processing unit, a target pitch pattern for the input signal pitch pattern, and a voice state and a phonemic type of the input signal, and calculating a degradation degree indicating how a waveform of the input signal degrades upon pitch conversion from the input signal pitch pattern to the target pitch pattern; and a pitch conversion step of (or means) performing the pitch conversion with predetermined data throughput depending on the degradation degree.
Thus, the degradation degree can be calculated in consideration of both of the degradation degree based on the input signal pitch pattern and the target pitch pattern as described in the above-mentioned [1], and the degradation degree based on the voice state and the phonemic type of the input signal as described in the above-mentioned [9], thereby enabling the data throughput for the pitch conversion to be more accurately reduced while the degradation of sound quality is suppressed.
[11] Also, in the above-mentioned [10], the degradation evaluation step (or means) may include an average pitch conversion amount calculation step of (or means) calculating an average pitch conversion amount by dividing a sum of pitch differences between the target pitch pattern and the input signal pitch pattern per predetermined cycle by a sum of pitches of the input signal pitch pattern per predetermined cycle, a degradation degree extraction step of (or means) extracting a degradation degree corresponding to the voice state and the phonemic type inputted from a database in which the degradation degrees are associated with all of combinations of voice states and phonemic types estimated to be recorded, and a degradation degree calculation step of (or means) providing as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount and the extracted degradation degree respectively weighted by predetermined coefficients.
[12] Also, in the above-mentioned [10], the pitch conversion step (or means) may include a first and second pitch conversion steps (or means) depending on a level of the degradation degree, accordingly the degradation evaluation step (or means) may also include the identical first and second pitch conversion steps (or means), and the degradation evaluation step (or means) may further include an average signal difference calculation step of (or means) calculating an average signal difference by dividing a sum of power differences between a first pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the first pitch conversion step (or means) and a second pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the second pitch conversion step (or means) per predetermined cycle by a sum of powers of the second pitch conversion result per predetermined cycle, a degradation degree extraction step of (or means) extracting a degradation degree corresponding to the voice state and the phonemic type inputted from a database in which the degradation degrees are associated with all of combinations of voice states and phonemic types estimated to be recorded, and a degradation degree calculation step of (or means) providing as the degradation degree to the pitch conversion step (or means) a sum of values that are the average signal difference and the extracted degradation degree respectively weighted by predetermined coefficients.
[13] Also, in the above-mentioned [10], the degradation evaluation step (or means) may include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, a degradation degree extraction step of (or means) extracting a degradation degree corresponding to the voice state and the phonemic type inputted from a database in which the degradation degrees are associated with all of combinations of voice states and phonemic types estimated to be recorded, and a degradation degree calculation step of (or means) providing as the degradation degree to the pitch conversion step (or means) a sum of values that are the pitch pattern change degree and the extracted degradation degree respectively weighted by predetermined coefficients.
[14] Also, in the above-mentioned [11], the pitch conversion step (or means) may include a first and second pitch conversion steps (or means) depending on a level of the degradation degree, accordingly the degradation evaluation step (or means) may also include the identical first and second pitch conversion steps (or means), and the degradation evaluation step (or means) may further include an average signal difference calculation step of (or means) calculating an average signal difference by dividing a sum of power differences between a first pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the first pitch conversion step (or means) and a second pitch conversion result obtained by converting a part of the input signal pitch pattern per predetermined processing unit and the target pitch pattern at the second pitch conversion step (or means) per predetermined cycle by a sum of powers of the second pitch conversion result per predetermined cycle, and the degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount, the extracted degradation degree, and the average signal difference respectively weighted by predetermined coefficients.
[15] Also, in the above-mentioned [11], the degradation evaluation step (or means) may further include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and a degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount, the extracted degradation degree, and the pitch pattern change degree respectively weighted by predetermined coefficients.
[16] Also, in the above-mentioned [12], the degradation evaluation step (or means) may further include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and a degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average signal difference, the extracted degradation degree, and the pitch pattern change degree respectively weighted by predetermined coefficients.
[17] Also, in the above-mentioned [14], the degradation evaluation step (or means) may further include a pitch pattern change degree calculation step of (or means) classifying changing trends of the input signal pitch pattern and the target pitch pattern respectively into any one of predetermined changing trends by calculating average pitches per predetermined time interval of the pitch pattern and by sequentially comparing the average pitches, and determining a pitch pattern change degree to the target pitch pattern for the input signal pitch pattern based on a combination of both changing trends, and a degradation degree calculation step (or means) may provide as the degradation degree to the pitch conversion step (or means) a sum of values that are the average pitch conversion amount, the extracted degradation degree, the average signal difference, and the pitch pattern change degree respectively weighted by predetermined coefficients.
As the above-mentioned [11]-[17], the combination of two, three, or four of the average pitch conversion amount, the average signal difference, the pitch pattern change degree, and the degradation degree extracted at the degradation degree extraction step can be used as the degradation degree.
According to the present invention, the data throughput can be reduced while the degradation of the sound quality due to the pitch conversion can be suppressed as much as possible, thereby enabling a processing congestion of a device to which the present invention is applied and a delay of the pitch conversion due to the congestion to be prevented. Also, a long-lived device can be realized.
Also, it is made possible to easily calculate or extract the degradation degree, so that circuits within the device can be simplified.
The above and other objects and advantages of the invention will be apparent upon consideration of the following detailed description, taken in conjunction with the accompanying drawings, in which the reference numerals refer to like parts throughout and in which:
Embodiments [1]-[3] of a pitch conversion method and a device using the method according to the present invention will now be described in the following order by referring to
I. Embodiment [1]:
I.1. Arrangement (common to embodiments (1)-(7) of degradation evaluating portion):
I.2. Entire operation example (common to embodiments [2] and [3]):
II. 1. Arrangement:
II. 2. Operation example:
III. Embodiment [3]:
III.1. Arrangement (common to embodiments (8)-(14) of degradation evaluating portion):
III.2. Operation example:
III.3. Embodiments (8)-(14) of degradation evaluating portion:
I.1. Arrangement (Common to Embodiments (1)-(7) of Degradation Evaluating Portion):
A pitch conversion device 10 according to an embodiment [1] of the present invention shown in
The pitch mark PM is data indicating positions of pitch cycles (periods) within the input signal pitch pattern IPP and the target pitch pattern TPP. Also, a predetermined processing unit is a data unit of e.g. a predetermined number of pitch cycles (namely, a predetermined number of pitch marks PM), a single phoneme, a single voice fragment (assembly of a plurality of phonemes), a single sentence, or the like.
Also, the pitch converter 200 is composed of a pitch converter 310 (i.e. a low-performance pitch converter using the pitch conversion technology such as the above-mentioned prior art example [1]) which receives the input signal pitch pattern IPP, the target pitch pattern TPP, and the pitch mark PM to execute the pitch conversion with small data throughput, a pitch converter 320 (i.e. a high-performance pitch converter using a pitch conversion technology such as mentioned in the above-mentioned prior art example [2]) which executes the pitch conversion with large data throughput, and a switchover portion 400 which determines whether the pitch conversion should be performed either by the pitch converter 310 or 320 and switches over from one to the other.
Hereinafter, the operation of this embodiment will be described. An entire operation example will be firstly described referring to
It is to be noted that the following description of the entire operation example is similarly applied to the embodiments [2] and [3] which will be described later except the calculation or extraction of the degradation degree DGR (hereinafter, referred to as degradation evaluation).
I.2. Entire Operation (Common to Embodiments [2] and [3]):
As shown in
The switchover portion 400 compares the degradation degree DGR with a predetermined threshold “Th”. With the result determining that the degradation degree is less than the threshold “Th” (at step S3), the switchover portion 400 provides the input signal pitch pattern IPP, the pitch mark PM, and the target pitch pattern TPP to the pitch converter 310.
The pitch converter 310 having received the input signal pitch pattern IPP, the pitch mark PM, and the target pitch pattern TPP executes the pitch conversion (at step S4), and transmits the output signal Out1 after the pitch conversion to the subsequent stage (at step S5).
On the other hand, with the result determining that the degradation degree is equal to or more than the threshold “Th” at the above-mentioned step S3, the switchover portion 400 provides the input signal pitch pattern IPP, the pitch mark PM, and the target pitch pattern TPP to the pitch converter 320.
The pitch converter 320 having received the input signal pitch pattern IPP, the pitch mark PM, and the target pitch pattern TPP executes the pitch conversion (at step S6), and transmits the output signal Out2 after the pitch conversion to the subsequent stage (at step S7).
I.3. Embodiments (1)-(7) of Degradation Evaluating Portion:
I.3.A Embodiment (1) of Degradation Evaluating Portion:
I.3.A.a Arrangement:
The degradation evaluating portion 100 shown in
I..3.A.b Operation Example:
As shown in
As shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (2) based on the average pitch conversion amount PC to be provided to the switchover portion 400 (at step S11).
Coefficients “a” and “b” in the above-mentioned function f1 have only to be preset by an operator or the like so that a switchover between the pitch converters 310 and 320 depending on the degradation degree DGR is optimally performed. The same applies to coefficients in functions used for embodiments of the degradation evaluating portion which will be described later.
I.3.B Embodiment (2) of Degradation Evaluating Portion:
I.3.B.a Arrangement:
The degradation evaluating portion 100 shown in
Also, the average signal difference calculator 130 includes the pitch converters 310 and 320 which are the same as the pitch converters 310 and 320 shown in
1.3.B.b Operation Example:
As shown in
Namely, the average signal difference calculator 130 inputs the input signal pitch pattern IPP, the pitch mark PM, and the target pitch pattern TPP for the pitch cycles “m” (smaller number than the pitch cycle number per processing unit) to be respectively provided to the pitch converters 310 and 320 (at step S20).
The pitch converters 310 and 320 respectively execute the pitch conversion, and provides the output signals Out1 and Out2 after the pitch conversion to the signal difference calculator 131 (at steps S21 and S22).
The signal difference calculator 131 having received the output signals Out1 and Out2 calculates the average signal difference DIF according to the following Eq. (3) to be provided to the degradation degree calculator 120 (at step S23).
Out1i and Out2i in Eq. (3) indicate pitch conversion results obtained by the pitch conversion to an input signal pitch and a target pitch at the position of the pitch cycle shown by a pitch mark PMi (see
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (4) based on the average signal difference DIF to be provided to the switchover portion 400 (at step S24).
I.3.C Embodiments (3) of Degradation Evaluating Portion:
I.3.C.a Arrangement:
The degradation evaluating portion 100 shown in
I.3.C.b Operation Example:
As shown in
Namely, the pitch pattern change degree calculator 140 receives the input signal pitch pattern IPP, the pitch mark PM, and the target pitch pattern TPP (at step S30), and calculates a change trend TND_I of the input signal pitch pattern IPP and a change trend TND_T of the target pitch pattern TPP (hereinafter, occasionally represented by a reference character TND) (at steps S31 and S32).
The pitch pattern change degree calculator 140 calculates average pitches AP1-AP3 (hereinafter, occasionally represented by a reference character AP) for three predetermined time intervals of the pitch pattern (e.g. time that is a pitch cycle divided into three, shown by the pitch mark PM), as shown in
If the average pitches AP1-AP3 of the input signal pitch pattern satisfy the relationship of AP1<AP2<AP3 (namely, a change trend that the average pitch AP gradually increases) for example, the pitch pattern change degree calculator 140 classifies the input signal pitch pattern change trend TND_I into a pitch pattern change trend TND1.
The pitch pattern change degree calculator 140 determines the pitch pattern change degree CHG from the combination of the input signal pitch pattern change trend TND_I and the target pitch pattern change trend TND_T by referring to the pitch pattern change degree calculating table TBL shown in
As shown, the pitch pattern change degree calculating table TBL is set so that as the difference between the input signal pitch pattern change trend TND_I and the target pitch pattern change degree TND_T becomes large, a larger value is obtained as the pitch pattern change degree CHG.
When the input signal pitch pattern change trend TND_I and the target pitch pattern change trend TND_T are respectively classified into a pitch pattern change trend TND3 (change trend in which the average pitch AP changes from up to down) and a pitch pattern change trend TND7 (change trend in which the average pitch AP changes from down to up) (namely, when the difference of the pitch pattern change trend TND is the largest) for example, the pitch pattern change degree calculator 140 determines the pitch pattern change degree CHG to be “4” (maximum value) by referring to the pitch pattern change degree calculating table TBL.
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (5) based on the pitch pattern change degree CHG to be provided to the switchover portion 400 (at step S34).
degradation degree DGR=f3(CHG) Eq. (5)
For the above-mentioned function f3, the same function as the function f1 or f2 described in the above-mentioned embodiment (1) or (2) of the degradation evaluating portion can be used.
I..3.D Embodiment (4) of Degradation Evaluating Portion:
The degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (6) based on the average pitch conversion amount PC and the average signal difference DIF to be provided to the switchover portion 400 (at step S40).
I.3.E Embodiment (5) of Degradation Evaluating Portion:
The degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (7) based on the average pitch conversion amount PC and the pitch pattern change degree CHG to be provided to the switchover portion 400 (at step S50).
I.3.F Embodiment (6) of Degradation Evaluating Portion:
The degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (8) based on the average signal difference DIF and the pitch pattern change degree CHG to be provided to the switchover portion 400 (at step S60).
I.3.G Embodiment (7) of Degradation Evaluating Portion:
The degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (9) based on the average pitch conversion amount PC, the average signal difference DIF, and the pitch pattern change degree CHG to be provided to the switchover portion 400 (at step S70).
II.1. Arrangement:
The pitch conversion device 10 according to the embodiment [2] of the present invention shown in
The sound state of the additional information INFO indicates a state such as “rise”, “fall, “transition”, and “steady” estimated as the input signal, and the phonemic type indicates a type such as vowels (“A”-“O”) and consonants (except vowels). The relationship between all of the combinations of the voice states and the phonemic types, and the degradation degree DGR (namely, degradation of sound quality which may actually occur) is preliminarily obtained by a simulation, an experiment, or the like to be recorded in the degradation rule database DB.
Hereinafter, the operation of this embodiment will be described. However, since operations except extraction of the degradation degree DGR in the degradation degree extractor 500 is common to that of the above-mentioned embodiment [1], only the operation of the degradation degree extractor 500 will now be described referring to
II.2. Operation Example:
As shown in
When the voice state and the phonemic type of the additional information INFO respectively indicate the “transition” state and the vowel “O” for example, the degradation degree extractor 500 extracts “10” for the degradation degree DGR from the degradation rule database DB.
III.1. Arrangement (Common to Embodiments (8)-(14) of Degradation Evaluating Portion):
The pitch conversion device 10 according to the embodiment [3] of the present invention shown in
III.2. Operation Example:
While the operation of this embodiment will be described hereinafter, only the embodiments (8)-(14) of the degradation evaluating portion 100 will now be described referring to
III.3. Embodiments (8)-(14) of Degradation Evaluating Portion:
III.3.A Embodiment (8) of Degradation Evaluating Portion:
In addition to the average pitch conversion amount calculator 110, the degradation degree extractor 500, and the degradation rule database DB which are the same as those of the above-mentioned embodiments [1] and [2], the degradation evaluating portion 100 shown in
In operation, as show in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (10) based on the average pitch conversion amount PC and the degradation degree DGR to be provided to the switchover portion 400 (at step S80).
The coefficient α4 in the above-mentioned function f8 may be preset by an operator or the like so that the switchover between the pitch converters 310 and 320 depending on the degradation degree DGR is optimally performed in the same way as the above-mentioned embodiment [1]. The same applies to coefficients in functions used for embodiments of the degradation evaluating portion as will be described later.
III.3.B Embodiment (9) of Degradation Evaluating Portion:
In addition to the average signal difference calculator 130, the degradation degree extractor 500, and the degradation rule database DB which are the same as those of the above-mentioned embodiments [1] and [2], the degradation evaluating portion 100 shown in
In operation, as show in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (11) based on the average signal difference DIF and the degradation degree DGR to be provided to the switchover portion 400 (at step S90).
III.3.C Embodiment (10) of Degradation Evaluating Portion:
In addition to the pitch pattern change degree calculator 140, the pitch pattern change degree calculating table TBL, the degradation degree extractor 500, and the degradation rule database DB which are the same as those in the above-mentioned embodiments [1] and [2], the degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (12) based on the pitch pattern change degree CHG and the degradation degree DGR to be provided to the switchover portion 400 (at step S100).
III.3.D Embodiment (11) of Degradation Evaluating Portion:
In addition to the above-mentioned embodiment (8) of the degradation evaluating portion, the degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (13) based on the average pitch conversion amount PC, the average signal difference DIF, and the degradation degree DGR to be provided to the switchover portion 400 (at step S110).
III.3.E Embodiment (12) of Degradation Evaluating Portion:
In addition to the above-mentioned embodiment (8) of the degradation evaluating portion, the degradation evaluating portion 100 shown in
In operation, as show in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (14) based on the average pitch conversion amount PC, the pitch pattern change degree CHG, and the degradation degree DGR to be provided to the switchover portion 400 (at step S120).
III.3.F Embodiment (13) of Degradation Evaluating Portion:
In addition to the above-mentioned embodiment (9) of the degradation evaluating portion, the degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (15) based on the average signal difference DIF, the pitch pattern change degree CHG, and the degradation degree DGR to be provided to the switchover portion 400 (at step S130).
III.3.G Embodiment (14) of Degradation Evaluating Portion:
In addition to the above-mentioned embodiment (11) of the degradation evaluating portion, the degradation evaluating portion 100 shown in
In operation, as shown in
The degradation degree calculator 120 calculates the degradation degree DGR by the following Eq. (16) based on the average pitch conversion amount PC, the average signal difference DIF, the pitch pattern change degree CHG, and the degradation degree DGR to be provided to the switchover portion 400 (at step S140).
It is to be noted that the present invention is not limited by the above-mentioned embodiments, and it is obvious that various modifications may be made by one skilled in the art based on the recitation of the claims.
Number | Date | Country | Kind |
---|---|---|---|
2006-198560 | Jul 2006 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
20030158728 | Bi et al. | Aug 2003 | A1 |
Number | Date | Country |
---|---|---|
2-66600 | Mar 1990 | JP |
7-219597 | Aug 1995 | JP |
10-78791 | Mar 1998 | JP |
Number | Date | Country | |
---|---|---|---|
20080091417 A1 | Apr 2008 | US |