Claims
- 1. A speech interval detecting method comprising the steps of:calculating a frame power of an input signal data in unit of predetermined frame width at a predetermined time interval, and then holding a maximum value and a minimum value of the frame power within a past predetermined time period; deciding a threshold value for power changed according to the maximum value being held and difference between the maximum value and the minimum value; and comparing the threshold value with power of a current frame to decide whether or not the current frame belongs to a speech interval or a non-speech interval.
- 2. A speech interval detecting method set forth in claim 1, wherein, if the difference between the maximum value and the minimum value is less than a predetermined value, the threshold value is decided close to the maximum value rather than a case where the difference between the maximum value and the minimum value is more than the predetermined value.
- 3. A speech interval detecting device comprising:a power calculator (32) for calculating a frame power of an input signal data in unit of predetermined frame width at a predetermined time interval; an instantaneous power maximum value latch (33) for holding a maximum value of the frame power within a past predetermined time period; an instantaneous power minimum value latch (34) for holding a minimum value of the frame power within the past predetermined time period; a power threshold value decision portion (35) for deciding a threshold value for power changed according to the maximum value being held in the instantaneous power maximum value latch and difference between the maximum value and the minimum value being held in the instantaneous power minimum value latch; and a discriminator (36) for comparing the threshold value obtained by the power threshold value decision portion with power of a current frame to decide whether or not the current frame belongs to a speech interval or a non-speech interval.
- 4. A speech interval detecting device set forth in claim 3, wherein, if the difference between the maximum value and the minimum value is less than a predetermined value, the power threshold value decision portion (35) decides the threshold value close to the maximum value rather than a case where the difference between the maximum value and the minimum value is more than the predetermined value.
Priority Claims (2)
Number |
Date |
Country |
Kind |
9-112822 |
Apr 1997 |
JP |
|
9-112961 |
Apr 1997 |
JP |
|
RELATED APPLICATION
This application is a division of patent application Ser. No. 09/202,867 filed Dec. 22, 1998 in the name of Atsushi Imai et al., which is a 371 of PCT/JP98/01984 filed Apr. 30, 1998.
US Referenced Citations (4)
Number |
Name |
Date |
Kind |
4672669 |
DesBlache et al. |
Jun 1987 |
A |
4696039 |
Doddington |
Sep 1987 |
A |
4897832 |
Suzuki et al. |
Jan 1990 |
A |
6272459 |
Takahashi |
Aug 2001 |
B1 |
Foreign Referenced Citations (5)
Number |
Date |
Country |
P58-130395 |
Aug 1983 |
JP |
P61-272796 |
Dec 1986 |
JP |
H6-98398 |
Apr 1994 |
JP |
06-266380 |
Sep 1994 |
JP |
H8-294199 |
Nov 1996 |
JP |
Non-Patent Literature Citations (3)
Entry |
D-695: Development of Time-Lag Adaptive voice Speed Control Technology, by Hiroshi Tanaka, et al. and Hypermedia Research Center, Sanyo Electric Co., Ltd. (1995, p. 301) and English translation (p. 1-3). |
2-6-2: An Approach for Absorbing Extension in Time caused in Speech Speed Conversion (A Method of Absorbing Time Expansion on Voice Speed Conversion) by Ryou Ikezawa, et al. (NHK Science & Technical Research Laboratories) p. 331-332 and English translation (p. 1-4). |
D-694: Real Time Absorption Method for Extension in time caused in Speech Speed Conversion (A Method of absorption of temporal discrepancy caused by speech rate conversion), Atsushi imai, et al. and NHK Science and Technical Research Laboratories, p. 300 and English translation (pp. 1-3). |