Claims
- 1. A method of encoding an analog speech signal via speech analysis, said method comprising the steps of:
- providing an analog speech signal;
- digitizing the analog speech signal to provide a plurality of samples of digital speech data;
- arranging the plurality of digital speech data samples in successive frames of digital speech data, each frame containing a plurality of digital speech data samples;
- analyzing the frames of digital speech data utilizing a linear predictive coding technique to determine a set of linear predictive coding speech parameters for each frame defining the linear prediction polynomial;
- subjecting respective frames of linear predictive coding speech parameters defining the linear prediction polynomial to a root factoring procedure involving
- initially determining a first quadratic factor indicative of a root of the prediction polynomial for a first current frame of digital speech data by deflating the prediction polynomial to a reduced order polynomial,
- successively determining the next quadratic factor for the first current frame of digital speech data in a continuing sequence until the prediction polynomial is reduced to a remaining quadratic polynomial factor,
- sorting the respective quadratic factors in the order of increasing bandwidth of the roots indicated thereby, and
- extracting roots based upon the sequence of the order of increasing bandwidth such that roots are removed in the order of decreasing significance as speech formant candidates;
- continuing the root factoring procedure with subsequent successive frames of digital speech data by
- estimating a first quadratic factor indicative of a root of the prediction polynomial for the next successive current frame of digital speech data based upon the roots as extracted from the previous frame of digital speech data,
- determining the first quadratic factor beginning with the estimation thereof by deflating the prediction polynomial to a reduced order polynomial,
- successively determining the next quadratic factor for said next successive current frame of digital speech data by initially estimating said next quadratic factor for said next successive current frame of digital speech data based upon the roots as extracted from the previous frame of digital speech data, and thereafter determining the next quadratic factor for said next successive current frame of digital speech data beginning with the estimation thereof in a continuing sequence until the prediction polynomial is reduced to a remaining quadratic polynomial factor,
- sorting the respective quadratic factors for said next successive current frame of digital speech data in the order of increasing bandwidth of the roots indicated thereby, and
- extracting roots for said next successive current frame of digital speech data based upon the sequence of the order of increasing bandwidth;
- utilizing the extracted roots as speech formant candidates; and
- determining the speech formants from the extracted roots as speech formant condidates in representing the analog speech signal as a compressed encoded form of digital speech signals.
- 2. A method as set forth in claim 1, further including storing or transmitting the speech formants as determined from the speech formant candidates provided by the extracted roots as digital speech signals representative of the analog speech signal.
- 3. A method of encoding an analog speech signal via speech analysis, said method comprising the steps of:
- providing an analog speech signal;
- digitizing the analog speech signal to provide a plurality of samples of digital speech data;
- arranging the plurality of digital speech data samples in successive frames of digital speech data, each frame containing a plurality of digital speech data samples;
- analyzing the frames of digital speech data utilizing a linear predictive coding technique to determine a set of linear predictive coding speech parameters as digital speech data representative of reflection coefficients for each frame;
- converting said digital speech data representative of reflection coefficients for each frame to digital speech data representative of predictor coefficients;
- defining a linear prediction polynomial from each frame of digital speech data representative of predictor coefficients;
- subjecting respective frames of digital speech data representative of predictor coefficients defining the linear prediction polynomial to a root factoring procedure involving
- initially determining a first quadratic factor indicative of a root of the prediction polynomial for a first current frame of digital speech data by deflating the prediction polynomial to a reduced order polynomial,
- successively determining the next quadratic factor for the first current frame of digital speech data in a continuing sequence unitl the prediction polynomial is reduced to a remaining quadratic polynomial factor,
- sorting the respective quadratic factors in the order of increasing bandwidth of the roots indicated thereby, and
- extracting roots based upon the sequence of the order of increasing bandwidth such that roots are removed in the order of decreasing significance as speech formant candidates;
- continuing the root factoring procedure with subsequent successive frames of digital speech data by
- estimating a first quadratic factor indicative of a root of the prediction polynomial for the next successive current frame of digital speech data based upon the roots as extracted from the previous frame of digital speech data,
- determining the first quadratic factor beginning with the estimation thereof by deflating the prediction polynomial to a reduced order polynomial,
- successively determining the next quadratic factor for said next successive current frame of digital speech data by initially estimating said next quadratic factor for said next successive current frame of digital speech data based upon the roots as extracted from the previous frame of digital speech data, and thereafter determining the next quadratic factor for said next successive current frame of digital speech data beginning with the estimation thereof in a continuing sequence until the prediction polynomial is reduced to a remaining quadratic polynomial factor,
- sorting the respective quadratic factors for said next successive current frame of digital speech data in the order of increasing bandwidth of the roots indicated thereby, and
- extracting roots for said next successive current frame of digital speech data based upon the sequence of the order of increasing bandwidth;
- utilizing the extracted roots as speech formant candidates; and
- determining the speech formants from the extracted roots as speech formant candidates in representing the analog speech signal as a compressed encoded form of digital speech signals.
- 4. A method as set forth in claim 3, further including storing or transmitting the speech formants as determined from the speech formant candidates provided by the extracted roots as digital speech signals representative of the analog speech signal.
- 5. A method as set forth in claim 3, wherein the root of the first quadratic factor for the current frame of digital speech data is estimated as the same as the smallest bandwidth root as determined from the previous frame of digital speech data.
- 6. A method as set forth in claim 5, wherein the determination of the first quadratic factor and respective successive quadratic factors of the prediction polynomial includes
- deflating the prediction polynomial to a reduced order polynomial by successively iterating the prediction polynomial with coefficient values corresponding to the deflated polynomial being progressively incremented in magnitude for each iteration until convergence occurs when the coefficient values correspond to a quadratic factor of the prediction polynomial.
- 7. A method as set forth in claim 6, further including
- checking for convergence as a bounds on the sum of the absolute values of the step increments du and dv of the coefficient values of the quadratic factor in accordance with the following relationship:
- .vertline.du.vertline.+.vertline.dv.vertline..ltoreq..epsilon., where
- .epsilon.is a constant magnitude lying in the range of 10.sup.-2 to 10.sup.-6.
- 8. A method as set forth in claim 5, wherein the root of the next quadratic factor after said first quadratic factor for the current frame of digital speech data is estimated as the same as the second smallest bandwidth root as determined from the previous frame of digital speech data.
Parent Case Info
This is a continuation of application Ser. No. 743,189, filed June 10, 1985, abandoned Mar. 27, 1989.
US Referenced Citations (7)
Non-Patent Literature Citations (3)
| Entry |
| Stark, Introduction to Numerical Methods, MacMillan Publishing Co., NY, 1970, pp. 85-91 and 96-113. |
| Henrici, Elements of Numerical Analysis, John Wiley & Sons, 1964, pp. 110-115. |
| Markel et al, Linear Prediction of Speech, Springer-Verlag, Berlin Heidelberg, 1976, pp. 94-95. |
Continuations (1)
|
Number |
Date |
Country |
| Parent |
743189 |
Jun 1985 |
|