Claims
- 1. A method for producing comfort noise (CN) in a digital mobile terminal that uses a discontinuous transmission, comprising the steps of:in response to a speech pause, calculating random excitation spectral control (RESC) parameters; and transmitting the RESC parameters to a receiver together with predetermined ones of CN parameters.
- 2. A method as in claim 1, wherein the step of calculating RESC parameters includes a step of analyzing a residual signal in a speech coder.
- 3. A method as in claim 2, wherein the speech coder implements a LPC analysis technique, and wherein the step of analyzing is of lower degree than the LPC analysis technique.
- 4. A method as in claim 2, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the step of analyzing is performed by first or second order LPC analysis.
- 5. A method as in claim 1, wherein the step of calculating RESC parameters includes steps of analyzing a residual signal in a speech coder to produce spectral parameters, and averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 6. A method as in claim 3, wherein the plurality of frames is equal to about 10 or greater.
- 7. A method as in claim 1, wherein the step of calculating RESC parameters includes steps of applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 8. A method as in claim 7, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . , R.
- 9. A method as in claim 7, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
- 10. A method as in claim 1, wherein the step of shaping includes steps of:forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 11. A method as in claim 1, wherein the step of calculating RESC parameters include a step of:applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; andwherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1R b(i)z-i.
- 12. A method as in claim 1, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined one of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 13. Apparatus for generating comfort noise (CN) in a system that uses a discontinuous transmission to a network, comprising:means in said digital mobile terminal that is responsive to a speech pause for calculating random excitation spectral control (RESC) parameters and for transmitting the RESC parameters together with predetermined ones of CN parameters to a receiver in said network.
- 14. Apparatus as in claim 13, wherein said calculating means analyses a residual signal in a speech coder.
- 15. Apparatus as in claim 14, wherein the speech coder implements a LPC analysis technique, and wherein the analysis is of lower degree than the LPC analysis technique.
- 16. Apparatus as in claim 14, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the analysis is performed by first or second order LPC analysis.
- 17. Apparatus as in claim 13, wherein said calculating means analyses a residual signal in a speech coder to produce spectral parameters, and further comprising means for averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 18. Apparatus as in claim 17, wherein the plurality of frames is equal to about 10 or greater.
- 19. Apparatus as in claim 13, wherein said calculating means applies an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 20. Apparatus as in claim 19, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1. . . , R.
- 21. Apparatus as in claim 19, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
- 22. Apparatus as in claim 13, wherein said shaping means is comprised of:means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 23. Apparatus as in claim 13, wherein said calculating means is comprised of:means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1R b(i)z-i.
- 24. Apparatus as in claim 23, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 25. A method for producing comfort noise (CN) in a digital mobile terminal receiver that uses a discontinuous transmission, comprising the steps of:receiving random excitation spectral (RESC) parameters; and shaping the spectral content of an excitation using the received RESC parameters prior to applying the excitation to a synthesis filter.
- 26. A method as in claim 25, wherein the step of calculating RESC parameters includes a step of analyzing a residual signal in a speech coder.
- 27. A method as in claim 26, wherein the speech coder implements a LPC analysis technique, and wherein the step of analyzing is of lower degree than the LPC analysis technique.
- 28. A method as in claim 26, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the step of analyzing is performed by first or second order LPC analysis.
- 29. A method as in claim 25, wherein the step of calculating RESC parameters includes steps of analyzing a residual signal in a speech coder to produce spectral parameters, and averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 30. A method as in claim 29, wherein the plurality of frames is equal to about 10 or greater.
- 31. A method as in claim 25, wherein the step of calculating RESC parameters includes steps of applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 32. A method as in claim 31, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R.
- 33. A method as in claim 31, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
- 34. A method as in claim 25, wherein the step of shaping includes steps of:forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 35. A method as in claims 25, wherein the step of calculating RESC parameters include a step of:applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1R b(i)z-i.
- 36. A method as in claim 35, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined one of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 37. Mobile terminal apparatus for generating comfort noise (CN) in a system that uses a discontinuous transmission to a network, comprising:means in said mobile terminal for shaping the spectral content of an excitation using received excitation spectral control (RESC) parameters prior to applying the excitation to a synthesis filter.
- 38. Apparatus as in claim 37, wherein said calculating means analyses a residual signal in a speech coder.
- 39. Apparatus as in claim 38, wherein the speech coder implements a LPC analysis technique, and wherein the analysis is of lower degree than the LPC analysis technique.
- 40. Apparatus as in claim 38, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the analysis is performed by first or second order LPC analysis.
- 41. Apparatus as in claim 37, wherein said calculating means analyses a residual signal in a speech coder to produce spectral parameters, and further comprising means for averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 42. Apparatus as in claim 41, wherein the plurality of frames is equal to about 10 or greater.
- 43. Apparatus as in claim 37, wherein said calculating means applies an LPC residual signal from a speech coder inverse filter to a RESC inverse filter Hhd HESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 44. Apparatus as in claim 43, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R.
- 45. Apparatus as in claim 43, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
- 46. Apparatus as in claim 37, wherein said shaping means is comprised of:means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 47. Apparatus as in claim 37, wherein said calculating means is comprised of:means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1R b(i)z-i.
- 48. Apparatus as in claim 47, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 49. A method for producing comfort noise (CN) in a network element that uses a discontinuous transmission, comprising the steps of:receiving excitation spectral control (RESC) parameters; and shaping the spectral content of an excitation using the received RESC parameters prior to applying the excitation to a synthesis filter.
- 50. A method as in claim 49, wherein the step of calculating RESC parameters includes a step of analyzing a residual signal in a speech coder.
- 51. A method as in claim 50, wherein the speech coder implements a LPC analysis technique, and wherein the step of analyzing is of lower degree than the LPC analysis technique.
- 52. A method as in claim 50, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the step of analyzing is performed by first or second order LPC analysis.
- 53. A method as in claim 49, wherein the step of calculating RESC parameters includes steps of analyzing a residual signal in a speech coder to produce spectral parameters, and averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 54. A method as in claim 53, wherein the plurality of frames is equal to about 10 or greater.
- 55. A method as in claim 49, wherein the step of calculating RESC parameters includes steps of applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 56. A method as in claim 55, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1R b(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R.
- 57. A method as in claim 55, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
- 58. A method as in claim 49, wherein the step of shaping includes steps of:forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 59. A method as in claim 49, wherein the step of calculating RESC parameters include a step of:applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1Rb(i)z-i.
- 60. A method as in claim 59, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined one of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 61. Apparatus for generating comfort noise (CN) in a system having a digital mobile terminal that uses a discontinuous transmission to a network, comprising:means in said network for shaping the spectral content of an excitation using received excitation spectral control (RESC) parameters prior to applying the excitation to a synthesis filter.
- 62. Apparatus as in claim 61, wherein said calculating means analyses a residual signal in a speech coder.
- 63. Apparatus as in claim 63, wherein the speech coder implements a LPC analysis technique, and wherein the analysis is of lower degree than the LPC analysis technique.
- 64. Apparatus as in claim 62, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the analysis is performed by first or second order LPC analysis.
- 65. Apparatus as in claim 61, wherein said calculating means analyses a residual signal in a speech coder to produce spectral parameters, and further comprising means for averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 66. Apparatus as in claim 65, wherein the plurality of frames is equal to about 10 or greater.
- 67. Apparatus as in claim 61, wherein said calculating means applies an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 68. Apparatus as in claim 67, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R.
- 69. Apparatus as in claim 67, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
- 70. Apparatus as in claim 61, wherein said shaping means is comprised of:means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 71. Apparatus as in claim 61, wherein said calculating means is comprised of:means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1Rb(i)z-i.
- 72. Apparatus as in claim 71, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 73. A method for producing comfort noise (CN) in a digital network element that uses a discontinuous transmission, comprising the steps of:in response to a speech pause, calculating random excitation spectral control (RESC) parameters; and transmitting the RESC parameters to a receiver together with predetermined ones of CN parameters.
- 74. A method as in claim 73, wherein the step of calculating RESC parameters includes a step of analyzing a residual signal in a speech coder.
- 75. A method as in claim 74, wherein the speech coder implements a LPC analysis technique, and wherein the step of analyzing is of lower degree than the LPC analysis technique.
- 76. A method as in claim 74, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the step of analyzing is performed by first or second order LPC analysis.
- 77. A method as in claim 73, wherein the step of calculating RESC parameters includes steps of analyzing a residual signal in a speech coder to produce spectral parameters, and averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 78. A method as in claim 77, wherein the plurality of frames is equal to about 10 or greater.
- 79. A method as in claim 73, wherein the step of calculating RESC parameters includes steps of applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 80. A method as in claim 79, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R.
- 81. A method as in claim 79, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
- 82. A method as in claim 73, wherein the step of shaping includes steps of:forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 83. A method as in claim 73, wherein the step of calculating RESC parameters include a step of:applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence; scaling the generated white noise sequence to produce a scaled noise sequence; and processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1Rb(i)z-i.
- 84. A method as in claim 83, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 85. Apparatus for generating comfort noise (CN) in a system having a network element that uses a discontinuous transmission, comprising:means in said network element that is responsive to a speech pause for calculating random excitation spectral control (RESC) parameters and for transmitting the RESC parameters together with predetermined ones of CN parameters to a receiver in said network.
- 86. Apparatus as in claim 85, wherein said calculating means analyses a residual signal in a speech coder.
- 87. Apparatus as in claim 86, wherein the speech coder implements a LPC analysis technique, and wherein the analysis is of lower degree than the LPC analysis technique.
- 88. Apparatus as in claim 86, wherein the speech coder implements a LPC analysis technique of order greater than two, and wherein the analysis is performed by first or second order LPC analysis.
- 89. Apparatus as in claim 85, wherein said calculating means analyses a residual signal in a speech coder to produce spectral parameters, and further comprising means for averaging the spectral parameters over a plurality of frames to provide RESC parameters.
- 90. Apparatus as in claim 89, wherein the plurality of frames is equal to about 10 or greater.
- 91. Apparatus as in claim 85, wherein said calculating means applies an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal.
- 92. Apparatus as in claim 91, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R.
- 93. Apparatus as in claim 91, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
- 94. Apparatus as in claim 85, wherein said shaping means is comprised of:means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
- 95. Apparatus as in claim 85, wherein said calculating means is comprised of:means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by: HRESC(z)=1-∑i=1Rb(i)z-i,where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence; means for scaling the generated white noise sequence to produce a scaled noise sequence; and means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content; wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form: 1/HRESC(z)=11-∑i=1Rb(i)z-i.
- 96. Apparatus as in claim 95, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
- 97. A method for generating comfort noise (CN) in an element of a mobile communications network that uses a discontinuous transmission, comprising the steps of:in response to a speech pause, buffering a set of speech coding parameters; within an averaging period, replacing speech coding parameters of the set that are not representative of background noise with speech coding parameters that are representative of the background noise; and averaging the set of speech coding parameters.
- 98. A method as in claim 97, wherein the step of replacing includes the steps of:measuring distances of the speech coding parameters from one another between individual frames within the averaging period; identifying those speech coding parameters which have the largest distances to the other parameters within the averaging period; and if the distances exceed a predetermined threshold, replacing an identified speech coding parameter with a speech coding parameter which has a smallest measured distance to the other speech coding parameters within the averaging period.
- 99. A method as in claim 97, wherein the step of replacing includes the steps of:measuring distances of the speech coding parameters from one another between individual frames within the averaging period; identifying those speech coding parameters which have the largest distances to the other parameters within the averaging period; and if the distances exceed a predetermined threshold, replacing an identified speech coding parameter with a speech coding parameter having a median value.
- 100. A method as in claim 97, wherein the step of averaging includes a step of computing an average excitation gain gmean and average short term spectral coefficients fmean(i).
- 101. A method as in claim 97, wherein the step of replacing includes steps of:forming a set of buffered excitation gain values over the averaging period; ordering the set of buffered excitation gain values; and performing a median replacement operation in which those L excitation gain values differing the most from the median value, where the difference exceeds a predetermined threshold value, are replaced by the median value of the set.
- 102. A method as in claim 101, wherein a length N of the averaging period is an odd number, and wherein the median of the ordered set is the ((N+1)/2)th element of the set.
- 103. A method as in claim 97, and further comprising a step of:forming a set of buffered Line Spectral Pair (LSP) coefficients f(k), k=1, . . . ,M over the averaging period; and determining a spectral distance of the LSP coefficients fi(k) of the ith frame in the averaging period, to the LSP coefficients fj(k) of the jth frame in the averaging period.
- 104. A method as in claim 103, where the step of determining the spectral distance is accomplished in accordance with the expression Δ Rij=∑k=1M(fi(k)-fj(k))2,where M is the degree of the LPC model, and fi(k) is the kth LSP parameter of the ith frame in the averaging period.
- 105. A method as in claim 103, and further comprising a step of determining the spectral distance ΔSi of the LSP coefficients fi(k) of frame i to the LSP coefficients of all the other frames j=1, . . . ,N, i≠j, within the averaging period of length N.
- 106. A method as in claim 105, wherein the step of determining the spectral distance is accomplished by determining the sum of the spectral distances ΔRij in accordance with Δ Si=∑j=1,j≠iNΔ Rij,for all i=1, . . . ,N.
- 107. A method as in claim 105, and further comprising steps of:after the spectral distances ΔSi have been found for each of the LSP vectors fi within the averaging period, ordering the spectral distances according to their values; considering a vector fi with the smallest distance ΔSi within the averaging period i=1, 2,. . . ,N to be a median vector fmed of the averaging period having a distance denoted as ΔSmed; and performing a median replacement of P (0≦P≦N-1) LSP vectors fi with the median vector fmed.
- 108. A method as in claim 107, wherein the steps of identifying and replacing are performed independently for excitation gain values g and Line Spectral Pair (LSP) vectors fi.
- 109. A method as in claim 98, wherein the steps of identifying and replacing are combined together for excitation gain values g and Line Spectral Pair (LSP) vectors fi.
- 110. A method as in claim 109, comprising steps of:in response to determining that the speech coding parameters in an individual frame are to be replaced by median values of the parameters, replacing both the excitation gain value g and the LSP vector fi of that frame by the respective parameters of the frame containing the median parameters.
- 111. A method as in claim 110, and comprising initial steps of:determining a distance ΔTij between the parameters of the ith frame and the jth frame of the averaging period in accordance with the expression Δ Tij=∑k=1M(fi(k)-fj(k))2+w(gi-gj)2,where M is the degree of the LPC model, fi(k) is the kth LSP parameter of the ith frame of the averaging period, and gi is the excitation gain parameter of the ith frame.
- 112. A method as in claim 111, and further comprising a step of:determining a distance ΔSi of the speech coding parameters of frame i, for all i=1, . . . ,N, to the speech coding parameters of all the other frames j=1, . . . ,N, i≠j within the averaging period of length N, in accordance with Δ Si=∑j=1,j≠iNΔ Tij,for all i=1, . . . ,N.
- 113. A method as in claim 112, wherein after the distances ΔSi have been determined for each of the frames within the averaging period, further comprising steps of:ordering the distances according to their values; and considering a frame with the smallest distance ΔSi within the averaging period i=1,2, . . . ,N as a median frame, having distance ΔSmed, of the averaging period, the median frame having speech coder parameters gmed and fmed.
- 114. A method as in claim 113, and comprising a step of performing median replacement on the speech coding parameter frames within the averaging period i=1,2, . . . ,N wherein parameters gi and fi of L (0≦L≦N-1) frames are replaced by the parameters gmed and fmed of the median frame.
- 115. A method as in claim 113, wherein differences between each individual distance and the median distance are determined by dividing an individual distance by the median distance in accordance with ΔSi/ΔSmed.
- 116. A method as in claim 107, wherein differences between each individual distance and the median distance are determined by dividing an individual distance by the median distance in accordance with ΔSi/ΔSmed.
- 117. Apparatus for generating comfort noise (CN) in an element of a mobile communication network that uses a discontinuous transmission to a network, comprising:data processing means in network element that is responsive to a speech pause for buffering a set of speech coding parameters and, within an averaging period, for replacing speech coding parameters of the set that are not representative of background noise with speech coding parameters that are representative of the background noise, said data processing means averaging the set of speech coding parameters and transmitting the averaged set of speech coding parameters to the mobile terminal.
- 118. Apparatus as in claim 117, wherein said data processor replaces speech coding parameters of the set by ordering the set and measuring distances of the speech coding parameters from one another between individual frames within the averaging period, by identifying those speech coding parameters which have the largest distances to the other parameters within the averaging period; and, if the distances exceed a predetermined threshold, by replacing the identified speech coding parameters with a speech coding parameter which has a smallest measured distance to the other speech coding parameters within the averaging period.
- 119. Apparatus as in claim 117, wherein said data processor replaces speech coding parameters of the set by ordering the set and measuring distances of the speech coding parameters from one another between individual frames within the averaging period; by identifying those speech coding parameters which have the largest distances to the other parameters within the averaging period; and, if the distances exceed a predetermined threshold, by replacing an identified speech coding parameter with a speech coding parameter having a median value.
- 120. Apparatus as in claim 117, wherein said data processing means identifies and replaces speech coding parameters independently for excitation gain values g and Line Spectral Pair (LSP) vector fi.
- 121. Apparatus as in claim 117, wherein said data processing means identifies and replaces speech coding parameters together for excitation gain values g and Line Spectral Pair (LSP) vector fi.
RELATED APPLICATIONS
This application is a continuation application, based on U.S. application for Patent, Ser. No. 08/965,303, filed on Nov. 6, 1997, now U.S. Pat. No. 5,960,389, and Applicant claims priority thereof. Said application Ser. No. 08/965,303 claims the benefit of Provisional Application 60/031,047, filed Nov. 15, 1996 and Provisional Application 60/031,321 filed Nov. 19, 1996. The disclosures of the above cited applications are incorporated herein by reference in their entireties.
US Referenced Citations (9)
Foreign Referenced Citations (2)
Number |
Date |
Country |
WO 9628809 |
Sep 1996 |
WO |
WO 9634382 |
Oct 1996 |
WO |
Non-Patent Literature Citations (4)
Entry |
Paksoy, E. et al., “Variable Bit-Rate Celp Coding of Speech With Phonetic Classification (1)”, European Transactions On Telecommunications And Related Technologies, vol. 5, No. 5, 9/94, pp. 57-67. |
Southcott, C.B. et al., Voice Control Of The Pan-European Digital Mobile Radio System:, Communications Technology For The 1990's And Beyond, vol. 2, Nov. 27, 1989, pp. 1070-1074. |
“European Digital Cellular Telecommunications Systems (Phase 2); Comfort Noise Aspect For Full Rate Speech Traffic Channels (GSM 06.12)” European Telecommunication Standard, 9/94, pp. 1-10. |
“European Digital Cellular Telecommunications System; Half Rate Speech Part 5; Discontinuous Transmission (DTX) For Half Rate Speech Traffic Channels”, European Telecommunication Standard, vol. 300 581-5, pp. 1-3,5,7-16, Nov. 1, 1995. |
Provisional Applications (2)
|
Number |
Date |
Country |
|
60/031047 |
Nov 1996 |
US |
|
60/031321 |
Nov 1996 |
US |
Continuations (1)
|
Number |
Date |
Country |
Parent |
08/965303 |
Nov 1997 |
US |
Child |
09/371332 |
|
US |