MONITORING VOICE QUALITY IN COMMUNICATION NETWORKS

Information

  • Patent Application
  • 20080014883
  • Publication Number
    20080014883
  • Date Filed
    July 14, 2006
    18 years ago
  • Date Published
    January 17, 2008
    16 years ago
Abstract
A method for monitoring voice quality in a communication network. In a first embodiment, the method comprises the steps of: establishing a voice communication pathway (VCPW) between first and second telephone terminals; establishing speech cadence between first and second terminals, wherein the terminals alternate between opposing send and listen/receive modes such that when one of the terminals is sending a voice test message the other terminal is either listening for or receiving the voice test message and vice versa; and calculating a voice quality score for one or more received voice test messages. In a second embodiment, synchronous cadence is established between a plurality of telephone terminals and a voice recognition server (VRS).
Description

BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a schematic that shows how cadence C is achieved following establishment of a voice communication pathway (VCPW) between first and second mobile terminals MT#1 and MT#1 according to the first embodiment of the invention.



FIG. 2 shows the situation where MT#1 and MT#2 do not establish cadence.



FIG. 3 shows how cadence is used to alternately send and receive voice test messages (VTMs) in synchrony between MT#1 and MT#2 according to the first embodiment of the invention.



FIG. 4 shows an interaction diagram illustrating the interaction across a communication network during cadence between MT#1 and MT#2 according to the first embodiment of the invention.



FIG. 5 is a schematic illustrating the remote monitoring of voice quality across a plurality of VCPWs according to the second embodiment of the invention.



FIG. 6 illustrates a variation of the second embodiment shown in FIG. 5.



FIG. 7 illustrates a non-limiting example of how cadence is achieved between a plurality of telephone terminals and a voice recognition server (VRS).



FIG. 8 shows another non-limiting example of how cadence is achieved between a plurality of telephone terminals and a VRS.





DETAILED DESCRIPTION OF THE INVENTION

This invention is directed to telecommunication systems and more particularly to a method and system for monitoring voice quality based on establishing speech cadence established over one or more voice communication pathways.


In a first embodiment, the invention is directed to a method for monitoring voice quality in a communication network, comprising the steps of: establishing a voice communication pathway (VCPW) between first and second telephone terminals; establishing speech cadence between first and second terminals, wherein the terminals switch between opposing send and listen/receive modes such that when one of the terminals is sending a voice test message over the VCPW the other terminal is either listening for or receiving the voice test message and vice versa; storing one or more received voice test messages in a suitable format (such as, but not limited to, WAV format) and/or calculating a voice quality score for one or more received voice test messages.


In a second embodiment, the invention is directed to a method for remotely monitoring voice quality across a plurality of voice communication pathways (VCPWs), comprising the steps of: establishing a plurality of voice communication pathways between a voice recognition server (VRS) and a plurality of telephone terminals (such as, but not limited to, a plurality of mobile terminals MT#1 . . . MT#N); establishing speech cadence between the VRS and the plurality of telephone terminals; converting one or more of the voice test messages received by the VRS into digital files such as, but not limited to, WAV files; forwarding one or more of the digital files to a remote computer for calculating a voice quality scores. It should be understood that the terms “received voice test message”, “received VTM”, “modified voice test message” and “received MVTM” are equivalent terms that refer to a received voice test message.


Any suitable method can be employed to calculate voice quality scores such as, but not limited to, the industry standard MOS equivalent score. A MOS equivalent score can be obtained by comparing the difference between the received VTM and the corresponding reference (i.e., original) voice test message (VTM) to produce an error signal, which can be normalized using known MOS score data for a given network to provide a MOS equivalent score as described, for example, in U.S. Pat. No. 6,330,428. Alternatively, voice quality scoring based on voice recognition in combination with a voice quality measurement application as described in U.S. Pat. No. 6,700,953; U.S. Pat. No. 6,700,953 is incorporated by reference herein in its entirety.


One or more of the voice quality scores can be directed to a remote computer. Alternatively, one or more modified voice test messages (MVTMs, i.e., received voice test messages) can be converted into any standard format such as, but not limited to, WAV format files. The WAV files can be directed to a remote computer for display or further processing to generate objective voice quality scores. For example, MVTMs can be converted to WAV files and directed to one or more remote computers via the Internet using, for example, http or https (combination of a normal http interaction over an encrypted secure socket layer (SSL) or transport layer security (TLS)). Thus, remote computers can perform voice quality (VQ) scoring in real time on any VCPW of interest. In addition, an operator is permitted to to listen to received VTMs by means of D/A conversion to subjectively monitor VCPWs of interest.


With respect to the first embodiment, speech cadence can be established between any suitable telephone terminals such as part number 250 in U.S. Pat. No. 6,330,428 (Lewis et al); U.S. Pat. No. 6,330,428 is incorporated herein by reference in its entirety. The voice terminal may comprise a network interface, standard terminal circuitry, a voice coder and a voice decoder as described in the U.S. Pat. No. 6,330,428 patent; such devices typically include an accurate clock. Voice messages can be sent, for example, over a voice communication pathway (VCPW) between a first mobile terminal (MT#1) and a second mobile terminal (MT#2).


Still referring to the first embodiment, speech cadence is established between MT#1 and MT#2; however, MT#1 and/or MT#2 can be replaced with any other telephony device such as a regular fixed telephone linked to a public switching telephone network (PSTN) or an integrated services digital network (ISDN). It should be understood that the examples used to describe the first embodiment are not intended to limit the setup at either end of the voice communication pathway (VCPW) under test. Either end of the voice communication pathway under test can take the form of a fixed end and/or a mobile end.


In accordance with the first embodiment of the invention, speech cadence is typically established over a voice communication pathway (VCPW) in one cycle of SEND and LISTEN/RECEIVE (“L/R”) as shown in FIG. 1. A first telephone terminal MT#1 is initially set to SEND mode and a second telephone terminal MT#2 at the other end of the VCPW under test is set in L/R (LISTEN/RECEIVE mode). Thus, the second terminal can be left in L/R to receive a voice test message (VTM) from the first terminal (see FIG. 3 and accompanying description below).



FIG. 1 shows how cadence C is achieved following establishment of a VCPW between first and second mobile terminals MT#1 and MT#1 according to the first embodiment of the invention. MT#1 and MT#2 achieve synchronized cadence within one complete cycle of L/R and SEND modes. In this example, MT#1 listened to MT#2's speech and adjusted its clock at 102 to be in synchronized cadence with MT#1. More specifically, MT#2 stretched its first L/R period 104 to synchronous time point Ts at 102, where Ts represents the end of MT#1's first SEND period 106. The subscript “S” in Ts represents synchrony between MT#1 and MT#2. By way of contrast, FIG. 2 shows the situation where MT#1 and MT#2 are not directed to establish cadence (represented by labels no-cadence, “NC” and lack of synchrony TNS).


Referring to FIG. 3, once cadence has been achieved voice test messages are sent and received alternately in synchrony between MT#1 and MT#2. In this illustration, speech cadence is continued between MT#1 and MT#2 as follows: at TS MT#1 is set to SEND mode at 120, and MT#2 is set to L/R mode at 140. At some point between TS and TS+m, MT#1 transmits a voice test message #1 (VTM#1) at 160 through a voice communication pathway (VCPW) in communication network 180, and MT#2 receives VTM#1 as modified voice test message #1 (MVTM#1) at 200. VTMs are original voice test messages and MVTMs are VTMs that have passed through a communication network under test or more particularly a voice communication pathway under test.


Still referring to FIG. 3, between TS+m and TS+2m, mobile terminals MT#1 and MT#2 are reset to L/R AND SEND modes at 220 and 240, respectively. MT#2 transmits a voice test message #2 (VTM#2) at 260 through communication network 180. MT#1 receives VTM#2 as modified voice test message #2 (MVTM#2) at 300.


Still referring to FIG. 3, between TS+2m and TS+3m, mobile terminals MT#1 and MT#2 are reset to SEND AND L/R modes at 320 and 340, respectively. MT#1 transmits a voice test message #3 (VTM#3) at 360 through communication network 180. MT#2 receives VTM#3 as modified voice test message #3 (MVTM#3) at 400.


Still referring to FIG. 3, between TS+3m and TS+4m, mobile terminals MT#1 and MT#2 are reset to L/R and SEND modes at 420 and 440, respectively. MT#2 transmits a voice test message #4 (VTM#4) at 460 through communication network 180. MT#1 receives VTM#4 as modified voice test message #4 (MVTM#4) at 500.


Still referring to FIG. 3, between TS+4m and TS+5m, mobile terminals MT#1 and MT#2 are reset to SEND and L/R modes at 520 and 540, respectively. MT#1 transmits a voice test message #5 (VTM#5) at 560 through communication network 180. MT#2 receives VTM#5 as modified voice test message #5 (MVTM#5) at 600.


Still referring to FIG. 3, between TS+5m and TS+6m, mobile terminals MT#1 and MT#2 are reset L/R and SEND modes at 620 and 640, respectively. MT#2 transmits a voice test message #6 (VTM#6) at 660 through communication network 180. MT#1 receives VTM#6 as modified voice test message #6 (MVTM#6) at 700.


The alternating synchronous cycles of SEND and L/R modes can continue to provide real time monitoring of VCPW 180. More specifically, if a voice quality test unit (VQTU, see FIG. 4 and description below) is operably coupled to MT#1 and/or MT#2 then voice quality scores can be generated upon receipt of each MVTM, i.e., a VQTU attached to MT#1 would be able to evaluate the voice quality of the communication pathway between MT#2 and MT#1. Conversely, a VQTU attached to MT#2 would be able to evaluate the voice quality of the communication pathway between MT#1 and MT#2.


Any suitable voice quality test unit (VQTU) (shown in FIG. 4 as members 880 and 900) can be used such as, but not limited to, that described in U.S. Pat. No. 6,330,428 in which an example of a voice quality test unit is described in terms of a voice quality performance evaluator (VQPE). U.S. Pat. No. 6,330,428 is incorporated herein by reference in its entirety.


Referring to FIG. 4, which is an interaction diagram 800 illustrating the interaction across a communication network 820 during cadence between a first mobile terminal (MT#1) at 840 and a second mobile terminal (MT#2) at 860. MT#1 and MT#2 are respectively adapted to receive and process instructions from voice quality test units VQTU#1 and VQTU#2. Hence, MT#1 and MT#2 are shown operably coupled to VQTU#1 at 880 and VQTU#2 at 900, respectively. VQTU#1 and/or VQTU#2 could take the form of a laptop computer or personal computer upon which software to set up speech cadence and application software to conduct voice quality evaluations is pre-loaded and in executable form.


Still referring to FIG. 4, VQTU#1 at 880 instructs MT#1 at 1000 to initialize to SEND (S) mode and send a first voice test message (VTM#1) to the second mobile terminal MT#2 at time T0. Upon sending VTM#1, VQTU#1 instructs MT#1 at 1020 to switch to L/R mode in anticipation of receiving VTM#2 (as MVTM#2) from MT#2.


Still referring to FIG. 4, at T0 or at some point prior to T0, the second voice quality test unit VQTU#2 instructs MT#2 to switch to L/R mode at 1040 to receive VTM#1, which is received as MVTM#1 (modified voice test message #1, i.e., VTM#1 after transmission through network 820). Upon receiving MVTM#1, VQTU#2 instructs MT#2 to switch to SEND (“S”) mode and send VTM#2 to MT#1 at 1060. Immediately after sending VTM#2, VQTU#2 instructs MT#2 to switch to L/R mode at 1080 in anticipation of receiving MVTM#3 at 1100. Upon receiving MVTM#3, VQTU#2 instructs MT#2 to switch to send mode at 1200 and transmit VTM#N at 1220. At any convenient juncture, VQTU#1 and/or VQTU#2 can compute voice quality scores based on received MVTMs (modified voice test messages) and direct such scores to remove remote displays and/or servers at 1240 and 1260, respectively, for display and/or statistical analysis.


Still referring to FIG. 4, VTM#2 is received as MVTM#2 by MT#1 at 1120. Upon receiving MVTM#2, VQTU#1 instructs MT#1 to switch to SEND (“S”) mode and send VTM#3 to MT#2 at 1140. Immediately after sending VTM#3, VQTU#1 instructs MT#1 to switch to L/R mode at 1160 in anticipation of receiving MVTM#N at 1180.


The invention is also amenable to objective voice quality scoring based on a voice recognition application in combination with a quality measurement application as described in U.S. Pat. No. 6,700,953; U.S. Pat. No. 6,700,953 is incorporated by reference herein in its entirety. Voice recognition application in combination with a quality measurement application can be used to obtain objective voice quality scoring upon receipt of each MVTM at one or both ends of the voice communication pathway channel under test. For example, MVTMs received during cadence at both ends of the voice communication pathway may be converted to a digital signal, and compared to corresponding reference phrases to generate objective voice quality scores, which can be directed to a remote server for display and/or further statistical processing.


Referring to FIG. 5, which speaks to the second embodiment of the invention, a method is provided for remotely monitoring voice quality across a plurality of VCPWs (represented as arrows 3000 in FIG. 5). A plurality of VCPWs 3000 is established between a voice recognition server (VRS) 3020 and a plurality of telephone terminals 3040, which are shown in FIG. 5 as a plurality of mobile terminals MT#1 . . . MT#N. It should be understood that the plurality of telephone terminals 3040 might be a mix of mobile terminals and fixed telephones.


The second embodiment lends itself to scalability. For example, the VRS 3020 can take the form of separate processors operating in parallel to handle greater numbers of incoming VTMs from greater and greater number of MTs 3040. For example, a VRS capable of communicating simultaneously with 100 MTs could be coupled to a further VRS capable of communicating simultaneously with another 100 MTs to provide VRS functionality able to communicate with up to 200 MTs.


One or more of the voice test messages received by the VRS are converted into WAV files, which are stored on a database 3060. The WAV files are accessible by one or more remote computers 3080 via the Internet 3100. Voice quality (VQ) algorithms are loaded on one or more of the remote computers to perform VQ scoring. The terms “plurality of VCPWs” and “plurality of communication lines” are regarded as equivalent terms.


In a variation of the second embodiment (see FIG. 6), one or more of the telephone terminals 3040 are operably linked to one or more voice quality test units VQTU(s) 3120 thereby allowing voice test messages received by the telephone terminals 3040 to be scored for voice quality.


Cadence between the VRS 3020 and the plurality of telephone terminals 3040 can be established in a variety of ways. In FIG. 7, the VRS runs a number of virtual clocks which allows the VRS to adjust itself and establish cadence with every MT operably linked to the VRS.


Cadence between the VRS 3020 and the plurality of telephone terminals 3040 can be established in a variety of ways. In FIG. 8, each MT individually adjusts its internal clock to achieve cadence with the VRS 3020.


The second embodiment of the invention provides a method for voice quality scoring (i.e., voice quality monitoring) in a communication network, comprising the steps of: establishing voice communication pathways between a plurality of telephone terminals and a voice recognition server (VRS); establishing speech cadence between the plurality of telephone terminals and the VRS, wherein voice test messages are alternately sent and received in synchrony between the plurality of telephone terminals and the VRS; and calculating a voice quality score for one or more received voice test messages.


Moreover, the invention is particularly suited for providing an objective measurement, in real-time, of voice quality transmitted over a voice communication pathway (i.e. a voice communication pathway channel across one or more telecom communication networks). The applicant is unaware of any other system or method that provides objective voice quality scores in real-time in both directions along a voice communication pathway.


It is to be understood that the present invention is not limited to the embodiments described above, but encompasses any and all embodiments within the scope of the following claims.

Claims
  • 1. A method for monitoring voice quality in a communication network, comprising: establishing a voice communication pathway between first and second telephone terminals;establishing speech cadence between first and second terminals, wherein the terminals switch between opposing send and listen/receive modes such that when one of the terminals is sending a voice test message the other terminal is either listening for or receiving the voice test message and vice versa; andcalculating a voice quality score for one or more received voice test messages.
  • 2. The method according to claim 2, wherein the step of calculating a voice quality score for one or more received voice test messages uses a voice recognition application in combination with a quality measurement application, wherein at least one of the received voice test messages is converted to a digital signal, wherein the digital signal is compared to a corresponding reference phrase to generate an objective voice quality score.
  • 3. The method according to claim 1, wherein the step of calculating a voice quality score for one or more received voice test messages is preceded by the steps of: identifying a corresponding voice reference message for one or more received voice test messages, andsynchronizing at least one received voice test message with corresponding reference voice messages to provide one or more pairs of aligned test-reference messages.
  • 4. The method according to claim 3, wherein the voice communication pathway spans a public cellular network (PCN) linked a public switched network (PSTN), and wherein the first terminal is a mobile terminal in communication with the PCN, and the second terminal is a phone operably coupled to the PSTN.
  • 5. The method according to claim 3, wherein the communication network comprises a public cellular network (PCN) linked an integrated services digital network (ISDN), wherein the first terminal is a mobile terminal in communication with the PCN, and the second terminal is a phone operably coupled to the ISDN.
  • 6. The method according to claim 1 further comprising the step of directing one or more voice quality scores to a remote computer.
  • 7. The method according to claim 1, further comprising the step of converting one or more received test messages into WAV format files, and directing the WAV files to a remote computer.
  • 8. A method for monitoring voice quality in a communication network, comprising: establishing voice communication pathways between a plurality of telephone terminals and a voice recognition server (VRS);establishing speech cadence between the plurality of telephone terminals and the VRS, wherein voice test messages are alternately sent and received in synchrony between the plurality of telephone terminals and the VRS; andcalculating a voice quality score for one or more received voice test messages.
  • 9. The method for monitoring voice quality in a communication network according to claim 8, wherein the VRS is capable of operating in parallel with at least one additional VRS unit.
  • 10. The method for monitoring voice quality in a communication network according to claim 8, wherein one or more received voice test messages are converted to one or more WAV format files, wherein voice quality scores are calculated on at least one of the WAV format files.
  • 11. The method for monitoring voice quality in a communication network according to claim 10, wherein the WAV files are sent over the Internet to remote computers for voice quality scoring.