Exemplary embodiments relate to an apparatus and method for transmitting and receiving voice data in a wireless communication system.
A speech is compressed by a speech codec (encoder) of a transmission terminal, various types of headers such as real-time transport protocol (RTP), user datagram protocol (UDP), and Internet protocol (IP) headers are attached to the compressed speech, and the header-attached compressed speech is transmitted through a communication modem such as a long-term evolution (LTE) modem. A reception terminal sequentially removes the RTP, UDP, and IP headers and checks whether there is a lost speech frame. A speech frame restored without any loss is decompressed by a speech codec (decoder) from a compressed state, converted into a pulse coded modulation (PCM) signal, and delivered to a speaker.
When a loss in information is discovered during the restoration of speech frames, error concealment for reducing damages in sound quality is carried out by using information about previous frames. If a call is not smooth because speech frames were lost during transmission due to congestion of a transmission path, a speech bit-rate may be adjusted by transmitting a codec mode request (CMR) message to a counterpart terminal such that the bit-rate is temporarily decreased when there is a congestion status and is gradually increased as the congestion status is solved.
According to conventional voice compression techniques such as adaptive multi-rate (AMR) and adaptive multi-rate wideband (AMR-WB) techniques, a voice bandwidth is fixed to a narrowband or a wideband, this indicates that an analog voice signal is converted into a digital signal at 8,000 or 16,000 samples/s and compressed, and this conversion speed does not change during a call. An AMR codec may compress a voice signal digitized at 8,000 samples/s to eight types of bit-rates of 4.75 to 12.2 Kbps and process a voice signal of a band of 300 to 3,400 Hz. An AMR-WB codec may compress a voice signal digitized at 16,000 samples/s to nine types of bit-rates of 6.6 to 23.85 Kbps and process a voice signal of a band of 50 to 7,000 Hz.
Recently, with respect to call quality of AMR-WB voice codecs commercialized in a voice over LTE (VoLTE) service, most listeners think that call quality is improved, but some listeners do not prefer call quality of a high-frequency voice and tend to think that the quality of a conventional AMR voice codec was better. This indicates that a high-frequency voice component may or may not be preferred according to a listener or background noise compressed along with a voice.
Provided are an operating method and apparatus of a transmission terminal and a reception terminal for transmitting/receiving a voice signal of which a sampling rate and/or a bit-rate are mutually adjusted between terminals during a call in consideration of the fact that optimal call quality can be achieved by using another sampling rate according to circumstances even at the same voice bit-rate.
Provided are a method and apparatus for efficiently negotiating, between two terminals, a bit-rate and/or a sampling rate of a voice codec supporting bit-rates of a wider range than the prior art and sampling rates of a wider range than the prior art.
According to an aspect of an exemplary embodiment, an operating method of a transmission terminal for transmitting a voice signal includes: generating sampling- and bit-rate request information including sampling information for determining a sampling rate of a voice signal and bit-rate information for determining a bit-rate of the voice signal and transmitting the generated sampling- and bit-rate request information to a reception terminal; receiving, from the reception terminal, combination determination information by at least one combination of sampling rates determined based on the sampling information and bit-rates determined based on the bit-rate information; and compressing the voice signal according to the received combination determination information and transmitting the compressed voice signal to the reception terminal.
According to an aspect of another exemplary embodiment, an operating apparatus of a transmission terminal for transmitting a voice signal includes: a request information generation unit configured to generate sampling- and bit-rate request information including sampling information for determining a sampling rate of a voice signal and bit-rate information for determining a bit-rate of the voice signal; a control unit configured to control the generated sampling- and bit-rate request information to be transmitted to a reception terminal; an interface unit configured to transmit the sampling- and bit-rate request information to the reception terminal under control of the control unit; and a voice compression unit configured to, if the interface unit receives, from the reception terminal, combination determination information by at least one combination of sampling rates determined based on the sampling information and bit-rates determined based on the bit-rate information, compress the voice signal according to the received combination determination information, wherein the interface unit transmits the compressed voice signal to the reception terminal under control of the control unit.
According to an aspect of another exemplary embodiment, an operating method of a reception terminal for receiving a voice signal includes: receiving, from a transmission terminal, sampling- and bit-rate request information including sampling information for determining a sampling rate of a voice signal and bit-rate information for determining a bit-rate of the voice signal; determining at least one combination of sampling rates determined based on the sampling information and bit-rates determined based on the bit-rate information, according to the sampling- and bit-rate request information; and transmitting the determined combination determination information to the transmission terminal.
According to an aspect of another embodiment, an operating apparatus of a reception terminal for receiving a voice signal includes: an interface unit configured to receive, from a transmission terminal, sampling- and bit-rate request information including sampling information for determining a sampling rate of a voice signal and bit-rate information for determining a bit-rate of the voice signal; a combination determination unit configured to determine at least one combination of sampling rates determined based on the sampling information and bit-rates determined based on the bit-rate information, according to the sampling- and bit-rate request information; and a control unit configured to control the determined combination determination information to be transmitted to the transmission terminal, wherein the interface unit transmits the combination determination information to the transmission terminal under control of the control unit.
According to exemplary embodiments, in a voice over Internet protocol (VoIP) system using a voice codec such as enhanced voice services (EVS) capable of compressing a voice at a plurality of sampling rates and bit-rates, a sampling rate and a bit-rate to be used for a service in the voice codec supporting the plurality of sampling rates and bit-rates may be appropriately negotiated, and a compression scheme of a counterpart terminal may be dynamically adjusted according to a taste of a recipient, voice content, and background noise.
Referring to
In general, examples of types of voice codecs according to bands are as shown in Table 1.
According to Table 1, a voice codec of the NB corresponding to 100 to 3,500 Hz has a sampling rate of 8,000 samples/s and selectively uses, as a bit-rate, one of 7.2, 8, 9.6, 13.2, 16.4, 24.4, 32, 48, 64, 96, and 128. In addition, a voice codec of the WB corresponding to 50 to 7,000 Hz has a sampling rate of 16,000 samples/s and selectively uses one of the 11 bit-rates described above. In addition, a voice codec of the SWB corresponding to 50 to 16,000 Hz has a sampling rate of 32,000 samples/s and selectively uses one of the 11 bit-rates described above. In addition, a voice codec of the FB corresponding to 50 to 20,000 Hz has a sampling rate of 48,000 samples/s and selectively uses one of the 11 bit-rates described above. That is, the voice codecs according to bands selectively use various bit-rates but respectively use fixed sampling rates. Compared with this, the exemplary embodiment allows not only bit-rates but also sampling rates to be selectively used according to types of voice codecs as described below.
According to the voice compression techniques, the number of bit-rates usable during a call is 8, 9, or less, and a difference between a minimum value and a maximum value is not that large as 10-20 Kbps or less. For example, 8 bit-rates between 4.75 Kbps and 12.2 Kbps may be used in the AMR technique, and 9 bit-rates between 6.6 Kbps and 23.85 Kbps may be used in the AMR-WB technique.
According to an exemplary embodiment, the transmission terminal may generate list information of, for example, 8,000, 16,000, 32,000, and 48,000 samples/s corresponding to types of sampling rates as sampling list information corresponding to sampling and/or bit-rate request information. In addition, the transmission terminal may generate list information of, for example, 7.2, 8, 9.6, 13.2, 16.4, 24.4, 32, 48, 64, 96, and 128 as bit-rate list information.
The sampling list information included in the sampling- and bit-rate request information may include sampling identification information corresponding to each of sampling types, and the bit-rate list information may include bit-rate identification information corresponding to each bit-rate type.
Table 2 illustrates identification information according to sampling rates.
According to Table 2, identification information Id is allocated to each of sampling rates.
Table 3 illustrates identification information according to bit-rates.
According to Table 3, identification information Id is allocated to each bit-rate.
The transmission terminal may insert the generated sampling- and bit-rate request information into a payload header. That is, the sampling list information and the bit-rate list information corresponding to the sampling- and bit-rate request information may be added to the payload header.
The payload header includes, together with the sampling list information and the bit-rate list information, sampling type confirmation information of a sampling type being currently used, bit-rate type confirmation information of a bit-rate type being currently used, and the like.
A function of each field of the payload header of
That is, as the bit-rate list information, list information of bit-rate types, for example, 7.2, 8, 9.6, 13.2, 16.4, 24.4, 32, 48, 64, 96, and 128 and respective pieces of identification information 0000 to 1010 corresponding to the list information are included. In addition, as the sampling rate list information, list information of, for example, 8,000, 16,000, 32,000, and 48,000 and respective pieces of identification information 00 to 11 corresponding to the list information are included. In addition, as the bit-rate type confirmation information, list information of a bit-rate used for a current frame and identification information corresponding to the list information are included. In addition, as the sampling type confirmation information, information about a sampling rate used for the current frame and identification information corresponding to the information are included.
The transmission terminal may generate a voice over Internet protocol (VoIP) packet in which a real-time transport protocol (RTP) header, a user datagram protocol (UDP) header, and an Internet protocol (IP) header are sequentially added in addition to a payload header having the sampling- and bit-rate request information, and transmit the generated VoIP packet to the reception terminal.
Referring back to
When receiving sampling- and bit-rate list information including sampling rate list information and bit-rate list information from the transmission terminal, the reception terminal selects EVS from the sampling rate list information and determines any one sampling rate (for example, 32,000 samples/s) as shown in
This combination determination information corresponds to combination information of sampling rate types and bit-rate types determined by using at least one of background noise, audio quality, audio tone of an audio signal including the voice signal.
Therefore, the reception terminal determines proper combination determination information for a voice signal in consideration of background noise, audio quality, a tone of a caller, and the like of a voice signal provided from a counterpart terminal and transmits the determined combination determination information to the counterpart terminal (transmission terminal), and the counterpart terminal (transmission terminal) receives the combination determination information.
After operation S102, the transmission terminal may compress a voice signal according to the received combination determination information and transmit the compressed voice signal to the reception terminal in operation S104. For example, when receiving {(0, 0), (1, 1), (1, 2), (2, 3), (2, 4), (2, 5)} shown in
Referring to
The request information generation unit 200 may insert the generated sampling- and bit-rate request information into a payload header. That is, the sampling list information and the bit-rate list information corresponding to the sampling- and bit-rate request information may be added to the payload header. The payload header includes, together with the sampling list information and the bit-rate list information, sampling type confirmation information of a sampling type being currently used, bit-rate type confirmation information of a bit-rate type being currently used, and the like.
The request information generation unit 200 may generate a VoIP packet in which at least one of an RTP header, a UDP header, and an IP header are added in addition to the payload header having the sampling- and bit-rate request information. As shown in
The control unit 210 may control the sampling- and bit-rate request information including the sampling rate list information and the bit-rate list information to be transmitted to a reception terminal.
The interface unit 230 may transmit the sampling- and bit-rate request information to the reception terminal under control of the control unit 210. Thereafter, the interface unit 230 may receive, from the reception terminal, combination determination information by at least one combination of sampling types selected from the sampling list information and bit-rate types selected from the bit-rate list information.
For example, as shown in
The voice compression unit 240 may compress the voice signal according to the received combination determination information. For example, when receiving {(0, 0), (1, 1), (1, 2), (2, 3), (2, 4), (2, 5)} shown in
The voice compression unit 240 may transmit the compressed voice signal to the reception terminal under control of the control unit 210.
Referring to
The sampling- and bit-rate request information may be received by being inserted into a payload header. That is, the sampling list information and the bit-rate list information corresponding to the sampling- and bit-rate request information may be added to the payload header, and the payload header includes, together with the sampling list information and the bit-rate list information, sampling type confirmation information of a sampling type being currently used, bit-rate type confirmation information of a bit-rate type being currently used, and the like.
The payload header having the sampling- and bit-rate request information may be received in a form of a VoIP packet in which at least one of an RTP header, a UDP header, and an IP header are added. As shown in
After operation S300, the reception terminal may determine at least one combination of sampling rates determined based on the sampling information and bit-rates determined based on the bit-rate information in operation S302. Combination determination information may be determined by at least one combination of sampling types selected from the sampling information and bit-rate types selected from the bit-rate information or by at least one combination of a range of sampling rates determined within the range of the sampling rate range information and a range of bit-rates determined within the range of the bit-rate range information. The sampling rate range information may be differently determined in a sending direction and a receiving direction, and the bit-rate range information may be differently determined in the sending direction and the receiving direction.
The reception terminal may separate the payload header, the RTP header, the UDP header, and the IP header from the received VoIP packet, extract the sampling list information and the bit-rate list information from the separated payload header, and determine the at least one combination by using the extracted sampling list information and bit-rate list information.
As shown in
The reception terminal may determine combination information of sampling rate types and bit-rate types by using at least one of background noise, audio quality, audio tone of an audio signal including the voice signal.
As shown in
After operation S302, the reception terminal may transmit the determined combination determination information to the transmission terminal in operation S304. Thereafter, when the transmission terminal transmits a voice signal compressed according to the combination determination information, the reception terminal may receive the compressed voice signal and restore and output a voice signal.
Referring to
The payload header may be received in a form of a VoIP packet in which at least one of an RTP header, a UDP header, and an IP header are added. As shown in
The control unit 410 may control the combination determination unit 420 to determine a combination of a sampling rate type and a bit rate type, according to the reception of the sampling- and bit-rate request information.
The combination determination unit 420 may determine at least one combination of sampling types determined from the sampling list information and bit-rate types determined from the bit-rate list information, according to the sampling- and bit-rate request information.
The combination determination unit 420 may separate the payload header, the RTP header, the UDP header, and the IP header from the received VoIP packet, extract the sampling list information and the bit-rate list information from the separated payload header, and determine the at least one combination by using the extracted sampling list information and bit-rate list information.
As shown in
The combination determination unit 420 may determine at least one combination of sampling rate types and bit-rate types by using at least one of background noise, audio quality, audio tone of an audio signal including the voice signal.
As shown in
The control unit 410 may control the determined combination determination information to be transmitted to the transmission terminal. According to this, the interface unit 400 may transmit the combination determination information to the transmission terminal.
Such a negotiation method between a transmission terminal and a reception terminal provides maximum flexibility to the transmission terminal in the selection of a bit rate and a sampling rate to be used for a call but has several important restrictions.
First, bit rates used for a call service are influenced by a fare system and are usually assigned to a set of adjacent bit rates such as 13.2, 16.4, and 24.4 Kbps rather than separated to, for example, 7.2, 24.4, and 48 Kbps. That is, bit rates may be assigned in a specific range unit, and a bit rate may be adjusted within this range and used according to a transmission condition. In this situation, rather than negotiating individual bit rates one by one, it may be efficient that a range of a minimum bit rate and a maximum bit rate to be used for a call is negotiated.
Herein, the br-send and br-recv messages indicate mutually opposite directions in view of the transmission terminal and the reception terminal. In order for a call negotiation to gradually converge according to message exchanges, a bit-rate range of a br-send message transmitted by the reception terminal should be a subset of a bit-rate range offered in a br-recv message transmitted by the transmission terminal. In addition, a bit-rate range of a br-recv message transmitted by the reception terminal should be a subset of a bit-rate range offered in a br-send message transmitted by the transmission terminal.
Another restriction of the basic negotiation method is a problem that, when a sampling rate has been negotiated but a transmission terminal needs to adjust the sampling rate due to an unpredicted characteristic of an input signal or background noise, the adjustment should be renegotiated by exchanging messages with a counterpart terminal. If a characteristic of an input signal or background noise is changed at a high speed, even when the transmission terminal continuously tries renegotiation, the transmission terminal may not meet a characteristic of a currently inputted signal. Even in this case, it may be efficient that a range of a minimum sampling rate and a maximum sampling rate to be used for a call is negotiated at once in a call negotiation step.
Table 5 includes the definition of these messages. bw, bw-send and bw-recv may be used for negotiation of individual sampling rates such as an NB, a WB, an SWB, and an FB besides sampling rates in a range unit.
In
In
In
In
The methods according to exemplary embodiments may be implemented as hardware or software or as a combination of hardware and software. When the methods are implemented as software, a computer-readable storage medium for storing one or more programs (software modules) may be provided. The one or more programs stored in the computer-readable storage medium are configured for execution by one or more processors in an electronic device. The one or more programs include instructions of commanding the electronic device to execute the methods according to embodiments.
These programs (software modules or software) may be stored in a nonvolatile memory including random access memory (RAM) and flash memory, read-only memory (ROM), electrically erasable programmable ROM (EEPROM), a magnetic disc storage device, compact disc-ROM (CD-ROM), a digital versatile disc (DVD), another type of optical storage device, or a magnetic cassette. Alternatively, the programs may be stored in a memory including some or all thereof. In addition, each of the memories described above may be plural in number.
Alternatively, the programs may be stored in an attachable storage device which can access the electronic device through a communication network including the Internet, an intranet, a local area network (LAN), a wide LAN (WLAN), and a storage area network (SAN), taken alone or in combination. This storage device may access the electronic device through an external port. Alternatively, a separate storage device on the communication network may access a portable electronic device.
Number | Date | Country | Kind |
---|---|---|---|
10-2014-0153191 | Nov 2014 | KR | national |
This application is a Continuation of U.S. application Ser. No. 16/569,157, filed on Sep. 12, 2019, which is a Continuation of U.S. application Ser. No. 15/305,944, filed on Mar. 20, 2017, which issued as U.S. Pat. No. 10,431,234 and is a National Stage of International Application No. PCT/KR2015/006330 filed on Jun. 22, 2015, which claims priority to Korean Patent Application No. 10-2014-0153191, filed on Nov. 5, 2014, in the Korean Intellectual Property Office, the entire disclosures of which are incorporated by reference herein for all purposes.
Number | Name | Date | Kind |
---|---|---|---|
4589130 | Galand | May 1986 | A |
4700361 | Todd | Oct 1987 | A |
5961589 | Hansen | Oct 1999 | A |
6011846 | Rabipour et al. | Jan 2000 | A |
6122618 | Park | Sep 2000 | A |
6384759 | Snyder | May 2002 | B2 |
6424636 | Seazholtz et al. | Jul 2002 | B1 |
6704705 | Kabal | Mar 2004 | B1 |
6704706 | Wu et al. | Mar 2004 | B2 |
6944221 | Keesman | Sep 2005 | B1 |
7088276 | Wegener | Aug 2006 | B1 |
7953604 | Mehrotra et al. | May 2011 | B2 |
8224659 | Lee | Jul 2012 | B2 |
8315861 | Miseki | Nov 2012 | B2 |
8589151 | Chamberlain | Nov 2013 | B2 |
8630864 | Ragot et al. | Jan 2014 | B2 |
8683013 | Major et al. | Mar 2014 | B2 |
8781078 | Fertner | Jul 2014 | B2 |
8942243 | Wänstedt et al. | Jan 2015 | B2 |
11705137 | Lee et al. | Jul 2023 | B2 |
20020001340 | Shenoi | Jan 2002 | A1 |
20020172193 | Preston et al. | Nov 2002 | A1 |
20040128125 | Makinen et al. | Jul 2004 | A1 |
20040243400 | Klinke | Dec 2004 | A1 |
20050034168 | Beveridge | Feb 2005 | A1 |
20050226426 | Oomen | Oct 2005 | A1 |
20060262831 | Kline et al. | Nov 2006 | A1 |
20070053446 | Spilo | Mar 2007 | A1 |
20070277078 | Yamanashi et al. | Nov 2007 | A1 |
20080232456 | Terashima et al. | Sep 2008 | A1 |
20090180518 | Ishii et al. | Jul 2009 | A1 |
20090234645 | Bruhn | Sep 2009 | A1 |
20090286472 | Eidenschink | Nov 2009 | A1 |
20110191111 | Chu et al. | Aug 2011 | A1 |
20120010880 | Nagel | Jan 2012 | A1 |
20120106451 | Enström et al. | May 2012 | A1 |
20120185600 | Belling et al. | Jul 2012 | A1 |
20120250762 | Kaye et al. | Oct 2012 | A1 |
20120323583 | Miyasaka et al. | Dec 2012 | A1 |
20130041673 | Nagel et al. | Feb 2013 | A1 |
20130132099 | Oshikiri et al. | May 2013 | A1 |
20130173274 | Kuntz | Jul 2013 | A1 |
20130230176 | Virette et al. | Sep 2013 | A1 |
20130268265 | Jeong et al. | Oct 2013 | A1 |
20130317811 | Grancharov et al. | Nov 2013 | A1 |
20140105041 | Swaminathan et al. | Apr 2014 | A1 |
20140297271 | Geiser | Oct 2014 | A1 |
20140330415 | Ramo et al. | Oct 2014 | A1 |
20150127775 | Munagala et al. | May 2015 | A1 |
20150149121 | Ahmed | May 2015 | A1 |
20160012825 | Kjoerling | Jan 2016 | A1 |
20160027446 | Purnhagen | Jan 2016 | A1 |
Number | Date | Country |
---|---|---|
101263554 | Sep 2008 | CN |
101743725 | Jun 2010 | CN |
102150204 | Aug 2011 | CN |
102985968 | Mar 2013 | CN |
103477650 | Dec 2013 | CN |
2004309686 | Nov 2004 | JP |
2008139781 | Jun 2008 | JP |
WO2008108379 | Sep 2008 | JP |
2008233436 | Oct 2008 | JP |
2008108379 | Sep 2008 | WO |
2010117326 | Oct 2010 | WO |
Entry |
---|
International Search Report and Written Opinion (PCT/ISA/210 & PCT/ISA/237) dated Oct. 27, 2015, issued by the International Search Authority in counterpart International Application No. PCT/KR2015/006330. |
Jeremy Colandairaj et al., “A Co-Design Solution for Wireless Feedback Control”, Proceedings of the 2007 IEEE International Conference on Networking, Sensing and Control, London, UK, Apr. 15, 2007-Apr. 17, 2007, pp. 404-409, XP031178344, (6 pages total). |
Notice of Allowance issued in prior U.S. Appl. No. 15/305,944 dated May 21, 2019. |
Final Office Action issued in prior U.S. Appl. No. 15/305,944 dated Mar. 5, 2019. |
Communication dated Jan. 16, 2020 issued by the Indonesian Intellectual Property Office in counterpart Indonesian Application No. P00201703240. |
Communication dated Apr. 3, 2020 issued by the State Intellectual Property Office of P.R. China in counterpart Chinese Application No. 201580072483.9. |
Communication dated Oct. 5, 2020 issued by the Korean Intellectual Property Office in counterpart Korean Application No. 10-2014-0153191. |
Communication dated Aug. 12, 2020 issued by the Vietnam Intellectual Property Office in counterpart Vietnam Application No. 1-2017-01853. |
Communication dated Oct. 23, 2017 by the European Patent Office in counterpart European Patent Application No. 15783098.5. |
Nokia et al., “AMR-WB extension for high audio quality”, TSG-SA WG4#24 meeting, Nov. 11, 2002-Nov. 15, 2002, Redmond, USA, SA4-020658, pp. 1-5, XP050637712, (5 pages total). |
Non-Final Office Action issued in prior U.S. Appl. No. 15/305,944 dated Jul. 26, 2018. |
Notice of Allowance issued in parent U.S. Appl. No. 16/569,157 dated Mar. 3, 2021. |
Communication dated Jan. 4, 2022 issued by the Korean Intellectual Property Office in counterpart Korean Application No. 10-2021-0147160. |
“Feasibility study on EVS audio bandwidth”, Telefon AB LM Ericsson, 3GPP TSG-SA WG4 Meeting #52, Jan. 24, 2009, 5 total pages. |
Office Action dated Sep. 1, 2023, issued by China National Intellectual Property Administration in Chinese Patent Application No. 202110762746.6. |
Notice of Allowance dated Nov. 5, 2023, issued by China National Intellectual Property Administration in Chinese Patent Application No. 202110762752.1. |
Number | Date | Country | |
---|---|---|---|
20210295856 A1 | Sep 2021 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16569157 | Sep 2019 | US |
Child | 17338818 | US | |
Parent | 15305944 | US | |
Child | 16569157 | US |