The present disclosure relates generally to communication systems, and more particularly, to concepts and techniques for reducing audio artifacts.
Peer-to-peer networks are commonly used for connecting wireless nodes via adhoc connections. These networks differ from the traditional client-server model where communications are usually with a central server. A peer-to-peer network has only equal peer nodes that communicate directly with one another. Such networks are useful for many purposes. A peer-to-peer network may be used, for example, as a consumer electronic wire replacement system for short range or indoor applications. These networks are sometimes referred to as Wireless Personal Area Networks (WPAN) and are useful for efficiently transferring video, audio, voice, text, and other media between wireless nodes over a short distance. A WPAN may provide connectivity for nodes in a home or a small office or may be used to provide connectivity for nodes carried by a person. In a typical scenario, a WPAN may provide connectivity for nodes within a range on the order of tens of meters.
In the applications requiring any use of audio signals (e.g., cell phone, headset), many audio codecs fail to remove the presence of audible quantization noise when the audio signal level is close to zero. One way to mitigate this problem is to use a non-uniform quantizer model.
Depending on hardware limitations, however, such non-linear quantizers may not have sufficient resolution for low-amplitude signals to suppress audible quantization noise. This problem is particularly significant when the codec employs noise-shaping techniques (e.g., sigma-delta modulation) using lower-than-adequate over-sampling ratios (OSR). The quantization noise resulting from low OSRs is particularly audible to a listener in the absence of adequate loudness levels in the signal of interest.
Consequently, there exists a need for a technique to attenuate audio artifacts in communication nodes.
According to an aspect of the disclosure, an apparatus for processing signals includes a receiver configured to receive an audio signal having a plurality of audio artifacts, and an audio circuit configured to reduce the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period.
According to another aspect of the disclosure, a method for processing signals includes receiving an audio signal having a plurality of audio artifacts, and reducing the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period.
According to a further aspect of the disclosure, an apparatus for processing signals includes means for receiving an audio signal having a plurality of audio artifacts, and means for reducing the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period.
According to yet a further aspect of the disclosure, a computer program product for processing signals includes computer-readable medium comprising instructions executable to receive an audio signal having a plurality of audio artifacts, and reduce the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period.
According to another aspect of the disclosure, a headset includes a receiver configured to receive an audio signal having a plurality of audio artifacts, an audio circuit configured to reduce the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period, and a transducer configured to provide an audible output based on the audio signal processed by the audio circuit.
According to yet another aspect of the disclosure, a watch includes a receiver configured to receive an audio signal having a plurality of audio artifacts, an audio circuit configured to reduce the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period, and a display configured to provide a visual output based on the audio signal processed by the audio circuit.
According to yet a further aspect of the disclosure, a medical monitor includes a receiver configured to receive an audio signal having a plurality of audio artifacts, wherein the audio signal is generated by a sensor, and an audio circuit configured to reduce the audio artifacts during at least a portion of a time period as a function of an energy level of the audio signal during that time period.
It is understood that other aspects of the invention will become readily apparent to those skilled in the art from the following detailed description, wherein it is shown and described only various aspects of the invention by way of illustration. As will be realized, the invention is capable of other and different aspects and its several details are capable of modification in various other respects, all without departing from the scope of the invention. Accordingly, the drawings and detailed description are to be regarded as illustrative in nature and not as restrictive.
These and other sample aspects of the disclosure will be described in the detailed description and the appended claims that follow, and in the accompanying drawings, wherein:
In accordance with common practice the various features illustrated in the drawings may be simplified for clarity. Thus, the drawings may not depict all of the components of a given apparatus (e.g., node) or method. In addition, like reference numerals may be used to denote like features throughout the specification and figures.
Various aspects of the disclosure are described below. It should be apparent that the teachings herein may be embodied in a wide variety of forms and that any specific structure, function, or both being disclosed herein are merely representative. Based on the teachings herein one skilled in the art should appreciate that an aspect disclosed herein may be implemented independently of any other aspects and that two or more of these aspects may be combined in various ways. For example, an apparatus may be implemented or a method may be practiced using any number of the aspects set forth herein. In addition, such an apparatus may be implemented or such a method may be practiced using other structure, functionality, or structure and functionality in addition to or other than one or more of the aspects set forth herein. An aspect may comprise one or more elements of a claim.
Several aspects of a receiver will now be presented. The receiver may be part of a mobile or fixed node, such as a phone (e.g., cellular phone), a personal digital assistant (PDA), an entertainment device (e.g., a music or video device), a headset (e.g., headphones, an earpiece, etc.), a microphone, a medical sensing device (e.g., a biometric sensor, a heart rate monitor, a pedometer, an EKG device, a smart bandage, etc.), a user I/O device (e.g., a watch, a remote control, a light switch, a keyboard, a mouse, etc.), a medical monitor that may receive data from the medical sensing device, an environment sensing device (e.g., a tire pressure monitor), a computer, a point-of-sale device, an entertainment device, a hearing aid, a set-top box, or any other suitable device. The node may include various components in addition to the receiver. By way of example, a wireless headset may include a transducer configured to provide an audio output to a user, a wireless watch may include a user interface configured to provide an indication to a user, and a wireless sensing device may include a sensor configured to provide an audio output to a user.
The receiver may also be part of an access device (e.g., a Wi-Fi access point) that provides backhaul services to other nodes. Such an access device may provide, by way of example, connectivity to another network (e.g., a wide area network such as the Internet or a cellular network) via a wired or wireless communication link.
In many of the applications described above, the receiver may be part of a node that transmits as well as receives. Such a node would therefore require a transmitter, which may be a separate component or integrated with the receiver into a single component known as a “transceiver.” As those skilled in the art will readily appreciate, the various concepts described throughout this disclosure are applicable to any suitable receiver function, regardless of whether the receiver is a stand-alone node, integrated into a transceiver, or part of a node in a wireless communications system.
In the following detailed description, various aspects of a receiver will be described for reducing or removing signal artifacts (e.g., audio artifacts) from received signals. Some aspects of the receiver will be described in the context of a WPAN supporting Ultra-Wideband (UWB), but as those skilled in the art will readily appreciate that the various aspects presented throughout this disclosure are likewise applicable to receivers for other radio technologies including Bluetooth, WiMax, and Wi-Fi, just to name a few. These aspects may also be extended to wired technologies including, by way of example, cable modem, Digital Subscriber Line, (DSL), Ethernet, and any other suitable communications technology.
An example of a UWB WPAN with wireless nodes that may benefit by incorporating various aspects of a receiver presented throughout this disclosure is shown in
The WPAN 100 is shown with a laptop computer 102 in communication with various other wireless nodes 104. In this example, the computer 102 may receive digital photos from a digital camera 104A, send documents to a printer 104B for printing, communicate with a smart band-aid 104C, synch-up with e-mail on a Personal Digital Assistant (PDA) 104D, transfer music files to a digital audio player (e.g., MP3 player) 104E, back up data and files to a mass storage device 104F, set the time on a watch 104G, and receive data from a sensing device 104H (e.g., a medical device such as a biometric sensor, a heart rate monitor, a pedometer, an EKG device, etc.). Also shown is a headset 106 (e.g., headphones, earpiece, etc.) that receives audio from the digital audio player 104E.
In one configuration of the WPAN 100, the computer 102 provides an access point to a Wide Area Network (WAN) (i.e., a wireless network covering a regional, nationwide, or even a global region). One common example of a WAN is the Internet. Another example of a WAN is a cellular network that supports CDMA2000, a telecommunications standard that uses Code Division Multiple Access (CDMA) to send voice, data, and signaling between mobile subscribers. Another example of a WWAN is a cellular network that provides broadband Internet access to mobile subscribers, such as Evolution-Data Optimized (EV-DO) or Ultra Mobile Broadband (UMB), both of which are part of the CDMA2000 family of air interface standards. Alternatively, or in addition to, the computer 102 may have a UWB connection to an Ethernet modem, or some other interface to a Local Area Network (LAN) (i.e., a network generally covering tens to a few hundred meters in homes, offices buildings, coffee shops, transportation hubs, hotels, etc.).
Various aspects of a receiver will now be presented with reference to
The receiver 200 is shown with a wireless interface 202 that implements the physical (PHY) layer and the Medium Access Control (MAC) layer. The PHY layer implements all the physical and electrical specifications to interface the receiver to the wireless medium. More specifically, the PHY layer is responsible for demodulating an RF carrier to recover an audio signal, as well as providing other processing functions such as analog/digital conversion, timing and frequency estimation, channel estimation, forward error correction (e.g., Turbo decoding), etc. The MAC layer manages the audio content that is across the PHY layer making it possible for several nodes to communicate with the receiver 200. The implementation of the wireless interface 202 is well within the capabilities of one skilled in the art, and therefore, will not be described any further.
The audio signal recovered by the wireless interface 202 may be encoded according to a given audio file format or streaming audio format. In such a case, an audio decoder 204 may be used to reconstruct the audio signal from the encoded transmission recovered by the wireless interface 202. In one example of a receiver, the audio decoder 204 may be configured to reconstruct an audio signal encoded with a backward adaptive gain ranged algorithm; however, the audio decoder 204 may be configured to handle other encoding schemes. Those skilled in the art will be readily able to implement the appropriate audio decoder 204 for any particular application. The audio decoder 204 may be a stand-alone component as shown in
The decoded audio signal may be provided to a gain controller 210 for volume control. The gain controller 210 may be any type of device (e.g., an amplifier) that is capable of controlling the power or amplitude of the audio signal. A gain control input to the gain controller 210 allows the volume of the audio signal to be adjusted.
The output from the gain controller 210 may be provided to an attenuator 216. The attenuator 216 may be any electronic device that is capable of reducing the amplitude or power of a signal without appreciably distorting its waveform (e.g., a multiplier). In a manner to be described in greater detail later, the attenuator 216 is used to attenuate the audio signal during audio silence, which tends to reduce audio artifacts.
The output from the attenuator 216 may be provided to an upsampler 218. The upsampler 218 is configured to sample the audio signal at a frequency much greater than the Nyquist frequency (two times the bandwidth of the audio signal). This process tends to reduce aliasing, which might otherwise distort the audio signal. The sampling frequency may be fixed or adaptively controlled for performance optimization.
The upsampled audio signal may be provided to a noise shaping filter, such as a sigma delta modulator (SDM) 220. The SDM 220 reduces quantization noise in the audio band by distributing it over a larger spectrum. The distribution of the quantization noise may be shaped with reduced noise at low frequencies and increased noise at higher frequencies, where it can be filtered.
The output from the SDM 220 may be provided to a Digital-to-Analog Converter (DAC) 222. The DAC converts the audio signal into an analog signal and provides the audio power amplifier functionality to drive a load 224 (e.g., a speaker).
As discussed earlier, an attenuator 216 is used to attenuate the audio signal during periods of audio silence. In one configuration of a receiver, this may be accomplished with a Signal Energy Detector (SED) 206, a threshold generator 212, a scaler generator 214 and a comparator 208.
The comparator 208, in addition to receiving the signal outputted from the decoder 204, receives a feedback signal that has already been processed downstream by the gain controller 210, the upsampler 218, and the SDM 220. The comparator 208 compares the decoded signal with the feed back signal and generates a quality metric signal based on the difference between the two signals. The comparator 208 then transmits the quality metric signal to the threshold generator 212.
The threshold generator 212 receives both the gain control signal and the quality metric signal, and, based on the two signals, generates an energy threshold signal representing a level of energy below which the received signal is designated as “low energy” or “silence” (e.g., audio silence). The threshold generator 212 then transmits the energy threshold signal to the scaler generator 214.
The SED 206 detects and measures an energy level of the signal received from the decoder 204. The SED 206 may be any device capable of sensing signal power or amplitude levels at high resolution. Based on the detected signal energy levels, the SED 206 generates an energy level signal, and transmits the energy level signal to the scaler generator 214.
The scaler generator is configured to receive the energy threshold signal generated by the threshold generator 214 and the energy level signal generated by the SED 206. Based on these two signals, the scaler generator 214 maps quantization noise of the received signal during long-periods of “silence” or “low energy” to zero. This is achieved by using an estimate of, for example, sound pressure levels from previous samples, as well as short “look ahead” to ramp up quickly once the received signal attains appreciable energy. To estimate the energy levels from previous levels, the scaler generator 214 computes an average energy value (Epast) of the received signal over previous N samples. To perform the short “look ahead,” the scaler generator 214 is configured to “look ahead” M samples, and compute the average energy value over the M samples (Efuture).
The scaler generated by the scaler generator 214 has a value that is based on the comparison between the threshold signal value and the average energy values Epast and Efuture. The scaler value may be either a zero (0) or a one (1). The scaler is varied between 0 and 1 via ramping up from 0 to 1 or ramping down from 1 to 0. For example, if either of the average energy values Epast and Efuture are less than the threshold signal value, the scaler generator 214 will maintain the scaler value at 0, if it was previously at 0, or ramp down the scaler value from 1 to 0, if it was previously at 1. Similarly, if either of the average energy values Epast and Efuture is equal to or greater than the threshold signal value, the scaler generator 214 will maintain the scaler value at 1, if it was previously at 1, or ramp up the scaler value from 0 to 1, if it was previously at 0. The duration of the ramp up and the ramp down may be determined based on the desired quality of attenuation of the received signal. For example, the ramp up duration may be significantly shorter than the ramp down duration in order to ensure minimal loss in the received signal content.
In addition to adjusting the scaler value based on the average energy levels Epast and Efuture, the scaler generator 214 may adjust the scaler value depending on the temporal location of the received signal in a predetermined time period. For example, the scaler generator 214 may ramp down the scaler value from 1 to 0 at the beginning of the time period and ramp up the scaler value from 0 to 1 at the end of the time period.
Once the scaler generator 214 generates the appropriate scaler, it is configured to transmit the scaler to the attenuator 216. As mentioned before, the attenuator 216 attenuates the received signal based on the scaler value. For example, if the scaler value is 0, indicating low average energy level (e.g., audio silence), the received signal is attenuated to null, ensuring that no signal artifacts are transmitted through the receiver. Conversely, if the scaler value is 1, indicating a sufficient average energy level, the received signal is transmitted through the attenuator 216 without being attenuated.
An example of a signal artifact attenuation process will now be presented with reference to the flow chart illustrated in
In block 304, an energy level of the received signal is measured, and the process proceeds to block 306. For example, as shown in
In block 306, a quality metric of the received signal is measured, and the process proceeds to block 308. For example, as shown in
In block 308, a threshold signal is generated, and the process proceeds to block 310. For example, in
In block 310, an average energy level is computed, and the process proceeds to block 312. For example, in
In block 312, a determination is made as to whether the computed average energy level value is less than the threshold signal value. If so, the process proceeds to block 314. If not, the process proceeds to block 316.
In block 314, a scaler value is set to 0, and the process proceeds to block 318. At block 316, the scaler value is set to 1, and the process likewise proceeds to block 318.
In block 318, the received signal is attenuated based on the scaler value, and the process proceeds to block 320 where a determination is made as to whether the receiver 200 is powered off. If the receiver 200 is not powered off, the process returns to block 302. Otherwise, the process ends.
The components described herein may be implemented in a variety of ways. For example, an apparatus may be represented as a series of interrelated functional blocks that may represent functions implemented by, for example, one or more integrated circuits (e.g., an ASIC) or may be implemented in some other manner as taught herein. As discussed herein, an integrated circuit may include a processor, software, other components, or some combination thereof. Such an apparatus may include one or more modules that may perform one or more of the functions described above with regard to various figures.
As noted above, in some aspects these components may be implemented via appropriate processor components. These processor components may in some aspects be implemented, at least in part, using structure as taught herein. In some aspects a processor may be adapted to implement a portion or all of the functionality of one or more of these components.
As noted above, an apparatus may comprise one or more integrated circuits. For example, in some aspects a single integrated circuit may implement the functionality of one or more of the illustrated components, while in other aspects more than one integrated circuit may implement the functionality of one or more of the illustrated components.
In addition, the components and functions described herein may be implemented using any suitable means. Such means also may be implemented, at least in part, using corresponding structure as taught herein. For example, the components described above may be implemented in an “ASIC” and also may correspond to similarly designated “means for” functionality. Thus, in some aspects one or more of such means may be implemented using one or more of processor components, integrated circuits, or other suitable structure as taught herein.
Also, it should be understood that any reference to an element herein using a designation such as “first,” “second,” and so forth does not generally limit the quantity or order of those elements. Rather, these designations may be used herein as a convenient method of distinguishing between two or more elements or instances of an element. Thus, a reference to first and second elements does not mean that only two elements may be employed there or that the first element must precede the second element in some manner. Also, unless stated otherwise a set of elements may comprise one or more elements. In addition, terminology of the form “at least one of: A, B, or C” used in the description or the claims means “A or B or C or any combination thereof.”
Those of skill in the art would understand that information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.
Those of skill would further appreciate that any of the various illustrative logical blocks, modules, processors, means, circuits, and algorithm steps described in connection with the aspects disclosed herein may be implemented as electronic hardware (e.g., a digital implementation, an analog implementation, or a combination of the two, which may be designed using source coding or some other technique), various forms of program or design code incorporating instructions (which may be referred to herein, for convenience, as “software” or a “software module”), or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
The various illustrative logical blocks, modules, and circuits described in connection with the aspects disclosed herein may be implemented within or performed by an integrated circuit (“IC”), an access terminal, or an access point. The IC may comprise a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, electrical components, optical components, mechanical components, or any combination thereof designed to perform the functions described herein, and may execute codes or instructions that reside within the IC, outside of the IC, or both. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
It is understood that any specific order or hierarchy of steps in any disclosed process is an example of a sample approach. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged while remaining within the scope of the present disclosure. The accompanying method claims present elements of the various steps in a sample order, and are not meant to be limited to the specific order or hierarchy presented.
The steps of a method or algorithm described in connection with the aspects disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module (e.g., including executable instructions and related data) and other data may reside in a data memory such as RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, a hard disk, a removable disk, a CD-ROM, or any other form of computer-readable storage medium known in the art. A sample storage medium may be coupled to a machine such as, for example, a computer/processor (which may be referred to herein, for convenience, as a “processor”) such the processor can read information (e.g., code) from and write information to the storage medium. A sample storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in user equipment. In the alternative, the processor and the storage medium may reside as discrete components in user equipment. Moreover, in some aspects any suitable computer-program product may comprise a computer-readable medium comprising codes (e.g., executable by at least one computer) relating to one or more of the aspects of the disclosure. In some aspects a computer program product may comprise packaging materials.
The previous description is provided to enable any person skilled in the art to practice the various aspects described herein. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects. Thus, the claims are not intended to be limited to the aspects shown herein, but is to be accorded the full scope consistent with the language claims, wherein reference to an element in the singular is not intended to mean “one and only one” unless specifically so stated, but rather “one or more.” All structural and functional equivalents to the elements of the various aspects described throughout this disclosure that are known or later come to be known to those of ordinary skill in the art are expressly incorporated herein by reference and are intended to be encompassed by the claims. Moreover, nothing disclosed herein is intended to be dedicated to the public regardless of whether such disclosure is explicitly recited in the claims. No claim element is to be construed under the provisions of 35 U.S.C. § 112, sixth paragraph, unless the element is expressly recited using the phrase “means for” or, in the case of a method claim, the element is recited using the phrase “step for.”
The present Application for patent claims priority to Provisional Application No. 61/083,454 entitled “METHOD AND APPARATUS FOR REDUCING AUDIO ARTIFACTS” filed Jul. 24, 2008, and assigned to the assignee hereof and hereby expressly incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
61083454 | Jul 2008 | US |