1. Technical Field
The present disclosure relates to an arithmetic-logic unit (ALU) for digital signal processor (DSP), specifically for processing audio signals.
2. Description of the Related Art
With reference to
The content of the discarded, less significant m−1 bit is not normally accessible and cannot be used by subsequent instructions.
In digital processing of signals by a DSP, the truncation or rounding performed in the ALU at each multiplication operation generates a distortion of the signal. The filtering, and more generally signal processing algorithms, generally entail a significant number of multiplications and truncations of the signal. Specifically, an FIR filter entails a number of multiplications proportional to the number of coefficients, while an IIR filter entails a theoretically infinite number of multiplications.
In the latter case, as well as distortion, the non-linearity introduced by the truncation may give rise to limit cycles and to the generation of spur tones even in the absence of variation of the input signal.
By adding an appropriate noise (known as “dither”) before performing truncation, a “linearization” of the truncation may be achieved, thereby avoiding distortion of the signal. This is counterbalanced by an increase in the correlated basic noise which however, especially in the case of acoustic signals, proves less psycho-acoustically disturbing to the ear than the distortion. For IIR filters, in addition, the linearity permits the elimination of limit cycles.
Since the entire word length of the result of multiplication is not normally available for subsequent processing, the solutions generally adopted entail the use of algorithms which increase the precision of the operations, if possible making use of the longer word length of the accumulator, or of the DSP, which has a considerably longer word length than that of the signal. This way the truncation remains confined to the less significant bit at negligible levels in relation to the signal dynamics. At the end of the processing, since the word size of the signal must be restored, an explicit addition of the dither and truncation of the final result of processing is performed.
For example, a typical sizing using this solution is that in which an input and output signal of 24 effective bits corresponds to the use of a 32 bit, fixed-point DSP. Considering, for example, assigning 3 bits to internal dynamics, the input signal will initially be multiplied by 2̂5 leaving 5 less significant bits to contain the distortions caused by truncation. At the end of 32 bit processing, these 5 less significant bits will be discarded from the final result, after having added an appropriate dither, dividing the signal by 2̂5 (in this example the dither has a 5 or 6 bit word, depending on whether its Probability Density Function or “PDF” is rectangular or triangular.
An embodiment seeks to address the problems caused by non-linearity of the truncation or rounding operation intrinsic to every multiplication operation performed in an ALU, but without the need to use algorithms which increase the precision of the operations and/or the use of DSP with word lengths significantly longer than that of the signal.
In an embodiment, an arithmetic-logic unit to process audio signals comprises: a multiplier configured to receive in input a first and a second signal and to supply in output a third signal which is the result of the multiplication of the first and second signals; a dither generator configured to generate a dither signal; an adder coupled to an output of the multiplier and configured to sum the third signal and the dither signal and to supply in output a fourth signal; and a bit reducer coupled to an output of the adder and configured to truncate or round the fourth signal. In an embodiment, the arithmetic-logic unit has a set word width, and the dither generator is configured to generate the dither signal with a word width equal or greater by one unit, to a number of bits which the third signal has in excess of the set word width. In an embodiment, the adder is configured to align the dither signal to a least significant bit of the third signal. In an embodiment, the dither signal has a rectangular, uniform distribution. In an embodiment, the dither signal has a triangular distribution. In an embodiment, the bit reducer supplies in output a fifth signal having a word width equal to a set word width of the arithmetic-logic unit.
In an embodiment, a digital signal processor to process audio signals comprises: a multiplier configured to receive in input a first and a second signal and to supply in output a third signal which is the result of the multiplication of the first and second signals; a dither generator configured to generate a dither signal; an adder coupled to an output of the multiplier and configured to sum the third signal and the dither signal and supply in output a fourth signal; and a bit reducer coupled to an output of the adder and configured to truncate or round the fourth signal. In an embodiment, the processor is configured to execute a portion of code containing at least one multiplication instruction and execution of the multiplication instruction includes causing the adder to sum the third signal and the dither signal. In an embodiment, execution of the multiplication instruction includes causing the bit reducer to truncate or round the fourth signal.
In an embodiment, a method of processing digital signals using a digital signal processor comprises: multiplying, using the digital signal processor, a first digital signal and a second digital signal to produce a product; adding, using the digital signal processor, a dither signal to the product to produce a sum having a number of bits; and truncating or rounding, using the digital signal processor, the sum to produce an output having a number of bits less than the number of bits of the sum. In an embodiment, adding the dither signal to the product is automatically performed when the multiplication operation is performed. In an embodiment, the dither signal is a null signal.
In an embodiment, an arithmetic-logic unit comprises: a multiplier configured to multiple a first and a second signal to produce a product; a dither generator configured to generate a dither signal; an adder configured to add the product and the dither signal to produce a sum having a number of bits; and a bit reducer configured to reduce a number of least significant bits of the sum to produce an output having a number of bits less than the number of bits of the sum. In an embodiment, the bit reducer is configured to produce the output by truncating the sum. In an embodiment, the bit reducer is configured to produce the output by rounding the sum. In an embodiment, the arithmetic-logic unit has a fixed word width, and the dither generator is configured to generate the dither signal with a word width equal or greater by one unit, to a number of bits which the product has in excess of the fixed word width. In an embodiment, the adder is configured to align the dither signal to a least significant bit of the product. In an embodiment, the dither signal has a rectangular distribution. In an embodiment, the bit reducer supplies in output a signal having a word width equal to a fixed word width of the arithmetic-logic unit.
In an embodiment, a system to process digital audio signals comprises: a multiplier configured to multiple first and second signals to produce a product; a dither generator configured to generate a dither signal; an adder configured to add the product and the dither signal to produce a sum having a number of bits; and a bit reducer configured to reduce a number of least significant bits in the sum. In an embodiment, the system comprises at least one processor configured to execute a portion of code containing at least one multiplication instruction, wherein execution of the multiplication instruction includes causing the adder to add the product and the dither signal. In an embodiment, execution of the multiplication instruction further includes causing the bit reducer to reduce the number of least significant bits in the sum.
In an embodiment, a method of processing digital signals using a digital signal processor comprises: multiplying, using the digital signal processor, a first digital signal and a second digital signal to produce a product; adding, using the digital signal processor, a dither signal to the product to produce a sum having a number of bits; and reducing, using the digital signal processor, a number of least significant bits in the sum to produce an output having a number of bits less than the number of bits of the sum. In an embodiment, the reducing comprises truncating the sum. In an embodiment, the reducing comprises rounding the sum. In an embodiment, the dither signal is a null signal.
In an embodiment, a tangible computer-readable medium's contents cause at least one digital signal processor to perform a method, the method comprising: multiplying a first digital signal and a second digital signal to produce a product; adding a dither signal to the product to produce a sum having a number of bits; and reducing a number of least significant bits in the sum to produce an output having a number of bits less than the number of bits of the sum. In an embodiment, the reducing comprises rounding the sum. In an embodiment, the adding comprises aligning the dither signal with a least significant bit of the product.
Further characteristics and advantages of embodiments of arithmetic logic units according to the disclosure will be more evident from the description below, made by way of a non-limiting example, of some preferred embodiments with reference to the attached figures, wherein:
In the following description, numerous specific details are given to provide a thorough understanding of embodiments. The embodiments can be practiced without one or more of the specific details, or with other methods, components, materials, etc. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the embodiments.
Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment. Thus, the appearances of the phrases “in one embodiment” “according to an embodiment” or “in an embodiment” and similar phrases in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
The headings provided herein are for convenience only and do not interpret the scope or meaning of the embodiments.
With reference to
As recalled above, dither signal is taken to mean a noise signal with a suitable distribution, which is voluntarily added to digital signals so as to minimize the distortion introduced by truncation or rounding in the case in which the said signals are re-quantized.
According to one embodiment, given a set word width for the arithmetic-logic unit, which may be a pre-set word width, the dither signal has a word width equal, or greater by one unit, to the number of bits which the third signal M, result of the multiplication, has in excess of said set word length.
In the example embodiment of an ALU 200 shown in
According to one embodiment, the dither signal D has a rectangular, uniform distribution.
In another embodiment, the dither signal D has a triangular distribution, that is derived from the convolution of two rectangular, uniform dithers.
The bit reducer 16 is configured to supply a fifth signal in output, MR, having a word width equal to that set for the arithmetic-logic unit.
As a result, a suitable dither D is implicitly and automatically added to the result of the multiplication inside the ALU itself, before the truncation or rounding. In other words, in the ALU according to an embodiment, the summing operation of the dither and the subsequent truncation or rounding of the word is automatically performed at the same time as the performance of the multiplication. No register or accumulator is therefore needed to access the n+m−1 bit result of the multiplication, precisely because the dithering and truncation operations are considered a single operation: the dither is added solely for the sake of the subsequent truncation and the useful data is that at n bit available after truncation.
In yet other terms, in an embodiment a programmer who programs the digital signal processor code has no need to compile a special instruction to use the dithering function. In fact, the multiplication instruction automatically also produces the summation operation of the result of the multiplication and the dither signal.
Advantageously, every multiplication instruction may also automatically comprise a truncation or rounding operation of the sum signal.
The arithmetic-logic unit (ALU) according to an embodiment makes it possible to use in the ALU a word length slightly greater than that corresponding to the performance required, specifically for an audio signal, with a significant saving in the number of circuit elements (and therefore silicon area) and consumption, at the same performance.
In addition, in an embodiment the linearization of the signal obtained makes it possible to create limit cycle-free type IIR filters without the need to add a dither to the input signal.
It is worth noting that, advantageously, in an embodiment the use of a traditional ALU may be simulated, for example if a deterministic result is needed, by merely attributing the dither signal an identically null code, in other words, composed of zeroes, and without making any circuit modifications.
The proposed embodiment, which permits a reduction of consumption and area without relinquishing high level performance, is particularly useful for battery powered applications such as mobile phones, MP3 players and PDA.
In
With reference to
The graph in
To achieve this performance with a traditional ALU a word of at least 28 bits would be needed and therefore the entire ALU and registers would need to be 28 or more bits.
A person skilled in the art may make modifications, adaptations and replacements of elements with others functionally equivalent to the embodiments described above so as to satisfy contingent requirements while remaining within the scope of protection of the following claims. Each of the characteristics described as pertaining to a possible embodiment may be realized independently of the other embodiments described.
The foregoing detailed description has set forth various embodiments of the devices and/or processes via the use of block diagrams and examples. Insofar as such block diagrams and examples contain one or more functions and/or operations, it will be understood by those skilled in the art that each function and/or operation within such block diagrams, flowcharts, or examples can be implemented, individually and/or collectively, by a wide range of hardware, software, firmware, or virtually any combination thereof. In one embodiment, the present subject matter may be implemented via Application Specific Integrated Circuits (ASICs). In one embodiment, the present subject matter may be implemented via one or more digital signal processors executing, for example, instructions stored on one or more memories. However, those skilled in the art will recognize that the embodiments disclosed herein, in whole or in part, can be equivalently implemented in standard integrated circuits, as one or more computer programs executed by one or more computers (e.g., as one or more programs running on one or more computer systems), as one or more programs executed by on one or more controllers (e.g., microcontrollers) as one or more programs executed by one or more processors (e.g., microprocessors), as firmware, or as virtually any combination thereof, and that designing the circuitry and/or writing the code for the software and or firmware would be well within the skill of one of ordinary skill in the art in light of the teachings of this disclosure.
When logic is implemented as software and stored in memory, logic or information can be stored on any computer-readable medium for use by or in connection with any processor-related system or method. In the context of this disclosure, a memory is a computer-readable medium that is an electronic, magnetic, optical, or other physical device or means that contains or stores a computer and/or processor program. Logic and/or the information can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions associated with logic and/or information.
In the context of this specification, a “computer-readable medium” can be any element that can store the program associated with logic and/or information for use by or in connection with the instruction execution system, apparatus, and/or device. The computer-readable medium can be, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device. More specific examples (a non-exhaustive list) of the computer readable medium would include the following: a portable computer diskette (magnetic, compact flash card, secure digital, or the like), a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM, EEPROM, or Flash memory), a portable compact disc read-only memory (CDROM), digital tape. Note that the computer-readable medium could even be paper or another suitable medium upon which the program associated with logic and/or information is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in memory.
The various embodiments described above can be combined to provide further embodiments. All of the U.S. patents, U.S. patent application publications, U.S. patent application, foreign patents, foreign patent application and non-patent publications referred to in this specification and/or listed in the Application Data Sheet are incorporated herein by reference, in their entirety. Aspects of the embodiments can be modified, if necessary to employ concepts of the various patents, application and publications to provide yet further embodiments.
These and other changes can be made to the embodiments in light of the above-detailed description. In general, in the following claims, the terms used should not be construed to limit the claims to the specific embodiments disclosed in the specification and the claims, but should be construed to include all possible embodiments along with the full scope of equivalents to which such claims are entitled. Accordingly, the claims are not limited by the disclosure.
Number | Date | Country | Kind |
---|---|---|---|
BS2008A000185 | Oct 2008 | IT | national |