Modern processing devices such as computers, servers, mobile telephones, and storage devices are often required to perform various types of data transformation. For example, data compression is a fundamental operation in a wide variety of communication and storage systems. As is well known, data compression techniques may be lossless or lossy, and may be linear or non-linear.
Examples of lossless data compression techniques include Lempel-Ziv (LZ) compression algorithms such as LZ77 and LZ78, described in J. Ziv and A. Lempel, “A Universal Algorithm for Sequential Data Compression,” IEEE Transactions on Information Theory, 23(3), pp. 337-343, May 1977, and J. Ziv and A. Lempel, “Compression of Individual Sequences via Variable-Rate Coding,” IEEE Transactions on Information Theory, 24(5), pp. 530-536, September 1978, respectively. Another example is the Burrows-Wheeler transform, described in M. Burrows and D. Wheeler, “A block sorting lossless data compression algorithm,” Technical Report 124, Digital Equipment Corporation, 1994.
Lossy data compression techniques are generally data specific, and include, for example, perceptual audio coding (PAC) or MP3 algorithms for audio data, JPEG algorithms for image data, and MPEG algorithms for video data.
These and many other commonly-used data compression techniques are non-linear, which can create a problem in that subsequent signal processing operations performed on decompressed data are very often linear operations. As a result, data compressed using non-linear compression usually must be decompressed to recover the original data before being subject to linear signal processing operations. Typically, most signal processing operations are computation and power intensive, and therefore any reduction in the amount of data subject to signal processing will help reduce both signal processing complexity and the associated power dissipation.
Accordingly, a need exists for an improved lossless, linear data compression technique that can allow compressed data to be subject to linear signal processing operations. Such an arrangement could significantly reduce the complexity of signal processing operations while providing additional benefits in the form of reduced power dissipation, improved signal integrity and better bandwidth utilization.
One or more illustrative embodiments of the present invention meet the above-identified need by providing improved lossless, linear data compression techniques utilizing difference-based data transformation.
In one embodiment, coding circuitry comprises a difference-based encoder having a plurality of processing stages, with the difference-based encoder being configured to generate respective orders of difference from a sequence of data samples, and to output encoded data determined based on at least a selected one of the orders of difference.
By way of example, the difference-based encoder may be configured to process a set of data samples in order to compute a set of orders of difference, to determine for each of the data samples in the set of data samples a least number of bits needed to represent that data sample, to determine for each of the orders of difference in the set of orders of difference a number of bits needed to represent that order of difference, based on the least numbers of bits needed to represent the respective data samples, and to output encoded data identifying a particular one of the orders of difference selected based on the numbers of bits needed to represent the orders of difference.
Coding circuitry in one or more embodiments may additionally or alternatively comprise a difference-based decoder having a plurality of processing stages, with the difference-based encoder being configured to process encoded data comprising selected ones of a plurality of orders of difference and to reconstruct a sequence of data samples based on the selected orders of difference.
By way of example, coding circuitry in a given embodiment may comprise an encoder-decoder (“codec”) that incorporates both a difference-based encoder and a difference-based decoder, so as to allow a corresponding processing device to support bidirectional coded communication with one or more other devices.
Coding circuitry in a given embodiment can be implemented in the form of one or more integrated circuits. For example, such coding circuitry comprising one or both of a difference-based encoder and a difference-based decoder may be implemented in a processor integrated circuit of a communication device, or in a system-on-chip (SOC) integrated circuit comprising read channel circuitry of a storage device.
Illustrative embodiments of the invention will be described herein with reference to exemplary communication systems, storage systems and associated coding circuitry and difference-based transformation algorithms. It should be understood, however, that other embodiments can be implemented using a wide variety of other types and arrangements of systems, circuitry and algorithms.
The difference-based encoding implemented using encoder 104 in the present embodiment may be configured to provide, for example, lossless, linear data compression in a manner to be described in greater detail below. Such an arrangement allows linear signal processing operations to be applied to the compressed data in transmitter 106 and receiver 110, thereby reducing signal processing complexity and power consumption in the system 100, while also providing additional benefits such as improved signal integrity and better bandwidth utilization.
The communication system 100 may be implemented using multiple computers, servers, mobile telephones, storage devices or other processing devices as illustrated in
The processor integrated circuit 210 is coupled to the memory 212 and the network interface 214, and further comprises coding circuitry 220 and other circuitry 222. The other circuitry 222 may comprise, for example, a central processing unit (CPU), an arithmetic logic unit (ALU), a digital signal processor (DSP), internal processor memory, as well as other types of internal processor circuitry, in any combination.
The coding circuitry 220 further comprises a codec 225 configured to perform both difference-based encoding and decoding operations. Thus, codec 225 may be viewed as comprising both the difference-based encoder 104 and the difference-base decoder 112, such that the processing device 200-1 can perform encoding operations to generate compressed data for delivery to other processing devices over the network 108, and can also perform decoding operations on compressed data delivered to it from other processing devices over the network 108. As will be described in greater detail below, such encoding and decoding operations associated with bidirectional communication between the devices 200 may be performed in the present embodiment using a lossless, linear difference-based coding algorithm, but other algorithms can be used in other embodiments.
It should be appreciated that a given one of the processing devices 200 can therefore be both a source of data for encoding and a destination for decoded data. Thus, a given processing device 200 may incorporate elements 102, 104, 106, 110, 112 and 114 of
The term “coding circuitry” as used herein is intended to be broadly construed, so as to encompass at least one of a difference-based encoder and a difference-based decoder, and may also encompass related portions of a transmitter and a receiver. For example, portions of the transmitter 106 and receiver 110 that are utilized in performing operations relating to data compression or data decompression as described herein may be considered part of the coding circuitry of a given processing device of system 100. Such coding circuitry need not comprise a codec 225 as illustrated in
The particular configuration of network-based communication system 100 as shown in
Also, the coding circuitry 220 comprising codec 225 can be implemented in numerous other types of systems. Another embodiment of such a system, in the form of a storage system 300 comprising a data storage device that incorporates coding circuitry 220 comprising codec 225, is illustrated in
The storage system 300 as shown in
It should also be noted that the broad term “coding circuitry” as used herein is intended to be construed so as to encompass processor circuitry that implements encoding or decoding operations at least in part in the form of software that is executed in a processor. For example, at least a portion of a difference-based transformation algorithm for data compression or decompression as disclosed herein may be implemented in the form of software that is stored in a memory such as memory 212 or an internal memory of processor 210 in the
A given such memory that stores software code for execution by a corresponding processor is an example of what is more generally referred to herein as a computer-readable medium or other type of computer program product having computer program code embodied therein, and may comprise, for example, electronic memory such as random access memory (RAM) or read-only memory (ROM), magnetic memory, optical memory, or other types of storage devices in any combination. The processor may comprise a microprocessor, CPU, ASIC, FPGA or other type of processing device, as well as portions or combinations of such devices. Although not expressly shown in
As indicated above, embodiments of the invention may be implemented in the form of integrated circuits. In a given such integrated circuit implementation, identical die are typically formed in a repeated pattern on a surface of a semiconductor wafer. Each die includes coding circuitry as described herein, and may include other structures or circuits. The individual die are cut or diced from the wafer, then packaged as an integrated circuit. One skilled in the art would know how to dice wafers and package die to produce integrated circuits. Integrated circuits so manufactured are considered embodiments of the invention.
Difference-based encoding and decoding operations performed in illustrative embodiments by encoder 104 and decoder 112, respectively, will now be described in greater detail with reference to
Assume initially that the data to be encoded comprises a data sequence obtained by sampling a pure sinusoid A sin(2π ft+φ) at a rate fs. The sampled sequence takes the form
A sin(2πfkδ+φ),k≧0
where
is the sampling interval, φ is the phase shift in radians and k is the index of the sample in the sequence. We now define Δ1(k) as the first order of difference between the (k+1)-th and k-th samples of this sequence as
Δ1(k)=A sin(2πf(k+1)δ+φ)−A sin(2πfkδ+φ)
The first order of difference is also referred to herein as first order differenced data. Substituting
and performing further simplification yields
More generally, an n-th order of difference between successive samples of this sequence, also referred to herein as an n-th order of difference, can be written as
The magnitude of Δn(k) depends on the quantities
We now examine the contribution of each term to the magnitude of the n-th order of difference Δn(k) as n increases.
The set of all possible values that the frequency f can assume falls into three intervals:
However, the range of values that contains information is confined to the interval
when the signal is sampled at the Nyquist rate. The first term
decreases in magnitude as n increases when fεI1, remains a constant when fεI2 and increases in magnitude when fεI3.
The second term
shifts the phase of the original signal by an amount
and in the process, allows the magnitude of this term to decrease or increase in comparison to the value prior to this phase shift. However, the range of values this term assumes is confined to the interval [−1,1]. Under conditions where
for some αε+, the phase of the original samples can be preserved by choosing n=4α. Under these circumstances, Equation (2) can be rewritten as
where Δ0(k) is the original sampled sequence, also referred to herein as zero order differenced data.
More generally, for any signal that can be expressed as a superposition of sinusoids of various frequencies given by
and sampled at a rate fs>2fmax, the n-th order of difference Δn(k) can be written using Equation (2) as
In this case, the magnitude of the successive differences decreases for those terms that have
remains the same for those that have
and increases in magnitude for those that have
We will now describe an embodiment of a successive difference architecture for the encoder 104 and a technique for determining the number of bits needed to represent the differenced samples. This will be followed by description of an embodiment of the transmitter 106 which assembles and transmits the data in an optimal format. The functionality of the receiver 110 and decoder 112 will then be described.
Consider an implementation of data source 102 that generates a data sample xk at each of a plurality of discrete time steps k (k≧0). At any time step k, an encoder of order n (n≦k) examines a set of n+1 consecutive samples {xk−n, xk−n+1, . . . , xk} and computes n+1 orders of difference
D(k)={Δ0(k),Δ1(k−1), . . . , Δn(k−n)}. (6)
For a given processing stage a of an n-stage encoder, where 0≦α≦n.
For α=0, Δ0(k) represents the original sample xk.
We now define the function F(m,x) that processes an m-bit sample x and returns the least number of bits needed to represent x. For example, if x can be represented as
then, computation of F(m,x) may involve the following steps:
1. Obtain the set {Bm−1, Bm−2, . . . , B0} where
where Π and Σ are logical AND and OR operators respectively.
2. Examine the Bq values to determine the value of F(m,x)
Now, if each xk is represented using m bits, a higher order of difference Δn(k) can be represented using at least 1 bit and at most m+n bits. Therefore, for each element in the set D(k), we use the function F(m,x) to compute the number of bits needed to represent each order of difference as
G(k)={gk0,gk1, . . . , gkn} (13)
where
g
k
n
=F(m+n,Δn(k−n)) (14)
In the set G(k) there exists at least one value a such that gka≦gkj for 0≦j≦n. If there exists more than one value of gkj that is the least among the values in G(k), the value corresponding to the least possible j is chosen. When a gka value is chosen, the tuple {gka, a, Δa(k−a)} is the desired encoded data.
The encoded data {gka, a, Δa(k−a)} generated by the encoder 104 may not be in an optimal format for transmission over the network 108. When the order of difference indicator a and the number of bits gka a needed to represent each sample vary often, the savings on the raw data achieved by performing a higher order of difference will be offset by the additional bits needed to represent a and gka for each sample. This issue can be addressed by grouping transformed data in blocks for transmission such that the transformed data for all samples within a given block have the same values for a and gka, respectively.
The transmitter 106 performs this grouping function in the present embodiment, by examining the parameters of a block of encoded data of length λ with the objective of determining the least number of bits and order of difference needed to represent the entire block of data.
Since the encoder 104 creates the set G(k) at any step k, this information can be used to determine the number of bits needed to store the transformed data for each order of difference. To accomplish this, at time step k, the transmitter 106 examines the pair of sets G(k) and G(k−1) and creates the quantity H(k) as
H(k)={hk0,hk1, . . . , hkn} (15)
where
h
k
n=max(gkn,gk−1n) (16)
When H(k) is computed in this manner for an entire block of length λ, from time steps k+1 to k+λ, the set H(k+λ) will have information on the least number of bits needed to represent each order of difference for the entire block of data. By this rationale, there would exist some hk+λαεH(k+λ) for some α,j (0≦α, j≦n) such that hk+λα≦hk+λj. In other words, the desired order of difference for the entire block of data is α and each differenced sample in the block can be represented using hk+λα bits since only the hk+λα least significant bits of the differenced sample need to be retained.
Having determined α and hk+λα, the block of original data comprising raw samples {xk+1,xk+2, . . . , xk+λ} needs to be converted into the block of data {Δα(k+1−α),Δα(k+2−α), . . . Δα(k+λ−α),}. This may be accomplished by implementing an additional encoder stage in transmitter 106 for order n and extracting data of order α for the entire block of length λ. In addition, only the least significant hk+λα bits of the data are chosen for transmission, since the more significant bits are obtained by sign extension of the hk+λα-th bit position.
Providing an additional encoder stage in the transmitter 106 decreases the complexity of seeking the best possible value of α and hk+λα from O(nλ) to O(n). This reduction in computational complexity is significant as the block length λ can range anywhere from a few samples to thousands of samples. Moreover, since D(k+1) is evaluated by the encoder 104 at step k+1, this value can be passed on to the transmitter 106 to initialize the differencing component in the transmitter at step k+λ+1. Knowledge of the state allows the encoder 104 to produce Δα(k+1−α) in one additional time step at k+λ+2 as α and hk+λα is known at the end of k+λ steps for the entire block of data of length λ. Thus the total latency in encoding a block of data of length λ is n+λ+2 steps.
The data emerging from the transmitter 106 may be configured to include a header comprising the tuple {λ,α,hk+λα} which provides information about the block of data that is to follow. This is followed by the transformed data sequence {Δa(k+1−α),Δα(k+2−α), . . . Δα(k+λ−α),}. If the number of steps needed to transmit the header is two or more, the two-step latency introduced by the transmitter can be completely masked.
The entire block of data received from the transmitter 106 is processed by the receiver 110 from which it creates the tuple {α,Δα(k−α)}. Since the order of difference is kept constant for the entire block of data, the main function of the receiver 110 is to keep track of the order of difference and number of bits, as each new block is encountered. Much of the work of regenerating the raw samples is performed by the decoder 112.
The decoder 112 in the present embodiment utilizes state variables. More particularly, at any step k, the decoder maintains a set of state variables given by the following set of orders of difference:
D(k)={Δ0(k−1),Δ1(k−2), . . . , Δn(k−n−1)} (17)
It takes the decoder n steps to create the state variables. On every successive time step k (k≧n) the decoder receives an ordered pair {α,Δα(k−α)} from the receiver 110, where 0≦α≦n. If α=0, then Δ0(k) represents the original sample xk. More generally, for any value of α (α≧0), the original sample is obtained by
Additionally, for α<j<n, the decoder computes
Δj(k−j)=Δj-1(k−j+1)−Δj-1(k−j) (19)
and uses these values to compute D(k+1). The decoder state variables allow the computation of the original xk at any step k in the decoding process when {α,Δα(k−α)} is known.
It should be noted that the diagram in
In another possible embodiment of the processing stage 400 of
The computation complexity for determining Δn(k) is O(n). More specifically, Δn(k) can be computed with n adders and n dual-register storage elements with a latency of n delay cycles and a path delay of n adders. Note that Δn(k) is computed from Δn−1(k+1) and Δn−1(k), both of which are evaluated at an earlier step in the computation. This results in the set of values D(k) and the corresponding number of bits associated with each element of this set G(k) being readily available at any step k as described earlier.
The decoder 112 of order n similarly comprises n processing stages, each configured in substantially the same manner. Examples of such decoder processing stages are shown in
Assume that for a given stage α (1≦α≦n) of the n stages of the decoder 112 at time step k, the multiplexers 502-1, 502-2 and 502-3 are denoted AMUX1(α,k), AMUX2(α,k) and RMUX(α,k), the adder 504 is denoted ADD(α,k) and register 506 is denoted REG(α,k). When the decoder receives the tuple {a,Δa(k−a)} at time step k, these elements hold the following values:
AMUX1(α,k)=−RMUX(α,k)when α≧a (20)
AMUX1(α,k)=RMUX(α,k)when α<a (21)
AMUX2(α,k)=RMUX(α+1,k)when α<a (22)
AMUX2(α,k)=Δa(k−a)when α=a (23)
AMUX2(α,k)=ADD(α−1,k)when α>a (24)
RMUX(α,k)=RMUX(α+1,k)when α<a (25)
RMUX(α,k)=Δa(k−a)when α=a (26)
RMUX(α,k)=ADD(α−1,k)when α>a (27)
ADD(α,k)=AMUX1(α,k)+AMUX2(α,k) (28)
REG(α,k)=RMUX(α,k−1) (29)
The original sample xk emerges at RMUX(0,k) at time step k. The processing stages shown in
In
As another example, the lower multiplexer in stage 500-4 in each of
In the decoder 600, all higher order differences are automatically computed and stored for processing the next set of data samples. The registers 506-1, 506-2, 506-3 and 506-4 in respective stages 500-1, 500-2, 500-3 and 500-4 store the same values in each of
In
In
In
In
These figures illustrate both the simplicity of the decoder architecture and the flexibility in handling varying orders of data differences between successive data samples.
Additional details will now be provided regarding encoding and decoding using difference-based transformation in illustrative embodiments.
It is assumed that up to α orders of difference are computed on the source data sequence as described in Equation (6). The value of α chosen can be either based on heuristics or information about the bandwidth of the underlying signal in relation to sampling frequency. When a heuristics approach is used, α can be derived based on experiments on data representative of that to be processed by the system. When information about the highest frequency component f and the sampling frequency fs is known, no more than a orders of difference need to be implemented where α is the solution of the equation
and where γ is the reduction in the number of bits needed. In other words, when data compression is possible on the original sequence comprising of m bits, then after higher order differencing, only m−γ bits will be necessary. For example, solving this equation for γ=1 will reduce the size of the original samples by one bit when the necessary conditions for convergence are satisfied.
The encoder functionality in such an embodiment may comprise the following steps:
Step 1: Allow n+1 samples to accumulate in a buffer to compute up to n orders of difference.
Step 2: Compute the desired order of difference as outlined in Equations (6) and (7). An exemplary hardware implementation may comprise cascaded differencing stages of the type illustrated in
Step 3: Compute the metric Bq as outlined in Equations (9) and (10) for each order of difference including zero order (n=0 for no transformation). In addition, we keep track of the least number of bits needed for each order of difference. Given a sequence {Bm−1,Bm−2, . . . , B0} we compute a metric B(m) as
B(m)=Bm OR B(m−1)when m>0 (31)
B(m)=Bm when m=0 (32)
where m denotes the index of the sequence and n the order of difference. The value of B(m) computed for the entire length of the data sequence is used to predict the number of bits needed to encode the entire block as outlined in Equations (11) and (12). This approach allows rapid computation of the number of bits needed at any stage. When using software, the bit computation can be done using logical operations instead of arithmetic or bit shift operations, which can speed up computation time.
In situations where space for storing the order of difference is limited, one could instead compute B(m) for the entire block for each order of difference and make note of which order of difference gives the least number of bits. Then the entire sequence of differences can be regenerated for this specific order of difference and passed to the transmitter.
Step 4: Compute the number of bits needed B(m) for each order of difference n until the entire data sequence is processed. Each order of difference n will therefore have a metric B(m) associated with it. This metric is used to compute the number of bits needed to represent the data as outlined in Equations (11) and (12). The order of difference that results in the least number of bits for entire block is chosen as the desired order of difference. Note that when the number of bits needed for order n (n>0) is more than the number of bits used to represent the original sequence (n=0), the original samples are passed as is. We denote the optimal number of bits needed for the sequence as mopt.
Practical considerations may require a block of data to be broken down into sub-blocks to ensure that optimal numbers of bits are used for packing each sub-block of data. For example, if a transformed data sequence requires 10 bits for the first 1000 symbols, 15 bits for the next 100 symbols and 10 bits for the next 2000 symbols, it may make sense to split this block into three sub-blocks rather than send the entire block of 3100 symbols using 15 bits.
Step 5: Extract the least significant mopt bits for the chosen order of difference irrespective of whether the differenced data is a positive or a negative quantity. We refer to this data as the payload which is sent to the transmitter.
The transmitter arranges the encoded data for transmission in the manner previously described. In one or more embodiments, the transmitted data may comprise a header, a payload and a footer. The header comprises information that is used by the receiver and decoder to obtain the original sequence. The payload comprises the transformed data sequence obtained from Steps 1 through 5 outlined above. The footer has information about sequence termination, continuation or otherwise.
As a more particular example, the transmitted data in one embodiment may be structured using at least a subset of the fields described below:
Header 1: Indicates the number of raw samples sent as is. This is used to initialize the state variables in the decoder.
Header 2: Indicates the number of bits used to represent raw samples.
Header 3: As many raw samples needed as the order of difference used.
Header 4: Number of samples in the payload.
Header 5: Indicates the number of bits used to represent each transformed data sample in the sequence mopt.
Header 6: Indicates the number of bits used to represent transformed samples (mopt).
Payload 1: As many transformed samples as created from Step 5 of the encoder.
Footer 1: Indicates whether the current payload has terminated or has been extended.
If the current sequence has not been terminated, then information regarding any changes to the order of difference or number of bits used to encode the data may be included here. This section can also have more complex information that describes the characteristics of the sequence of data that follows the footer. For example, a certain value encoded into Footer 1 might indicate that the data following the footer will comprise a sequence of a particular number of raw samples using a certain number of bits, followed by a particular number of first order differenced samples using another number of bits, followed by a particular number of second order difference samples using yet another number of bits. This example is used for illustration only and the information in Footer 1 can be used to describe a wide range of arrangements of encoded data.
Footer 2: A non-zero value specifies the number of additional samples that are embedded as payload following the footer. The additional samples may either be raw data samples, or differenced samples using Steps 1 through 5 above. This additional sequence is a continuation of the original sequence of data, but the number of bits used to encode the data and the order of differencing may have changed.
Footer 3: Specifies the order of difference for the data that is about to follow.
Footer 4: Specifies the number of bits for the data that is about to follow.
Payload 2: Continuation of the current sequence using either raw data or data compiled using Steps 1 through 5 above.
Footer 1: See Footer 1 above.
It is to be appreciated that the particular fields and processing operations described above and elsewhere herein are considered examples only, and other embodiments can use different types and arrangements of fields and processing operations. The particular fields and processing operations listed above and elsewhere herein should therefore be viewed as optional rather than as requirements of any embodiment of the invention.
Exemplary operations performed by the receiver 110 in an illustrative embodiment may include the following:
Process header information: When a new block of data is received, the receiver passes the raw data samples in the header as state variables that will be used by the decoder.
Processing of Payload: The payload data is sign extended based on the number of bits specified in the header. Once the entire payload is processed by the receiver, the processed payload data comprises m-bit wide data, same as the width of the data created by the encoder. For the block of data that is processed, the order of difference associated with that block of data and the sign extended data is passed on to the decoder.
Processing of footer and additional payload: Any additional data in the footer is processed in the same manner until the entire block of data has been processed. Once this is done, the output of the encoder would comprise sets of sequences where each sequence would include a sub-sequence of encoded data and the corresponding order of difference used to encode the data.
Exemplary operations performed by the decoder 112 in an illustrative embodiment may include the following:
Step 1: Load state variables. For every new block of data that is received, the decoder receives the initial state variables given by the raw samples stored in the header. This information is used to initialize the registers so that computation of Equation (18) is possible.
Step 2: Retrieve original samples of the sequence. Use the header information to determine the raw samples as outlined in Equations (18) and (19). The functional implementation of the decoder circuitry comprising stage 500 is described above using Equations (20) through (29).
Step 3: Data with multiple payloads. For the second payload in the sequence, the only change would be the order of difference. The order of difference information helps to determine which stage of the decoder gets the data that is fed by the receiver after sign extension. Also, information is provided on what the settings should be when they are applied to the multiplexers as outlined in Equations (20) through (29). When the order of difference changes, the data received from the transmitter is fed to the decoder at a different stage but the original sample is extracted from the same point, namely RMUX (0,k) where k is the total number of stages in the decoder. The differences are thus processed until all data for the block has been processed, including one header and a variable number of footers and payloads.
The above-described illustrative embodiments of encoding and decoding processes using difference-based data transformation may be configured to provide lossless, linear data compression in which compressed data can be directly subject to linear signal processing operations. Such arrangements can reduce the complexity of signal processing operations while providing additional benefits in the form of reduced power dissipation, improved signal integrity and better bandwidth utilization. The amount of data compression provided in a given embodiment may be comparable to that of existing lossless data compression techniques, but system performance is considerably improved.
Again, it should be emphasized that the embodiments of the invention as described herein are intended to be illustrative only. For example, other embodiments of the invention can be implemented using a wide variety of other types of systems, circuitry and associated difference-based data transformation algorithms, than those included in the embodiments described herein. These and numerous other alternative embodiments within the scope of the following claims will be readily apparent to those skilled in the art.