Data encoding and decoding using Slepian-Wolf coded nested quantization to achieve Wyner-Ziv coding

Information

  • Patent Application
  • 20060197690
  • Publication Number
    20060197690
  • Date Filed
    March 22, 2005
    19 years ago
  • Date Published
    September 07, 2006
    18 years ago
Abstract
A system and method for realizing a Wyner-Ziv encoder may involve the following steps: (a) apply nested quantization to input data from an information source in order to generate intermediate data; and (b) encode the intermediate data using an asymmetric Slepian-Wolf encoder in order to generate compressed output data representing the input data. Similarly, a Wyner-Ziv decoder may be realized by: (1) applying an asymmetric Slepian-Wolf decoder to compressed input data using side information to generate intermediate values, and (b) jointly decoding the intermediate values using the side information to generate decompressed output data.
Description
FIELD OF THE INVENTION

The present invention relates to the field of information encoding/decoding, and more particularly to a system and method for realizing a Wyner-Ziv code using nested quantization and Slepian Wolf coding.


DESCRIPTION OF THE RELATED ART

In 1976, Wyner and Ziv [1] established a theorem regarding the best possible source coding performance given distortion under the assumption that the decoder has access to side information. Unfortunately, codes realizing or approaching this best possible performance have not heretofore been demonstrated. Thus, it would be greatly desirable to be able to design codes (especially practical codes) realizing or approaching this best possible performance, and, to deploy such codes for use in encoders and decoders.


SUMMARY

In one set of embodiments, a system and method for generating compressed output data may involve:

    • (a) receiving input data from an information source;
    • (b) applying nested quantization to the input data in order to generate intermediate data;
    • (c) encoding the intermediate data using an asymmetric Slepian-Wolf encoder in order to generate compressed output data representing the input data; and
    • (d) performing at least one of storing the compressed output data, and, transferring the compressed output data.


      The values of the input data may be interpreted as vectors in an n-dimensional space, where n is greater than or equal to one.


The information source may be a continuous source or a discrete source. A discrete source generates values in a finite set. A continuous source generates values in a continuum.


The operations (b) and (c) may be arranged so as to realize the encoder portion of a Wyner-Ziv code.


The compressed output data may be stored in a memory medium for future decompression. Alternatively, the compressed output data may be transferred to a decoder for more immediate decompression.


The process of applying nested quantization to the input data may include: quantizing values of the input data with respect to a fine lattice to determine corresponding points of the fine lattice; and computing indices identifying cosets of a coarse lattice in the fine lattice corresponding to the fine lattice points. The intermediate data include said indices. The coarse lattice is a sublattice of the fine lattice.


In any given dimension, some choices for the fine lattice and coarse lattice may lead to better performance than others. However, the principles of the present invention may be practiced with non-optimal choices for the fine lattice and coarse lattice as well as with optimal choices.


In another set of embodiments, a system and method for recovering information from compressed input data may involve:

    • (a) receiving compressed input data, wherein the compressed input data is a compressed representation of a block of samples of a first source X;
    • (b) receiving a block of samples of a second source Y;
    • (c) applying an asymmetric Slepian-Wolf decoder to the compressed input data using the block of samples of the second source Y, wherein said applying generates a block of intermediate values;
    • (d) performing joint decoding on each intermediate value and a corresponding sample of the block of second source samples to obtain a corresponding decompressed output value.


      The operations (c) and (d) may be arranged so as to realize the decoder portion of a Wyner-Ziv code.


The joint decoding may involve determining an estimate of a centroid of a function restricted to a region of space corresponding to the intermediate value. The function may be the conditional probability density function of the first source X given said corresponding sample of the second source block. The centroid estimate may be (or may determine) the decompressed output value.


The region of space is a union of cells (e.g., Voronoi cells) corresponding to a coset of a coarse lattice in a fine lattice, wherein the coset is identified by the intermediate value.


In yet another set of embodiments, a system and method for computing a table representing a nested quantization decoder may involve:

    • (a) computing a realization z of a first random vector;
    • (b) computing a realization y of a second random vector;
    • (c) adding z and y to determine a realization x of a source vector;
    • (d) quantizing the realization x to a point in a fine lattice;
    • (e) computing an index J identifying a coset of a coarse lattice in the fine lattice based on the fine lattice point;
    • (f) adding the realization x to a cumulative sum corresponding to the index J and the realization y;
    • (g) incrementing a count value corresponding to the index J and the realization y;
    • (h) repeating operations (a) through (g) a number of times;
    • (i) dividing the cumulative sums by their corresponding count values to obtain resultant values; and
    • (j) storing the resultant values in a memory.


In one set of embodiments, a system for generating compressed output data may include a memory and a processor. The memory is configured to store data and program instructions. The processor is configured to read and execute the program instructions from the memory. In response to execution of the program instructions, the processor is operable to: (a) receive input data from an information source; (b) apply nested quantization to the input data in order to generate intermediate data; (c) encode the intermediate data using an asymmetric Slepian-Wolf encoder in order to generate compressed output data representing the input data; and (d) perform at least one of: storing the compressed output data; and transferring the compressed output data.


In another set of embodiments, a system for decoding compressed data may include a memory and processor. The memory is configured to store data and program instructions. The processor is configured to read and execute the program instructions from the memory. In response to execution of the program instructions, the processor is operable to: (a) receive compressed input data, wherein the compressed input data is a compressed representation of a block of samples of a first source X; (b) receive a block of samples of a second source Y; (c) apply an asymmetric Slepian-Wolf decoder to the compressed input data using the block of samples of the second source Y, wherein said applying generates a block of intermediate values; (d) perform joint decoding on each intermediate value and a corresponding sample of the block of second source samples to obtain a corresponding decompressed output value, wherein said performing joint decoding includes determining an estimate of a centroid of a function restricted to a region of space corresponding to the intermediate value, wherein said estimate determines the decompressed output value. The function is the conditional probability density function of the first source X given said corresponding sample of the second source block.


In yet another set of embodiments, a system for computing a table representing a nested quantization decoder may include a memory and processor. The memory is configured to store data and program instructions. The processor is configured to read and execute the program instructions from the memory. In response to execution of the program instructions, the processor is operable to: (a) computing a realization z of a first random vector; (b) computing a realization y of a second random vector; (c) adding z and y to determine a realization x of a source vector; (d) quantizing the realization x to a point in a fine lattice; (e) computing an index J identifying a coset of a coarse lattice in the fine lattice based on the fine lattice point; (f) adding the realization x to a cumulative sum corresponding to the index J and the realization y; (g) incrementing a count value corresponding to the index J and the realization y; (h) repeating operations (a) through (g) a number of times; (i) dividing the cumulative sums by their corresponding count values to obtain resultant values; and (j) storing the resultant values in a memory medium.


We propose a practical scheme that we refer to as Slepian-Wolf coded nested quantization (SWC-NQ) for Wyner-Ziv coding that deals with source coding with side information under a fidelity criterion. The scheme utilizes nested lattice quantization with a fine lattice for quantization and a coarse lattice for channel coding. In addition, at low dimensions (or block sizes), an additional Slepian-Wolf coding stage is added to compensate for the weakness of the coarse lattice channel code. The role of Slepian-Wolf coding in SWC-NQ is to exploit the correlation between the quantized source and the side information for further compression and to make the overall channel code stronger.


The applications of this proposed scheme are very broad; it can be used in any application that involves lossy compression (e.g., of speech data, audio data, image data, video data, graphic data, or, any combination thereof).


We show that SWC-NQ achieves the same performance of classic entropy-constrained lattice quantization. For example, 1-D/2-D SWC-NQ performs 1.53/1.36 dB away from the Wyner-Ziv rate distortion (R-D) function of the quadratic Gaussian source at high rate assuming ideal Slepian-Wolf coding. In other words, the scheme may be optimal in terms of compression performance, at least in some embodiments. We also demonstrate means of achieving efficient Slepian-Wolf compression via multi-level LDPC codes.




BRIEF DESCRIPTION OF THE DRAWINGS

A better understanding of the present invention can be obtained when the following detailed description of the preferred embodiment is considered in conjunction with the following drawings, in which:



FIG. 1A illustrates one embodiment of a computer system that may be used for implementing various of the method embodiments described herein;



FIG. 1B illustrates one embodiment of a communication system including two computers coupled through a computer network;



FIG. 2 is a block diagram for one embodiment of a computer system that may be used for implementing various of the method embodiments described herein;



FIG. 3A illustrates one embodiment of a sensor system as a possible application of the inventive principles described herein;



FIG. 3B illustrates one embodiment of a video transmission as another possible application of the inventive principles described herein;



FIG. 3C illustrates a system that compressed source information and stored the compressed information in a memory medium for later retrieval and decompression;



FIG. 4 illustrates one embodiment of a method for encoding data;



FIG. 5 illustrates one embodiment of a method for decoding data using side information;



FIG. 6 illustrates one embodiment of a method for computing a table that represents an nested quantization decoder.



FIG. 7 illustrate an example of a fine lattice, coarse lattice, coset leader vector v and region R(v) in dimension n=2;



FIG. 8 illustrate a simplified nested quantization ender and decoder;



FIG. 9 shows δ2(R) with different V2's using nested A2 lattices (i.e., hexagonal lattices) in dimension n=2;



FIG. 10 show {overscore (D)}2 (R) as the convex hull of δ2(R) with different V2;



FIG. 11 shows the granular and boundary components of distortion with different V2's;



FIG. 12 plots {overscore (D)}n(R) for n=1, 2, 4, 8 and 24 with Γz2=0.01;



FIG. 13 shows the lower bound of D(R) with different V2's in the 1-D case;


FIGS. 14(a) and (b) plot the optimal V*2 (scaled by σZ) as a function of R for the 1-D (n=1) and 2-D (n=2) cases;



FIG. 15 shows the improvement gained by using the optimal (non-linear) estimator at low rates, for n=2 and σZ2=0.01;



FIG. 16 illustrates one embodiment of a multi-layer Slepian Wolf coding scheme;



FIG. 17 shows results based on 1-D nested lattice quantization both with and without Slepian Wolf coding (SWC); and



FIG. 18 shows results based on 2-D nested lattice quantization both with and without Slepian Wolf coding (SWC).




While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the drawings and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the present invention as defined by the appended claims.


DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
Incorporation by Reference

The following patent application documents are hereby incorporated by reference in their entirety as though fully and completely set forth herein:

  • U.S. Provisional Application Ser. No. 60/657,520, titled “Multi-Source Data Encoding, Transmission and Decoding”, filed Mar. 1, 2005, whose inventors are Vladimir M. Stankovic, Angelos D. Liveris, Zixiang Xiong, Costas N. Georghiades, Zhixin Liu, Samuel S. Cheng, and Qian Xu, including Appendices A through H;
  • U.S. patent application Ser. No. 11/069,935, titled “Multi-Source Data Encoding, Transmission and Decoding Using Slepian-Wolf Codes Based On Channel Code Partitioning”, filed Mar. 1, 2005, whose inventors are Vladimir M. Stankovic, Angelos D. Liveris, Zixiang Xiong, and Costas N. Georghiades, including Appendices A through H; and
  • U.S. patent application Ser. No. 11/068,737, titled “Data Encoding and Decoding Using Slepian-Wolf Coded Nested Quantization to Achieve Wyner-Ziv Coding”, filed Mar. 1, 2005, whose inventors are Zhixin Liu, Samuel S. Cheng, Angelos D. Liveris, and Zixiang Xiong, including Appendices A through H.


The following publications are referred to herein and are incorporated by reference in their entirety as though fully and completely set forth herein:

  • [1] A. Wyner and J. Ziv, “The rate-distortion function for source coding with side information at the decoder,” IEEE Trans. Inform. Theory, vol. IT-22, pp. 1-10, January 1976.
  • [2] A. Wyner, “The rate-distortion function for source coding with side information at the decoder-II: general sources,” Inform. Contr., vol. 38, pp. 60-80, 1978.
  • [3] S. Servetto, “Lattice quantization with side information,” in Proceedings of the IEEE Data Compression Conference, DCC2000, Snowbird, Utah, March 2000.
  • [4] X. Wang and M. Orchard, “Design of trellis codes for source coding with side information at the decoder,” in Proc. DCC'01, Snowbird, Utah, March 2001.
  • [5] P. Mitran and J. Bajcsy, “Coding for the Wyner-Ziv problem with turbo-like codes,” in Proc. ISIT'02, Lausanne, Switzerland, June 2002.
  • [6] R. Zhang A. Aaron and B. Girod, “Wyner-Ziv coding of motion video,” in Proc. 36th Asilomar Conf., Pacific Grove, Calif., November 2002.
  • [7] S. Pradhan and K. Ramchandran, “Distributed source coding using syndromes (DISCUS): Design and construction,” IEEE Trans. Inform. Theory, vol. 49, pp. 626-643, March 2003.
  • [8] D. Rebollo-Monedero, R. Zhang, and B. Girod, “Design of optimal quantizers for distributed source coding,” in Proc. DCC'03, Snowbird, UT, March 2003.
  • [9] J. Chou, S. Pradhan, and K. Ramchandran, “Turbo and trellis-based constructions for source coding with side information,” in Proc. DCC'03, Snowbird, UT, March 2003.
  • [10] A. Liveris, Z. Xiong and C. Georghiades, “Nested convolutional/turbo codes for the binary Wyner-Ziv problem,” in Proc. ICIP'03, Barcelona, Spain, September 2003.
  • [11] Z. Xiong, A. Liveris, S. Cheng, and Z. Liu, “Nested quantization and Slepian-Wolf coding: A Wyner-Ziv coding paradigm for i.i.d. sources,” in Proc. IEEE Workshop on Statistical Signal Processing, St. Louis, Mo., September 2003.
  • [12] Y. Yang, S. Cheng, Z. Xiong, and W. Zhao, “Wyner-Ziv coding based on TCQ and LDPC codes,” in Proc. 37th Asilomar Conf., Pacific Grove, Calif., November 2003.
  • [13] J. H. Conway and Neil J. A. Sloane, “Sphere Packings, Lattices and Groups”, Springer, N.Y., 1998.
  • [14] G. Ungerboeck, “Channel coding with multilevel/phase signals,” IEEE Trans. Inform. Theory, vol. 28, pp. 55-67, January 1982.
  • [15] M. Marcellin and T. Fischer, “Trellis coded quantization of memoryless and Gaussian-Markov sources,” IEEE Communications, vol. 38, pp. 82-93, January 1990.
  • [16] R. Zamir and S. Shamai, “Nested linear/lattice codes for Wyner-Ziv encoding,” in Proc. IEEE Information Theory Workshop, Killarney, Ireland, June 1998, pp. 92-93.
  • [17] J. Conway, E. Rains, and N. Sloane, “On the existence of similar sublattices,” Canadian J. Math., vol. 51, pp. 1300-1306, 1999.
  • [18] R. Zamir, S. Shamai, and U. Erez, “Nested linear/lattice codes for structured multiterminal binning,” IEEE Trans. Inform. Theory, vol. 48, pp. 1250-1276, June 2002.
  • [19] M. V. Eyuboglu and G. D. Forney, Jr., “Lattice and trellis quantization with lattice- and trellis-bounded codebooks—high-rate theory for memoryless sources,” IEEE Trans. Information Theory, vol. 39, pp. 46-59, January 1993.
  • [20] Robert G. Gallager, Low Density Parity Check Codes, MIT Press, 1963, ISBN: 0262571773.
  • [21] D. MacKay, “Good error-correcting codes based on very sparse matrices,” IEEE Trans. Information Theory, vol. 45, pp. 399-431, March 1999.
  • [22] D. MacKay and R. Neal, “Near shannon limit performance of low density parity check codes,” Electron. Lett., vol. 33, pp. 457458, March 1997.
  • [23] D. Rebollo-Monedero, A. Aaron, and B. Girod, “Transforms for high-rate distributed source coding,” in Proc. 37th Asilomar Conf., Pacific Grove, CA, November 2003.
  • [24] D. Slepian and J. K. Wolf, “Noiseless coding of correlated information sources,” IEEE Trans. Inform. Theory, vol. 19, pp. 471-480, July 1973.
  • [25] R. Zamir, “The rate loss in the Wyner-Ziv problem,” IEEE Trans. Inform. Theory, vol. 42, pp. 2073-2084, November 1996.
  • [26] V. Tarokh, A. Vardy, and K. Zeger, “Universal bound on the performance of lattice codes,” IEEE Trans. Inform. Theory, vol. 45, pp. 670-681, March 1999.
  • [27] Lori A. Dalton, “Analysis of 1-D nested lattice quantization and Slepian-Wolf coding for Wyner-Ziv coding of i.i.d. sources,” May 2002, Technical report, Texas A&M University.
  • [28] G. D. Forney Jr., “Coset codes-Part II: Binary lattices and related codes,” IEEE Trans. Inform. Theory, vol. 34, pp. 1152-1187, 1988.
  • [29] A. Liveris, Z. Xiong and C. Georghiades, “Compression of binary sources with side information at the decoder using LDPC codes,” IEEE Communications Letters, vol. 6, pp. 440-442, October 2002.
  • [30] T. M. Cover and J. A. Thomas, Elements of Information Theory, Wiley Interscience, 1991.
  • [31] R. G. Gallager, Information Theory and Reliable Communication, New York: Wiley, 1968.


    Terminology


The following is a glossary of terms used in the present application:


Memory Medium—Any of various types of memory devices, storage devices, or combinations thereof. The term “memory medium” is intended to include: CD-ROM, any of various kinds of magnetic disk (such as floppy disk or hard disk), any of various kinds of magnetic tape, optical storage, and bubble memory; any of various kinds of read only memory (ROM); any of various kinds of random access memory (RAM) such as DRAM, DDR RAM, SRAM, EDO RAM, Rambus RAM, etc.


Carrier Medium—a memory medium as described above, or, a communication medium on which signals are conveyed, e.g., signals such as electrical, electromagnetic, acoustic, optical signals.


Programmable Hardware Element—includes various types of programmable hardware, reconfigurable hardware, programmable logic, or field-programmable devices (FPDs), such as one or more FPGAs (Field Programmable Gate Arrays), or one or more PLDs (Programmable Logic Devices), or other types of programmable hardware. A programmable hardware element may also be referred to as “reconfigurable logic”.


Program—the term “program” is intended to have the full breadth of its ordinary meaning. The term “program” includes 1) a software program which may be stored in a memory and is executable by a processor or 2) a hardware configuration program useable for configuring a programmable hardware element.


Software Program—the term “software program” is intended to have the full breadth of its ordinary meaning, and includes any type of program instructions, code, script and/or data, or combinations thereof, that may be stored in a memory medium and executed by a processor. Exemplary software programs include programs written in text-based programming languages, such as C, C++, Pascal, Fortran, Cobol, Java, assembly language, etc.; graphical programs (programs written in graphical programming languages); assembly language programs; programs that have been compiled to machine language; scripts; and other types of executable software. A software program may comprise two or more components that interoperate.


Hardware Configuration Program—a program, e.g., a netlist or bit file, that can be used to program or configure a programmable hardware element.


Computer System—any of various types of computing or processing systems, including a personal computer system (PC), mainframe computer system, workstation, network appliance, Internet appliance, personal digital assistant (PDA), television system, grid computing system, or other device or combinations of devices. In general, the term “computer system” can be broadly defined to encompass any device (or combination of devices) having at least one processor that executes instructions from a memory medium.



FIG. 1A—Computer System



FIG. 1A illustrates a computer system 82, according to one set of embodiments, operable to execute a set of programs. The programs may be configured to implement any or all of the method embodiments described herein. The computer system 82 may include one or more processors, memory media, and one or more interface devices. The computer system 82 may also include input and output devices. The memory media may include various well known systems and devices configured for the storage of data and computer programs. For example, the memory media may store one or more programs which are executable to perform the methods (or some subset of the methods) described herein. The memory medium may also store operating system software, as well as other software for operation of the computer system. In various embodiments, the computer system 82 may be a personal computer, a notebook computer, a workstation, a server, a router, a computer implemented on a card, etc.



FIG. 1B—Computer Network



FIG. 1B illustrates a communication system including a first computer system 82 and a second computer system 90, according to one set of embodiments. The first computer system 82 couples to the second computer system 90 through a network 84 (or, more generally, any of various known communication mechanisms). The first and second computer systems may each be any of various types, as desired. The network 84 can also be any of various types, including a LAN (local area network), WAN (wide area network), the Internet, or an Intranet.


Each of the computer systems may be configured with programs implementing any or all of the method embodiments described herein. In one embodiment, the first and second computer systems are each configured with software for encoding and decoding data as described variously herein.


It is noted that computer system 82 and computer system 90 may be configured according to any of various system architectures.



FIG. 2—Computer System Block Diagram



FIG. 2 is a block diagram representing one embodiment of computer system 82 and/or computer system 90.


The computer system may include at least one central processing unit CPU 160 which is coupled to a host bus 162. The CPU 160 may be any of various types, including, but not limited to, an x86 processor, a PowerPC processor, a CPU from the SPARC family of RISC processors, as well as others. A memory medium, typically comprising RAM, and referred to as main memory 166, is coupled to the host bus 162 by means of memory controller 164. The main memory 166 may store programs operable to implement encoding and/or decoding according to any (or all) of the various embodiments described herein. The main memory may also store operating system software, as well as other software for operation of the computer system.


The host bus 162 couples to an expansion or input/output bus 170 through a bus controller 168 or bus bridge logic. The expansion bus 170 may be the PCI (Peripheral Component Interconnect) expansion bus, although other bus types can be used. The expansion bus 170 includes slots for various devices such as a video card 180, a hard drive 182, a CD-ROM drive (not shown) and a network interface 122. The network interface 122 (e.g., an Ethernet card) may be used to communicate with other computers through the network 84.


In one embodiment, a device 190 may also be connected to the computer. The device 190 may include an embedded processor and memory. The device 190 may also or instead comprise a programmable hardware element (such as an FPGA). The computer system may be operable to transfer a program to the device 190 for execution of the program on the device 190. The program may be configured to implement any or all of the encoding or decoding method embodiments described herein.


In some embodiments, the computer system 82 may include input devices such as a mouse and keyboard and output devices such a display and speakers.



FIGS. 3A, 3B & 3C—Exemplary Systems


Various embodiments of the present invention may be directed to sensor systems, wireless or wired transmission systems, or, any other type of information processing or distribution system utilizing the coding principles described herein.


For example, as FIG. 3A shows, a sensor system may include a first sensor (or set of sensors) and a second sensor (or set of sensors). The first sensor may provide signals to a transmitter 306. The sensors may be configured to sense any desired physical quantity or set of physical quantities such as time, temperature, energy, velocity, flow rate, displacement, length, mass, voltage, electrical current, charge, pressure, etc. The transmitter 306 may receive the signals, digitize the signals, encode the signals according the inventive principles described herein, and transmit the resulting compressed data to a receiver 308 using any of various known communication mechanism (e.g., a computer network). The receiver 308 receives the compressed data from the transmitter as well as side information from a second sensor. The receiver 308 decodes the compressed data, according to the inventive principles described herein, using the side information, and thereby, generates decompressed output data. The decompressed output data may be used as desired, e.g., displayed to a user, forwarded for analysis and/or storage, etc.


As another example, a first video source may generate video signals as shown in FIG. 3B. A transmitter 316 receives the video signal, encodes the video signals according the inventive principles described herein, and transmits the resulting compressed data to a receiver 318 using any of various known communication mechanism (e.g., a computer network). The receiver 318 receives the compressed data from the transmitter as well as side information from a second sensor. The receiver 318 decoders the compressed data, according to the inventive principles described herein, using the side information.


As yet another embodiment, a encoder 326 may receive signals from a first source and encode the source signals according to the inventive principles described herein, and store the resulting compressed data onto a memory medium 327. At some later time, an encoder 328 may read the compressed data from the memory medium 327 and decode the compressed data according to the inventive principles described herein.


It is noted that embodiments of the present invention can be used for a plethora of applications and is not limited to the above applications. In other words, applications discussed in the present description are exemplary only, and the present invention may be used in any of various types of systems. Thus, the system and method of the present invention is operable to be used in any of various types of applications, including audio applications, video applications, multimedia applications, any application where physical measurements are gathered, etc.



FIG. 4 illustrates one embodiment of a method for decoding data. In step 405, input data is received from an information source.


In step 410, nested quantization as described herein is applied to the input data in order to generate intermediate data.


In step 420, the intermediate data is encoded using an asymmetric Slepian-Wolf encoder as described herein, in order to generate compressed output data representing the input data.


The nested quantization and asymmetric Slepian-Wolf encoder may be configured so that the combination of steps 410 and 420 realizes the encoder portion of a Wyner-Ziv code.


In step 425, the compressed output data may be stored and/or transferred. In one embodiment, the compressed output data may be stored onto a memory medium for decompression at some time in the future. In another embodiment, the compressed output data may be transferred, e.g., to a decoder device.


The information source may be a continuous source or a discrete source. A discrete source generates values in a finite set. A continuous source generates values in a continuum. The values of the input data may be interpreted as vectors in an n-dimensional space, where n is greater than or equal to one.


The process of applying nested quantization to the input data may include: quantizing values of the input data with respect to a fine lattice to determine corresponding points of the fine lattice; and computing indices identifying cosets of a coarse lattice in the fine lattice corresponding to the fine lattice points. The intermediate data include said indices. The coarse lattice is a sublattice of the fine lattice.


In any given dimension, some choices for the fine lattice and coarse lattice may lead to better performance than others. However, the principles of the present invention may be practiced with non-optimal choices for the fine lattice and coarse lattice as well as with optimal choices.


In various embodiments, the information source may be a source of audio information, a source of video information, a source of image information, a source of text information, a source of information derived from physical measurements (e.g., by a set of one or more physical sensors), or, any combination thereof.


As discussed in reference [29], one way to do asymmetric Slepian-Wolf encoding is by means of syndrome forming, which involves a modification of classical channel encoding. This type of Slepian-Wolf encoding is used to generate the simulation results described in this paper. However, the general method of Slepian-Wolf coded nested quantization disclosed in this paper can also be performed with other forms of Slepian-Wolf encoders.


In some embodiments, the asymmetric Slepian-Wolf encoder may be a low density parity check syndrome former or a turbo syndrome former.


In one embodiment, the asymmetric Slepian-Wolf encoder may be configured as a multi-layered encoder as described herein.


An encoder system may be configured to implement any embodiment of the method illustrated and described above in connection with FIG. 4. The encoder system may include one or more processors or programmable hardware elements, and/or, dedicated circuitry such as application specific integrated circuits. In one embodiment, the encoder system includes a processor (e.g., a microprocessor) and memory. The memory is configured to store program instructions and data. The processor is configured to read and execute the program instructions from the memory to implement any embodiment of the method illustrated and described above in connection with FIG. 4.


Furthermore, a computer-readable memory medium may be configured to store program instructions which are executable by one or more processors to implement any embodiment of the method illustrated and described above in connection with FIG. 4.



FIG. 5 illustrates one embodiment of a method for decoding data. In step 510, compressed input data is received. The compressed input data is a compression representation of a block of samples of a first source X. In step 512, a block of samples of a second source Y is received. Steps 510 and 512 need not be performed in any particular order. In one embodiment, steps 510 and 512 may be performed in parallel, or, at least in a time overlapping fashion. The first source X and the second source Y may be statistically correlated.


In step 514, an asymmetric Slepian-Wolf decoder as described herein is applied to the compressed input data using the block of samples of the second source Y. This application of the asymmetric Slepian-Wolf decoder generates a block of intermediate values.


In step 516, joint decoding is performed on each intermediate value and a corresponding sample of the block of second source samples to obtain a corresponding decompressed output value. The joint decoding may include determining an estimate of a centroid of a function restricted to a region of space corresponding to the intermediate value. The function may be the conditional probability density function of the first source X given said corresponding sample of the second source block. The centroid estimate may be (or may determine) the decompressed output value. The resulting block of decompressed output values may be used in any of various ways as desired. For example, the block of decompressed output values may be displayed to a user, forwarded for analysis and/or storage, transmitted through a network to one or more other destinations, etc.


The steps 514 and 516 may be configured so as to realize the decoder portion of a Wyner-Ziv code.


The region of space is a union of cells (e.g., Voronoi cells) corresponding to a coset of a coarse lattice in a fine lattice, wherein the coset is identified by the intermediate value.


The centroid estimate may be determined by reading the centroid estimate from a table stored in a memory medium using said corresponding sample of the second source block and the intermediate value as addresses. The table may be computed in at a central code design facility, and, then deployed to a decoder system through any of various known means for data distribution. The table may be stored in a memory medium of the decoder system. The decoder system may accessing the table to determine the centroid estimate in real time.


In one alternative embodiment, the centroid estimate may be determined by performing a Monte Carlo iterative simulation at decode time.


The intermediate values generated in step 514 may specify cosets of a coarse lattice in a fine lattice. The coarse lattice may be a sublattice of the fine lattice.


The asymmetric Slepian-Wolf decoder may be a multi-layered decoder. Furthermore, the asymmetric Slepian-Wolf decoder may be a low density parity check decoder or a turbo decoder.


A decoder system may be configured to implement any embodiment of the method illustrated and described above in connection with FIG. 5. The decoder system may include one or more processors or programmable hardware elements, and/or, dedicated circuitry such as application specific integrated circuits. In one embodiment, the decoder system includes a processor (e.g., a microprocessor) and memory. The memory is configured to store program instructions and data. The processor is configured to read and execute the program instructions from the memory to implement any embodiment of the method illustrated and described above in connection with FIG. 5.


Furthermore, a computer-readable memory medium may be configured to store program instructions which are executable by one or more processors to implement any embodiment of the method illustrated and described above in connection with FIG. 5.



FIG. 6 illustrates one embodiment of a method for computing a table representing a nested quantization decoder by Monte Carlo simulation. The method may be implemented by executing program instructions on a computer system (or a set of interconnected computer systems). The program instructions may be stored on any of various known computer-readable memory media.


In step 610, the computer system may compute a realization z of a first random vector (the auxiliary vector), e.g., using one or more random number generators. In step 615, the computer system may compute a realization y of a second random vector (the side information), e.g., using one or more random number generators. Steps 610 and 615 need not be performed in any particular order.


In step 620, the computer system may add the realization y and the realization z to determine a realization x of a source vector.


In step 625, the computer system may quantize the realization x to a point p in a fine lattice as described herein.


In step 630, the computer system may compute an index J identifying a coset of a coarse lattice in the fine lattice based on the fine lattice point p. The coarse lattice is a sublattice of the fine lattice.


The computer system may maintain a set of cumulative sums, i.e., one cumulative sum for each possible pair in the Cartesian product (CPR) of the set of possible indices and the set of possible realizations of the second random vector (the side information). The cumulative sums may be initialized to zero. Furthermore, the computer system may maintain a set of count values, i.e., one count value for each possible pair in the Cartesian product CPR.


In step 635, the computer system may add the realization x to a cumulative sum corresponding to the index J and the realization y. In step 640, the computer system may increment a count value corresponding to the index J and the realization y. Steps 635 and 640 need not be performed in any particular order.


The computer system may repeat steps 610 through 640 a number of times as indicated in step 645. In one embodiment, the number of repetitions may be determined by input provided by a user.


In step 650, the computer system may divide the cumulative sums by their corresponding count values to obtain resultant values. The resultant values may be interpreted as being the centroid estimates described above in connection with FIG. 5.


In step 655, the computer system may store the resultant values as a table in a memory associated with the computer system, e.g., onto hard disk.


The table may be distributed (e.g., with decoding software configured according to any of the various method embodiments described herein) to decoder systems by any of various means. In one embodiment, the table may be downloaded to decoder systems over a network such as the Internet. In another embodiment, the table may be stored on a computer-readable memory media (such as CD-ROM, magnetic disk, magnetic tape, compact flash cards, etc.) and the memory media may be provided (e.g., sold) to users of decoder systems for loading onto their respective computer systems.


In one embodiment, system for computing a table representing a nested quantization decoder may be configured with a processor and memory. The memory is configured to store program instructions and data. The processor is configured to read and execute the program instructions from the memory to implement any embodiment of the method illustrated and described above in connection with FIG. 6.


The Wyner-Ziv coding problem deals with source coding with side information under a fidelity criterion. The rate-distortion function for this setup, R*(D), is given by [1]:
R*(D)=minp(ux)minf:AU×AYAX^[I(U;X)-I(U;Y)],(1)

where the source X (with an alphabet AX), the side information Y (with an alphabet AY) and the auxiliary random variable U (with an alphabet AU) form a Markov chain as Y←→Xƒ→U, with the distortion constraint E[d(X, f(U,Y),Y)]≦D. The function I(·) denotes the Shannon mutual information as defined in [3]. The function p(u|x) is the conditional probability of U given X. The function f represents the mapping from the possible auxiliary variable and side information to a reconstructed value of X.


Although the theoretical limits for the rate-distortion function have been known for some time [1], [2], practical approaches to binary Wyner-Ziv coding and continuous Wyner-Ziv coding have not appeared until recently [3], [4], [5], [6], [7], [8], [9], [10], [11], [12]. A common context of interest for continuous Wyner-Ziv coding is code design for the quadratic Gaussian case, where the correlation between the source X and the side information Y is modeled as an additive white Gaussian noise (AWGN) channel as X=Y+Z, Z˜N(0, σZ2), with a mean-squared error (MSE) measure. For this case, one can first consider lattice codes [13] or trellis-based codes [14], [15] that have been used for both source and channel coding in the past, and focus on finding good nesting codes among them. Following Zamir et al's nested lattice coding scheme [16], Servetto [3] proposed explicit nested lattice constructions based on similar sublattices [17] with the assumption of high correlation. Research on trellis-based nested codes as a way of realizing high-dimensional nested lattice codes has just started recently [7]. For example, in DISCUS [7], two source codes (scalar quantization and trellis coded quantization—TCQ) and two channel codes (scalar coset code and trellis-based coset code [14]) are used in source-channel coding for the Wyner-Ziv problem, resulting in four combinations. One of them (scalar quantization with scalar coset code) is nested scalar quantization and another one (TCQ with trellis-based coset code, also suggested in [4]) can effectively be considered as nested TCQ.


Zamir et al. [18], [16] first outlined some theoretical constructions using a pair of nested linear/lattice codes for binary/Gaussian sources, where the fine code in the nested pair plays the role of source coding while the coarse code does channel coding. They also proved that, for the quadratic Gaussian case, the Wyner-Ziv rate-distortion (R-D) function is asymptotically achievable using nested lattice codes, with the assumption that the lattice is ideally sphere-packed as the lattice dimension goes to infinity.


The performance of a nested lattice quantizer can approach the Wyner-Ziv limit at high rate when high-dimensional lattices are used, because both the granular gain and boundary gain reach their ultimate values [19] when the dimension n→∞. Nevertheless, lattice coding and code design with high dimensionality are difficult in practice.


For a nested lattice quantizer using low-to-moderate dimensional lattices, a pragmatic approach to boost the overall performance is to increase the boundary gain with a second stage of binning, without increasing the dimensionality of the lattices. Suppose a second stage of binning is applied without introducing extra overload probability Pol, and the binning scheme partitions the support region of fine lattice (actually the Voronoi region of the coarse lattice for nested lattice quantizer) into m cosets. Thus the volume of the support region decreases by a factor of N/m while the overload probability stays fixed, where N is the nesting ratio. From the definition of boundary gain [19], the boundary gain increases without changing the dimension of the lattices. Since various possible boundary gains are realizable using the second-stage of binning as discussed above, there is only maximally 1.53 dB (decibels) of granular gain left unexploited by the quantizer. Thus the second stage of binning allows us to show the theoretical performance limits at high rates with low-to-moderate dimensional source codes.


In this paper, we introduce a new framework for the continuous Wyner-Ziv coding of independent and identically distributed (i.i.d.) sources based a combination of Slepian-Wolf coding (SWC) and nested quantization (NQ). In this framework, which we refer to as SWC-NQ, the role of Slepian-Wolf coding, as a second-stage of binning which increases the boundary gain of source coding, is to exploit the correlation between the quantized source and the side information for further compression and by making the overall channel code stronger. SWC-NQ connects network information theory with the rich areas of (a) lattice source code designs (e.g., [13]) and (b) channel code designs (e.g., LDPC codes [20], [21] and [22]), making it feasible to devise codes that can approach the Wyner-Ziv rate-distortion function. LDPC is an acronym for “low density parity check”.


For the quadratic continuous case, we establish the high-rate performance of SWC-NQ with low-to-moderate dimensional nested quantization and ideal SWC. We show that SWC-NQ achieves the same performance of classic entropy-constrained lattice quantization as if the side information were also available at the encoder. For example, 1-D/2-D SWC-NQ performs 1.53/1.36 dB away from the Wyner-Ziv R-D function of the quadratic continuous source at high rate assuming ideal SWC.


A recent work, [23], starts with non-uniform quantization with index reuse and Slepian-Wolf coding and shows the same high-rate theoretical performance as ours when the quantizer becomes an almost uniform one without index reuse. This agrees with our assertion that at high rates, the nested quantizer asymptotically becomes a non-nested regular one so that strong channel coding is guaranteed.


We also implement 1-D and 2-D nested lattice quantizers in the rate range of 1-7 bits per sample. Although our analysis shows that nesting does not help at high rate, experiments using nested lattice quantizers together with irregular LDPC codes for SWC obtain performances close to the corresponding limits at low rates. Our work thus shows that SWC-NQ provides an efficient scheme for practical Wyner-Ziv coding with low-dimensional lattice quantizers at low rates.


Although the theoretical analyses are taken under the assumption of high rate, the rate-distortion performance at low rate is still consistent with the one at high rate, i.e., SWC-NQ achieves the same performance of classic entropy coded quantization (ECQ) as if the side information were also available at the encoder even at low rate, when a non-linear estimator is applied at the decoder. This non-linear estimator, as we present in this paper, is the optimal one in the sense of the MSE measurement. At high rates, the non-linear estimator reduces to the linear one analyzed in this paper.


We note that the non-linear estimation in the decoder can yield significant gains for low rates and for high rates it cannot help noticeably. This is confirmed by the agreement of the high rate analysis results in this paper, which assume that the linear estimation is used, with the high rate simulation results, for which the non-linear estimation method is always used.


The following is a list of some of the contents of this paper:

    • 1. A theoretical analysis and simulation for low-to-moderate dimensional nested lattice quantization at high rates. The rate-distortion function for general continuous sources with arbitrary probability density function (PDF) and MSE measurement, and a theoretical lower bound of rate-distortion function for the quadratic Gaussian case, are presented.
    • 2. An analysis of the granular and boundary gains of the source coding component of nested lattice quantization. This analysis explains the phenomenon of an increasing gap of the rate-distortion function of nested lattice quantization at low-to-moderate dimension, with respect to the Wyner-Ziv limit, as we observe in the simulation.
    • 3. A new Wyner-Ziv coding framework using nested lattice quantization and Slepian-Wolf coding, which we refer to as SWC-NQ, is introduced. The SWC-NQ rate-distortion function for general continuous sources with arbitrary PDF and MSE measurement is presented, and is in agreement with the performance of entropy-constrained lattice quantization as if the side information were available at the encoder.
    • 4. A non-linear estimator for the decoder corresponding to the nested quantizer is presented, and is proved to be optimal in sense of MSE measurement. This estimator helps to improve the performance of SWC-NQ at low rates, and is consistent with the analytical performance at high rates.
    • 5. Examples of practical code design using a 1-D (scalar) lattice and 2-D (hexagonal) lattice, and multi-layer irregular LDPC codes, are given in this paper.


      Some Background on Wyner-Ziv Coding


In this section, we briefly review the basic concepts and milestone theorems of Wyner-Ziv coding. Wyner and Ziv [1], [2] present the limit of rate-distortion performance for lossy coding with side information, for both Gaussian and general sources.


The problem of rate distortion with side information at the decoder asks the question of how many bits are needed to encode X under the constraint that E[d (X,{circumflex over (X)})]≦D, assuming the side information Y is available at the decoder but not at the encoder. This problem generalizes the setup of [24] in that coding of X is lossy with respect to a fidelity criterion rather than lossless. For both discrete and continuous alphabets of AX and general distortion metrics d(·), Wyner and Ziv [1] gave the rate-distortion function R*WZ(D) for this problem as R*WZ(D)=inf I(X;Z|Y), where the infimum is taken over all random variables Z such that Y→X→Z is a Markov chain and there exists a function {circumflex over (X)}═X(Z,Y) satisfying E[d(X,{circumflex over (X)})]≦D. According to [1],
RWZ*(D)RXY(D)=inf{X^AX:E[d(X,X^)]D}I(X;X^Y).

This means that usually there is a rate loss in the Wyner-Ziv problem. Zamir quantified this loss in [25]. In particular, Zamir showed a rate loss of less than 0.22 bit for a binary source with Hamming distance, and a rate loss of less than 0.5 bit/sample for continuous sources with MSE distortion.


Note that when D=0, the Wyner-Ziv problem degenerates to the Slepian-Wolf problem with R*WZ(0)=RX/Y(0)=H(X|Y). Another special case of the Wyner-Ziv problem is the quadratic Gaussian case when X and Y are zero mean and stationary Gaussian memoryless sources and the distortion metric is MSE. Let Xi denote the ith component of X, and Yi denotes the ith component of Y, i=1, 2, . . . , n. Let the covariance matrix of (Xi,Yi) be
cov(Xi,Yi)=[σX2ρσXσYρσXσYσY2]

with |ρ|<1 for all n, then
RWZ*(D)=RXY(D)=12log+[σX2(1-ρ2)D],

where log+ x=max{0, log x}. This case is of special interest in practice because many image and video sources can be modeled as jointly Gaussian (after mean subtraction) and Wyner-Ziv coding suffers no rate loss.


Lattices and Nested Lattices


In this section, we review the idea of lattice and nested lattices and introduce notation that will be used in our discussion.


For a set of n basis vectors {g1, . . . ,gn} in Rn, an unbounded n-dimensional (n-D) lattice Λ is defined by

Λ={l=Gi:iεZn}  (2)

and its generator matrix

G=[g1|g2| . . . |gn].

R donates the set of real numbers. Rn denotes n-dimensional Euclidean space. Z denotes the set of integers. Zn denotes the Cartesian product of n copies of Z, i.e., the set of n-vectors whose components are integers.


The nearest neighbor quantizer QΛ(x) associated with Λ is given by
QΛ(x)=argminlΛx-l.(3)

The notation “arg min” denotes the value of the argument (in this case l) where the minimum is achieved. Expression (3) is augmented with a set of “tie breaking” rules to decide the result in cases where two or more points of the lattice Λ achieve the minimum distance to vector x. Any of various sets of tie breaking rules may be used. For example, in dimension one (i.e., n=1) with lattice Λ being the integers, points of the form k+(½) with be equidistant to k and k+1. One possible tie-breaking rule would be to map such points up to k+1. In one set of embodiments, the nearest neighbor quantizer defined by (3) and a set of tie breaking rules has the property:

QΛ(x+l)=QΛ(x)+l, ∀lεΛ.


The basic Voronoi cell of Λ, which specifies the shape of the nearest-neighbor decoding region, is

K={x:QΛ(x)=0}.  (4)

Associated with the Voronoi cell K are several important quantities: the cell volume V, the second moment σ2 and the normalized second moment G(Λ), defined by
V=Kx,(5)σ2=1nVKx2x,(6)G(Λ)=σ2V2/n,(7)

respectively. The minimum of G(Λ) over all lattices in Rn is denoted as Gn. By [13],

Gn≧1/(2πe), ∀n  (8) Gn1/(2π),n(8)limnGn=1/(2πⅇ)(9)

The notation “∀” is to be read as “for all”. The constant e is Euler's constant.


A pair of n-D lattices (Λ12) with corresponding generator matrices G1 and G2 is nested, if there exists an n×n integer matrix P such that

G2=G1×P and
det{P}>1,

where det{P} denotes the determinant of the matrix P. In this case V2/V1 is called the nesting ratio, and Λ1 and Λ2 are called the fine lattice and coarse lattice, respectively.


For a pair (Λ12) of nested lattices, the points in the set Λ12≡{Λ1∩K2} are called the coset leaders of Λ2 relative to Λ1, where K2 is the basic Voronoi cell of Λ2. The notation “A≡B” means that A is being defined by expression B, or vice versa. For each vεΛ12 the set of shifted lattice points

C(v)≡{v+l, ∀lεΛ2}

is called a coset of Λ2 relative to AΛ1. The jth point of C(v) is denoted as cj(v). Then

C(0)={cj(0),∀jεZ}=Λ2  (10)

and
C(0)={cj(0),jZ}=Λ2,and(10)vΛ1/Λ2C(v)=Λ1.(11)

Since

cj(v)εΛ1,∀jεZ,  (12)

we further define

Rj(v)={x:QΛ1(x)=cj(v)}

as the Voronoi region associated with cj(v) in Λ1, and R(v)=∪j=−∞Rj(v). Then
vΛ1/Λ2R0(v)=K2,and(13)j=-vΛ1/Λ2Rj(v)=vΛ1/Λ2R(v)=n.(14)



FIG. 7 illustrates examples of v, C(v) and R(v). The fine lattice points are at the centers of the small hexagons. The coarse lattice points are at the centers of the large hexagons. R(v) is the union of the shaded hexagons. The coset C(v) is the set composed of the centers of the shaded hexagons. The fine lattice and coarse lattice may be generated by
G1=[2103]andG2=[51333],

respectively, and related by
P=[2-113].

Nested Lattice Quantization


Throughout this paper, we use the correlation model of X=Y+Z, where X, Y and Z are random vectors in Rn. X is the source to be coded, Y is the side information, and Z is the noise. Y and Z are independent. In this section we discuss the performance of nested lattice quantization for general sources where Y and Z are arbitrarily distributed with zero means, as well as for the quadratic Gaussian case where Yi˜N(0,σY2) and Zi˜N(0,σZ2), i=1, 2, . . . , n, are Gaussian. For both cases, the mean squared error (MSE) is used as the distortion measurement.


Zamir et al.'s nested lattice quantization scheme [18], [16] works as follows: Let the pseudo-random vector U (also referred to herein as the “dither”), known to both the quantizer encoder and the decoder, be uniformly distributed over the basic Voronoi cell K1 of the fine lattice Λ1. For a given target average distortion D, denote α√{square root over (1−=D/σZ2)} as the estimation coefficient. Given the realizations of the source, the side information and the dither as x, y and u, respectively, then according to [18], the nested quantizer encoder quantizes αx+u to the nearest point xQΛ1=QΛ1(αx+u) in Λ1, computes xQΛ1−QΛ2 (xQΛ1) which is the coset shift of xQΛ1 with respect to Λ2, and transmits the index corresponding to this coset shift.


The nested quantizer decoder receives the index, generates xQΛ1−QΛ2(xQΛ1) from the index, forms

w=xQΛ1−QΛ2(xQΛ1)−u−αy

and reconstructs x as {circumflex over (x)}=y+α(w−QΛ2(w)) using linear combination and dithering in estimation.


It is shown in [18] that the Wyner-Ziv R-D function DWZ(R)=σX/Y22−2R is achievable with infinite dimensional nested lattice quantization for quadratic Gaussian case. In this paper, we analyze the high-rate performance of low-dimensional nested lattice quantization, which is of more practical interest as high-dimensional nested lattice quantization is too complex to implement, for both general and Gaussian sources.


Our analysis is based on the high-resolution assumption, which means 1) V1 is small enough so that the PDF of X,f(x), is approximately constant over each Voronoi cell of Λ1 and 2) dithering can be ignored. With the high-rate assumption, α≈1 and the encoder/decoder described above simplifies as follows:

    • The encoder quantizes x to xQΛ1=QΛ1(x), computes v=xQΛ1−QΛ2(xQΛ1), and transmits an index corresponding to the coset leader v.
    • Upon receiving v, the decoder forms w=v−y and reconstructs x as {circumflex over (x)}v=y+w−QΛ2(w)=v+QΛ2(y−v).


      This simplified nested lattice quantization scheme for high rate is shown in FIG. 8 and was also used in [3].


      A. High Rate Performance for General Sources with Arbitrary Distribution


      Theorem 4.1: If a pair of n-D nested lattices (Λ12) with nesting ratio N=V2/V1 is used for nested lattice quantization, the distortion per dimension in Wyner-Ziv coding of X with side information Y at high rate is
      Dn=G(Λ1)V12/n+1nEZ[QΛ2(Z)2].(15)

      The notation ∥a∥ denotes the length (or norm) of the vector a.


      Proof: Since
      n=j=-vΛ1/Λ2Rj(v),(16)

      the average distortion for a given realization of the side information Y=y is
      D(y)=Rnf(xy)x-x^v2x=vΛ1/Λ2j=-xRj(v)f(xy)x-x^v2x=vΛ1/Λ2j=-xRj(v)f(xy)x-cj(v)+cj(v)-x^v2x=vΛ1/Λ2j=-xRj(v)f(xy)[x-cj(v)2+cj(v)-x^v2+2<x-cj(v),cj(v)-x^v>]x(a)vΛ1/Λ2j=-[f(cj(v)y)xRj(v)x-cj(v)2x+xRj(v)f(xy)cj(v)-x^v2x]=(b)vΛ1/Λ2j=-[f(cj(v)y)nG(Λ1)V11+(2/n)+xRj(v)f(xy)QΛ2(cj(v))-QΛ2(y-cj(v)+QΛ2(cj(v))2x](c)nG(Λ1)V12n+j=-vΛ1/Λ2xRj(v)f(xy)QΛ2(x-y)2x=nG(Λ1)V12n+xRnf(xy)QΛ2(x-y)2x,(17)

      where (a) comes from the high rate assumption and
      xRj(v)<x-cj(v),cj(v)-x^v>x=0.(18)

      The latter is due to the fact that x−cj(v) is odd spherical symmetric for xεRj(v) and both cj(v) and {circumflex over (x)}v are fixed for xεRj(v) with given v and y. (b) is due to cj(v)=QΛ1(x) for xεRj(v), and

      {circumflex over (x)}v=cj(v)−QΛ2(cj(v))+QΛ2(y−cj(v)+QΛ2(cj(v)));  (19)

      and (c) is due to

      QΛ2(a+QΛ2(b))=QΛ2(a)+QΛ2(b), ∀a,bεRn  (20)

      and the high resolution assumption.


Therefore, the average distortion per dimension over all realizations of Y is
Dn=1nEY[D(y)]=G(Λ1)V12/n+1nxyf(x,y)QΛ2(x-y)2xy=G(Λ1)V12/n+1nyf(y)zf(z)QΛ2(z)2zy=G(Λ1)V12/n+1nEZ[QΛ2(z)2].(21)

Remarks: There are several interesting facts about this rate-distortion function.


1) For a fixed pair of the nested lattices (Λ12), Dn only depends on Z, i.e., the correlation between X and Y. Dn is independent of the marginal distribution of X (or Y).


2) The first term, G(Λ1)V12/n, in the expression for Dn is due to lattice quantization in source coding. It is determined by the geometric structure and the Voronoi cell volume V1 of lattice Λ1. The second term, 1/nEZ[∥QΛ2(z)∥2], is the loss due to nesting (or the channel coding component of the nested lattice code). The second term depends on Voronoi cell volume V2 and the distribution of Z. From another point of view, the first term is the granular component MSEg with respect to the granular lattice Λ1, and the second term is the overload component MSEol with respect to the lattice Λ2 of the nested quantizer. MSEg=G(Λ1)V12/n is the same as the granular MSE for non-nested lattice quantizer [19].


Corollary 4.1: For the quadratic case, Dn→DWZX/Y22−2R as n→∞.


Proof Since the nested lattice quantizer is a fixed-rate quantizer with the rate
R=1nlog(V2V1),then(21)canberewrittenasDn=G(Λ1)V22/n2-2R+1nEZ[QΛ2(z)2].(22)

For the quadratic Gaussian case, according to equation (3.14) of [18],
1nlog(V2)12log(2πeσZ2)(23)

when n is sufficiently large, where σZ2X/Y2 is the variance of the AWGN Z. Then
G(Λ1)V22/n2-2R12πe2πeσZ22-2R=σZ22-2R=DWZ.(24)

At the same time, according to equation (3.12) of [18], Pe=Pr{Z∉K2}<ε, with any ε>0 and sufficiently large n, hence 1/nEZ[∥QΛ2(z)∥2]→0 as n→∞. Consequently, the performance becomes
Dn=G(Λ1)V22/n2-2R+1nEZ[QΛ2(z)2]->σXY22-2R=DWZ(25)

as n→∞, for the quadratic Gaussian case. This limit agrees with the statement in [18], which claims that the nested lattice quantization can achieve the Wyner-Ziv limit asymptotically as the dimensionality goes to infinity.


B. A Lower Bound of the Performance for Quadratic Case


The source-coding-loss in (21) has an explicit form, while the channel-coding loss is not so directly expressed. Among all the possible patterns of the additive channels, AWGN is of most interest. In such case Z is a Gaussian variable with zero mean and variance σZ2X/Y2. From Theorem 4.1, we obtain a lower bound of the high-rate R-D performance of low-dimensional nested lattice quantizers for Wyner-Ziv coding, when Z is Gaussian, stated as the following corollary.


Corollary 4.2: For X=Y+Z, Y˜N(0,σY2) and Z˜N(0,σZ2), the R-D performance of Wyner-Ziv coding for X with side information Y using n-D nested lattice quantizers is lower-bounded at high rate by
Dn(R)D_n(R)minV2>0δn(R),(26)

where
δn(R)GnV22/n2-2R+1nj=1((2j-1)γnV22/n)u(j2V22/nΓ(n2+1)2/n2πσZ2),(27)


γn is the n-D Hermite's constant [13], [26], and u(t) is defined in [26] as
u(t){-t(1+t1!+t22!++tn2-1(n2-1)!)ifniseven-t(1+t1/2(1/2)!+t3/2(3/2)!++tn2-1(n2-1)!)ifnisodd}.(28)


Specifically, when n=1, the best possible high rate performance is
D1(R)=minV2>0{G1V222-2R+V22j=0(2j+1)Q(V2σZ(j+12))},where(29)Q(t)=12π-r2/2τ.(30)


Proof: 1) Rate Computation: Note that the nested lattice quantizer is a fixed rate quantizer with rate
R=1nlog2(V2V1).


2) Distortion computation: Define
L2=12minl,lΛ2,lll-l,and(31)Pz(L)=Pr(Z>L).(32)


For the 1-D (scalar) case, PZ can be expressed in terms of the Q function and EZ[∥QΛ2(z)∥2] simplifies to [27]
Ez[QΛ2(z)2]=V22j=0(2j+1)Q(V2σz(j+12)).(33)


For the n-D (with n>1) case, note that [28]

L22=γ(Λ2)V2)2/n,  (34)

and

QΛ2(z)∥2≧∥z∥2−∥z−QΛ2(z)∥2≧∥z∥2−L22,  (35)

where γ(Λ2) is the Hermite's constant of lattice Λ2 [13], [26].


Then we get
EZ[QΛ2(z)2]=j=1((j-1)L2<zjL2f(z)QΛ2(z)2zj=1((j-1)L2<zjL2f(z)(z2-L22)zj=1((j-1)L2<zjL2f(z)((j-1)2L22-L22)z=j=1((j-1)2L22-L22)[PZ((j-1)L2)-PZ(jL2)]=j=1((2j-1)L22)PZ(jL2)=j=1[(2j-1)γ(Λ2)V22n]Pe(j2V22nΓ(n2+1)2n2πσZ2),(36)

where
Γ(t)=0ut-1e-uu

is Euler's gamma function, and Pe(·) is defined in [26] as the symbol error probability under maximum likelihood decoding while transmitting the lattice points over an AWGN channel. A lower bound of Pe(·) was also given in [26] as Pe(t)≧u(t)


Then Theorem 4.1 and (36) give
DnδnGnV12/n+1nj=1((2j-1)γnV22n)u(j2V22nΓ(n2+1)2n2πσZ2).(37)

Using
R=1nlog2(V2V1),

we eliminate V1 in Dn and obtain a lower bound on Dn(R) as
Dn(R)D_n(R)minV2>0δn(R),where(38)δn(R)=GnV22/n2-2R+1nj=1((2j-1)γnV22/n)u(j2V22nΓ(n2+1)2n2πσZ2).(39)
FIG. 9 shows δ2(R) with different V2's using nested Λ2 lattices (i.e., hexagonal lattices) in 2-D with σZ2=0.01. The lower bound {overscore (D)}2(R) is the lower convex hull of all operational R-D points with different V2, as shown in FIG. 10. We observe from FIG. 10 that the gap from {overscore (D)}n(R) to DWZ(R) in dBs keeps increasing as the rate increases with σZ2=0.01. This increasing gap comes from the fact that, the granular MSE component
MSEgGnV12/n=112γg(V2N)2/n=112γgV22/n2-2R

is away from the benchmark 2−2R with an increasing gap as V2 increases, where
γg112Gn

is the granular gain [19] of lattice Λ1, and
R=1nlogN,

N is the nesting ratio, as shown in FIG. 11.



FIG. 12 plots {overscore (D)}n(R) for n=1, 2, 4, 8 and 24 with σZ2=0.01. We see that {overscore (D)}n(R) gets closer and closer to the Wyner-Ziv R-D function DWZ(R)=σX/Y22−2R as n goes to infinity.


C. Discussion of the Correlation-Asymptotical Property


As to the asymptotical property of the nested-lattice quantization for Wyner-Ziv coding, we have the following statement. Here asymptotical means that the correlation
ρE[XY]E[X2]E[Y2]

between the source X and the side information Y goes to 1 asymptotically. If we fix σY2 then the asymptotical performance is the one when σZ2→0.


Corollary 4.3: The distortion of the nested lattice quantization maintains a constant gap (in dB) to the Wyner-Ziv bound for all 0<σZ2<1.


Proof. Denote s=V22/n,
t=j2Γ(n2+1)2/n2πσZ2s,andA=Γ(n2+1)2/n.

From Corollary 4.1, we get
δ=Gns2-2R+1nj=1((2j-1)γns)u(t).(40)

Fix rate R and dimensionality n (without loss of generality, assume n is even), and minimize δ with respect to s,
δs=Gn2-2R+1nj=1((2j-1)γn(t))+1nj=1((2j-1)γntu(t)t)where(41)u(t)t=--t(1+t1!++tn2-1(n2-1)!)+-t(1+t1!++tn2-2(n2-2)!)=--ttn2-1(n2-1)!then(42)δs=Gn2-2R+1nj=1((2j-1)γnu(t))-1nj=1((2j-1)γnt-tt(n/2)-1(n2-1)!).(43)

Set
δs=0,

and denote the optimal s as s0, and denote the corresponding t as
t0=j2A2πσZ2s0,

we get
Gn2-2R+1nj=1((2j-1)γnu(t0))=1nj=1((2j-1)γnt0-t0t(n/2)-1(n2-1)!),hence(44)δ*=Gns02-2R+1nj=1((2j-1)γns0u(t0))=1nj=1((2j-1)γns0t0-t0t0(n/2)-1(n2-1)!)=1nj=1((2j-1)γn2πσZ2j2At02-t0t0(n/2)-1(n2-1)!).(45)

From (45) one can see that the optimal t0 only depends on the rate R and the dimensionality n. The optimal t0 stays unchanged with different σZ2, thus the optimized distortion δ* is a linear function of σZ2, denoted as D=δ*=B(R,n)σZ2. Since the Wyner-Ziv bound is DWZZ22−2R, the gap (in terms of dB) from the practical optimized distortion D to Wyner-Ziv bound DWZ with fixed R and n is
ΔD=10log10DDWZ=10log10B(R,n)2-2R(46)

which stays constant for all σZ2<1.


This result verifies our simulation results which show that the distortion of the nested lattice quantizer does NOT approach the Wyner-Ziv bound as the correlation between the source and the side information goes to 1 asymptotically.


Slepian-Wolf Coded Nested Lattice Quantization (SWC-NQ)


In this section, we evaluate the boundary gain of the source coding component of nested lattice quantization. Motivated by this evaluation, we introduce SWC-NQ and analyze its performance.


A. Motivation of SWC-NQ


From Theorem 4.1, the distortion per dimension of the nested lattice quantizer is Dn=MSEg+MSEol, where MSEg=G(Λ1)V12/n is the granular component of the distortion, characterized by the granular gain
γg(Λ1)=1/12G(Λ1),MSEol=1nEz[QΛ2(z)2]

is the overload component of the distortion, characterized by the boundary gain γb2). The boundary gain γb2) is defined in [19] as follows. Suppose that Λ2 is the boundary (coarse) lattice with its Voronoi region as the n-dimensional support region, and it has the same overload probability as a cubic support region of size αM centered at the origin. The boundary gain is then defined as the ratio of the normalized volume (αM)2 of the cubic support region to the normalized volume V22/n, as
γb(Λ2)=(αM)2V22/n.Since(47)MSEg=G(Λ1)V12/n=112γg(Λ1)V22/nN-2/n=112γg(Λ1)αMγb(Λ2)N-2/n(48)

If the nesting ratio N stays constant (i.e., the codebook size N stays constant), then MSEg will be reduced by a factor of γb2), without affecting MSEol because the overload probability stays unchanged.


To increase the boundary gain γb2), a second-stage of binning can be applied to the quantization indices. The essence of binning is a channel code which partitions the support region into several cosets. Assuming the channel code is strong enough so that there is no extra overload probability introduced (i.e., it is lossless coding for the indices without decoding error), and the channel code partitions the support region K2 into m cosets, with the set composed of the coset leaders denoted as S, then #(S)=m and S is the support region for the quantization indices and hence the support region for the nested quantization, with Vol(S)=(m/N)V2<V2. Then the effective volume of the support region decreases by a factor of the coset size after the second stage of binning, and therefore, the boundary gain γb2) increases. The notation “#(A)” denotes the cardinality of (i.e., the number of elements in) the set A.


We thus propose a framework for Wyner-Ziv coding of i.i.d. sources based on SWC-NQ, which involves nested quantization (NQ) and Slepian-Wolf coding (SWC). The SWC operates as the second binning scheme. Despite the fact that there is almost no correlation among the nested quantization indices that identify the coset leaders vεΛ12 of the pair of nested lattices (Λ1, Λ2), there still remains correlation between v and the side information Y. Ideal SWC can be used to compress v to the rate of R=H(v|Y). State-of-the-art channel codes, such as LDPC codes, can be used to approach the Slepian-Wolf limit H(v|Y) [29]. The role of SWC in SWC-NQ is to exploit the correlation between v and Y for further compression.


B. Uniform High Rate Performance


Let's evaluate the high rate performance for the quadratic Gaussian case first. For this case, a lower bound for the high-rate performance of SWC-NQ with a pair of arbitrary nested lattices (Λ12) is given as
Dn(R)G(Λ1)2(2/n)h(X,Λ2)σXY22-2R+1nj=1((2j-1)γnV22/n)u(j2V22/nΓ(n2+1)2/n2πσZ2)where(49)h(X,Λ2)-xRnf_(x)log2[i=-f_(x+ci(0)σXY)]x,(50)

{overscore (f)}(·) is the PDF of an n-D i.i.d. Gaussian source with 0 mean and unit variance, u(t) is defined in (28), and ci(0) is defined above (in the section entitled “Lattices and Nested Lattices”), as the lattice points of Λ2.


Proof The proof to this lower bound is provided later.


For example, the lower bounds of D(R) for the 1-D case with different V2 are plotted in FIG. 13.



FIG. 13 gives us a hint that, intuitively, the best R-D function of SWC-NQ is the R-D function as if the side information were also available at the encoder, and maintains a constant gap of 2πeGn from the Wyner-Ziv limit in dB. Here the best means that, for a given distortion D, the minimal achievable rate R over all possible V2, or equivalently, the minimal achievable distortion D over all possible V2 for a given rate R. This claim is stated and proved as follows. Let's start with the following lemma and then prove the main theorem.


Lemma 5.1: For nested lattice quantization, denote W≡QΛ1(X), and V≡W−QΛ2 (W). At high rate, H(V/Y)≈H(W|Y).


Proof: The proof is provided later.


Theorem 5.2: The optimal R-D performance of SWC-NQ for general sources using low-dimensional nested lattices for Wyner-Ziv coding at high rate is
Dn*(R)minV2D(R)=Gn2(2/n)h(XY)2-2R.(51)

Proof 1) When V2→√, QΛ2(QΛ1(X))=0 and QΛ2(Z)=0, then
nR=H(VY)=H(QΛ1(X)-QΛ2(QΛ1(X))Y)=H(QΛ1(X)Y)=h(XY)-log(V1)(52)

and Dn(R)=GnV12/n. Combine R and Dn through V1 and we get the R-D function as

Dn(R)V2→∞=Gn2(2/n)h(X/Y)2−2R  (53)

Since
Dn*(R)minV2D(R)Dn(R)V2->,thenDn*(R)Gn2(2/n)h(X|Y)2-2R.(54)

then

Dn*(R)≦Gn2(2/n)h(X/Y)2−2R).  (54)

2) Denote w≡QΛ1(x), and S1≡{(x,{circumflex over (x)}):E[d(x,{circumflex over (x)})]≦D}. The rate of Wyner-Ziv coding with respect to a given distortion D is [1]
nR*(D)=minp(v),p(x^v,y),(x,x^)S1I(X;VY)=(a)minp(v),p(x^v,y),(x,x^)S1H(VY)(b)minp(v),p(x^v,y),(x,x^)S1H(WY),(55)

where (a) comes from H(V|X,Y)=0 and (b) comes from Lemma 5.1.


Define S2≡{(x,{circumflex over (x)}):E[d(x,w)]≦D}. From Theorem 4.1,
E[d(x,x^)]=GnV12/n+1nEZ[QΛ2(z)2]=E[d(x,w)]+1nEZ[QΛ2(z)2]E[d(x,w)].(56)


Then ∀(x,{circumflex over (x)})εS1,

D≧E[d(x,{circumflex over (x)})]≧E[d(x,w)],  (57)

it means that (x,{circumflex over (x)})εS2.


Then S1S2, and
nR*(D)minp(v),p(x^v,y),(x,x^)S1H(WY)minp(v),p(x^v,y),(x,x^)S2H(WY).(58)

Since H(W|Y)=h(X|Y)−log(V1) and E[d(x,{circumflex over (x)})|Y]=GnV12/n, R*(D) can be calculated using Lagrangian method, as
nR*(D)minp(v),p(x^v,y),(x,x^)S2H(WY)=h(XY)-n2log(DGn).(59)

Then

D*(R)≧Gn2(2/n)h(X/Y)2−2R  (60)

From (54) and (60), it is proved that, at high rate, the best R-D function of SWC-NQ using low-dimensional lattices is

D*(R)=Gn2(2/n)h(X/Y)2−2R.  (61)

Corollary 5.4: The optimal R-D performance of quadratic Gaussian SWC-NQ using low-dimensional nested lattices at high rate is

D*n(R)=2πeGnσX/Y2−2R.  (62)

We thus conclude that at high rates, SWC-NQ performs the same as the traditional entropy-constrained lattice quantization with the side information available at both the encoder and decoder. Specifically, the R-D functions with 1-D (scalar) lattice and 2-D (hexagonal) lattice are 1.53 dB and 1.36 dB away from the Wyner-Ziv bound, respectively.


Remarks: We found that for finite rate R and small n (e.g., n=1 and 2), the optimal V2, denoted as V*2, that minimizes the distortion Dn(R) is also finite. FIGS. 14(a) and (b) plot the optimal V*2 (scaled by σZ) as a function of R for the 1-D (n=1) and 2-D (n=2) case. We see that as R goes to infinity, V*2 also goes to infinity. We also observe that for fixed R and n, Dn(R) stays roughly constant for V2>V*2.


Code Design and Simulation Results


In this section, the optimal decoder for nested quantizer at low rate is introduced, and the issue of code design is also discussed, along with simulation results.


A. The Optimal Decoder for Nested Quantizer at Low Rate


The optimal estimator for the decoder corresponding to the nested quantizer should minimize the distortion between X and the reconstructed {circumflex over (X)}. If mean squared error (MSE) is used as the distortion measure, {circumflex over (x)} will be E[X|X, j,y], where j is the received bin index corresponding to the coset leader v=xQΛ1−QΛ2 (xQΛ1) Let Y and Z be independent zero mean Gaussian random variables with variances σY2 and σZ2, then we have X|y˜N(y,σZ2).


When n=1, the optimal decoder for nested quantizer can be derived directly from E[X|j,y] as
x^(j,y)=12πσZ2n=-jq+nQ(j+1)q+nQxexp(-x-y22σZ2)x,(63)

where q and Q are the uniform intervals of the two nested lattices used by the scalar quantizer, with Q/q=N, where N is the nesting ratio. At high rates, the rate distortion performance using this non-linear estimation matches our analysis in (29); at low rate, such estimation method helps to boost the performance.


When n>1, the optimal decoder for the nested quantizer is stated as follows.


Theorem 6.3: The optimal decoder for the nested quantizer in the sense of MSE is
x^=E[xy,j]=R(v)xf(xy)xR(v)f(xy)x.Proof:(64)x^=E[xy,j]=Rnxf(xy,j)x=Rnxf(x,jy)P(jy)x=Rnxf(x,jy)xP(jy)=Rnxf(xy)P(jx,y)xR(v)f(xy)x.(65)

Note thatj, x, y form a Markov chain as y←→x←→j, then and we get
P(jx,y)=P(jx)={01ifxR(v)ifxR(v)},(66)x^=Rnxf(x|y)P(j|x)xR(v)f(x|y)x=R(v)xf(x|y)xR(v)f(x|y)x.(67)


Estimation at the decoder plays an important role for low-rate implementation. We thus apply an optimal non-linear estimator at the decoder at low rates in our simulations.


Corollary 6.5: The optimal estimator stated in Theorem 6.3 degenerates to the linear one {circumflex over (x)}=v+QΛ2(y−v) at high rates as we discussed above in the section entitled “Nested Lattice Quantization” and in the section entitled “Slepian-Wolf Coded Nested Lattice Quantization”.


Proof: At high rate,
x^=R(v)xf(x|y)xR(v)f(x|y)x=i=-Ri(v)xf(x|y)xi=-Ri(v)f(x|y)x(a)i=-ci(v)Ri(v)f(x|y)xi=-Ri(v)f(x|y)x(b)v+QΛ2(y-v)K(v+QΛ2(y-v))f(x|y)xK(v+QΛ2(y-v))f(x|y)x=v+QΛ2(y-v),(68)

which is the linear estimator, discussed above in the section entitled “Nested Lattice Quantization” and in the section entitled “Slepian-Wolf Coded Nested Lattice Quantization”, for high rate performance. Steps (a) and (b) of (68) come from the high rate assumption.


Since the non-linear estimation is a definite integral of a simple function over a disconnected region which includes many isolated Voronoi cells, we choose the Monte Carlo method to do this integral. In one simulation, for each scaling factor, there are totally 104×104=108 pairs of {x,y} to be simulated, and for each pair of {x,y}, there are 104 samples to calculate this definite integral.



FIG. 15 shows the improvement gained by using the optimal (non-linear) estimator at low rates, for n=2 and σZ2=0.01.


B. Code Design of LDPC Codes


Let J (0≦J≦N−1) denote the index of the coset leader v. The index J is coded using Slepian-Wolf codes with Y as the side information. Instead of coding J as a whole, we code J bit-by-bit using multi-layer Slepian-Wolf coding as follows.


Assume J=(BmBm-1 . . . B2B1)2, where Bm is the most significant bit (MSB) of J, and B1 is the least significant bit (LSB). A block of the indices may be collected. The first B1 (i.e., a block of the first bits from the block of indices) is encoded at rate R1=H(B1/Y) using a Slepian-Wolf code designed under the assumption that the corresponding decoder has only Y as side information; then the second bit B2 (i.e., a block of the second bits from the block of indices) is encoded at rate R2=H(B2|Y,B1) using a Slepian-Wolf code designed under the assumption that the corresponding decoder has only Y and B1 as side information; . . . ; finally, the last bit Bm (i.e., a block of the last bits from the block of indices) is encoded at rate Rm=H(Bm|Y, B1, B2, . . . , Bm-1) with a Slepian-Wolf code designed under the assumption that the corresponding decoder has side information {Y, B1, B2′, . . . , Bm-1}. Hence the total rate of the Slepian-Wolf code is H(J|Y)=H(v|Y).


Practically, strong channel codes such as LDPC or Turbo codes are applied as Slepian-Wolf codes. The first step in designing is to determine the rate of the channel code to be used. Since Rn is equivalent to the amount of syndromes to be sent per bit, the channel code rate is 1−Rn. Thus the optimum rate at the nth layer that achieves Slepian-Wolf bound is 1−H(Bn|Y, B1, . . . , Bn-1). This multi-layer Slepian-Wolf coding scheme is shown in FIG. 16.


As shown in FIG. 16, one embodiment of an SWC-NQ encoder includes a nested lattice quantization unit 1610 and a set of Slepian-Wolf encoder SWE1, SWE2, . . . , SWEm. The nested quantization unit 1610 operates on a value of the input source X and generates the bits B1, B2, . . . , Bm-1, Bm of the index J as described above. The nested quantization unit does this operation repeatedly on successive values of the input source, and thus, generates a stream of indices. Each of the Slepian-Wolf encoders SWEn, n=1, 2, . . . , m, collects a block of the B, bits from the stream of indices and encodes this block, thereby generating an encoded block Tn. The encoded blocks T1, T2, . . . , Tm are sent to an SWC-NQ decoder.


As shown, one embodiment of the SWC-NQ decoder includes a set of Slepian-Wolf decoders SWD1, SWD2, . . . , SWDm and a nested quantization decoder 1620. Each Slepian-Wolf decoder SWDn, n=1, 2, . . . , m, decodes the compressed block T, to recover the corresponding block of Bn bits. As noted above, decoder SWD, uses side information {Y, B1,B2, . . . ,Bn-1}. The nested quantization decoder 1620 operates on the blocks generated by the decoders using a block of the Y values, as described above, to compute a block of estimated values of the source.


C. Simulation Results


We carry out 1-D nested lattice quantizer design for different sources with 106 samples of X in each case. For σY2=1 and σZ2=0.01, FIG. 17 shows results with nested lattice quantization alone and SWC-NQ. The former exhibits a 3.95-9.60 dB gap from DWZ(R) for R in the range from 1.0 to 7.0 bits/sample (b/s), which agree with the high rate lower bound of Theorem 1. At high rate, we observe that the gap between our results with ideal SWC (i.e., rate computed as H(J|Y) in the simulation) and DWZ(R) is indeed 1.53 dB. With practical SWC based on irregular LDPC codes of length 106 bits, this gap is 1.66-1.80 dB for R in the range from 0.93 to 5.00 b/s.


For 2-D nested lattice quantization, we use the A2 hexagonal lattices again with σY2=1 and σZ2=0.01. FIG. 18 shows results with nested lattice quantization alone and SWC-NQ. At high rate, the former case exhibits a 4.06-8.48 dB gap from DWZ(R) for R=1.40-5.00 b/s, again in agreement with the high rate lower bound of Theorem 1. We observe that the gap between our results with ideal SWC (measured in the simulation) and DWZ(R) is 1.36 dB. With practical SWC based on irregular LDPC codes (of length 106 bits), this gap is 1.67-1.72 dB for R=0.95-2.45 b/s.


We thus see that using optimal estimation as described herein, our simulation results with either 1-D or 2-D nested quantization (and practical Slepian-Wolf coding) are almost a constant gap away from the Wyner-Ziv limit for a wide range of rates.


In this paper, the high-rate R-D performance of the nested lattice quantization for the Wyner-Ziv coding is analyzed, with low dimensional lattice codes. The performance is away from the Wyner-Ziv bound with each specific lattice code, and exhibits an increasing gap from the Wyner-Ziv bound as the rate increases. The reason for the increase of the gap mainly comes from the fact that the granular component of the distortion is an increasing function of the rate. Therefore the Slepian-Wolf coding, as a second-layer binning scheme, is applied to the quantization indices for further compression. This Slepian-Wolf coded nested lattice quantization (SWC-NQ) performs at a constant gap from the Wyner-Ziv bound at high rates, and the constant gap is the same as the one from ECVQ (entropy coded vector quantization) to the ideal R-D function of source coding without the side information. Moreover, a non-linear estimator for the decoder is introduced, and proved to be optimal in the sense of the MSE measurement. This non-linear estimator helps at low-rates, and degrades to the linear one which is assumed in the theoretical analyses in this paper. Simulation results for 1-D and 2-D cases are in agreement with the theoretical analysis.


Proof of Lower Bound (49)


Proof to establish the lower bound for the performance of quadratic Gaussian SWC-NQ.


1) Rate Computation:


The rate for SWC-NQ is:
R=1nH(v|Y).(69)

Since at high rate,
P(v|Y)=j=-xRj(v)fX|Y(x)x=j=-xR0(v)fX|Y(x+cj(0))xj=-fX|Y(v+cj(0))V1g(v)V1,whereg(x)j=-fX|Y(x+cj(0)),andX|YN(0,σX|Y2).(70)


Then the achievable rate of SWC-NQ is
nR=H(v|Y)=-vΛ1/Λ2P(v|Y)log2[P(v|Y)]-vΛ1/Λ2P(v|Y)log2[g(v)V1]=-vΛ1/Λ2j=-xR0(v)fX|Y(x+cj(0))xlog2g(v)-log2V1-j=-vΛ1/Λ2xR0(v)fX|Y(x+cj(0))log2g(x)x-log2V1=(a)-xRnfX|Y(x)log2g(x)x-log2V1

where (a) comes from the periodic property of g(·), i.e., g(x−l)=g(x),∀lεΛ2. Thus the achievable rate of SWC-NQ is

nR═H(v|Y)=h′(X,Λ2)+log2 σX/Yn−log2V1.  (71)

2) Distortion Computation: From Theorem 4.1, the average distortion of nested lattice quantization over all realizations of (X,Y) is
Dn=G(Λ1)V12/n+1nEZ[QΛ2(z)2]G(Λ1)V12/n+1nj=1((2j-1)γnV22/n)u(j2V22/nΓ(n2+1)2/n2πσZ2).(72)

Because SWC is lossless, the distortion of SWC-NQ is also Dn. Combining Dn and R through V1, we obtain the R-D performance of SWC-NQ with a pair of n-D nested lattices (Λ12) as
Dn(R)G(Λ1)2(2/n)h(X,Λ2)σX|Y22-2R+1nj=1((2j-1)γnV22/n)u(j2V22/nΓ(n2+1)2/n2πσZ2).(73)

Proof of Lemma 5.1


This proof closely follows the remark 3) of [1] page 3, with some slight modifications. Let
δminwx^d(w,x^)|Y>0.

Here δ is actually the minimum of the distance between two lattice points of Λ2. Thus if (x,{circumflex over (x)})εS1, λ≡Pr{W≠{circumflex over (X)}}<E[d(W,{circumflex over (X)})]/θ<(E[d(W,{circumflex over (X)})]+E[d(X,{circumflex over (X)})])/θ  (74)


where (a) comes from the triangle inequality. From Theorem 1,

D=E[d(X,{circumflex over (X)})]=MSEg+MSEol,

where MSEg=E[d(W,X)] is the granular component and MSEol is the overload component, then

λ≦2D/δ.  (75)


Now since {circumflex over (X)} is a function of V, Y, Fano's inequality [30], [31] implies that
H(W|V,Y)-λlogλ-(1-λ)log(1-λ)+λlog(W)ɛ(λ),sothat(76)H(V|Y)I(W;V|Y)=H(W|Y)-H(W|V,Y)H(W|Y)-ɛ(2Dδ).(77)

Meanwhile, from data processing rule, we have H(V|Y)≦H(W|Y). At high rate, D→0, and
ɛ(2Dδ)->0.
Thus at high rate, H(V|Y) H(W|Y).


This claim is also verified intuitively by FIG. 13, where the slant part of each curve which corresponds to the R-D performance with a fixed V2, or δ, approximately maintains a constant slope.


It is noted that any or all of the method embodiments described herein may be implemented in terms of program instructions executable by one or more processors. The program instructions (or subsets thereof) may be stored and/or transmitted on any of various carrier media. Furthermore, the data generated by any or all of the method embodiments described herein may be stored and/or transmitted on any of various carrier media.


Although the embodiments above have been described in considerable detail, numerous variations and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. It is intended that the following claims be interpreted to embrace all such variations and modifications.

Claims
  • 1. A method for generating compressed output data, the method comprising: receiving input data from an information source; applying nested quantization to the input data in order to generate intermediate data; encoding the intermediate data using an asymmetric Slepian-Wolf encoder in order to generate compressed output data representing the input data; performing at least one of: storing the compressed output data; and transferring the compressed output data.
  • 2. The method of claim 1, wherein said applying nested quantization to the input data includes: quantizing values of the input data with respect to a fine lattice to determine corresponding points of the fine lattice; and computing indices identifying cosets of a coarse lattice in the fine lattice corresponding to the fine lattice points, wherein the intermediate data comprises said indices, wherein the coarse lattice is a sublattice of the fine lattice.
  • 3. The method of claim 1, wherein said transferring the compressed output data comprises transferring the compressed output data to a decode unit.
  • 4. The method of claim 1, wherein the information source is a source of audio information source, a source of video information, a source of image information, a source of text information, or a source of information generated by one or more sensor devices.
  • 5. The method of claim 1, wherein the information source is a continuous source.
  • 6. The method of claim 1, wherein the information source is a discrete source.
  • 7. The method of claim 1, wherein the values of the input data are vectors in an n-dimensional space, wherein n is greater than or equal to one.
  • 8. The method of claim 2, wherein the fine lattice is a hexagonal lattice.
  • 9. The method of claim 1, wherein the asymmetric Slepian-Wolf encoder is a low density parity check encoder or a turbo encoder.
  • 10. The method of claim 1, wherein the asymmetric Slepian-Wolf encoder is a multi-layered encoder.
  • 11. A method comprising: (a) receiving compressed input data, wherein the compressed input data is a compressed representation of a block of samples of a first source X; (b) receiving a block of samples of a second source Y; (c) applying an asymmetric Slepian-Wolf decoder to the compressed input data using the block of samples of the second source Y, wherein said applying generates a block of intermediate values; (d) performing joint decoding on each intermediate value and a corresponding sample of the block of second source samples to obtain a corresponding decompressed output value, wherein said performing joint decoding includes determining an estimate of a centroid of a function restricted to a region of space corresponding to the intermediate value, wherein said estimate determines the decompressed output value.
  • 12. The method of claim 11, wherein the function is the conditional probability density function of the first source X given said corresponding sample of the second source block.
  • 13. The method of claim 11, wherein the region of space is a union of cells corresponding to a coset of a coarse lattice in a fine lattice, wherein the coset is identified by the intermediate value.
  • 14. The method of claim 11, wherein said determining the centroid estimate is performed by reading the centroid estimate from a table stored in memory using said corresponding sample of the second source block and the intermediate value as addresses.
  • 15. The method of claim 11, wherein said determining the centroid estimate comprises performing a Monte Carlo iterative simulation.
  • 16. The method of claim 11, wherein the intermediate values specify cosets of a coarse lattice in a fine lattice, wherein the coarse lattice is a sublattice of the fine lattice.
  • 17. The method of claim 11, wherein (a) and (b) are performed in parallel.
  • 18. The method of claim 11, wherein said asymmetric Slepian-Wolf decoder is a multi-layered decoder.
  • 19. The method of claim 11, wherein said asymmetric Slepian-Wolf decoder is a low density parity check decoder or a turbo decoder.
  • 20. A method for computing a table representing a nested quantization decoder, the method comprising: (a) computing a realization z of a first random vector; (b) computing a realization y of a second random vector; (c) adding z and y to determine a realization x of a source vector; (d) quantizing the realization x to a point in a fine lattice; (e) computing an index J identifying a coset of a coarse lattice in the fine lattice based on the fine lattice point; (f) adding the realization x to a cumulative sum corresponding to the index J and the realization y; (g) incrementing a count value corresponding to the index J and the realization y; (h) repeating operations (a) through (g) a number of times; (i) dividing the cumulative sums by their corresponding count values to obtain resultant values; (j) storing the resultant values in a memory medium.
  • 21. A system for generating compressed output data, the system comprising: a memory configured to store data and program instructions; and a processor configured to read and execute the program instructions from the memory, wherein in response to execution of the program instructions, the processor is operable to: receive input data from an information source; apply nested quantization to the input data in order to generate intermediate data; encode the intermediate data using an asymmetric Slepian-Wolf encoder in order to generate compressed output data representing the input data; and perform at least one of: storing the compressed output data; and transferring the compressed output data.
  • 22. A system for decoding compressed data, the system comprising: a memory configured to store data and program instructions; and a processor configured to read and execute the program instructions from the memory, wherein in response to execution of the program instructions, the processor is operable to: (a) receive compressed input data, wherein the compressed input data is a compressed representation of a block of samples of a first source X; (b) receive a block of samples of a second source Y; (c) apply an asymmetric Slepian-Wolf decoder to the compressed input data using the block of samples of the second source Y, wherein said applying generates a block of intermediate values; (d) perform joint decoding on each intermediate value and a corresponding sample of the block of second source samples to obtain a corresponding decompressed output value, wherein said performing joint decoding includes determining an estimate of a centroid of a function restricted to a region of space corresponding to the intermediate value, wherein said estimate determines the decompressed output value.
  • 23. The system of claim 22, wherein the function is the conditional probability density function of the first source X given said corresponding sample of the second source block.
  • 24. A system for computing a table representing a nested quantization decoder, the system comprising: a memory configured to store data and program instructions; and a processor configured to read and execute the program instructions from the memory, wherein in response to execution of the program instructions, the processor is operable to: (a) computing a realization z of a first random vector; (b) computing a realization y of a second random vector; (c) adding z and y to determine a realization x of a source vector; (d) quantizing the realization x to a point in a fine lattice; (e) computing an index J identifying a coset of a coarse lattice in the fine lattice based on the fine lattice point; (f) adding the realization x to a cumulative sum corresponding to the index J and the realization y; (g) incrementing a count value corresponding to the index J and the realization y; (h) repeating operations (a) through (g) a number of times; (i) dividing the cumulative sums by their corresponding count values to obtain resultant values; and (j) storing the resultant values in a memory medium.
  • 25. A computer-readable memory medium configured to store program instructions, wherein the program instructions are executable to implement: receiving input data from an information source; applying nested quantization to the input data in order to generate intermediate data; encoding the intermediate data using an asymmetric Slepian-Wolf encoder in order to generate compressed output data representing the input data; performing at least one of: storing the compressed output data; and transferring the compressed output data.
  • 26. A computer-readable memory medium configured to store program instructions, wherein the program instructions are executable to implement: (a) receiving compressed input data, wherein the compressed input data is a compressed representation of a block of samples of a first source X; (b) receiving a block of samples of a second source Y; (c) applying an asymmetric Slepian-Wolf decoder to the compressed input data using the block of samples of the second source Y, wherein said applying generates a block of intermediate values; (d) performing joint decoding on each intermediate value and a corresponding sample of the block of second source samples to obtain a corresponding decompressed output value, wherein said performing joint decoding includes determining an estimate of a centroid of a function restricted to a region of space corresponding to the intermediate value, wherein said estimate determines the decompressed output value.
  • 27. The computer-readable memory medium of claim 26, wherein the function is the conditional probability density function of the first source X given said corresponding sample of the second source block.
  • 28. A computer-readable memory medium configured to store program instructions, wherein the program instructions are executable to implement: (a) computing a realization z of a first random vector; (b) computing a realization y of a second random vector; (c) adding z and y to determine a realization x of a source vector; (d) quantizing the realization x to a point in a fine lattice; (e) computing an index J identifying a coset of a coarse lattice in the fine lattice based on the fine lattice point; (f) adding the realization x to a cumulative sum corresponding to the index J and the realization y; (g) incrementing a count value corresponding to the index J and the realization y; (h) repeating operations (a) through (g) a number of times; (i) dividing the cumulative sums by their corresponding count values to obtain resultant values; (j) storing the resultant values in a memory medium.
PRIORITY DATA AND CONTINUATION DATA

This application claims the benefit of U.S. Provisional Application No. 60/657,520, filed on Mar. 1, 2005, entitled “Multi-Source Data Encoding, Transmission and Decoding”, invented by Vladimir M. Stankovic, Angelos D. Liveris, Zixiang Xiong, Costas N. Georghiades, Zhixin Liu and Samuel S. Cheng, including Appendices A-H. This application is a continuation in part of U.S. patent application Ser. No. 11/068,737, filed on Mar. 1, 2005, entitled “Data Encoding and Decoding Using Slepian-Wolf Coded Nested Quantization to Achieve Wyner-Ziv Coding”, invented by Zhixin Liu, Samuel S. Cheng, Angelos D. Liveris and Zixiang Xiong, including Appendices A-H.

Provisional Applications (1)
Number Date Country
60657520 Mar 2005 US
Continuation in Parts (1)
Number Date Country
Parent 11068737 Mar 2005 US
Child 11086778 Mar 2005 US