1. Field of the Invention
This invention relates to image and video displays, more particularly flat panel displays used as still image and/or video monitors, and methods of generating and driving image and video data onto such display devices.
2. Prior Art
Flat panel displays such as plasma, liquid crystal display (LCD), and light-emitting-diode (LED) displays generally use a pixel addressing scheme in which the pixels are addressed individually through column and row select signals. In general, for M by N pixels—or picture elements—arranged as M rows and N columns, we will have M row select lines and N data lines (see
Video and still images are generally converted to compressed forms for storage and transmission, such as MPEG2, MPEG4, JPEG2000 etc. formats and systems. Image compression methods are based on orthogonal function decomposition of the data, data redundancy, and certain sensitivity characteristics of the human eye to spatial and temporal features. Common image compression schemes involve the use of Direct Cosine Transform as in JPEG or motion JPEG, or Discrete Walsh Transform. In addition, video compression may involve skipping certain frames and using forward or backward frame estimation, skipping color information, or chroma subsampling in a luminance-chrominance (YCrCb) representation of the image etc. A video decoder is used to convert the spatially and temporally compressed image information to row and column pixel information in the color (RGB) representation to produce the image information, which will be for example at 6 Mbits per frame as in VGA resolution displays. However, from an information content point of view, much of this video information is actually spatially redundant as the image had originally been processed to a compressed form, or it has information content which the human eye is not sensitive to. All these techniques pertain to the display system's components in the software or digital processing domain, and the structure of the actual optical display comprised of M×N pixels is not affected by any of the techniques used for the video format, other than the number of pixels and frame rate.
Prior art in the field does not address image compression and decompression techniques directly. Data is generally made available on a pixel-by-pixel basis, with which the video system displays at a certain refresh rate. Image and/or video compression is generally applied to the transmission, storage and image reconditioning of data for the display (as in U.S. Pat. No. 6,477,279). Multiple line addressing in passive matrix displays is also an established technique (as in Lueder, E., “Liquid Crystal Displays—Addressing Schemes and Electro-Optical Effects”, John Wiley & Sons 2001, pp. 176-194, or U.S. Pat. No. 6,111,560,). Time-domain Walsh function based orthogonal waveforms are applied to column and rows such that crossing points in the row and columns will generate shades of gray through amplitude modulation as desired. This is in contrast to employing two-dimensional orthogonal basis function expansions used in video and image compression.
The present invention may have various modifications and alternative forms from the specific embodiments depicted in the drawings. These drawings do not limit the invention to the specific embodiments disclosed. The invention covers all modifications, improvements and alternative implementations which are claimed below.
The invention is a display method and system which constructs an image and/or video through successively displaying image components or summations of image components at a high frame rate. The image construction uses image compression to calculate orthogonal image coefficients, and drive these coefficients as video signals to pixel arrays in time domain through the use of time-dependent spatial masking of image information within a pixel array. The purpose of the invention is to enable content driven optimization of frame rate and/or video data rate for minimizing power consumption. In each frame, the source image to be driven is first grouped together to a certain size consisting of nx×ny pixels. For example, we can divide the image into rectangular groupings of 4×4 or 8×8 pixels, 4×1, 8×1, or any other arbitrary group size. 1×1 grouping case corresponds to conventional pixel-by-pixel driving, and offers no compression benefit. The grouping size is limited by the frame rate, which in turn is limited by the switching speed of the pixels and driver components described herein and the image compression ratio. Each image grouping, or macro-pixel as will be referred from here on, is then decomposed into components proportional to certain orthogonal image basis functions. These image functions are implemented through masking the row select and column data signals of the pixels so that the desired spatial profile of the orthogonal image basis functions are achieved. The image basis functions are shown in
Any image can be decomposed into orthogonal components, whose coefficients are found by integrating the image data with the basis functions shown in
The invention is based on the inverse transform of EQ. 1, i.e. that an image f(x,y) can be constructed as a summation of image components Duv*wuv(x,y).
The summation of the image components is performed in time domain through successively displaying patterns corresponding to the basis functions wuv with a light strength proportional to coefficients Duv and a certain subframe duration τsf. Further, we transform into a basis function set w* from w, as described below, such that the image components are positive for all x,y. The human eye would integrate the image patterns in time, and perceive a single image corresponding to f(x,y). If the pixel electronics have a capacitor to which the pixel image data is stored, it can also be used in integrating the image pattern along with the viewer. In this case, the image is updated with each pattern, and not re-written. Since the capacitor voltage is not reset at each step, a smaller amount of charge needs to be added to the capacitor at each subframe—this will result in lowering the power consumption of the data drivers. In pulse-width-modulation (PWM) based implementations, the ‘on’ time of selected pixels conforming to a wuv pattern is common. In essence, a single PWM generator is used for the whole group of pixels.
In orthogonal function implementations used in conventional Discrete Walsh Transform compression techniques, the basis functions wuv(x,y) take on values of +1 or −1, thereby they can satisfy orthogonality properties, in which the integration over the macro-pixel region of the cross product of two different basis functions is zero. i.e.
for (u,v) equal to (u′,v′), and zero when the indices do not match. In U.S. Patent Application Publication No. 2010/0007804, an image construction based video display system is described, which uses orthogonal Walsh function based the current application, an extension of these techniques are made for application to fine-arrays of pixels, with which individual row and column control are possible, and a spatial light modulator is therefore not necessary. When the basis functions are mapped to +1 or 0 instead of +1 or −1, as in U.S. Patent Application Publication No. 2010/0007804, this creates a non-zero integration value of the cross product of two different basis functions over the macro-pixel area. Such functions, because of their non-orthogonal nature, can not be used in deconstructing the image into components, hence the original orthogonal basis functions having values of +1 or −1 are used in determining image coefficients Duv using EQ. 1. In performing an image construction using EQ. 2 in which coefficients Duv are computed using orthogonal basis functions, each component of the image, given by the function Duv*wuv will have both positive and negative values throughout the macro-pixel, for u,v components other than 0,0. When we restrict the image components to be non-negative, through the use of basis functions in the +1, 0 domain, we are introducing averaging artifacts. Displaying an image component Duv*w*uv(x,y) will create an average value of 0.5×Duv for u,v other than 0,0. The 0,0 image component D00*w*00(x,y) is equal to the sum of the image over the macro-pixel, and is effectively the image averaged out over the macro-pixel area.
Since each image component having u,v indices other than 0,0 will now contribute ½th of the Duv value to the macro-pixel average, we should really be displaying the 0,0 image component with a strength equal to
In general, D00 is greater than or equal to the sum of the rest of the image components derived using the +1 and 0 mapping. Hence, subtracting out each of these non-zero integration components from D00 will be greater than or equal to zero. Consider for example the D01 component. Denote wuv as the original Walsh function having the values of +1 and −1. Using the new basis functions w*=(w+1)/2, substituting wuv which can take on values of 0 and 1 instead of −1 and +1, w*uv will transform the image construction equation EQ. 2 to
To reproduce the image correctly, the component value when the basis function is equal to all 1's (w00) has to be corrected with the summation over all Duv except for the 00 component as in the second term of EQ. 3. Note that if a subset of basis functions are used as in lossy compression/construction, the summation will need to span only the Duv coefficients that are used. The updated D00 coefficient is used in the image construction instead of the original value, since now the total sum of the average of the image components will equal the original D00 value. D00 may run negative in certain cases, which will cause artifacts. This can be treated in a lossy construction manner through hard limiting the number of dominant components to be displayed, or reducing the high frequency content in a more graceful manner, in essence spatially low pass filtering the image. Such artifacts can also be eliminated by reducing the pixel-grouping size for the region of interest. For example, transforming the 8×8 pixel region into four 4×4 block regions and implementing the algorithm at the reduced pixel group size level. Since the correction amount applied to the D00 coefficient needs to be bounded by the D00 value, having a smaller number of components in the image construction will result in this bound to satisfied with a higher spatial frequency bandwidth than a larger macro-pixel case.
The image coefficients Duv can have positive or negative values for all components having higher order than the 00 component. In implementing the display component, the value of Duv*w*uv(x,y) can only be positive. In the case of ‘negative’ Duv, the image component is generated using the absolute value of Duv and the inverse of the basis function pattern w*uv(x,y). The inverse pattern is defined by interchanging the 0 values with +1 values in the w*uv(x,y) pattern, i.e., inverting or reversing the switch pattern for that orthogonal basis function.
A block diagram showing the whole system is in
For each frame, the video image is constructed through
A subframe mask can be generated by selecting multiple row and columns spanning a macro-pixel. Assume a 4×4 pixel array forming the macro-pixel. The basis functions of
To arrive at a single frame of the intended image, each image component in a subframe is displayed successively. An observer's eye will integrate the displayed image components to visually perceive the intended image, which is the sum of all displayed image components. The Duv coefficients calculated in EQ. 1 assume equal subframe durations. The subframe duration can be made varying with the uv index, in which case the particular Duv will need to be normalized with the subframe time τuv. Such a scheme may be used to relax the data driver's speed and precision requirements. The subframe image integration can also be partially performed in pixel structures which can retain the image data, as in active matrix pixels. In this case, instead of resetting the image information at each subframe, the corresponding signal stored in a capacitor is updated at each subframe. This is explained below.
A lossy compression based decomposition allows one to neglect higher spatial frequency component coefficients Duv. These are generally components which have high order oblique spatial frequencies, which the human eye has reduced sensitivity to. Taking the example of 4×4 pixel grouping, which will have 16 image components with coefficients from D00, D01, D02, D03, D10, D11, etc. up to D33, and transformed basis functions w*00 through w*33, and the inverses of these functions (except for the inverse of w*00 which is a blank image), the original image will be exactly reconstructed if we use all 16 components, assuming the corrected D00 coefficient remains non-negative. However, in a general moving video case, the oblique spatial components may be neglected to some extent. A display system which uses only horizontal and vertical image components can be satisfactory in some cases. To improve image accuracy, the dominant of the diagonal spatial frequency basis functions such as w*11, w*22, and or w*33 having coefficients D11, D22 and/or D33 can also be added. The oblique components such as w*12, w*13, w*23 etc. may also be neglected if the picture quality is deemed satisfactory by applying a threshold below which we will neglect the component. In image and video compression techniques like JPEG and MPEG2 intra frame compression, the sequence of spatial frequency components are in a ‘zig-zag’ order, which allows for an ‘EOB’ (end-of-block) signal to denote that remaining coefficients in the sequence are negligible. The sequence goes as w*00, w*01, w*10, w*20, w*11, w*02, w*03, w*12, w*21, w*30, w*40, etc. until an EOB is sent. Components before the EOB may also have negligible coefficient value. The video source coding can therefore have a variable sequence length, to which the display system will match. If none of the components are non-negligible, we would resort to lossless operation on the macro-pixel. Note also that different macro-pixels can have different levels of compression depending on the source video at the same time. Such a case can occur for example in a computer monitor, where during operation, regions of the screen may have stagnant images, but require a high accuracy such as a window showing a text and high resolution imagery, or portions having a fast moving image in which we need a high frame rate for motion compensation, but not necessarily need a lossless image reproduction scheme. By masking out different macro-pixel regions where we can skip certain image components, or updating the macro-pixel image less frequently, the image accuracy and power can be optimized. We can decide on which macro-pixel to run which accuracy mode by calculating the Duv coefficients and comparing them to the component coefficients in the earlier image frames. A fast moving image vs. slow moving or stagnant image, and an accurate image vs. a lossy compressed image can be differentiated thus.
In active matrix displays, in which the pixel circuitry may have a capacitor to hold the Duv coefficient value, we may partition the dominant components over several subframes. This is so that the capacitor charge does not change as much when we reset the value. For example, in transitioning from the w*00 component to the w*01 component, the capacitor voltage on half the pixels in a macro-pixel will be reset to zero, and the capacitor voltages on the remaining half of the pixels will be set to the D01 coefficient value. This requires the column data drivers to charge and/or discharge up to the full capacitor voltage within a subframe duration, which costs power. Instead, the previous subframe data can be retained until the end of the frame, with the provision that it is normalized with the number of subframes the data will remain on the capacitor. To illustrate this, assume we have a lossless construction over 16 subframes, each subframe with equal duration. The time integrated voltage over the frame is given by EQ. 3. In this equation, the components Duv*w*uv are assumed to be ON for one subframe duration, and the capacitors are reset to the next component voltage when the subframe duration ends. Instead, a portion of each previous component can be retained on the capacitor. The w*00 component duration will then be 16 subframes, hence its value will be normalized by 16. Assume the second subframe is the w*01D01 component. This component will last for 15 subframes. This macropixel capacitors will be recharged such that the voltage at the second subframe is equivalent to D00w*00/16+D01w*01/15. The process repeats for each component, which will be normalized with the number of remaining subframes till the end of the frame. The last component to be displayed, w*33D33 will only be effective for one subframe, so it's value is not normalized. The net effect will be that at the end of the frame, we have the same integrated image information as EQ. 3.
Taking the example of a VGA resolution display operating at 30 frames per second, and a 4×4 pixel grouping to define the macro-pixels, the display device to satisfy VGA resolution employing this invention will use
By using a pixel addressing mask pattern, the number of pixels which is addressed uniquely is reduced from 768000 (for three colors) by a factor of 16 down to 48000 (for three colors) for the VGA resolution display. There are 16000 macro-pixels in the display. The raw image data rate which the pixel drivers depends on the level of image compression desired. For a lossless image reconstruction, there are 16 image components per macro-pixel per color. Consider an 8 bit color system. If each component coefficient Duv is described with 8 bit accuracy, we would need a 184 Mbps data rate. This corresponds to 16 components×8 bits=128 bits per macro-pixel per color per frame. In reality, only the D00 component needs to have the full 8 bit accuracy, while the higher order components can have less accuracy. The higher order components will in general be limited in amplitude by a factor of 0.5 to the lower order component. Hence, the first order coefficients D01 and D10 can be described with a 7 bit precision, the second order coefficient D02, D20, D11 can be described with a 6 bit precision and so on. We would therefore not need more than 80 bits per macro-pixel per color per frame, which optimizes the data rate down to 120 Mbps. The video data driver precision need not satisfy the full 8-bit resolution throughout the frame, and can be made to have a dynamic resolution by turning off unnecessary components when not needed. Define arbitrarily three compression levels for clarification purposes—lossless compression, medium and high level compression. In actual implementation these definitions may have different forms based on the desired image quality. Assume that in a medium compression level, we cut off oblique spatial frequency components such as w*12D12, w*13D13, w*23D23 etc. but not w*11D11, w*22D22, w*33D33. Then we are working with 10 components in total. These components would require a total of 60 bits per macro-pixel per color per frame. The total data rate is reduced to 86 Mbps. Define the high compression level as an operation mode in which we neglect D11, D22, D33. Then we would use 46 bits per macro-pixel per color per frame. The total data rate is then 66 Mbps. The row and column select pattern needs to be updated 16 times each frame for the lossless compression case, 10 times each frame for the medium level compression case, and 7 times each frame for the high level compression case. For 30 frames per second, displaying 7 subframes requires 210 patterns to be generated per second, or 4.7 msec per subframe. Using 10 components, we would need to generate 300 patterns per second, or 3.3 msec per subframe. For lossless image reproduction, a total of 16 subframes are needed, which equals 480 patterns per second, requiring 2 msec per subframe. These values provide a settling time bound for the data drivers.
In a particular embodiment of the invention, a LED based active-matrix display system is considered, though the invention is not so limited. The display system consists of:
The pixels are grouped in 4×4 arrays, thus each red, green and blue LED defines a macro-pixel, thereby 48000 macro-pixels exist for three colors. The macro-pixels for different colors can be selected at the same time since the column video data is coming from different digital-analog converters. A fast enough digital-analog converter can service all pixels, or a larger number of digital-analog converters can be employed to relax the speed and driving requirements if necessary.
In the image processor 130, the image is divided into macro-pixel arrays for processing. For each macro-pixel, the image decomposition algorithm determines the coefficients corresponding to each orthogonal basis function for each color to be used. The decomposition coefficients Duv, where u and v run from 0 through 3 are calculated. These coefficients are summations of 16 pixel values comprising the macro-pixel according to the corresponding masking patterns wuv. The number of decomposition coefficients to be used can be selected from one to sixteen, in increasing resolution. The full set of sixteen coefficients is used when lossless reconstruction of the image is necessary. This mode is determined when all Duv coefficients are greater in magnitude from a threshold value. Portions of the display can also have different compression levels during operation, which the image processor can decide depending on the decomposition coefficient value it calculates. The row and column select block 120 scans and selects the macro-pixel to be operated on. Masking pattern generator 140 is a secondary switch network which drives the patterns related to the Duv coefficient to be displayed through a counter based logic, or a look-up table. The patterns are shown in
The display is scanned at each frame starting with the w*00D00 component of macro-pixels. The row and column select signal mask generated by 140 is all 1's in this case, meaning 4 rows and 4 columns are all selected. The necessary voltage signal is loaded to the video data memory, which can be a single capacitor for a macro-pixel array, and the macro-pixel scan proceeds to the next array. The subframe scan ends upon visiting all 48000 macro-pixels. The next subframe will load the w*01D01 component to each macro-pixel. In this case, the mask generator 140 will generate the required signals for loading the pattern w01 to the 4×4 pixel array. It can also load the inverse of the pattern if the Duv coefficient is negative. The signal masks can change for each macro-pixel in the scan, as there is no restriction as to which image coefficient is to be loaded during the scan. One macro-pixel can be loaded with a particular Duv with a masking pattern of wuv, while the next macro-pixel in the scan can be loaded with a different component having a different masking pattern, since for one macro-pixel, a particular Duv term may be negligible and eliminated from displaying, while for another macro-pixel it may be non-negligible. Each macro-pixel can have a different effective frame rate. While the subframe update rate is common, since each frame may be composed of a different number of subframes. A macro-pixel can also have its frame rate changed by the image processor when the nature of the video content changes. This can happen as shown in
A similar embodiment with an LCD based active-matrix display is also possible. In this case, since the pixel switching speeds may be considerably slower than that of a LED based display, subframe durations are longer. The maximum possible number of subframes that can be squeezed in a frame will be limited. In such a case, one may resort to driving modes in which a certain subset of w*uvDuv components are displayed in a frame, and the remaining components are displayed in an alternate frame such that the picture will have minimum loss of fidelity. In such a case the Duv coefficients will need to be normalized appropriately.
In certain LED based arrays (see U.S. Provisional Patent Application No. 60/975,772 filed Sep. 27, 2007), or MEMS based digital micromirror device (U.S. Pat. No. 5,452,024 filed Sep. 19, 1995), light elements can only be in ON or OFF states. The desired light value can be determined through pulse width modulation, or through bitplane modulation. In such an embodiment, pixels can be addressed as a group of macro-pixels, having a common ON time duration, but the data is AND'ed with the known basis function patterns of 1's and 0's. The number of subframes is again equal to the number of components that is used, or the maximum number of components pertaining to the macro-pixel size.
This application claims the benefit of U.S. Provisional Patent Application No. 61/157,698 filed Mar. 5, 2009.
Number | Name | Date | Kind |
---|---|---|---|
5452024 | Sampsell | Sep 1995 | A |
5508716 | Prince et al. | Apr 1996 | A |
5537492 | Nakajima et al. | Jul 1996 | A |
5696524 | Hoshino et al. | Dec 1997 | A |
6111560 | May | Aug 2000 | A |
6229583 | Yasunishi | May 2001 | B1 |
6477279 | Go | Nov 2002 | B2 |
6535195 | Nelson | Mar 2003 | B1 |
6850219 | Aoyama et al. | Feb 2005 | B2 |
7623560 | El-Ghoroury et al. | Nov 2009 | B2 |
7767479 | El-Ghoroury et al. | Aug 2010 | B2 |
7829902 | El-Ghoroury et al. | Nov 2010 | B2 |
20020075217 | Hoshino | Jun 2002 | A1 |
20050128172 | Hoshino | Jun 2005 | A1 |
20060098879 | Kim et al. | May 2006 | A1 |
20070035706 | Margulis | Feb 2007 | A1 |
20070075923 | Beuker et al. | Apr 2007 | A1 |
20080018624 | Cernasov | Jan 2008 | A1 |
20080137990 | Ward | Jun 2008 | A1 |
20090086170 | El-Ghoroury et al. | Apr 2009 | A1 |
20090278998 | El-Ghoroury et al. | Nov 2009 | A1 |
20100003777 | El-Ghoroury et al. | Jan 2010 | A1 |
20100007804 | Guncer | Jan 2010 | A1 |
20100066921 | El-Ghoroury et al. | Mar 2010 | A1 |
20100220042 | El-Ghoroury et al. | Sep 2010 | A1 |
Number | Date | Country |
---|---|---|
1322442 | Nov 2001 | CN |
1348301 | May 2002 | CN |
1666241 | Sep 2005 | CN |
0577258 | Jan 1994 | EP |
0720141 | Jul 1996 | EP |
51-056118 | May 1976 | JP |
2001-350454 | Dec 2001 | JP |
2005-532588 | Oct 2005 | JP |
WO-2004006219 | Jan 2004 | WO |
Entry |
---|
“Notice of Allowance Dated Nov. 28, 2012, Korean Patent Application No. 10-2011-7002953”, (Nov. 28, 2012). |
“Office Action Dated Aug. 20, 2012, European Patent Application No. 10710122.2” (Aug. 20, 2012). |
Lueder, Ernst , “Liquid Crystal Displays, Addressing Schemes and Electro-Optical Effects”, John Wiley & Sons Ltd., (2001), pp. 176-194. |
Poynton, Charles , “Digital Video and HDTV Algorithms and Interfaces”, Morgan Kaufmann Publishers, an Imprint of Elsevier Science, (2003), pp. 447-454. |
Shirai, T. , et al., “RGB-LED Backlights for LCD-TVs with 0D, 1D, and 2D Adaptive Dimming”, 2006 SID International Symposium, Society for Information Display, SD 06 Digest, 44.4, (2006), pp. 1520-1523. |
“International Search Report and Written Opinion of the International Searching Authority Dated Jun. 10, 2010”, International Application No. PCT/US2010/026325. |
“International Preliminary Examination Report of the International Preliminary Examination Authority Dated Apr. 14, 2011, International Application No. PCT/US2009/050175”. |
“International Search Report Report and Written Opinion of the International Searching Authority Dated Sep. 9, 2009”, International Application No. PCT/US2009/050175. |
“Office Action Dated May 31, 2012, Korean Patent Application No. 10-2011-7002953” (May 31, 2012). |
“Office Action Dated Apr. 1, 2013, Korean Patent Application No. 10-2011-7023107”, (Apr. 1, 2013). |
“Office Action Dated Apr. 16, 2013; Japanese Patent Application No. 2011-517637”, (Apr. 16, 2013). |
“Office Action Dated Apr. 18, 2013; European Patent Application No. 10710122.2”, (Apr. 18, 2013). |
“Office Action Dated Mar. 4, 2013; European Patent Application No. 09790247.2”, (Mar. 4, 2013). |
“Office Action Dated Mar. 5, 2013, Japanese Patent Application No. 2011-553131”, (Mar. 5, 2013), |
“Office Action Dated Apr. 22, 2013; Chinese Patent Application No. 200980134961.9”, (Apr. 22, 2013). |
Notice of Allowance Dated Oct. 1, 2013; Japanese Patent Application No. 2011-517637, (Oct. 1, 2013). |
Office Action Dated Oct. 7, 2013; U.S. Appl. No. 12/499,560, (Oct. 7, 2013). |
Office Action Dated Sep. 4, 2013; Chinese Patent Application No. 201080019853.X, (Sep. 4, 2013). |
Office Action Dated Dec. 2, 2013; Korean Patent Application No. 10-2011-7023107, (Dec. 2, 2013). |
Office Action Dated Nov. 25, 2013; European Patent Application No. 09790247.2, (Nov. 25, 2013). |
Number | Date | Country | |
---|---|---|---|
20100225679 A1 | Sep 2010 | US |
Number | Date | Country | |
---|---|---|---|
61157698 | Mar 2009 | US |