The present invention relates to forming a color image having a desired resolution from a panchromatic image and a color image having less than the desired resolution.
Video cameras and digital still cameras generally employ a single image sensor with a color filter array to record a scene. This approach begins with a sparsely populated single-channel image in which the color information is encoded by the color filter array pattern. Subsequent interpolation of the neighboring pixel values permits the reconstruction of a complete three-channel, full-color image. One popular approach is to either directly detect or synthesize a luminance color channel, e.g. “green”, and then to generate a full-resolution luminance image as an initial step. This luminance channel is then used in a variety of ways to interpolate the remaining color channels. A simple bilinear interpolation approach is disclosed in U.S. Pat. No. 5,506,619 (Adams et al.) and U.S. Pat. No. 6,654,492 (Sasai). Adaptive approaches using luminance gradients and laplacians are also taught in U.S. Pat. No. 5,506,619 as well as U.S. Pat. No. 5,629,734 (Hamilton et al.). U.S. Patent Application Publication No. 2002/0186309 (Keshet et al.) reveals using bilateral filtering of the luminance channel in a different kind of adaptive interpolation. Finally, U.S. Patent Application Publication No. 2003/0053684 (Acharya) describes using a bank of median filters on the luminance channel in yet another adaptive interpolation method.
Under low-light imaging situations, it is advantageous to have one or more of the pixels in the color filter array unfiltered, i.e. white or panchromatic in spectral sensitivity. These panchromatic pixels have the highest light sensitivity capability of the capture system. Employing panchromatic pixels represents a tradeoff in the capture system between light sensitivity and color spatial resolution. To this end, many four-color color filter array systems have been described. U.S. Pat. No. 6,529,239 (Dyck et al.) teaches a green-cyan-yellow-white pattern that is arranged as a 2×2 block that is tessellated over the surface of the sensor. U.S. Pat. No. 6,757,012 (Hubina et al.) discloses both a red-green-blue-white pattern and a yellow-cyan-magenta-white pattern. In both cases, the colors are arranged in a 2×2 block that is tessellated over the surface of the imager. The difficulty with such systems is that only one-quarter of the pixels in the color filter array have highest light sensitivity, thus limiting the overall low-light performance of the capture device.
To address the need of having more pixels with highest light sensitivity in the color filter array, U.S. Patent Application Publication No. 2003/0210332 (Frame) describes a pixel array with most of the pixels being unfiltered. Relatively few pixels are devoted to capturing color information from the scene producing a system with low color spatial resolution capability. Additionally, Frame teaches using simple linear interpolation techniques that are not responsive to or protective of high frequency color spatial details in the image.
It is an object of the present invention to produce a digital color image having the desired resolution from a digital image having panchromatic and color pixels.
This object is achieved by a method for forming a digital color image of a desired resolution, comprising:
(a) providing a panchromatic image of a scene having a first resolution at least equal to the desired resolution and a first color image having at least two different color photoresponses, the first color image having a lower resolution than the desired resolution; and
(b) using the color pixel values from the first color image and the panchromatic pixel values to provide additional color pixels and combining the additional color pixels with the first color image to produce the digital color image having the desired resolution.
It is a feature of the present invention that images can be captured under low-light conditions with a sensor having panchromatic and color pixels and processing produces the desired resolution in a digital color image produced from the panchromatic and colored pixels.
The present invention makes use of a color filter array with an appropriate composition of panchromatic and color pixels in order to permit the above method to provide both improved low-light sensitivity and improved color spatial resolution fidelity. The above method preserves and enhances panchromatic and color spatial details and produce a full-color, full-resolution image.
In the following description, a preferred embodiment of the present invention will be described in terms that would ordinarily be implemented as a software program. Those skilled in the art will readily recognize that the equivalent of such software can also be constructed in hardware. Because image manipulation algorithms and systems are well known, the present description will be directed in particular to algorithms and systems forming part of, or cooperating more directly with, the system and method in accordance with the present invention. Other aspects of such algorithms and systems, and hardware or software for producing and otherwise processing the image signals involved therewith, not specifically shown or described herein, can be selected from such systems, algorithms, components and elements known in the art. Given the system as described according to the invention in the following materials, software not specifically shown, suggested or described herein that is useful for implementation of the invention is conventional and within the ordinary skill in such arts.
Still further, as used herein, the computer program can be stored in a computer readable storage medium, which can include, for example; magnetic storage media such as a magnetic disk (such as a hard drive or a floppy disk) or magnetic tape; optical storage media such as an optical disc, optical tape, or machine readable bar code; solid state electronic storage devices such as random access memory (RAM), or read only memory (ROM); or any other physical device or medium employed to store a computer program.
Before describing the present invention, it facilitates understanding to note that the present invention is preferably utilized on any well-known computer system, such as a personal computer. Consequently, the computer system will not be discussed in detail herein. It is also instructive to note that the images are either directly input into the computer system (for example by a digital camera) or digitized before input into the computer system (for example by scanning an original, such as a silver halide film).
Referring to
A compact disk-read only memory (CD-ROM) 124, which typically includes software programs, is inserted into the microprocessor based unit for providing a way of inputting the software programs and other information to the microprocessor based unit 112. In addition, a floppy disk 126 can also include a software program, and is inserted into the microprocessor-based unit 112 for inputting the software program. The compact disk-read only memory (CD-ROM) 124 or the floppy disk 126 can alternatively be inserted into externally located disk drive unit 122 which is connected to the microprocessor-based unit 112. Still further, the microprocessor-based unit 112 can be programmed, as is well known in the art, for storing the software program internally. The microprocessor-based unit 112 can also have a network connection 127, such as a telephone line, to an external network, such as a local area network or the Internet. A printer 128 can also be connected to the microprocessor-based unit 112 for printing a hardcopy of the output from the computer system 110.
Images can also be displayed on the display 114 via a personal computer card (PC card) 130, such as, as it was formerly known, a PCMCIA card (based on the specifications of the Personal Computer Memory Card International Association) which contains digitized images electronically embodied in the PC card 130. The PC card 130 is ultimately inserted into the microprocessor based unit 112 for permitting visual display of the image on the display 114. Alternatively, the PC card 130 can be inserted into an externally located PC card reader 132 connected to the microprocessor-based unit 112. Images can also be input via the compact disk 124, the floppy disk 126, or the network connection 127. Any images stored in the PC card 130, the floppy disk 126 or the compact disk 124, or input through the network connection 127, can have been obtained from a variety of sources, such as a digital camera (not shown) or a scanner (not shown). Images can also be input directly from a digital camera 134 via a camera docking port 136 connected to the microprocessor-based unit 112 or directly from the digital camera 134 via a cable connection 138 to the microprocessor-based unit 112 or via a wireless connection 140 to the microprocessor-based unit 112.
In accordance with the invention, the algorithm can be stored in any of the storage devices heretofore mentioned and applied to images in order to interpolate sparsely populated images.
In
X
5=(P1+P2+P3+P7+P8+P9)/6
Alternate weighting to the pixel value in this approach are also well known to those skilled in the art. As an example,
X
5=(P1+2P2+P3+P7+2P8+P9)/8
B
5
=|P
1
−P
9|
V
5
=|P
2
−P
8|
S
5
=|P
3
−P
7|
BX
5=(P1+P9)/2
VX
5=(P2+P8)/2
SX
5=(P3+P7)/2
In
P
C=(4P5−P1−P3−P7−P9)/4
Again, in
R
L=(R1+R3+R7+R9)/4
Again, in
R
F
=R
L
+kP
C
where the scale factor k is nominally one (1), but can be any value from minus infinity to plus infinity. For different colors, such as green and blue, similar computations will be performed. The operations within block 206 (
In
C
R1
=R
1
−P
1
C
R3
=R
3
−P
3
C
R7
=R
7
−P
7
C
R9
=R
9
−P
9
The values CR1, CR3, CR7, and CR9 are the resulting color differences as illustrated in
Returning to
C
R5=(CR1+CR3+CR7+CR9)/4
Returning to
R=C
R
+P
G=C
G
+P
B=C
B
+P
The operations within block 206 (
In
G
S5
=|P
3
−P
7|
G
B5
=|P
1
−P
9|
L
S5=|2P5−P3−P7|
L
B5=|2P5−P1−P9|
S
5
=aG
S5
+bL
S5
B
5
=aG
B5
+bL
B5
GS5 is a slash gradient and GB5 is a backslash gradient for pixel P5. LS5 is a slash laplacian and LB5 is a backslash laplacian for pixel P5. The coefficients a and b are used to tune how much of each gradient and laplacian component goes into the final classifier computation. Typical values for a and b are a=1, b=0 for a gradient-only classifier, a=0, b=1 for a laplacian-only classifier, and a=1, b=1 for a combined gradient-and-laplacian classifier. Another example uses directional median filters. Again referring to
MS5=median (P3, P5, P7)
MB5=median (P1, P5, P9)
S
5
=|M
S5
−P
5|
B
5
=|M
B5
−P
5|
MS5 is the statistical median of the three panchromatic values P3, P5, and P7. MB5 is the statistical median of the three panchromatic values P1, P5, and P9. The third example uses sigma filtering which is a subclass of bilateral filtering. In this case, we compute four classifiers d1, d3, d7, and d9, which correspond to pixels R1, R3, R7, and R9:
d
1
=|P
1
−P
5|
d
3
=|P
3
−P
5|
d
7
=|P
7
−P
5|
d
9
=|P
9
−P
5|
In
c1=1 if d1<t, otherwise c1=0
c3=1 if d3<t, otherwise c3=0
c7=1 if d7<t, otherwise c7=0
c9=1 if d9<t, otherwise c9=0
The threshold value, t, is a function of the inherent noisiness of the image capture device. Classically, this noise is modeled as a Gaussian (normal) distribution with an associated mean and standard deviation. The value t is typically set to a value between 1 and 3 times the standard deviation of this noise model.
In
R
S5=(R3+R7)/2+k(2P5−P3−P7)/2
R
B5=(R1+R9)/2+k(2P5−P1−P9)/2
The scale factor k is nominally one (1), but can be any value from minus infinity to plus infinity. If the panchromatic classification decision is slash, then the color value R5 for pixel P5 is computed as RS5. Otherwise, it is computed as RB5. In the case of the sigma filter a single prediction value responsive to c1, c3, c7, and c9 is computed:
R
5={(c1R1+c3R3+c7R7+c9R9)+k[(c1+c3+c7+c9)P5−c1P1−
c
3
P
3
−c
7
P
7
−c
9
P
9]}/(c1+c3+c7+c9)
From the above equation, we can see that for pixel P5 we compute a red pixel value R5 from the coefficients c1, c3, c7, and c9 of the classifier decision and from existing red and panchromatic pixel values R1, R3, R7, R9, P5, P1, P3, P7, and P9. The scale factor k is nominally one (1), but can be any value from minus infinity to plus infinity. For different colors, such as green and blue, similar computations will be performed.
Taking every possible combination of values for c1, c3, c7, and c9, this amounts to selecting one of 16 possible predictor values. The operations within block 206 (
In
C
R1
=R
1
−P
1
C
R3
=R
3
−P
3
C
R7
=R
7
−P
7
C
R9
=R
9
−P
9
The values CR1, CR3, CR7, and CR9 are the resulting color differences as illustrated in
In
G
S5
=|P
3
−P
7|
G
B5
=|P
1
−P
9|
L
S5=|2P5−P3−P7|
L
B5=|2P5−P1−P9|
S
5
=aG
S5
+bL
S5
B
5
=aG
B5
+bL
B5
GS5 is a slash gradient and GB5 is a backslash gradient for pixel P5. LS5 is a slash laplacian and LB5 is a backslash laplacian for pixel P5. The coefficients a and b are used to tune how much of each gradient and laplacian component goes into the final classifier computation. Typical values for a and b are a=1, b=0 for a gradient-only classifier, a=0, b=1 for a laplacian-only classifier, and a=1, b=1 for a combined gradient-and-laplacian classifier. Another example uses directional median filters. Again referring to
MS5=median (P3, P5, P7)
MB5=median (P1, P5, P9)
S
5
=|M
S5
−P
5|
B
5
=|M
B5
−P
5|
MS5 is the statistical median of the three panchromatic values P3, P5, and P7. MB5 is the statistical median of the three panchromatic values P1, P5, and P9. The third example uses sigma filtering which is a subclass of bilateral filtering. In this case, we compute four classifiers d1, d3, d7, and d9, which correspond to pixels R1, R3, R7, and R9:
d
1
=|P
1
−P
5|
d
3
=|P
3
−P
5|
d
7
=|P
7
−P
5|
d
9
=|P
9
−P
5|
In
c1=1 if d1<t, otherwise c1=0
c3=1 if d3<t, otherwise c3=0
c7=1 if d7<t, otherwise c7=0
c9=1 if d9<t, otherwise c9=0
The threshold value, t, is a function of the inherent noisiness of the image capture device. Classically, this noise is modeled as a Gaussian (normal) distribution with an associated mean and standard deviation. The value t is typically set to a value between 1 and 3 times the standard deviation of this noise model.
In
C
S5=(C3+C7)/2
C
B5=(C1+C9)/2
If the panchromatic classification decision is slash, then the color difference value C5 for pixel P5 is computed as CS5. Otherwise, it is computed as CB5. In the case of the sigma filter a single prediction value responsive to c1, c3, c7, and c9 is computed:
C
5=(c1C1+c3C3+c7C7+c9C9)/(c1+c3+c7+c9)
From the above equation, we can see that for pixel P5 we compute a color difference value C5 from the coefficients c1, c3, c7, and c9 of the classifier decision and from existing color difference values and panchromatic pixel values C1, C3, C7, and C9. The scale factor k is nominally one (1), but can be any value from minus infinity to plus infinity.
Taking every possible combination of values for c1, c3, c7, and c9, this amounts to selecting one of 16 possible predictor values. The resulting full-resolution color difference image 250 will consist of CR, CG, CB, and P pixel values at every pixel location.
Returning to
R=C
R
+P
G=C
G
+P
B=C
B
+P
The operations within block 206 (
The interpolation algorithms disclosed in the preferred embodiments of the present invention can be employed in a variety of user contexts and environments. Exemplary contexts and environments include, without limitation, wholesale digital photofinishing (which involves exemplary process steps or stages such as film in, digital processing, prints out), retail digital photofinishing (film in, digital processing, prints out), home printing (home scanned film or digital images, digital processing, prints out), desktop software (software that applies algorithms to digital prints to make them better—or even just to change them), digital fulfillment (digital images in—from media or over the web, digital processing, with images out—in digital form on media, digital form over the web, or printed on hard-copy prints), kiosks (digital or scanned input, digital processing, digital or scanned output), mobile devices (e.g., PDA or cell phone that can be used as a processing unit, a display unit, or a unit to give processing instructions), and as a service offered via the World Wide Web.
In each case, the interpolation algorithms can stand alone or can be a component of a larger system solution. Furthermore, the interfaces with the algorithm, e.g., the scanning or input, the digital processing, the display to a user (if needed), the input of user requests or processing instructions (if needed), the output, can each be on the same or different devices and physical locations, and communication between the devices and locations can be via public or private network connections, or media based communication. Where consistent with the foregoing disclosure of the present invention, the algorithms themselves can be fully automatic, can have user input (be fully or partially manual), can have user or operator review to accept/reject the result, or can be assisted by metadata (metadata that can be user supplied, supplied by a measuring device (e.g. in a camera), or determined by an algorithm). Moreover, the algorithms can interface with a variety of workflow user interface schemes.
The interpolation algorithms disclosed herein in accordance with the invention can have interior components that utilize various data detection and reduction techniques (e.g., face detection, eye detection, skin detection, flash detection).
The invention has been described in detail with particular reference to certain preferred embodiments thereof, but it will be understood that variations and modifications can be effected within the spirit and scope of the invention.
Reference is made to commonly assigned U.S. patent application Ser. No. 11/341,206, filed Jan. 27, 2006 by James E. Adams, Jr. et al, entitled “Interpolation of Panchromatic and Color Pixels”, the disclosure of which is incorporated herein.