1. Field of the Invention
The present invention relates to digital video cameras. More particularly, the invention relates to a dual sensor video camera including a panchromatic sensor and a color filter array (CFA) sensor.
2. Description of the Related Art
Digital video cameras utilize sensors that capture light from a scene and produce digital information representing images of the scene. The sensors have a large number of sensor sites that each capture light from a particular point within the scene, which is represented as a corresponding pixel in the digital image.
Two basic types of digital video cameras are known in the art: single sensor cameras and 3-sensor cameras. The sensor in a single sensor camera is typically overlaid with an alternating pattern (e.g., Bayer pattern) of color filters referred to as a color filter array (CFA). The color filter array typically comprises a pattern of red color filters, green color filters, and blue color filters, where each color filter is aligned over one of the sensor sites. Thus, the color filter over each sensor site filters either the red, green, or blue component of the light falling onto it so that each sensor site effectively captures either red, green, or blue color information.
A process called demosaicing or CFA interpolation is used to estimate the missing color components for each pixel in the image. For example, if a particular sensor site is overlaid with a red color filter so that it captures the red color component then the demosaicing algorithm estimates the green and blue color components for the corresponding pixel based on the green and blue color components measured by surrounding sensor sites that are overlaid with green and blue color filters.
The demosaicing process works quite well for many images. However, in some images, a problem called color aliasing occurs. For example, in an image with a lot of high frequency information (fine detail), the color information can change as fast as every pixel. In this situation the demosaicing algorithm has difficulty making appropriate estimates as to the missing color components for each pixel, with the result that spurious colors appear in the image.
One approach to this problem has been to overlay a low-pass filter over the sensor. A low-pass filter limits how quickly the image information can change. This solves the problem of color aliasing but destroys fine detail and makes all images fuzzier, whether they suffered from visible color aliasing or not.
The other type of video camera, the 3-sensor camera, uses a beam splitter to split the light into three light beams that are sent to three different sensors. One sensor is overlaid with a red color filter, one is overlaid with a green color filter, and one is overlaid with a blue color filter. The red, green, and blue color components of the image pixels are obtained from the corresponding sensor sites of the respective sensors.
This approach provides a very high quality result with no color aliasing. However, one problem with 3-sensor cameras is the cost and complexity involved in their production. For example, the three sensors are typically precisely mechanically aligned with each other so that their respective sensor sites correspond to the same pixels in the image, which adds manufacturing cost. Also, since the beam splitter splits light into three separate beams, the amount of light that reaches each sensor is reduced. The reduction in light to the sensors results in a lowered signal-to-noise ratio and effectively adds dynamic noise to the image.
Both single sensor and 3-sensor video cameras typically convert the image from RGB 4:4:4 format to an industry standard YCbCr 4:2:2 format.
Various embodiments of a dual-sensor video camera are disclosed. The dual-sensor video camera includes a color filter array (CFA) sensor, i.e., a sensor overlaid with a color filter array. The CFA sensor includes a low-pass filter. The dual-sensor video camera also includes a panchromatic sensor, also referred to as a monochrome sensor.
The dual-sensor video camera also includes a beam splitter configured to split an incoming light beam into two beams, where one of the beams is directed to the CFA sensor and the other beam is directed to the panchromatic sensor.
The dual-sensor video camera also includes one or more computational elements, such as one or more processors or one or more programmable hardware elements, such as an FPGA. The one or more computational elements are operable to receive first image information from the panchromatic sensor and second image information from the CFA sensor and produce an output image from the first image information and the second image information. The output image includes luminance information based on the first image information from the panchromatic sensor and chrominance information based on the second image information from the CFA sensor.
A better understanding of the present invention may be obtained when the following detailed description is considered in conjunction with the following drawings, in which:
While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and are described in detail. It should be understood, however, that the drawings and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the present invention as defined by the appended claims.
Various embodiments of a dual-sensor digital video camera are described herein. One of the sensors comprises a color filter array (CFA) sensor, i.e., a sensor overlaid with a color filter array. The color filter array comprises an alternating pattern of color filters, where each color filter is aligned over one of the sensor sites. The CFA sensor is also overlaid with a low-pass filter for preventing or reducing color aliasing, as described above. The other sensor comprises a panchromatic sensor, also referred to as a monochrome sensor. The panchromatic sensor is not overlaid with color filters, and thus, the light falling onto its sensor sites includes all color components. Also, the panchromatic sensor is not overlaid with a low-pass filter. As described below, the dual-sensor video camera produces images based on the image information from both the CFA sensor and the panchromatic sensor.
In various embodiments, the beam splitter 40 may be configured to send the two light beams at various angles with respect to each other. Thus, the CFA sensor 50 and the panchromatic sensor 52 may be mechanically arranged in various ways within the dual-sensor video camera and at various angles with respect to each other. In the embodiment illustrated in
Also, in various embodiments, the beam splitter 40 may be configured to direct different amounts of light to the two sensors. In other words, the two light beams into which the incoming light beam is split may have various intensities with respect to each other. In particular, in some embodiments, the beam splitter 40 may direct a majority of the light to the panchromatic sensor 52 so that it receives more light than the CFA sensor 50.
The CFA sensor 50 produces image information from the light beam that it receives, e.g., where the image information indicates the amount of light received by each of its sensor sites. Similarly, the panchromatic sensor 52 also produces image information indicating the amount of light received by each of its sensor sites. The image information produced by the CFA sensor 50 is also referred to herein as CFA image information, and the image information produced by the panchromatic sensor 52 is referred to herein as panchromatic image information.
The dual-sensor video camera is operable to produce an output image based on both the CFA image information and the panchromatic image information. For example, the dual-sensor video camera may include one or more computational elements operable to combine the CFA image information and the panchromatic image information to produce the output image. More particularly, the resulting output image may comprise chrominance information (but not luminance information) from the CFA image information and luminance information (but not chrominance information) from the panchromatic image information, as described in more detail below.
Referring now to
As indicated in blocks 201 and 203, respectively, the CFA image information may be received from the CFA sensor 50, and the panchromatic image information may be received from the panchromatic sensor 52. For example, the computational element(s) may be coupled to the CFA sensor 50 and the panchromatic sensor 52 such that it receives the image information from the respective sensors.
As indicated in 205, the computational element(s) may generate an RGB image from the CFA image information received from the CFA sensor 50. The RGB information may comprise an RGB representation of the CFA image information. As described above, generating the RGB image information may comprise performing a demosaicing algorithm to estimate color components of the image pixels.
In 207, the RGB image may be converted to YCbCr format, i.e., may be converted to a YCbCr representation of the RGB image. As known in the art, each pixel in the YCbCr image comprises three components (Y, Cb, Cr), where the Y component is the luminance (brightness) component, and the Cb and Cr components are chroma components.
The panchromatic image information comprises monochrome image information, e.g., simply indicates the luminance value of each pixel in the image. As indicated in 209, the computational element(s) may produce an output image based on the panchromatic image information and the YCbCr image generated in 207 by replacing the Y component (luminance component) of each pixel in the YCbCr image with the luminance value of the corresponding pixel from the panchromatic image.
For example, suppose that the pixels in the YCbCr image are represented as follows: [YC1, Cb1, Cr1], [YC2, Cb2, Cr2], . . . [YCN, CbN, CrN]. And suppose that the corresponding pixels in the panchromatic image are represented as follows: [YP1], [YP2], . . . [YPN]. Thus, the output image generated in 209 may comprise a YCbCr image represented as: [YP1, Cb1, Cr1], [YP2, Cb2, Cr2], . . . [YPN, CbN, CrN].
Thus, the luminance information for the resulting output image comes from the panchromatic image information from the panchromatic sensor 52, and the chrominance (e.g., hue and saturation) information comes from the CFA image information from the CFA sensor 50. As noted above, the CFA sensor 50 includes a low-pass filter. Since the CFA image information is low-pass filtered, color aliasing is reduced or eliminated in the output image. Moreover, human vision perceives image sharpness primarily based on luminance information. Thus, since the luminance information for the resulting output image comes from the panchromatic image information, which has not been low-pass filtered, the output image may be perceptibly sharper and less fuzzy than in traditional single-sensor video cameras.
The dual-sensor video camera may also have advantages over traditional 3-sensor video cameras. For example, the dual-sensor video camera may be less expensive to produce, since it uses two sensors instead of three. Also, as described in more detail below, some embodiments may utilize an electronic alignment technique to align pixels in the two sensors instead of relying on precise mechanical alignment, which may also reduce the manufacturing cost. Also, as noted above, in some embodiments the beam splitter 40 may direct a majority of the light from the incoming light beam to the panchromatic sensor 52. This may result in an increased signal-to-noise ratio in the luminance component of the output image (possibly at the expense of a decreased signal-to-noise ratio in the chrominance components, where those errors are less visible in human vision).
It is noted that
It is noted that the YCbCr output image generated in 209 may be further processed. For example, in a typical embodiment, the YCbCr image may be sub-sampled down to YCbCr 4:2:2 format, which is a common output format used in digital video cameras. It is also noted that in alternative embodiments the dual-sensor video camera may be operable to produce an output image in an image format or color space other than YCbCr. In general, similar techniques as described above may be applied to generate any of various types of output images in which luminance information is represented separately from chrominance information.
In various embodiments, the dual-sensor video camera may include one or more computational elements of any kind operable to produce the output image based on the image information from the two sensors. For example, the dual-sensor video camera may include one or more processors and/or one or more programmable hardware elements operable to produce the output image. Examples of programmable hardware elements include reconfigurable hardware, programmable logic, or field-programmable devices (FPDs), such as one or more FPGAs (Field Programmable Gate Arrays), or one or more PLDs (Programmable Logic Devices), such as one or more Simple PLDs (SPLDs) or one or more Complex PLDs (CPLDs), or other types of programmable hardware.
As shown, the FPGA device 300 includes conversion logic 320. The conversion logic 320 may comprise a portion of the FPGA device 300 (e.g., a subset of its resources, such as memory, gates, multipliers, or other programmable logic elements) configured to convert images from one format to another, e.g., as described above with reference to blocks 205 and 207. For example, the conversion logic 320 may be operable to generate the RGB image from the CFA image information and may convert the RGB image to YCbCr format.
The exemplary FPGA device 300 also includes combining logic 322. The combining logic 322 may comprise a portion of the FPGA device configured to combine the panchromatic image information with the YCbCr image generated in 207, e.g., by replacing the Y component values with luminance values based on the panchromatic image information, as described above.
The exemplary FPGA device 300 also includes sub-sampling logic 324. The sub-sampling logic 324 may comprise a portion of the FPGA device configured to sub-sample the YCbCr output image generated in 209 down to YCbCr 4:2:2 format, or may perform any of various other types of re-sampling.
The method of
In another embodiment, the two sensors may be aligned with each other to within a certain tolerance, but may not necessarily be aligned so precisely that pixels in the exact same position in the two images correspond to each other. In this embodiment, the slight difference in alignment may be compensated for electronically. For example, during the manufacturing process, images generated by the two sensors may be compared to each other to determine the difference in alignment, and information indicating the alignment difference may be stored in a memory medium of the dual-sensor video camera. The computational element(s) may use this information to determine which pixel in the panchromatic image corresponds to a given pixel in the YCbCr image generated from the CFA image information. (The sensors used in the dual-sensor video camera may have slightly more rows and columns than needed in the final image, in order to take the possible alignment differences into account.)
For example, suppose that the dual-sensor video camera is aimed at a scene which produces an image pattern such as shown in
Thus, the manufacturing process for the dual-sensor video camera may comprise aiming the dual-sensor video camera at a target and analyzing the target images produced by the two sensors in order to determine the horizontal and vertical alignment differences. For example, the camera may be pointed at a target with a black dot in the upper left corner, and the pixels in each image may be read in order to determine the difference in where the black dot falls in each image. The horizontal and vertical difference values may be stored in a memory medium of the dual-sensor video camera and used by the computational element(s) in order to determine that corresponding pixels are shifted horizontally and vertically with respect to each other by the indicated number of pixels.
Similarly, the corresponding pixels in the two images may also be rotated with respect to each other, e.g., as illustrated in
Utilizing an electronic alignment technique to electronically align pixels in the two images in this manner may enable the manufacturing cost to be reduced compared to traditional precision alignment techniques.
Although the embodiments above have been described in considerable detail, numerous variations and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. It is intended that the following claims be interpreted to embrace all such variations and modifications.
Number | Name | Date | Kind |
---|---|---|---|
4264928 | Schober | Apr 1981 | A |
4876591 | Muramatsu | Oct 1989 | A |
5038216 | Easterly et al. | Aug 1991 | A |
5081525 | Akiyama et al. | Jan 1992 | A |
5347599 | Yamashita et al. | Sep 1994 | A |
5374971 | Clapp et al. | Dec 1994 | A |
5379069 | Tani | Jan 1995 | A |
5486853 | Baxter et al. | Jan 1996 | A |
5515099 | Cortjens et al. | May 1996 | A |
5528274 | Hyodo | Jun 1996 | A |
5528289 | Cortjens et al. | Jun 1996 | A |
5537157 | Washino et al. | Jul 1996 | A |
5579053 | Pandel | Nov 1996 | A |
5598209 | Cortjens et al. | Jan 1997 | A |
5612733 | Flohr | Mar 1997 | A |
5617539 | Ludwig et al. | Apr 1997 | A |
5629734 | Hamilton et al. | May 1997 | A |
5633681 | Baxter et al. | May 1997 | A |
5661525 | Kovacevic et al. | Aug 1997 | A |
5689641 | Ludwig et al. | Nov 1997 | A |
5692159 | Shand | Nov 1997 | A |
5751338 | Ludwig, Jr. | May 1998 | A |
5821987 | Larson | Oct 1998 | A |
5832143 | Suga et al. | Nov 1998 | A |
6072522 | Ippolito et al. | Jun 2000 | A |
6100929 | Ikeda et al. | Aug 2000 | A |
6266093 | Glenn | Jul 2001 | B1 |
6356308 | Hovanky | Mar 2002 | B1 |
6373523 | Jang | Apr 2002 | B1 |
6563537 | Kawamura et al. | May 2003 | B1 |
6594688 | Ludwig et al. | Jul 2003 | B2 |
6639626 | Kubo et al. | Oct 2003 | B1 |
6643462 | Harand et al. | Nov 2003 | B2 |
6724619 | Kwong et al. | Apr 2004 | B2 |
6731334 | Maeng et al. | May 2004 | B1 |
6809358 | Hsieh et al. | Oct 2004 | B2 |
6816904 | Ludwig et al. | Nov 2004 | B1 |
6850265 | Strubbe et al. | Feb 2005 | B1 |
6965705 | Ma et al. | Nov 2005 | B1 |
6980485 | McCaskill | Dec 2005 | B2 |
6993179 | Weinshall et al. | Jan 2006 | B1 |
7035481 | Kim et al. | Apr 2006 | B2 |
7038709 | Verghese | May 2006 | B1 |
7046295 | Hovanky | May 2006 | B2 |
7057647 | Monroe | Jun 2006 | B1 |
7088391 | Glenn et al. | Aug 2006 | B2 |
7088392 | Kakarala et al. | Aug 2006 | B2 |
7202904 | Wei | Apr 2007 | B2 |
20030151685 | Ia Grone | Aug 2003 | A1 |
20040001137 | Cutler et al. | Jan 2004 | A1 |
20040183897 | Kenoyer et al. | Sep 2004 | A1 |
20060082676 | Jenkins et al. | Apr 2006 | A1 |
20060119710 | Ben-Ezra et al. | Jun 2006 | A1 |
20060262333 | Jenkins | Nov 2006 | A1 |
20070139517 | Jenkins | Jun 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20080030611 A1 | Feb 2008 | US |