The present invention relates to electronic devices, and more particularly to image reformatting methods and related devices such as digital cameras.
Recently, digital cameras have become a very popular consumer appliance appealing to a wide variety of users ranging from photo hobbyists, web developers, real estate agents, insurance adjusters, photo-journalists to everyday photography enthusiasts. Advances in large resolution CCD/CMOS sensors coupled with the availability of low-power digital signal processors (DSPs) has led to the development of digital cameras with both high resolution image and short video clip capabilities, and these capabilities have spread into various consumer products such as cellular phones. The high resolution (e.g., sensor with a 2560×1920 pixel array) provides quality offered by traditional film cameras. U.S. Pat. No. 5,528,293 and U.S. Pat. No. 5,412,425 disclose aspects of digital camera systems including storage of images on memory cards and power conservation for battery-powered cameras.
a is a functional block diagram for digital camera control and image processing; the automatic focus, automatic exposure, and automatic white balancing are referred to as the 3A functions. The image processing typically includes functions such as color filter array (CFA) interpolation, gamma correction, white balancing, color space conversion, and JPEG/MPEG compression/decompression (JPEG for single images and MPEG for video clips) and is referred to as the image pipeline. Note that the typical color CCD consists of a rectangular array of photosites (pixels) with each photosite covered by a filter (CFA): red, green, or blue. In the commonly-used Bayer pattern CFA one-half of the photosites are green, one-quarter are red, and one-quarter are blue.
The current trend of incorporating video capabilities into high resolution digital cameras creates a problem because the camera must satisfy both the high resolution of a still image camera and the high frame rate/low resolution requirements of a video camera. Consequently, most image sensors (CCD or CMOS) employ schemes to average pixel values within the sensor device for video mode. Averaging pixels does two things:
The averaging of pixel values in the sensor device, however, poses challenges for the subsequent image pipeline processing of the pixel data. Indeed, the sensor often outputs video-mode data in some fixed, regular, but locally scrambled format and not in normal raster-scan order. The video format is not consistent across various sensor manufacturers. Thus it is a problem of image pipelines to support all the different video output formats, including adaptation to future formats.
The present invention provides devices and methods for programmable reformatting of image sensor pixel output.
This has advantages including substitutability of image sensors within digital cameras with a single image pipeline.
The drawings are heuristic for clarity.
a-1b are functional block diagrams of preferred embodiment pipeline reformatter and location.
a-3b show functional and hardware blocks of of a digital camera.
Preferred embodiment reformatters and reformatting methods (programmably) convert image sensor (CCD/CMOS) output pixel streams into another (standard) format, such as raster-scan Bayer format. Reformatting enables various image sensor output formats to be used with a single image pipeline. Video-mode output formats vary among image sensor manufacturers due in part to the particular downsampling implementation: a 5 megapixel image (e.g., 2560×1920 pixels) is downsampled by a factor of 4 in both dimensions to yield a VGA 640×480 video output frame, and this large downsampling factor allows for many alternative output stream formats.
a shows a reformatter following faulty pixel correction and prior to memory write or video-mode processing; however, the reformatter may be at other locations in the processing, such as between the optical black clamp and the faulty pixel correction. Note that in order to optimize the dynamic range of the pixel values represented by the image sensor, the pixels representing black should have a 0 value. The black clamp function adjusts for this by subtracting an offset for each pixel. There is only one color channel per pixel at this stage of the processing, and rearranging the pixel location order can be either prior to or after black clamp.
Similarly, image sensor arrays may have faulty (missing) pixels, especially arrays with more than 500,000 elements. The missing pixel values are filled by simple interpolation within the array; a high order interpolation may not be necessary because a later interpolation is also performed in the CFA interpolation stage. Therefore, the main reason for this preliminary faulty pixel correction interpolation step is to make the image processing regular by eliminating missing data. Again, the faulty pixel correction may precede or follow a rearrangement of the pixel location order.
b shows a reformatter with duplicate local memories; this allows the reformatter to write incoming pixels to one memory while the rest of the processing reads reformatted pixels from the other memory. Each of the two memories may store 2560 10-bit or 14-bit pixels to hold two R-B pairs of 640 pixel lines in Bayer format.
Preferred embodiment camera systems and devices, such as digital still cameras and video-capable cellular phones, include preferred embodiment reformatting. The reformatting computations can be performed with digital signal processors (DSPs) or general-purpose programmable processors or application specific circuitry, or systems on a chip such as a DSP, application specific circuitry, and a RISC processor on the same chip with the RISC processor controlling. The reformatting parameters are programmed (ROM, Flash EEPROM, FeRAM, etc.) to adapt to the image sensor used in the camera system. Analog-to-digital converters and digital-to-analog converters provide coupling to the real world, and modulators and demodulators (plus antennas for air interfaces) provide coupling for wireless transmission.
First preferred embodiments write (output) successive input pixels to (local) memory with the write address controlled according to increments and decrements (strides) which are programmable parameters. The programmability (either dynamic or static) of the parameters allows for adaptation to a particular image sensor. The parameters are:
initial value of output address
strides (output address increments and decrements for the input pattern)
number of strides for the input pattern
The following pseudocode implements the first preferred embodiment with output_pointer a pointer to memory for writing the current incoming pixel and stride[index] the increment/decrement to be used to jump to the next address:
As an example, consider a sensor with video-mode output of frames of size 400×300 pixels and with the sensor output pixel stream R B Gr Gb R B Gr Gb . . . for two lines of raster-scan Bayer format. The upper portion of
Hence, after reformatting the corresponding output in memory (addresses and contents) looks like:
Thus the preferred embodiment for this sensor output pattern applies with the initial address (initial_output_address)=0, the number of strides (num_strides)=2, the first stride (stride[0])=401; and the second stride (stride[1])=−400. That is, the write address (output_pointer) has successive values: 0, 401, 1, 402, 2, 403, 3, 404, 4, . . .
With other sensor output pixel patterns the parameters are adjusted to likewise reformat to the raster-scan Bayer format. For a second example, if the foregoing first example were modified by the interchange of B and Gr so that the video-mode output pixel stream is R Gr B Gb R Gr B Gb . . . , then the pixel stream and corresponding address for memory write are:
and the output reformatted in memory is again:
This time the preferred embodiment applies with initial_output_address=0 as before, but with a larger number of strides (num_strides=4), the first stride (stride[0])=1; the second stride (stride[1])=400, the third stride (stride[2])=1; and the fourth stride (stride[3])=−400. That is, the write address (output_pointer) has successive values: 0, 1, 401, 402, 2, 3, 403, . . .
More involved video-mode output pixel patterns could include intermingling pixels from more than a pair of R-B lines of raster-scan Bayer format. For a third example, if two red and two blue lines are multiplexed, then the video-mode output could be R1 Gb1 R2 Gb2 Gr1 B1 Gr2 B2 R1 Gb1 R2 . . . and the corresponding address for memory write would be:
Thus the memory would contain a first Bayer red line (R1s and Gr1s) of 400 pixels followed by a first blue line, a second red line, and a second blue line; and then repeats from subsequent inputs.
For this example the preferred embodiment applies with initial_output_address=0 as before, and again with four strides (num_strides=4), but the first three strides all equal 400 (stride[0]=stride[1]=stride[2]=400); and the fourth stride is a large decrement (stride[3]=−1199). For this sensor the write address (output_pointer) has successive values: 0, 400, 800, 1200, 1, 401, 801, . . . .
In short, the pixel output pattern prescribes a set of strides, and the preferred embodiments take these strides as parameter inputs.
Second preferred embodiments write successive input pixels to (local) memory with the write address controlled according to the following programmable parameters:
and so again the reformatted pixel pattern in memory looks like:
Thus the bit vector (bit pattern repeat) 1010 applied to the input stream has R (to location 0), Gr (to location 1), R (to location 2), . . . going to the first reformatted output line; and the bit vector 0101 has B (to location 401), Gb (to location 402), B (to location 403), . . . going to the second reformatted output line. The initial offset for the second line is 1. Indeed, the values of the control parameters for this example are:
Third preferred embodiment reformatters and methods use a memory pointer for each output line and a control program to apply the pointers to the incoming pixel stream ordering; the methods have the following control parameters:
And the reformatted output (memory address and contents) is:
Thus the sequence of output addresses for the input pixel stream is: 0:0, 1:1, 0:1, 1:2, 0:2, 1:3, 0:3, 1:4, . . . which can be implemented with two output pointers, P0 and P1. The parameter values for this example would be:
The third preferred embodiments comprehend new CCD sensors that can read out pixels from both the left and the right edges of the sensor array, which requires the output address to increment (normal left-to-right sweep) and decrement (reverse right-to-left sweep) in an interleaved manner. The pointer decrement instruction deals with the reverse addressing.
Preferred embodiment hardware can have 8 or more output pointers, a line index of at least 2 bits (for up to 4 output lines) and control programs with at least 16 entries to deal with various sensor output formats. Also, the output lines (Bayer format) are logical and not physical lines in local memory. That is, in
As another example, again consider the foregoing second example:
The control program now has length 4:
control_program[0]=(0:0:++)
control_program[1]=(0:0:++)
control_program[2]=(1:1:++)
control_program[3]=(1:1:++)
In contrast, a control program with four pointers, P0, . . . , P3, where P0 is for R pixels, P1 is for Gr pixels, P2 is for B pixels, P3 is for Gb pixels, could be:
control_program[0]=(0:0:++)
control_program[1]=(0:1:++)
control_program[2]=(1:2:++)
control_program[3]=(1:3:++)
This has two pointers for each output line, and the initializations would be 0, 1, 1, 2, for P0, P1, P2, P3, respectively. The increment would change to:
The preferred embodiments may be varied while retaining one or more of the features of a (programmable) reordering of the sensor array output pixel stream.
For example, the output lines could correspond to a non-Bayer format; the sensor output need not be a downsampling of the full array, such as the third preferred embodiment could be used just to allow right-to-left readout at full resolution; the specific numbers (e.g., output lines with 400 sample in the first preferred embodiments and up to 8 pointers in the third preferred embodiments) could be varied; and so forth.
| Number | Name | Date | Kind |
|---|---|---|---|
| 6630956 | Toi | Oct 2003 | B1 |
| 6933970 | Koshiba et al. | Aug 2005 | B2 |
| Number | Date | Country | |
|---|---|---|---|
| 20060007332 A1 | Jan 2006 | US |