This application is based upon and claims the benefit of priority from prior Japanese Patent Application No. 2009-060934, filed Mar. 13, 2009, the entire contents of which are incorporated herein by reference.
1. Field of the Invention
The present invention relates to an image signal processing device, which is applied to a digital camera, for example. In particular, the present invention relates to an image signal processing device having a dynamic range expansion function, and to an image signal processing method.
2. Description of the Related Art
An device for expanding the dynamic range of a charge coupled device (CCD) and a CMOS (complementary metal oxide semiconductor) image sensor applied to a digital camera and a digital video camera has been developed (e.g., see JP 2008-271368 and JP 2007-124400).
The JP 2008-271368 discloses the following technique. According to the technique, when imaging is carried out in an exposure setup mode, luminance information, for example, luminance histogram is analyzed. In this case, the luminance histogram is analyzed with respect to a synthetic image generated from long-time and short-time exposure image signals. From the analyzed result, the exposure of the short-time exposure image signal is controlled.
On the other hand, the JP 2007-124400 discloses a double-exposure accumulating operation by a photodiode, division read and linear synthesis of a division read signal. Specifically, the following technique is disclosed. According to the technique, long and short exposure time signals are independently converted to analog-to-digital signals for one horizontal scan period, and then, these signals are output. The output two digital signals are added, and thereby, a reduction of an image quality is prevented so that a dynamic range is expanded.
However, the conventional techniques do not give consideration to each image quality of long- and short-time exposure image signals. Rather, a signal-to-noise ratio (SNR) and a quantization error are relatively preferable in the long-time exposure image signal. Thus, preferably, a specific subject, for example, an important subject such as a human face is controlled so that it is included in the long-time exposure image signal. However, the control is not carried out in the conventional case.
Moreover, a dynamic range of a display device for displaying an image signal is relatively narrow. For this reason, there is a need to narrow the dynamic range in an imaging unit if the display device displays an image signal having an expanded dynamic range. So, the expanded dynamic range is compressed using a dynamic range compression technique, for example, high-luminance knee compression. In also case, it is desired that a knee point is optimized based on the luminance level of an important subject such as a face. However, the control is not taken into consideration in the conventional case. Therefore, it is desired to provide image signal processing device and method, which prevent a reduction of the image quality of an important subject while expanding dynamic range.
According to a first aspect of the invention, there is provided an image signal processing device comprising: an imaging unit configured to generate first and second image signals imaged using different exposure time based on a reference read voltage; a synthesis circuit configured to synthesize the first and second image signals generated by the imaging unit; a detection unit configured to detect luminance information of a specified subject using a synthesized image signal outputting from the synthesis circuit; and a controller configured to control the reference read voltage of the imaging unit, the controller configured to determine a first knee point based on luminance information of a specified subject detected by the detection unit, and to control the first knee point according to the reference read voltage.
According to a second aspect of the invention, there is provided an image signal processing method comprising: imaging a specified subject based on a reference read voltage; setting luminance information of the specified subject to a target luminance information using exposure control; and determining a first knee point based on the target luminance information, and controlling the reference read voltage according to the first knee point.
According to a third aspect of the invention, there is provided an imaging signal processing method comprising: imaging a specified subject having an expanded dynamic range based on a reference read voltage; setting luminance information of the specified subject to a target luminance information using exposure control; determining a reference read voltage of the dynamic range based on the target luminance information; calculating a luminance histogram of the determined reference read voltage; accumulating the calculated histogram; and determining a dynamic range based on the accumulated histogram and a predetermined selection reference.
Various embodiments of the present invention will be hereinafter described with reference to the accompanying drawings.
A sensor core unit 11 includes a pixel unit 12, a CDS 13 functioning as a column noise cancel circuit, a column analog-to-digital converter (ADC) 14, a latch circuit 15 and two line memories (MSTS, MSTL) 28-1 and 28-2.
The pixel unit 12 photoelectrically converts a light, which is incident via a lens 17, and then, generates charges in accordance with the incident light. Further, the pixel unit 12 is provided with a plurality of cells (pixels), which are arrayed like a matrix on a semiconductor substrate (not shown). One cell PC comprises four transistors (Ta, Tb, Tc, Td) and a photodiode (PD). Each cell PC is supplied with pulse signals ADRESn, RESETn and READn. The transistor Tb of each cell PC is connected to a vertical signal line VLIN. One terminal of a current path of a load transistor TLM for a source follower circuit is connected to the vertical signal line VLIN, and the other terminal thereof is grounded.
An analog signal corresponding to signal charges generated from the pixel unit 12 is supplied to the ADC 14 through the CDS 13, and thereafter, converted to a digital signal, and then, latched by the latch circuit 15. The digital signal latched by the latch circuit 15 is successively transferred via line memories (MSTS, MSTL) 28-1 and 28-2. For example, 10-bit digital signals SH and SL+SH read from line memories (MSTS, MSTL) 28-1 and 28-2 are supplied to a linear synthesis circuit 31, and thereafter, synthesized by means of the circuit 31.
The following circuit and registers are arranged adjacent to the pixel unit 12. The arranged circuit is a pulse selector circuit (selector) 22. The arranged registers are a signal read vertical register (VR register) 20, an accumulation time control vertical register (ES register, long accumulation time control register) 21 and an accumulation time control vertical register (WD register, short accumulation time control register) 27.
A timing generator (TG) 19 generates pulse signals S1 to S4, READ, RESET/ADRES/READ, VRR, ESR and WDR in accordance with a control signal CONT and a command CMD supplied from a controller 34 described later.
The pulse signals S1 to S4 are supplied to the CDS circuit 13. The pulse signal READ (including a medium read signal Vm described later) is supplied to a pulse amplitude control circuit 29. The amplitude of the pulse signal READ is controlled by means of the pulse amplitude control circuit 29, and thereby, a three-value pulse signal VREAD is generated, and then, supplied to the selector circuit 22. In addition, the pulse signal RESET/ADRES/READ is supplied to the selector circuit 22. The pulse signal VRR is supplied to the VR register 20, the pulse signal ESR is supplied to the ES register 21 and the pulse signal WDR is supplied to the WD register 27. A vertical line of the pixel unit 12 is selected by means of the registers 20, 21 and 27, and then, the pulse signal RESET/ADRES/READ (typically, shown as RESETn, ADRESn, READn in
In the cell PC, a current path of a row select transistor Ta and an amplification transistor Tb is connected in series between a power supply VDD and the vertical signal line VLIN. The gate of a transistor Ta is supplied with a pulse signal (address pulse) ADRESn. A current path of a reset transistor Tc is connected between the power supply VDD and the gate (detection node FD) of the transistor Tb, and further, the gate thereof is supplied with a pulse signal (reset pulse) RESETn. One terminal of a current path of a read transistor Td is connected to the detection node FD, and further, the gate thereof is supplied with a pulse signal (read pulse) READn. The other terminal of the current path of the read transistor Td is connected with a cathode of a photodiode PD. In this case, an anode of the photodiode PD is grounded. Further, a bias voltage VVL is applied to the pixel unit 12 from a bias generator circuit (bias 1) 23. The bias voltage VVL is supplied to the gate of a load transistor TLM.
A VREF generator circuit 24 generates an analog-to-digital conversion (ADC) reference waveform in response to a main clock signal MCK. The VREF generator circuit 24 generates triangular waves VREFTL and VREFTS to carry out two-timeanalog-to-digital conversions for one horizontal scan period, and thereafter, supplies these waves to the ADC 14.
According to the configuration, for example, in order to read an n-line signal of the vertical signal line VLIN, the pulse signal ADRESn is set to an “H” level to operate the amplification transistor Tb and the load transistor TLM. A signal charge obtained by photoelectric conversion of the photodiode PD is accumulated for a predetermined period. In order to remove a noise signal such as a dark current in the detection node FD before read is carried out, the pulse signal RESETn is set to an “H” level to turn on the transistor Tc, and thereby, the detection node FD is set to a VDD voltage=2.8 V. In this way, a reference voltage (reset level) of a state that no signal is included in the detection node FD is output to the vertical signal line VLIN. A charge corresponding to the reset level of the vertical signal line VLIN is supplied to the ADC 14 via the CDS 13.
The pulse signal (read pulse) READn is set to an “H” level to turn on the read transistor Td. Then, an accumulated signal charge generated by the photodiode PD is read to the detection node FD. In this way, a voltage (signal+reset) level of the detection node FD is read to the vertical signal line VLIN. A charge corresponding to the signal+reset level of the vertical signal line VLIN is subjected to correlated double sampling by means of the CDS 13 so that noise is cancelled, and thereafter, supplied to the ADC 14. Automatic gain control (AGC) processing may be carried out between CDS 13 and ADS 14.
Thereafter, a reference waveform level output from the VREF generator circuit 24 is increased (i.e., triangular wave VREF is changed from a low level to a high level), and thereby, an analog signal is converted to a digital signal by means of the ADC 14. The analog-to-digital conversion operation is carried out two times for one horizontal scan period in accordance with triangular waves VREFTL and VREFTS supplied from the VREF generator circuit 24. For example, the triangular wave is 10 bits (0 to 1023 levels). Output data of the ADC 14 corresponding to triangular waves VREFTL and VREFTS is successively held by means of the latch circuit 15, and then, transferred to line memory MSTS and MSTL. In other words, a wide dynamic range (WDR) sensor executes a double-exposure accumulation operation. Therefore, a long-time exposure signal, that is, an SL signal (sensor output is an SL+SH signal) and a short-time exposure signal, that is, an SH signal are detected. These signals are delayed and adjusted by means of line memories MSTS and MSTL so that timing is matched.
Signal SH held by the line memory MSTS and signal SL+SH held by the line memory MSTSL are supplied to a linear synthesis circuit 31. A signal SF synthesized by the linear synthesis circuit 31 is supplied to an image signal processing circuit 32. The image signal processing circuit 32 executes generally various signal processings, for example, a shading correction, a noise cancel and a de-mosaic processing with respect to an input signal. In this way, the input signal is converted from a Bayer-format signal SF to an RGB-format signal SF_RGB. One output signal of the image signal processing circuit 32 is supplied to an AE detection unit 33 while the other output signal thereof is successively supplied to a dynamic range compression unit (D range compression unit) 35 and an output unit 36.
The AE detection unit 33 includes known YUV conversion unit 33a and face detection unit 33b. Signal SFRGB is converted to a luminance signal (Y), a blue-component color difference signal (U) and a red-component color difference signal (V) by means of the YUV conversion unit 33a. The face detection unit 33b makes face detection with respect to a luminance signal to output face luminance information given as a detected important subject. The luminance information is supplied to a controller 34.
The controller 34 comprises a microprocessor, for example. The controller 34 has an auto-exposure (AE) control function and a function of determining a medium read voltage (Vm). Specifically, the controller 34 execute exposure control so that the luminance is set to a target luminance, for example, 650 LSB based on the supplied face luminance information. Further, when exposure control ends and the face luminance is determined, the controller 34 finds a knee point so that the face becomes a signal SL, and then, determines a Vm value from the knee pint. The controller 34 outputs a command CMD, a control signal CONT and the determined Vm value. The command CMD, control signal CONT and Vm value are supplied to a timing generator (TG) 19. The timing generator 19 generates the various pulse signals based on command CMD, control signal CONT and Vm value.
First, at time to, a reset pulse is released so that exposure (photoelectric conversion) is started. According to the high light case, a charge larger than a set medium read voltage (Vm value) is accumulated in a photodiode. For this reason, at time t1, a charge larger than the medium read voltage (Vm value) is partially transferred, and thus, discharged.
Charge (short-time exposure) is again carried out for a short time (TH) from time t1 to t2. Thereafter, a charge more than the Vm value is partially transferred, and then, detected as a signal SH.
At time t3, charges remaining at time t3 are fully transferred, and then, added to the charge of the detected signal SH, and thus, detected as a signal SL+SH.
In other words, two signals; specifically, a short-time exposure signal, that is, signal SH and the sum of a long-time exposure signal and the short-time exposure signal, that is, signal SL+SH are obtained as a sensor output.
As can be seen from
As can be seen from
As can be seen from
SF=G×SH (1)
According to the same condition as above, the exposure ratio is set to 16:1 and 32:1, and thereby, the maximum number of bits is 13 bits and 14 bits. Therefore, a dynamic range is expanded to eight times (WDR×8) and 16 times (WDR×16) as shown in
As can be seen from
Vm=Vn/(TL/(TL−TH)) (2)
Hereinafter, the following modes of the expansion will be set; specifically, a WDR×4 mode (12-bit mode), a WDR×8 mode (13-bit mode) and a WDR×16 mode (14-bit mode) will be explained. However, this embodiment is not limited to the expansions.
According to a conventional case, it is desired that the value of Vm is set to 512 LSB in the light of an expansion. However, the conventional case gives no consideration to the quality of long-time exposure signal (signal SL) and short-time exposure signal (signal SH). Specifically, the quality of signal SL rather than signal SH is relatively preferable in an SNR and a quantization error. For example, it is desired that an important subject such as a human face is controlled so that it is involved in signal SL. As can be seen from
In general, according to the standard suitable as an image, the face luminance is set to around 650 LSB. According to this embodiment, the value of Vm for determining the knee point is determined after being fed-back from luminance information of an important subject in auto-exposure (AE) control.
Specifically, the detected luminance information output from the AE detection unit 33 is supplied to the controller 34. Based on the supplied luminance information, the controller 34 executes exposure control so that the luminance is set to a target luminance of the face as an important subject, for example, 650 LSB (S12, S13). In other words, the controller 34 generates a control signal based on the luminance information supplied from the AE detection unit 33, and thereafter, supplies the signal to the timing generator 19. The timing generator 19 generates various pulse signals in accordance with the control signal, and thereafter, supplies them to the sensor core unit 11. Signals SH and SL+SH read from the sensor core unit 11 are successively processed according to loop of the linear synthesis circuit 31, the image signal processing circuit 32, the AE detection unit 33 and the controller 34.
AE control is carried out in the manner described above; as a result, face luminance information as an important subject converges on 600 LSB, for example. In this case, the controller 34 again converts the luminance information to RGB. For example, if the RGB has a relation of R:G:B=400:600:500 LSB, the maximum value of them, that is, G=600 LSB is obtained. When G=600 LSB, the knee point Vn is Vn=658 from
As described above, the value of Vm is determined in cooperation with AE control, and thereby, it is possible to take an image having an expanded dynamic range without reducing the image quality of an important subject such as a face.
In this case, a compression ratio of signal SH becomes higher than the case of Vm=512 LSB. For this reason, gradation on the signal SH side is relatively lost. This shows the trade-off between signal SL and signal SH given as a main point depending on a knee point position. In other words, according to this embodiment, the trade-off is optimized from luminance information in accordance with a shooting scene, that is, a face as an important subject.
Likewise, even if a face as an important subject is dark in some degree, there is a shooting scene desired to improve gradation characteristic of a high luminance portion. For example, there is the case of taking a scene outside a window from the room, and simultaneously, taking a human face in the room. In this case, a target luminance of the face is reduced to 520 LSB, and simultaneously, the value of Vm is reduced to 448 LSB. In this way, the face is involved in the signal SL side, and the compression ratio of signal SH is relaxed, and thereby, the gradation characteristic is improved. Of course, the value of Vm may be further reduced so that gradation characteristic on the signal SH side is improved.
The series of operation is carried out in a state that a shutter is half-pushed or a preview operating state if this embodiment is applied to a digital camera, for example.
Then, the shutter is operated, and thereafter, a signal SF_RGB of an image expanding the dynamic range is obtained without reducing the image quality of an important subject. Signal SF_RGB is subjected to signal processings such as white balance and linear matrix by means of the image signal processing circuit 32 shown in
According to the first embodiment, AE control is carried out based on luminance information of a face as an important subject. In a state that the AE control converges, the medium read voltage of the sensor, that is, the value of Vm is determined. Therefore, the knee point of the sensor is set higher than the value of Vm, and thereby, the image quality of a face as an important subject is improved.
If the accuracy of the AE control is high and high-accurately converges on a target luminance level, no feedback control may be carried out like the first embodiment. In this case, an accurate value of Vm is anticipated before AE control; therefore, an anticipated value of Vm is set in place of the temporary value of Vm, and thus, feed-forward control may be carried out.
The first embodiment has described the shooting scene to which Vm=448 LSB is applied. In this case, the expansion exceeds four times; for this reason, there is the possibility that data is saturated in the WDR×14 bit mode.
Thus, a method of optimizing a WDR mode will be described below with reference to
In this case, a WDR mode is set to the maximum. According to this modification example, a WDR×16 mode is the maximum, and when Vm=512 LSB is set; a dynamic range is expanded by 14 bits in the maximum. AE control is carried out based on luminance information of a face as an important subject from an image having an expanded dynamic range like the first embodiment (S21 to S23).
The AE control converges, and thereafter, the controller 34 determines the optimum value of Vm in the WDR×16 mode (S24).
As shown in
According to the modification example, the value of Vm is determined based on luminance information of a face as an important subject. Thereafter, a WDR mode is optimized based on accumulated histogram of the luminance information. Therefore, this serves to prevent white defect while to reduce contrast compression on the high-luminance side to the minimum. In this way, it is possible to provide a high image quality in a wide dynamic range image.
As described in the first embodiment, when a display device displays an image having an expanded dynamic range, the image is set to a bit map (BMP) file narrow range, for example, sRGB 8 bits. For this reason, there is a need to effectively compress the image having an expanded dynamic range.
According to the second embodiment, luminance information of a face as an important subject is used to compress a dynamic range. Specifically, as shown in
A knee point position is important in data compression as well as data expansion. Thus, according to the second embodiment, luminance information of a face output from the AE detection unit 33 is supplied to the compressor 35b for executing knee compression. The compressor 35b determines a knee point based on luminance information of a face as an important subject to compress a luminance signal. Therefore, signal linear characteristic of a face is secured while gradation of a high-light portion is compressed to the minimum. In this way, it is possible to generate a high-quality dynamic range image.
The second embodiment has described fixed knee compression of a luminance signal. This embodiment is applicable to dynamic range compression using the property of retina such as a Retinex processing circuit. In this case, knee compression is carried out in compressing illumination light. Therefore, the second embodiment is applied to the knee compression. In this way, it is possible to determine a knee point based on illumination light, and to compress the illumination light.
Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the invention in its broader aspects is not limited to the specific details and representative embodiments shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2009-060934 | Mar 2009 | JP | national |