The present disclosure relates to an image processing apparatus, an imaging apparatus, and an image processing method. Particularly, the present disclosure relates to an image processing apparatus, an imaging apparatus, and an image processing method that can control a field of view of a captured image of a lensless camera (lensless imaging apparatus) and that generate a restored image including part of an imaging region.
In recent years, the development of a lensless camera that can capture an image without using a lens has been progressed. A conventional general camera inputs light transmitted through a lens into an image sensor as an imaging device to capture an image. In place of the lens, the lensless camera uses a mask in which transmission regions and non-transmission regions of light are set as a two-dimensional pattern and inputs the light transmitted through the mask into an image sensor. The image sensor captures an image through the mask. Predetermined image processing can be applied to captured data through the mask to generate a two-dimensional image corresponding to a captured scene similar to the general camera. The lensless camera does not require the lens, and downsizing and weight reduction of the imaging apparatus are realized.
As described above, the lensless camera inputs the light transmitted through the mask into the image sensor. Information indicating how the emitted light from the scene (captured scene) is transmitted through the mask and projected to the sensor is defined in advance as a matrix, and the matrix and the image (observed image) projected to the sensor are used to generate an image (restored image) in which the actual scene is reproduced.
Note that the lensless camera is described in, for example, the following literature.
PTL 1 (PCT Patent Publication No. WO2012/040192).
The following process executed is an example of a reconstruction process of the captured image for generating, from the image (observed image) projected to the image sensor of the lensless camera, the image (restored image) in which the actual scene is reproduced. The emitted light of the captured scene is expressed as luminance values of a plurality of three-dimensional sampling points, and a plurality of sets of sensor observed values projected to the image sensor through the mask is obtained in advance in a simulation or the like.
Furthermore, a mask matrix expressing a mask is calculated from the sets. An inverse matrix of the mask matrix is obtained, and the inverse matrix is applied to observed values, that is, pixel values of the image sensor of the lensless camera to thereby restore the luminance values of the sampling points included in the captured scene. The luminance values of the sampling points are luminance values indicating the image of the captured scene.
When the light emitted from the P three-dimensional sampling points is written as a scene emitted light vector x of the length P, and the observed values of the two-dimensional sensor corresponding to the number of pixels N that receive the light are expressed as a scene observed value vector y of the length N, the relationship can be represented as follows.
y=Mx
Here, M is a matrix representing a transmission function of the mask, that is, a mask matrix.
To reproduce the emitted light of the scene, x satisfying the relationship of the expression can be obtained from the sensor observed values y of the lensless camera. The resolution of the reproduced scene can be improved by increasing the number P of three-dimensional sampling points included in the scene. However, this increases the time it takes to estimate the scene emitted light vector x, and the reconstruction of image requires a long time.
In the expression, y=Mx indicates that all of the light entering the sensor of the lensless camera is superimposed on the sensor, and the field of view (FOV) of the restored image is significantly wide.
PCT Patent Publication No. WO2012/040192
The field of view (FOV) of the lensless camera is wide, and the number of sampling points P expressing the entire scene is large. However, in the case where the lensless camera is used as a camera, it is often desirable to narrow down the FOV in some degree to limit the imaging range. Although it is possible to cut out only part of the captured image, there is a problem that the resolution is reduced in that case.
In addition, although there is the method of increasing the resolution by increasing the number of three-dimensional sampling points P that can be reproduced as a captured image as described above, there is a problem that the method increases the reconstruction time of image, and the method is not practical.
The present disclosure has been made in view of, for example, the problems described above, and an object of the present disclosure is to provide an image processing apparatus, an imaging apparatus, and an image processing method that can change a field of view (FOV) of a captured image captured by a lensless camera without reducing resolution or increasing an amount of calculation.
A first aspect of the present disclosure provides an image processing apparatus including a signal processing unit that receives observed image signals as output of an image sensor of a lensless camera to generate a restored image of a restored image region including part of a captured image region of the lensless camera, in which the signal processing unit generates the restored image by executing arithmetic processing of observed image signals in a restored region generated by subtracting observed image signals not included in the restored image region from the observed image signals and an inverse matrix or a pseudo-inverse matrix of a matrix applied to generate the restored image.
Furthermore, a second aspect of the present disclosure provides an imaging apparatus including: a mask in which transmission regions and non-transmission regions of light are set as a two-dimensional pattern; an imaging unit including an image sensor that receives the light through the mask; and a signal processing unit that receives observed image signals as output of the image sensor to generate a restored image of a restored image region including part of a captured image region of the imaging unit, in which the signal processing unit generates the restored image by executing arithmetic processing of observed image signals in a restored region generated by subtracting observed image signals not included in the restored image region from the observed image signals and an inverse matrix or a pseudo-inverse matrix of a matrix applied to generate the restored image.
Furthermore, a third aspect of the present disclosure provides an image processing method executed by an image processing apparatus, in which the image processing apparatus includes a signal processing unit that receives observed image signals as output of an image sensor of a lensless camera to generate a restored image of a restored image region including part of a captured image region of the lensless camera, and the signal processing unit generates the restored image by executing arithmetic processing of observed image signals in a restored region generated by subtracting observed image signals not included in the restored image region from the observed image signals and an inverse matrix or a pseudo-inverse matrix of a matrix applied to generate the restored image.
Note that other objects, features, and advantages of the present disclosures will become apparent from more detailed description based on an embodiment of the present disclosure described later and the attached drawings. Note that in the present specification, a system denotes logical set configurations of a plurality of apparatuses, and the apparatuses of the configurations may not be in the same housing.
According to the configuration of the embodiment of the present disclosure, the field of view of the captured image of the lensless camera can be controlled, and the configuration of generating the restored image including part of the imaging region is realized. Note that the advantageous effect described in the present specification is illustrative only, and the advantageous effect is not limited. There may also be additional advantageous effects.
Hereinafter, details of an image processing apparatus, an imaging apparatus, and an image processing method of the present disclosure will be described with reference to the drawings. Note that the details will be described according to the following items.
1. Overview and Principle of Lensless Camera
2. Setting of Field of View in Lensless Camera
3. Configuration and Process of Image Processing Apparatus of Present Disclosure
4. Detailed Sequence of Mask Matrix Calculation Process in Mask Matrix Calculation Unit
5. Detailed Sequence of Image Estimation Process in Image Estimation Unit
6. Example of Setting Sampling Points and Example of Generating Image
7. Example of Generation Process of Image with Partially Improved Resolution
8. Processing Example of Dispersing Emitted Light from Sampling Points in Wavelength Direction
9. Advantageous Effects of Image Processing of Present Disclosure
10. Example of Hardware Configuration of Image Processing Apparatus
11. Conclusion of Configuration of Present Disclosure
First, an overview and a principle of a lensless camera will be described.
The overview of the lensless camera (lensless imaging apparatus) will be described by comparing the lensless camera with the configuration of a general camera (imaging apparatus) with reference to
(a) Lensless camera
(b) Camera with lens
(c) Pinhole camera
(c) The pinhole camera includes: a pinhole 21 provided as a hole on a light shielding film 12; and an image sensor (solid-state imaging device) 11. In the case of the imaging apparatus including the pinhole, beams L1 to L3 emitted from different light sources on an object surface transmit through the pinhole 21, and images are formed at pixels I1 to I3 on the image sensor (solid-state imaging device) 11 as illustrated in the diagram of (c) the pinhole camera on the lower right of
In the case of the imaging apparatus including the pinhole camera, the image is formed based only on the beam of one pixel among the beams L1 to L3 emitted from the light sources in the image sensor (solid-state imaging device) 11, and the image enters each pixel on the image sensor (solid-state imaging device) 11. Therefore, a dark image is captured.
(b) The camera with lens on the upper right of
As for the image sensor (solid-state imaging device) 11 in the case of (b) the camera with lens on the upper right of
As illustrated in the diagram of (b) the camera with lens on the upper right of
As described with reference to (b) the camera with lens on the upper right of
However, the size of the imaging apparatus (imaging device) is determined by the imaging lens and the focal length of the imaging lens, and there is a limit to downsizing.
On the other hand, (a) the lensless camera illustrated on the left side of
(a) The lensless camera illustrated on the left side of
Here, the mask 51 is a mask including a mask pattern in which the sizes of the openings 51a and light-shielding portions 51b are randomly set in the horizontal direction and the vertical direction in units with a unit size A as illustrated on the lower part of (a) the lensless camera in
According to the configuration, the beams L1 to L3 are modulated and enter the image sensor 11 based on the unit size A and the size of the distance d.
More specifically, it is assumed that the light sources of the beams L1 to L3 in the upper diagram of (a) the lensless camera of
In the case of the lensless camera, the incident light is modulated by the openings 51a randomly set on the mask 51, and the detection sensitivity of each pixel has directivity according to the incident angle as illustrated on the upper left of
That is, assuming that the light sources included in an object surface 71 are point light sources, beams at the same light intensity emitted by the same point light source enter the image sensor 11. However, the beams are modulated by the mask 51, and the incident angles vary according to the regions on the imaging surface of the image sensor 11. Furthermore, the variations in the incident angle of the incident light according to the regions on the image sensor 11 due to the mask 51 provide light sensitivity characteristics, that is, directivity according to the incident angle. Therefore, the beams at the same light intensity are detected at different sensitivities according to the regions on the image sensor 11 due to the mask 51 provided on the earlier stage of the imaging surface of the image sensor 11, and detection signals are detected at detection signal levels that vary according to the regions.
More specifically, as illustrated on the upper right of
DA=α1×a+β1×b+γ1×c (1)
DB=α2×a+β2×b+γ2×c (2)
DC=α3×a+β3×b+γ3×c (3)
Here, α1 is a coefficient for a detection signal level a that is set according to the incident angle of the beam from the point light source PA on the object surface 71 to be restored, at the position Pa on the image sensor 11. In addition, β1 is a coefficient for a detection signal level b that is set according to the incident angle of the beam from the point light source PB on the object surface 71 to be restored, at the position Pa on the image sensor 11. Furthermore, γ1 is a coefficient for a detection signal level c that is set according to the incident angle of the beam from the point light source PC on the object surface 71 to be restored, at the position Pa on the image sensor 11.
Therefore, (α1×a) of the detection signal level DA is a value indicating the detection signal level based on the beam from the point light source PA at the position Pc. In addition, (β1×b) of the detection signal level DA is a value indicating the detection signal level based on the beam from the point light source PB at the position Pc. Furthermore, (γ1×c) of the detection signal level DA is a value indicating the detection signal level based on the beam from the point light source PC at the position Pc.
Therefore, the detection signal level DA is expressed as a combined value of values obtained by multiplying the components of the point light sources PA, PB, and PC at the position Pa by the coefficients α1, β1, and γ1, respectively. Hereinafter, the coefficients α1, β1, and γ1 will be collectively referred to as a set of coefficients.
Similarly, a set of coefficients α2, β2, and γ2 regarding the detection signal level DB in the point light source PB correspond to the set of coefficients α1, β1, and γ1 regarding the detection signal level DA in the point light source PA, respectively. In addition, a set of coefficients α3, β3, and γ3 regarding the detection signal level DC in the point light source PC correspond to the set of coefficients α1, β1, and γ1 regarding the detection signal level DA in the point light source PA, respectively.
However, the detection signal levels of the pixels at the positions Pa, Pb, and Pc are values expressed by the sums of products of the light intensities a, b, and c of the beams emitted by the point light sources PA, PB, and PC, respectively, and the coefficients. Therefore, the light intensities a, b, and c of the beams emitted from the point light sources PA, PB, and PC, respectively, are mixed in the detection signal levels, and this is different from the image obtained by forming the image of the object.
That is, the set of coefficients α1, β1, and γ1, the set of coefficients α2, β2, and γ2, the set of coefficients α3, β3, and γ3, and the detection signal levels DA, DB, and DC are used to form simultaneous equations, and the light intensities a, b, and c are solved to obtain the pixel values at the positions Pa, Pb, and Pc as illustrated on the lower right of
In addition, in a case where the distance between the image sensor 11 illustrated on the upper left of
Therefore, the sets of coefficients can be changed to correspond to various distances in one imaging, and this can reconstruct the images of the object surface at various distances from the imaging position.
As a result, in the imaging using the lensless camera, there is no need to be conscious of the phenomenon such as so-called out-of-focus in which the image is captured while the focal point is deviated in the imaging of the imaging apparatus using the lens. As long as the imaging is captured such that the object to be imaged is included in the field of view, the sets of coefficients can be changed according to the distance to reconstruct the images of the object surface at various distances after the imaging.
Note that the detection signal levels illustrated on the upper right of
The configuration can realize a so-called lensless camera that does not require an imaging lens or a pinhole. As a result, the imaging lens, the pinhole, and the like are not required components, and the height of the imaging apparatus, that is, the thickness in the incident direction of light in the configuration for realizing the imaging function, can be reduced. In addition, the sets of coefficients can be changed in various ways, and the restored images (final images) on the object surface at various distances can be reconstructed and restored.
Note that hereinafter, the image before reconstruction captured by the image sensor will be simply referred to as an observed image, and the image reconstructed and restored by applying signal processing to the observed image will be referred to as a restored image (final image). Therefore, from one observed image, the sets of coefficients described above can be changed in various ways to reconstruct, as final images, the images on the object surface 71 at various distances.
In the imaging device of a general lensless camera, the unit size of the openings 51a in the mask 51 is uniformly set across the entire region as illustrated for example in
Next, setting of the field of view in the lensless camera will be described.
The image sensor 81 captures the image (observed image) 86 through the mask 82. The captured data transmitted through the mask 82, that is, the observed image 86 on the image sensor 1, is input to the signal processing unit 83. The signal processing unit 83 applies predetermined signal processing to the observed image 86 on the image sensor 81 to generate the restored image (final image) 87 as a two-dimensional image corresponding to the captured scene as in the general camera.
The light emitted from P three-dimensional sampling points of the region to be imaged can be written as a scene emitted light vector x of a length P, and observed values of the image sensor 81 corresponding to the number of pixels N that receive the light can be expressed as a scene observed value vector y of the length N. The relationship can be represented by using the following relational expression.
y=Mx
Here, M is a matrix representing a transmission function of the mask 82.
To reproduce the captured image of the object (=emitted light of scene), x satisfying the relationship of the expression can be obtained from the sensor observed values y. The signal processing unit 83 executes a process of multiplying the sensor observed values y input from the image sensor 81 by a matrix including an inverse matrix of the matrix representing the transmission function of the mask 82 to calculate the scene emitted light vector x to calculate the pixel values of the restored image (final image) 87 including the object 85.
Note that a plurality of sets of sensor observed values projected to the image sensor 81 through the mask 82 is obtained in advance in a simulation or the like, and a mask matrix expressing the characteristics (transmission function) of the mask 82 is calculated in advance from the sets. The signal processing unit 83 applies an inverse matrix of the matrix and restores the luminance value of each sampling point of the region to be imaged to generate the restored image (final image) 87 based on the sensor observed values y input from the image sensor 81.
To reproduce the emitted light of the scene, x satisfying the expression, that is, the relationship
y=Mx
can be obtained from the sensor observed values y. The vector x includes an emitted light vector of each sampling point. That is, the vector x indicates the emitted light on the basis of points included in the captured scene, and values obtained by developing the emitted light vector of each sampling point on a two-dimensional plane correspond to configuration pixel values of the restored image (final image) 87.
The resolution of the scene reproduced as the restored image (final image) 87 can be improved by increasing the number P of three-dimensional sampling points included in the scene. However, the increase in the number of sampling points P increases the time for estimating the scene emitted light vector x, and the reconstruction of the image, that is, generation process of the restored image (final image) 87, takes a long time.
The expression y=Mx indicates that all of the light entering the sensor is superimposed on the sensor, and the field of view (FOV) of the image to be restored is significantly wide.
The field of view (FOV) of the lensless camera is wide, and the number of sampling points for expressing the entire scene is large. However, in a case where the lensless camera is used as a camera, it is often desirable to narrow down the FOV in some degree to limit the imaging range.
A specific example will be described with reference to
(a) Example of Captured Image with Wide Field of View
(b) Example of Captured Image with Narrow Field of View
The user (photographer) may want to capture an image by limiting the imaging range to only a specific object, such as a region of a person, as illustrated in
For a process of executing this, it is possible to cut out only part of the captured image. However, there is a problem that the resolution is reduced in that case.
Furthermore, as described above, there is also a method of increasing the resolution by increasing the number of three-dimensional sampling points P that can be reproduced in the captured image. However, there is a problem that the reconstruction time of image increases, and this is not practical.
Another example of the method for the field of view (FOV) includes a method of changing the distance between the image sensor 81 and the mask 82 to restrict the light entering the image sensor 81 in terms of hardware as illustrated in
As compared to the setting of
In the example illustrated in
Note that a plurality of LCDs can be arranged in front of the sensor to control transmission/non-transmission of light from an arbitrary direction. Controlling the plurality of layers of LCDs also allows to perform the pan-tilt or to project only a plurality of ROIs on the screen. PTL 1 illustrates a method of arranging a MEMS array filter in front of the sensor and switching the transmission/non-transmission of light to thereby change the FOV. However, all of these methods require additional hardware, and an increase in the size and the cost is inevitable. In addition, although an optical filter is used as a mask, measures are not taken except for the transmission wavelength region. For example, an appropriate filter corresponding to the far-infrared region is not mentioned.
Next, a configuration and a process of an image processing apparatus of the present disclosure will be described with reference to
For example, the signal processing unit 101 receives observed image signals as output of an image sensor of the lensless camera 102 and generates a restored image of a restored image region including part of a captured image region of the lensless camera 101.
The lensless camera 102 is a camera including the image sensor 81 and the mask 82 described with reference to
The signal processing unit 101 receives a captured image (observed values yall) as output of the lensless camera 102. The captured image (observed value yall) includes pixel values on the image sensor according to the amount of light received through the mask of the lensless camera 102.
The signal processing unit 101 applies signal processing to the input image (captured image (observed values yall)) and outputs a restored image (x{circumflex over ( )}roi) set to a predetermined field of view (FOV). Note that (x{circumflex over ( )}) denotes a character provided with ({circumflex over ( )}) above x. The same applies to the following description. This is the same for other characters, and for example, (y{circumflex over ( )}) denotes a character provided with ({circumflex over ( )}) above y.
The user can freely set the field of view (FOV). A captured image (observed values yall) 110 as output of the lensless camera 102 is a captured image with one fixed field of view unrelated to the field of view set by the user.
The signal processing unit 101 receives information including
(input 1) the captured image (observed values yall) 110 as output of the lensless camera 102 and
(input 2) “restored image region corresponding ROI (Region of Interest) information” Rroi equivalent to field-of-view setting information of the restored image (x{circumflex over ( )}roi) to be output
and generates and outputs output information including
(output 1) the restored image (x{circumflex over ( )}roi) set to the predetermined field of view (FOV).
Of the restored image (x{circumflex over ( )}roi) as output data, (x{circumflex over ( )}roi) is equivalent to the emitted light of each sampling point x included in a captured scene (ROI) in the predetermined field of view. That is, the restored image (x{circumflex over ( )}roi) is an image obtained by reproducing the object image of the captured scene and is an image similar to the image captured by a normal camera with lens.
The signal processing unit 101 holds
captured image region corresponding ROI information (Rall) 104,
mask information (IM) 105, and
camera configuration information (Ic) 106
in a storage unit. Note that although the storage unit storing the information is illustrated inside of the signal processing unit 101 in
The signal processing unit 101 further includes data processing units including
a captured image region corresponding mask matrix (Mall) calculation unit 107,
a restored image region corresponding mast matrix (Mroi) calculation unit 108, and
an image estimation unit 109.
The captured image region corresponding mask matrix (Mall) calculation unit 107 calculates a mask matrix (Mall) corresponding to the entire captured image (all ROI) captured by the image sensor of the lens camera 102. The mask matrix (Mall) is a matrix representing the transmission function of the mask described above.
That is, the matrix represents the transmission function of a mask satisfying the relational expression
y=Mx
where the light emitted from the P three-dimensional sampling points of the region to be imaged is written as the scene emitted light vector x of the length P, and the observed values of the image sensor corresponding to the number of pixels N that receive the light are expressed as the scene observed value vector y of the length N.
On the other hand, the restored image region corresponding mast matrix (Mroi) calculation unit 108 calculates a mask matrix (Mroi) corresponding to the restored image (x{circumflex over ( )}roi) as output data, instead of the entire captured image (all ROI) captured by the image sensor of the lens camera 102. That is, the restored image region corresponding mast matrix (Mroi) calculation unit 108 calculates a restored image region corresponding mask matrix applied to generate the restored image of the restored image region as part of the captured image region of the lensless camera 102.
For example, as illustrated in
The three-dimensional region is equivalent to the setting region of the sampling points where the scene emitted light vector x of the relational expression
y=Mx
is measured. The captured image region corresponding mask matrix (Mall) calculation unit 107 calculates the mask matrix (Mall) representing the transmission function of the mask that can be applied to the relational expression
y=Mx
in a case where, for example, it is assumed that the sampling points are arranged in the captured image region (all ROI) 121 illustrated in
On the other hand, the restored image region corresponding mast matrix (Mroi) calculation unit 108 calculates a mask matrix (Mroi) corresponding to the restored image region (restored ROI) 122. That is, the restored image region corresponding mast matrix (Mroi) calculation unit 108 calculates the mask matrix (Mroi) representing the transmission function of the mask that can be applied to the relational expression
y=Mx
in a case where, for example, it is assumed that the sampling points are arranged on the restored image region (restored ROI) 122 illustrated in
The captured image region corresponding mask matrix (Mall) calculation unit 107 receives information including
the captured image region corresponding ROI information (Rall) 104,
the mask information (IM) 105, and
the camera configuration information (Ic) 106
and calculates a mask matrix (Mall) corresponding to the captured image region (all ROI) 121.
The captured image region corresponding ROI information (Rall) 104 is, for example, setting range information of the captured image region (all ROI) 121 illustrated in
The camera configuration information (Ic) 106 is information regarding the mask and the image sensor and includes, for example, the mask size, the image sensor size, the number of image sensor pixels (N), and the information of distance between the mask and the sensor. Furthermore, the number (P) of sampling points set in the captured image region and the sampling point setting position (x) information are also recorded.
The captured image region corresponding mask matrix (Mall) calculation unit 107 receives the information and calculates the mask matrix (Mall) corresponding to the captured image region (all ROI) 121.
On the other hand, the restored image region corresponding mast matrix (Mroi) calculation unit 108 calculates the mask matrix (Mroi) corresponding to the restored image region (restored ROI) 122. The restored image region corresponding mast matrix (Mroi) calculation unit 108 receives information including
restored image region corresponding ROI information (Rroi) 103,
the mask information (IM) 105, and
the camera configuration information (Ic) 106
and calculates the mask matrix (Mroi) corresponding to the restored image region (restored ROI) 122.
The restored image region corresponding ROI information (Rroi) 103 is, for example, setting range information of the restored image region (restored ROI) 122 illustrated in
The mask information (IM) 105 is, for example, the two-dimensional pattern information including the transmission regions and the non-transmission regions of light in the mask. The camera configuration information (Ic) 106 includes the distance between the mask and the image sensor, the number of pixels (N) information of the image sensor, and the like. The restored image region corresponding mast matrix (Mroi) calculation unit 108 receives the information and calculates the mask matrix (Mall) corresponding to the restored image region (restored ROI) 122.
Note that a detailed sequence of a mask matrix calculation process executed by the mask matrix calculation units including
the captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108
will be described in detail in a later stage with reference to a flow chart illustrated in
The image estimation unit 109 receives the following information including (a) the captured image (observed values yall) as output of the lensless camera 102,
(b) the mask information (IN) 105,
(c) the camera configuration information (Ic) 106,
(d) the captured image region corresponding ROI information (Rall) 104,
(e) the restored image region corresponding ROI information (Rroi) 103,
(f) the captured image region corresponding mask matrix (Mall) calculated by the captured image region corresponding mask matrix (Mall) calculation unit 107, and
(g) the restored image region corresponding mast matrix (Mroi) calculated by the restored image region corresponding mast matrix (Mroi) calculation unit 108.
The image estimation unit 109 receives the information and outputs the restored image (x{circumflex over ( )}roi) set to the predetermined field of view (FOV). Of the restored image (x{circumflex over ( )}roi) as output data, (x{circumflex over ( )}roi) denotes the emitted light of each sampling point x included in the captured scene (ROI) in the predetermined field of view, such as the restored image region (restored ROI) 122 illustrated in
The image estimation unit 109, for example, subtracts observed image signals outside of the restored image region not included in a restored image region from observed image signals as output of the image sensor of the lensless camera 102 to calculate observed image signals inside of the restored region and
executes arithmetic processing of the observed image signals inside of the restored region and a pseudo-inverse matrix or an inverse matrix of a restored image region corresponding mask matrix to generate a restored image of the restored image region including part of a captured image region of the lensless camera. A detailed sequence of the restored image generation process executed by the image estimation unit 109 will be described in detail in a later stage with reference to a flow chart illustrated in
Next, the detailed sequence of the mask matrix calculation process executed by the mask matrix calculation units including
the captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108
will be described with reference to a flow chart illustrated in
Note that as described above,
the captured image region corresponding mask matrix (Mall) calculation unit 107 calculates the mask matrix (Mall) corresponding to the captured image region (all ROI) 121 illustrated for example in
The captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108
of the signal processing unit 101
first acquire the mask information IM in step S101. Note that the two mask matrix calculation units including
the captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108
will be simply referred to as a mask matrix calculation unit in the description.
The mask information IM acquired in step S101 is, for example, the two-dimensional pattern information including the transmission regions and the non-transmission regions of light in the mask as described above.
Next, the mask matrix calculation unit acquires the camera configuration information Ic in step S102. The camera configuration information Ic is information regarding the mask and the image sensor and includes, for example, the mask size, the image sensor size, the number of image sensor pixels (N), the information of distance between the mask and the sensor, and the like.
Next, the mask matrix calculation unit acquires the ROI information in step S103. In the process of step S103, different processes are executed in the two mask matrix calculation units including
the captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108.
The captured image region corresponding mask matrix (Mall) calculation unit 107 acquires the captured image region corresponding ROI information (Rall) 104 illustrated in
On the other hand, the restored image region corresponding mast matrix (Mroi) calculation unit 108 acquires the restored image region corresponding ROI information (Rroi) 103 illustrated in
Note that the number (P) of sampling points and the setting position (x) information can also be acquired from the camera configuration information Ic 106.
Next, the mask matrix calculation unit uses the number of sensor pixels N, the positions x of the sampling points, and the number of sampling points P to prepare and initialize a P×N matrix M in step S104. That is,
the mask matrix calculation unit generates a matrix including P×N elements (matrix elements) defined by
the number of sensor pixels (N) and
the number of sampling points (P).
Note that the initial value of each element is, for example, 0.
Note that here, the matrices generated by
the captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108 are different matrices.
The captured image region corresponding mask matrix (Mall) calculation unit 107 assumes that the sampling points are set in the captured image region (all ROI) 121 illustrated in
On the other hand, the restored image region corresponding mast matrix (Mroi) calculation unit 108 assumes that the sampling points are set in the restored image region (restored ROI) 122 illustrated in
Note that
in the matrices including P×N elements generated by
the captured image region corresponding mask matrix (Mall) calculation unit 107 and
the restored image region corresponding mast matrix (Mroi) calculation unit 108,
the number of sensor pixels N is the number of pixels (N) of the image sensor of the lensless camera 102 and is the same.
On the other hand, the number of sampling points P can be freely set. For example, the numbers of sampling points P of the matrices including P×N elements generated by the captured image region corresponding mask matrix (Mall) calculation unit 107 and the restored image region corresponding mast matrix (Mroi) calculation unit 108 may be set in proportion to, for example, the sizes of the captured image region (all ROI) 121 and the restored image region (restored ROI) 122 illustrated in
In the case where the same number of sampling points P is set, the restored image of the restored image region (restored ROI) 122 can be an image with higher resolution than the case in which the number of sampling points P is set in proportion to the size of the region. As described above, the resolution of the restored image can be improved by increasing the number of sampling points P. That is, a large number of dense sampling points can be set in the restored image region (restored ROI) 122 illustrated in
The process from step S105 is a setting process of a value of each element of the matrix generated in step S104.
First, an element of the matrix (matrix element) as a target of setting the value is selected in steps S105 and S106. First, an element (p=0, n=0) on the upper left corner of the P×N matrix is selected in steps S105 and S106. Note that p and n are element identification indices of the P×N matrix.
Next, in step S107, the mask matrix calculation unit obtains a sensor pixel value (y) of an nth pixel at the time that the light of a pth sampling point x(p) is projected to the sensor through the mask. The pixel value calculation process can be executed in a simulation process or an actual measurement process.
Next, in step S108, the mask matrix calculation unit applies the sensor pixel value (y) calculated in the process of step S107,
that is, the value of the sensor pixel value (y) of the nth pixel at the time that the light of the pth sampling point x(p) is projected to the sensor through the mask,
to set
y/x(p)
as a value of the selected element (p=0, n=0) of the P×N mask matrix.
The setting value is a setting value according to the relational expression described above, that is, the relational expression
y=Mx
of the case in which the light emitted from the P three-dimensional sampling points of the region to be imaged is written as the scene emitted light vector x of the length P, and the observed values of the image sensor corresponding to the number of pixels N that receive the light are set as the scene observed value vector y of the length N.
Steps S109 to S114 are an update process of the element of the matrix (matrix element) as a target of setting the value and a processing end determination process. In step S109, the matrix element index n is incremented by 1.
In step S110, whether
n<N
is determined,
and if
n<N,
the process of steps S107 and S108 is applied to a new matrix element (p, n) to determine the value of the new matrix element (p, n).
Furthermore,
the matrix element index p is incremented by 1 in step S111.
In step S112, whether
p<P
is determined,
and if
p<P,
the process of steps S107 and S108 is applied to a new matrix element (p, n) to determine the value of the new matrix element (p, n).
Eventually, in step S112, if it is determined that the expression
p<P
is not satisfied, the values of all of the elements of the P×N matrix are determined, and the matrix is completed.
The mask matrix generated in the flow is
a mask matrix according to the relational expression described above, that is, the relational expression
y=Mx
of the case in which the light emitted from the P three-dimensional sampling points of the region to be imaged is written as the scene emitted light vector x of the length P, and the observed values of the image sensor corresponding to the number of pixels N that receive the light are set as the scene observed value vector y of the length N.
Note that the mask matrix generated by the captured image region corresponding mask matrix (Mall) calculation unit 107 according to the flow is the captured image region corresponding mask matrix (Mall) and is the captured image region corresponding mask matrix (Mall) satisfying the relational expression
y=Mx
regarding the sensor pixel values (y) of the case where it is assumed that the image sensor of the lensless camera 102 has received the emitted light of the P sampling points set in the captured image region (all ROI) 121 illustrated for example in
On the other hand, the mask matrix generated according to the flow by the restored image region corresponding mast matrix (Mroi) calculation unit 108 is the restored image region corresponding mast matrix (Mroi) and is the restored image region corresponding mast matrix (Mroi) satisfying the relational expression
y=Mx
regarding the sensor pixel values (y) of the case where it is assumed that the image sensor of the lensless camera 102 has received the emitted light of the P sampling points set in the restored image region (restored ROI) 122 illustrated for example in
[5. Detailed Sequence of Image Estimation Process in Image Estimation Unit]
Next, with reference to the flow chart illustrated in
the detailed sequence of the image estimation process in the image estimation unit 109 will be described.
As described above with reference to
(a) the captured image (observed values yall) as output of the lensless camera 102,
(b) the mask information (IN) 105,
(c) the camera configuration information (Ic) 106,
(d) the captured image region corresponding ROI information (Rall) 104,
(e) the restored image region corresponding ROI information (Rroi) 103,
(f) the captured image region corresponding mask matrix (Mall) calculated by the captured image region corresponding mask matrix (Mall) calculation unit 107, and
(g) the restored image region corresponding mast matrix (Mroi) calculated by the restored image region corresponding mast matrix (Mroi) calculation unit 108
and outputs the restored image (x{circumflex over ( )}roi) set to the predetermined field of view (FOV). Of the restored image (x{circumflex over ( )}roi) as output data, (x{circumflex over ( )}roi) denotes the emitted light of each sampling point x included in the captured scene (ROI) in the predetermined field of view, such as the restored image region (restored ROI) 122 illustrated for example in
As described above, the image estimation unit 109 executes, for example, the arithmetic processing of the observed image signals inside of the restored region and the pseudo-inverse matrix or the inverse matrix of the restored image region corresponding mask matrix to generate the restored image of the restored image region including part of the captured image region of the lensless camera.
Hereinafter, the processes of the steps of the flow chart illustrated in
The image estimation unit 109 of the signal processing unit 101 first acquires the mask information IM in step S201. The mask information IM acquired in step S201 is, for example, the two-dimensional pattern information including the transmission regions and the non-transmission regions of light of the mask as described above.
Next, the image estimation unit 109 acquires the camera configuration information Ic in step S202. The camera configuration information Ic is information regarding the mask and the image sensor and includes, for example, the mask size, the image sensor size, the number of image sensor pixels (N), the information of distance between the mask and the sensor, and the like.
Next, the image estimation unit 109 acquires the ROI information in step S203. In step S203, the image estimation unit 109 acquires the captured image region corresponding ROI information (Rall) 104 and the restored image region corresponding ROI information (Rroi) 103 illustrated in
That is, the image region corresponding ROI information (Rall) 104 is the ROI information of the case where it is assumed that the image sensor of the lensless camera 102 has imaged the emitted light of the sampling points in the captured image region (all ROI) 121 illustrated in
On the other hand, the restored image region corresponding ROI information (Rroi) 103 is, for example, the setting position (x) information of the sampling points in the restored image region (restored ROI) 122 illustrated in
In this way, in step S203, the image estimation unit 109
acquires the captured image region corresponding ROI information (Rall) 104 and the restored image region corresponding ROI information (Rroi) 103 illustrated in
Next, the image estimation unit 109 acquires the number of sensor pixels N in step S204. That is, the image estimation unit 109 acquires the number of sensor pixels (N) of the image sensor of the lensless camera 102. The information is acquired from, for example, the camera configuration information Ic 106 illustrated in
Next, the image estimation unit 109 acquires the positions x of the sampling points and the number of points P in step S205.
That is,
the number of sampling points (P) and
the position (x) of each sampling point
are acquired.
Note that the information is acquired from, for example, one of
the camera configuration information (Ic) 106,
the captured image region corresponding ROI information (Rall) 104, and
the restored image region corresponding ROI information (Rroi) 103.
The number of sampling points (P) in the captured image region and the position (x) of each sampling point are acquired from the captured image region corresponding ROI information (Rall) 104. Specifically, for example, the number (P) of sampling points and the positions (x) set in the captured image region (all ROI) 121 illustrated in
Next, the image estimation unit 109 in step S206 calculates a pseudo-inverse matrix (M+all) of the captured image region corresponding mask matrix (Mall) calculated by the captured image region corresponding mask matrix (Mall) calculation unit 107. Furthermore, in step S207, the image estimation unit 109 uses the pseudo-inverse matrix (M+all) obtained in step S206 to calculate emitted light estimation values (x{circumflex over ( )}all) of the sampling points. The emitted light estimation values (x{circumflex over ( )}all) of the sampling points denote emitted light of each sampling point x included in the captured scene, such as the captured image region (all ROI) 121 illustrated for example in
A specific example of the calculation process of the emitted light estimation values (x{circumflex over ( )}all) of the sampling points will be described with reference to
(a) a captured image region ROI (Rall) 201 expressed as two-dimensional plane data,
(b) emitted light (x{circumflex over ( )}all) 204 of sampling points set in the captured image region ROI (Rall) 201,
(c) a mask including a captured image region corresponding mask matrix (Mall) 202, and
(d) sensor observed values (pixel values) (yall) 203 of the case in which the image sensor has received the emitted light (x{circumflex over ( )}all) 204 of the sampling points set in the captured image region ROI (Rall) 201.
That is, illustrated are the sensor observed values (pixel values) (yall) 203 of the case in which the image sensor has received the emitted light (x{circumflex over ( )}all) 204 of the sampling points set in the captured image region ROI (Rall) 201 expressed as two-dimensional plane data, through the mask including the captured image region corresponding mask matrix (Mall) 202. Note that although the captured image region ROI (Rall) is basically set as a three-dimensional region as described with reference to
The sensor observed values (pixel values) (yall) 203 illustrated in
the captured image region corresponding mask matrix (Mall) 202 and
the sensor observed values (pixel values) (yall) 203,
the estimated values of the emitted light (x{circumflex over ( )}all) 204 of the sampling points can be obtained by calculation processes (processes a and b) illustrated below.
The pseudo-inverse matrix (M+all) of the captured image region corresponding mask matrix (Mall) calculated by the captured image region corresponding mask matrix (Mall) calculation unit 107 is calculated according to the following (Expression 1).
[Math. 1]
M
+
all=(MallTMall)−1MallT (Expression 1)
Note that
M+all represents a pseudo-inverse matrix of the captured image region corresponding mask matrix (Mall) and
MTall represents a transposed matrix of the captured image region corresponding mask matrix (Mall). In addition,
(MTallMall)−1 denotes an inverse matrix of (MTallMall).
The pseudo-inverse matrix (M+all) of the captured image region corresponding mask matrix (Mall) and
the sensor observed values (pixel values) (yall) 203
are used, and the following (Expression 2) is used to calculate the estimated values of the emitted light (x{circumflex over ( )}all) 204 of the sampling points.
[Math. 2]
{circumflex over (x)}
all
=M
all
+
y
all (Expression 2)
The (Expression 2) is equivalent to an expression obtained by using the pseudo-inverse matrix (M+) of the matrix M to express the relational expression described above, that is, the relational expression
y=Mx
of the case in which the light emitted from the P three-dimensional sampling points of the region to be imaged is written as the scene emitted light vector x of the length P, and the observed values of the image sensor corresponding to the number of pixels N that receive the light is set as the scene observed value vector y of the length N.
Next, in step S208, the image estimation unit 109 extracts light not included in the restored image region ROI (Rroi) in the emitted light (x{circumflex over ( )}all) 204 of the sampling points and sets the light as sampling point emitted light estimation values (x{circumflex over ( )}out) outside of the restored image region ROI (Rroi).
Furthermore, in step S209, the image estimation unit 109 obtains observed values (y{circumflex over ( )}out) corresponding to the sampling point emitted light estimation values (x{circumflex over ( )}out) outside of the restored image region ROI (Rroi).
The details of the processes of steps S208 and S209 will be described. In step S208, the sampling points not included in the restored image region ROI (Rroi) 211 illustrated in
Next, in step S209, the observed values (y{circumflex over ( )}out) corresponding to the sampling point emitted light estimation values (x{circumflex over ( )}out) outside of the restored image region ROI (Rroi) are calculated by using the following (Expression 3).
[Math. 3]
ÿ
out
=M
all
{circumflex over (x)}
out (Expression 3)
The expression is an expression corresponding to the relational expression described above, that is, the relational expression
y=Mx
of the case in which the light emitted from the P three-dimensional sampling points of the region to be imaged is written as the scene emitted light vector x of the length P, and the observed values of the image sensor corresponding to the number of pixels N that receive the light is set as the scene observed value vector y of the length N.
Next, in step S210, the image estimation unit 109 obtains emitted light (x{circumflex over ( )}roi) estimated values of the sampling points in the restored image region ROI (Rroi).
First,
from (a) the sensor observed values described with reference to
(b) the observed values (y{circumflex over ( )}out) corresponding to the sampling point emitted light estimation values (x{circumflex over ( )}out) outside of the restored image region ROI (Rroi) calculated in step S209
are subtracted. That is, a subtraction process
(yall)−(y{circumflex over ( )}out)
is used to obtain the observed values (y{circumflex over ( )}roi) of the case in which the light from the sampling points in the restored image region ROI (Rroi) is observed.
Next, emitted light (x{circumflex over ( )}roi) estimated values of the sampling points in the restored image region ROI (Rroi) are calculated according to the following (Expression 4).
[Math. 4]
{circumflex over (x)}
roi
=M
roi
+(yall−ŷout)
where
M
roi
+=(MroiTMroi)−1MroiT (Expression 4)
In this way, the image estimation unit 109 obtains the emitted light (x{circumflex over ( )}roi) estimated values of the sampling points in the restored image region ROI (Rroi) according to the (Expression 4). The estimated values are specifically equivalent to the pixel values of the image (restored image) illustrating the object in the restored image region ROI (Rroi).
As described above, the image estimation unit 109 generates the restored image of the restored image region including part of the captured image region of the lensless camera 102 in the following process.
(S206) The image estimation unit 109 calculates the pseudo-inverse matrix of the captured image region corresponding mask matrix applied to generate the restored image of the entire imaging range of the lensless camera 102.
(S207) The image estimation unit 109 uses the pseudo-inverse matrix of the captured image region corresponding mask matrix and the observed image signals to calculate the emitted light estimation values of the sampling points in the captured image region.
(S208 to S210) The image estimation unit 109 subtracts the observed image signals outside of the restored image region not included in the restored image region from the observed image signals as output of the image sensor of the lensless camera 102 to calculate the observed image signals inside of the restored region and executes arithmetic processing of the observed image signals inside of the restored region and the pseudo-inverse matrix of the restored image region corresponding mask matrix to generate the restored image of the restored image region including part of the captured image region of the lensless camera 102.
Note that the pseudo-inverse matrix used in the processes may be an inverse matrix.
In
A virtual pinhole lens 301 and a virtual sensor surface 302 are prepared. Light beams, that is,
x{circumflex over ( )}all (p1),
x{circumflex over ( )}all(p2), and
x{circumflex over ( )}all(p3)
from sampling points 305, 306, and 307 pass through the virtual pinhole lens 301 and intersect the virtual sensor surface 302.
Observed values (pixel values) according to the incident light, that is, observed values (pixel values) including
a virtual sensor pixel y(p1) 303,
a virtual sensor pixel y(p2) 304, and
a virtual sensor pixel y(p3) 305
can be obtained at virtual sensor pixels 303, 304, and 305 on the virtual sensor surface 302, respectively. In this case, the sampling points are in focus in the reproduced image regardless of the distance from the sampling points.
In addition,
x{circumflex over ( )}all(p1) and
x{circumflex over ( )}all (p2)
from sampling points 316 and 317 pass through the virtual lens 311, and the light beams are mapped on the virtual sensor surface 313. Pixel values on the virtual sensor 313, that is, values including
a virtual sensor pixel y(p1) 314 and
a virtual sensor pixel y(p2) 315
can be combined on the virtual sensor surface 313 to reproduce an image with lens bokeh in the depth direction.
a captured image region ROI (Rall) 321,
a restored image region ROI (Rroi) 322, and
an ROI (Rout) 323 outside of the restored image region
can be set in the three-dimensional space as illustrated in
The emitted light from each sampling point enters the image sensor through a mask with conversion characteristics (transmission function) according to a captured image region corresponding mask matrix (Mall) 331 illustrated for example in
For example, as illustrated in
[7. Example of Generation Process of Image with Partially Improved Resolution]
A process of the present disclosure can be executed to generate an image with partially improved resolution. A generation process of an image with partially improved resolution will be described with reference to
In a normal camera captured image, the pixel pitch on the screen is constant, and the resolution on one image is constant in any region of the image. In the process of the present disclosure, instead of using uniform sampling points, the density of the sampling points can be changed in each region, for example. In this way, the resolution can be changed region-by-region.
For example, as in a restored image 401 illustrated in
Next, a simulation is performed to calculate a mask matrix (Mroi) in a state in which the density of the sampling points of only the high-resolution region (Rroi) 422 is higher than the other regions. The sampling point emitted light of the case of using the mask with characteristics (transmission function) of the mask matrix (Mroi) is calculated. As a result of the process, a partial region high-resolution image 431 is generated, in which the resolution of only the region with a person is increased.
In this way, the signal processing unit 109 can execute the process by setting the density of the sampling points in the restored image region higher than the density of the sampling points of the other regions in the image captured by the lensless camera 102. The signal processing unit 109 can generate a partial region high-resolution image in which the resolution of only the restored image region is increased.
[8. Processing Example of Dispersing Emitted Light from Sampling Points in Wavelength Direction]
Next, a processing example of dispersing the emitted light from the sampling points in the wavelength direction will be described. Specifically, the process is, for example, a process of observing the emitted light of each sampling point by separating the emitted light into wavelength light beams corresponding to the colors of RGB. The process can be executed to generate a color image.
x
col(pL+1) (Expression a)
Note that for example, when the emitted light from the sampling point is separated into three wavelengths (R, G, B), L equals to 3, and 1 equals to 0 to 2.
An observed value observed after the emitted light enters an nth pixel of N constituent pixels of an image sensor 473 through a mask 472 can be expressed as follows.
y
col(nL+1) (Expression b)
The relationship between (Expression a) and (Expression b) can be represented as follows.
y
col
=M
col
×x
col (Expression c)
In the (Expression c), Mcol is an NL×PL matrix (mask matrix of mask 472), where
(N) represents the number of pixels of the image sensor 473,
(L) represents the number of separated wavelength light beams of the emitted light from the sampling points, and
(P) represents the number of sampling points.
The matrix can be obtained by simulating the observed values of the emitted light at different wavelengths from one sampling point.
As a result, the luminance separated into each emitted light (xcol) wavelength of sampling points 501 can be reproduced from the observed values ycol on the image sensor 473. For example, the output values of the colors of RGB can be acquired to reproduce a color image. Note that similar expansion can also be performed for the polarization as in the case of the wavelength.
In this way, the signal processing unit 109 can execute the process by partitioning the wavelength of the emitted light of the sampling points in the restored image region to thereby generate a restored image in a plurality of different wavelength light units. The signal processing unit 109 can also execute the process by partitioning the polarized components of the emitted light of the sampling points in the restored image region to thereby generate a restored image in a plurality of different polarized component units.
The process executed by the image processing apparatus of the present disclosure can attain the following advantageous effects. Functions, such as non-mechanical zoom, non-mechanical pan, and removal of background, can be realized at high resolution and low amount of calculation without adding hardware to the lensless camera. Furthermore, the resolution of an unimportant region can be reduced to reduce the amount of calculation without changing the resolution of an important region, or conversely, the resolution of the important region can be improved while maintaining the amount of calculation. The process of the present disclosure is also effective for an image, such as a far-infrared image, using a wavelength other than visible light.
Next, an example of a hardware configuration of an image processing apparatus will be described with reference to
A CPU (Central Processing Unit) 501 functions as a control unit or a data processing unit that executes various processes according to a program stored in a ROM (Read Only Memory) 502 or a storage unit 508. For example, the CPU 501 executes the processes according to the sequences described in the embodiment. The program executed by the CPU 501, data, and the like are stored in a RAM (Random Access Memory) 503. The CPU 501, the ROM 502, and the RAM 503 are connected to each other through a bus 504.
The CPU 501 is connected to an input-output interface 505 through a bus 504, and an input unit 506 including various switches, a keyboard, a mouse, a microphone, a sensor, and the like and an output unit 507 including a display, a speaker, and the like are connected to the input-output interface 505. The CPU 501 executes various processes according to commands input from the input unit 506 and outputs processing results to, for example, the output unit 507.
The storage unit 508 connected to the input-output interface 505 includes, for example, a hard disk or the like and stores the program executed by the CPU 501 and various data. A communication unit 509 functions as a transmission and reception unit of Wi-Fi communication, Bluetooth (registered trademark) (BT) communication, and other data communication through a network, such as the Internet and a local area network, and communicates with an external apparatus.
A drive 510 connected to the input-output interface 505 drives a removable medium 511, such as a magnetic disk, an optical disk, a magneto-optical disk, and a semiconductor memory like a memory card, to record or read data.
The embodiment of the present disclosure has been described in detail with reference to a specific embodiment. However, it is obvious that those skilled in the art can modify or replace the embodiment without departing from the scope of the present disclosure. That is, the present invention has been disclosed illustratively, and the present invention should not be interpreted restrictively. To determine the scope of the present disclosure, the section of the claims should be considered.
Note that the technique disclosed in the present specification can be configured as follows.
(1)
An image processing apparatus including:
a signal processing unit that receives observed image signals as output of an image sensor of a lensless camera to generate a restored image of a restored image region including part of a captured image region of the lensless camera, in which
the signal processing unit generates the restored image by executing arithmetic processing of observed image signals in a restored region generated by subtracting observed image signals not included in the restored image region from the observed image signals and an inverse matrix or a pseudo-inverse matrix of a matrix applied to generate the restored image.
(2)
The image processing apparatus according to (1), in which
the signal processing unit includes a restored image region corresponding matrix calculation unit that generates a matrix applied to generate the restored image based on correspondence information of emitted light from sampling points in the restored image region and observed values of the image sensor.
(3)
The image processing apparatus according to (1) or (2), in which
the signal processing unit includes a restored image region corresponding matrix calculation unit that generates a matrix applied to generate the restored image of P×N defined by the number of sampling points (P) in the restored image region and the number of pixels (N) of the lensless camera.
(4)
The image processing apparatus according to any one of (1) to (3), in which
the signal processing unit includes an image estimation unit that generates the restored image by executing arithmetic processing of observed image signals in the restored region, the observed image signals being calculated by subtracting the observed image signals not included in the restored image region from emitted light estimation values of sampling points in the captured image region calculated based on an inverse matrix of a matrix applied to generate a restored image of an entire imaging range of the lensless camera and based on the observed image signals, and an inverse matrix of the matrix applied to generate the restored image of the entire imaging range of the lensless camera.
(5)
The image processing apparatus according to any one of (1) to (4), in which
the signal processing unit further includes a captured image region corresponding mask matrix calculation unit that generates a matrix applied to the captured image region based on correspondence information of emitted light from sampling points in the captured image region and the observed values of the image sensor.
(6)
The image processing apparatus according to any one of (1) to (5), in which
the signal processing unit sets density of the sampling points in the restored image region to density higher than density of the sampling points of other regions of the captured image to generate the restored image at high resolution.
(7)
The image processing apparatus according to any one of (1) to (6), in which
the signal processing unit partitions wavelength of the emitted light of the sampling points in the restored image region to generate a restored image in a plurality of different wavelength light units.
(8)
The image processing apparatus according to any one of (1) to (7), in which
the signal processing unit partitions polarized components of the emitted light of the sampling points in the restored image region to generate a restored image in a plurality of different polarized component units.
(9)
An imaging apparatus including:
a mask in which transmission regions and non-transmission regions of light are set as a two-dimensional pattern;
an imaging unit including an image sensor that receives the light through the mask; and
a signal processing unit that receives observed image signals as output of the image sensor to generate a restored image of a restored image region including part of a captured image region of the imaging unit, in which
the signal processing unit generates the restored image by executing arithmetic processing of observed image signals in a restored region generated by subtracting observed image signals not included in the restored image region from the observed image signals and an inverse matrix or a pseudo-inverse matrix of a matrix applied to generate the restored image.
(10)
An image processing method executed by an image processing apparatus, in which
the image processing apparatus includes a signal processing unit that receives observed image signals as output of an image sensor of a lensless camera to generate a restored image of a restored image region including part of a captured image region of the lensless camera, and
the signal processing unit generates the restored image by executing arithmetic processing of observed image signals in a restored region generated by subtracting observed image signals not included in the restored image region from the observed image signals and an inverse matrix or a pseudo-inverse matrix of a matrix applied to generate the restored image.
In addition, the series of processes described in the specification can be executed by hardware, software, or a composite component of the hardware and the software. In the case where the processes are executed by the software, a program recording the processing sequence can be executed by installing the program in a memory of a computer incorporated into dedicated hardware, or the program can be executed by installing the program in a general-purpose computer that can execute various processes. For example, the program can be recorded in advance in a recording medium. Other than installing the program in the computer from the recording medium, the program can be received through a network, such as a LAN (Local Area Network) and the Internet, and the program can be installed in a recording medium, such as a built-in hard disk.
Note that various processes described in the specification may not be executed in chronological order described above, and the processes may be executed in parallel or individually according to the capabilities of the apparatuses that execute the processes or as necessary. In addition, the system in the present specification denotes logical set configurations of a plurality of apparatuses, and the apparatuses of the configurations may not be in the same housing.
As described above, according to the configuration of the embodiment of the present disclosure, the field of view of the captured image of the lensless camera can be controlled, and the configuration for generating the restored image including part of the imaging region is realized.
Number | Date | Country | Kind |
---|---|---|---|
2018-039315 | Mar 2018 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2018/045005 | 12/6/2018 | WO | 00 |