The present invention relates to a stereoscopic image data creating device, a stereoscopic image data reproducing device, and a file management method for having image data accompanied with attribute information at the time of creating image data for performing three-dimensional display.
Conventionally, various methods for displaying a three-dimensional image have been proposed. Of these; a method generally used is called a “binocular method” that uses binocular parallax. This method allows stereoscopic view to be performed by preparing a left-eye image and a right-eye image having binocular parallax and projecting each image on the left and right eyes separately. In description below, the aforementioned images are referred to as a left-eye image and a right-eye image, respectively. Additionally, 3D and 2D are used as terms meaning three-dimensional or stereoscopic and two-dimensional, respectively, and image data for stereoscopic view is referred to as 3D image data while normal two-dimensional image data is referred to as 2D image data.
Note that, Non Patent Literature 1 describes safety guidelines for creating a 3D image using binocular parallax. In a 3D image using binocular parallax, stereoscopic sense of extrusion and depth can be controlled by adjusting parallax, however, people with a smaller interpupillary distance or children feel a stereoscopic effect strongly even if the parallax is the same. Moreover, in the case of displaying in the retracted direction on a display, creating parallax which exceeds the interpupillary distance of both eyes on the display shall be avoided as much as possible because our eyes do not open outward. Therefore, it is stated in the safety guidelines that the interpupillary distance is around 5 cm for a child who is 6 years of age, and this value is used as a representative value for children considering safety according to results of a survey on the interpupillary distance.
A frame sequential method, a parallax barrier method, and the like are herein proposed as representative binocular methods, and will be described in detail based on conceptual diagrams thereof.
Further,
Further, Patent Literature 1 discloses, as shown in
However, in a method of the above-described Patent Literature 1, from the transmitting device side to the receiving device side, a set of the maximum parallax value and the minimum parallax value of the parallax range is transmitted together with raw image data, while on the receiving device side, according to the received set of the maximum parallax value and the minimum parallax value of the parallax range, remapping is performed to create a display image. Such a method has been problematic that, since only one type for each of a value of the maximum parallax and the minimum parallax is transmitted, in the case that the receiving device receives only maximum parallax and minimum parallax by searching a corresponding point, and in the case that these values include errors, parallax adjustment based on this parallax range causes a possibility of displaying with improper parallax.
In addition, conversely, when only imaging mechanism information is received and parallax is adjusted based on maximum parallax and minimum parallax calculated from the received imaging mechanism information, in terms of a mechanism, parallax safely falls within a larger parallax range, but there has been a problem that an adjustable parallax range is reduced, and thus losing flexibility in changing a presented position of a stereoscopic vision to be stereoscopically displayed.
Further, there has been a problem that, depending on a real parallax value of a stereoscopic vision which is photographed, parallax is changed in a direction opposite to a desired direction for adjustment, and there is a possibility that adjustment opposite to intended parallax adjustment is performed.
The present invention has been devised to solve problems as described above, and an object thereof is to provide a stereoscopic image data creating device, a stereoscopic image data reproducing device, and a file management method for having image data for three-dimensional display with versatility as well as allowing presentation of a stereoscopic vision having safer and more comfortable parallax on a reproducing device side.
A stereoscopic image data creating device of the present invention is a stereoscopic image data creating device for creating image data in a predetermined file format from a plurality of image data corresponding to each of a plurality of viewpoints, comprising: a 3D information creating means for creating and outputting 3D information by using at least one or more maximum parallax or minimum parallax among a first maximum parallax, a second maximum parallax, a first minimum parallax, and a second minimum parallax which are input, wherein the first maximum parallax is a maximum value of parallax geometrically obtained from a mechanism of the imaging means, the first minimum parallax is a parallax which falls within a predetermined parallax range from the mechanism of the imaging means and is at a position nearest from the imaging means, the second maximum parallax is a maximum value of parallax between a left-eye image and a right-eye image of a stereoscopic image, and the second minimum parallax is a minimum value of parallax between the left-eye image and the right-eye image of the stereoscopic image; and a multiplexing means for multiplexing the 3D information and the image data to create stereoscopic image data in a predetermined file format.
Otherwise, a stereoscopic image data creating device of the present invention is a stereoscopic image data creating device for creating image data in a predetermined file format from a plurality of image data corresponding to each of a plurality of viewpoints, comprising: a 3D information creating means for creating and outputting 3D information by using at least one or more maximum parallax or minimum parallax among a first maximum parallax, a second maximum parallax, a first minimum parallax, and a second minimum parallax which are input, wherein the first maximum parallax is a maximum value of parallax geometrically obtained from a mechanism of imaging means, the first minimum parallax is a parallax which falls within a predetermined parallax range from the mechanism of the imaging means and is at a position nearest from the imaging means, the second maximum parallax is a maximum value of parallax between a left-eye image and a right-eye image of a stereoscopic image, and the second minimum parallax is a minimum value of parallax between the left-eye image and the right-eye image of the stereoscopic image; an image compression encoding means for performing compression encoding on the input plurality of image data to output compressed image data; and a multiplexing means for multiplexing the 3D information and the compressed image data to create stereoscopic image data in a predetermined file format.
Further, the stereoscopic image data in the predetermined file format includes the image data in n viewpoints as the plurality of viewpoints where n is three or more.
Further, when 3D information is created from the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax, the 3D information creating means generates parallax unit information that indicates units of the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax, while the stereoscopic image data in the predetermined file format includes the parallax unit information.
Further, the parallax unit information has a feature that a unit of the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax is any of a pixel unit, a sub-pixel unit, a unit of length, a unit of distance, or percentage to a width of an entire image.
Further, when 3D information is created from the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax, the 3D information creating means generates parallax target image information indicating, from which two viewpoint image data in combination among the plurality of image data, the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax are obtained, and the stereoscopic image data in the predetermined file format includes the parallax target image information.
Otherwise, a stereoscopic image data reproducing device of the present invention is a stereoscopic image data reproducing device for reproducing a plurality of image data corresponding to each of a plurality of viewpoints from image data in a predetermined file format, comprising: a demultiplexing means for demultiplexing, from the file format, image data and 3D information indicating at least one or more maximum parallax or minimum parallax among first maximum parallax as a maximum value of parallax geometrically obtained from a mechanism of imaging means, second maximum parallax as a maximum value of parallax between a left-eye image and a right-eye image of a stereoscopic image, first minimum parallax which falls within a predetermined parallax range from the mechanism of the imaging means and is parallax at a position nearest from the imaging means, and second minimum parallax as a minimum value of parallax between the left-eye image and the right-eye image of the stereoscopic image; a 3D information analyzing means for analyzing the 3D information; and a stereoscopic intensity converting means for adjusting parallax for the image data, wherein the 3D information analyzing means analyzes the 3D information, and the stereoscopic intensity converting means uses at least one or more maximum parallax or minimum parallax among the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax to adjust parallax of the image data for reproduction.
A stereoscopic image data reproducing device of the present invention is a stereoscopic image data reproducing device for reproducing a plurality of image data corresponding to each of a plurality of viewpoints from image data in a predetermined file format, comprising: a demultiplexing means for demultiplexing, from the file format, compressed image data and 3D information indicating at least one or more maximum parallax or minimum parallax among first maximum parallax as a maximum value of parallax geometrically obtained from a mechanism of imaging means, second maximum parallax as a maximum value of parallax between a left-eye image and a right-eye image of a stereoscopic image, first minimum parallax which falls within a predetermined parallax range from the mechanism of the imaging means and is parallax at a position nearest from the imaging means, and second minimum parallax as a minimum value of parallax between the left-eye image and the right-eye image of the stereoscopic image; a 3D information analyzing means for analyzing the 3D information; an image decoding means for decoding the compressed image data; and a stereoscopic intensity converting means for adjusting parallax for image data in which the compressed image data is decoded, wherein the 3D information analyzing means analyzes the 3D information, and the stereoscopic intensity converting mean uses at least one or more maximum parallax or minimum parallax among the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax to adjust parallax of the image data for reproduction.
Further, in the case of analyzing information indicating the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax, the 3D information analyzing means analyzes parallax unit information indicating units of the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax, and uses the analyzed parallax unit information to analyze information indicating the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax.
Further, the 3D information includes parallax target image information indicating, from which two viewpoint image data in combination among the plurality of image data, the first maximum parallax, the second maximum parallax, the first minimum parallax, and the second minimum parallax are obtained, and the 3D information analyzing means analyzes the parallax target image information, while the stereoscopic intensity converting means adjusts parallax for the image data indicated by the parallax target image information.
Further, in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first maximum parallax to that of the second maximum parallax, and when the second maximum parallax is larger than the first maximum parallax, judges that a value of the second maximum parallax is inappropriate, and adjusts parallax based on the first maximum parallax.
It is preferable that in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first minimum parallax to that of the second minimum parallax, and when the second minimum parallax is smaller than the first minimum parallax, adjusts parallax so that a value of the first minimum parallax becomes a value of the second minimum parallax.
It is preferable that in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first minimum parallax to that of the second minimum parallax, and when the second minimum parallax is smaller than the first minimum parallax, reduces and displays the image data so that a value of the first minimum parallax becomes a value of the second minimum parallax.
It is preferable that in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first minimum parallax to that of the second minimum parallax, and when the second minimum parallax is smaller than the first minimum parallax, judges that a value of the second minimum parallax is inappropriate to stop stereoscopic display, and displays in 2D, or performs 2D-3D conversion for image data in a viewpoint to perform 3D display.
It is preferable that in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first minimum parallax to that of the second minimum parallax, and when the second minimum parallax is smaller than the first minimum parallax, judges that a value of the second minimum parallax is inappropriate, and performs 2D-3D conversion on image data in any one viewpoint among the image data to perform 3D display.
It is preferable that in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first maximum parallax to that of the second maximum parallax, and when the second maximum parallax is smaller than the first maximum parallax, adjusts parallax based on a value of the second maximum parallax.
It is preferable that in the case of adjusting parallax for the image data, the stereoscopic intensity converting means compares the magnitude of the first minimum parallax to that of the second minimum parallax, and when the second minimum parallax is larger than the first minimum parallax, adjusts parallax based on a value of the second minimum parallax.
It is preferable that the 3D information analyzing means analyzes the parallax target image information, and the stereoscopic intensity converting means adjusts parallax for the image data indicated by the parallax target image information.
A file management method of the present invention is a file management method of managing 3D information as attribute information for stereoscopic display with image data, wherein the 3D information is comprised of parallax target image information indicating a combination of viewpoint images, parallax unit information indicating units of first and second maximum parallax and first and second minimum parallax, first maximum parallax, second maximum parallax, first minimum parallax, second minimum parallax, and an assumed display size indicating a size of a display for stereoscopic display of the image data.
Further, the 3D information includes a parallax recording feasibility flag indicating whether parallax information of each of the first and second maximum parallax and the first and second minimum parallax is recorded in the 3D information.
Further, a file to be managed is comprised of a file header, the 3D information, management information to be used for recording information which is not directly related to a three-dimensional image, and the image data, and the file header, the 3D information, the management information, and the image data are arranged from the head of the file in an order of the file header, the 3D information, the management information, and the image data.
Further, a file to be managed is comprised of the 3D information, management information to be used for recording information which is not directly related to a three-dimensional image, a file header, and the image data, and the 3D information, the management information, the file header, and the image data are arranged from the head of the file in an order of the 3D information, the management information, the file header, and the image data.
Further, the image data is comprised of both of left-eye image data and right-eye image data.
It is preferable that the image data of a first file to be managed is left-eye image data, the image data of a second file to be managed is right-eye image data, and the first file and the second file are a set of image data for a left eye and a right eye for forming a 3D image, and managed in a same dedicated folder.
It is preferable that the image data of the first file to be managed is left-eye image data, the image data of the second file to be managed is right-eye image data, and the first file and the second file are a set of image data for a left eye and a right eye for forming a 3D image, which is managed by providing a file name with an index so as to be distinguished from the other set of image data.
It is preferable that the image data of the first file to be managed is left-eye image data, the image data of the second file to be managed is right-eye image data, and a third file to be managed is a 3D management information file which stores 3D management information indicating that the first file and the second file are a set of image data for a left eye and a right eye for forming a 3D image.
It is preferable that the 3D management information file is a metafile and manages a name of the first file and a name of the second file indicating a set of image data for a left eye and a right eye for forming a 3D image by describing them in the metafile.
It is preferable that a file to be managed is comprised of a first file header, first 3D information, first management information, first image data, a second file header, second 3D information, second management information, and second image data, in which the first file header, the first 3D information, the first management information, the first image data, the second file header, the second 3D information, the second management information, and the second image data are arranged from the head of the file in an order of the first file header, the first 3D information, the first management information, the first image data, the second file header, the second 3D information, the second management information, and the second image data, where the first image data and the second image data area set of image data for a left eye and a right eye for forming a 3D image.
It is preferable that a file to be managed is comprised of a file header, first 3D information, first management information, first image data, second 3D information, second management information, and second image data, in which the file header, the first 3D information, the first management information, the first image data, the second 3D information, the second management information, and the second image data are arranged from the head of the file in an order of the file header, the first 3D information, the first management information, the first image data, the second 3D information, the second management information, and the second image data, where the first image data and the second image data are a set of image data for a left eye and a right eye for forming a 3D image.
It is preferable that a file to be managed is comprised of third 3D information, third management information, a first file header, first 3D information, first management information, first image data, a second file header, second 3D information, second management information, and second image data, in which the third 3D information, the third management information, the first file header, the first 3D information, the first management information, the first image data, the second file header, the second 3D information, the second management information, and the second image data are arranged from the head of the file in an order of the third 3D information, the third management information, the first file header, the first 3D information, the first management information, the first image data, the second file header, the second 3D information, the second management information, and the second image data, where the first image data and the second image data area set of image data for a left eye and a right eye for forming a 3D image, the third 3D information includes 3D information in a common part of the first and second image data, the first 3D information includes individual 3D information of the first image data, and the second 3D information includes individual 3D information of the second image data.
It is preferable that a file to be managed is comprised of third 3D information, third management information, a file header, first 3D information, first management information, first image data, second 3D information, second management information, and second image data, in which the third 3D information, the third management information, the file header, the first 3D information, the first management information, the first image data, the second 3D information, the second management information, and the second image data are arranged from the head of the file in an order of the third 3D information, the third management information, the file header, the first 3D information, the first management information, the first image data, the second 3D information, the second management information, and the second image data, where the first image data and the second image data are a set of image data for a left eye and a right eye for forming a 3D image, the third 3D information includes 3D information in a common part of the first and second image data, the first 3D information includes individual 3D information of the first image data, and the second 3D information includes individual 3D information of the second image data.
It is preferable that a file to be managed includes a plurality of viewpoint image information comprised of a file header, 3D information, management information, and viewpoint image data created for each different viewpoint image data, and the plurality of different viewpoint image information are repeatedly arranged from the head of the file in an order of the file header, the 3D information, the management information, and the viewpoint image data.
It is preferable that a file to be managed stores common information of 3D information of the plurality of viewpoint image information which are created for each of a plurality of viewpoint image data, management information of an entire file, and the plurality of viewpoint image information created for each of the plurality of viewpoint image data, in which the common information of 3D information, the management information of the entire file, and the plurality of viewpoint image information are arranged from the head of the file in an order of the common information of 3D information, the management information of the entire file, and the plurality of viewpoint image information, and the 3D information of the viewpoint image information which is created for each of the plurality of viewpoint image data includes individual 3D information created for each viewpoint image data.
According to the stereoscopic image data creating device, the stereoscopic image data reproducing device, and the file management method of the present invention, it is possible to perform parallax adjustment suited to a display size to be displayed safely and appropriately with higher flexibility.
Hereinafter, description will be given in detail for preferred embodiments of a stereoscopic image data creating device, a stereoscopic image data reproducing device and a file management method according to the present invention with reference to the accompanying drawings. Additionally, in the description below, like numerals represent like elements even in configurations of various drawings, which description will be thus omitted.
Description will be given for a stereoscopic image data creating device according to a first embodiment with reference to drawings.
Next, description will be given for the first parallax information. The first parallax information is obtained by geometric calculation from camera mechanism information such as information on an orientation as well as arrangement and an angle of view of a lens for a camera used for photographing, a value of which depends on a point whether a camera photographing method is a parallel or cross-eyed viewing method, and an angle of view of a camera in the case of the cross-eyed viewing method. In the embodiment, the first parallax information is comprised of first maximum parallax as parallax to a background at infinite distance with maximum physical parallax, and first minimum parallax as parallax in the case of setting an object having a recommended camera-specific photographing distance so as to have a position capable of physically approaching a camera as well as a parallax value allowing a comfortable stereoscopic view, similarly from camera mechanism information.
First, description will be given for a method of obtaining the first parallax information in photographing by the parallel viewing method.
Moreover, description will be given for the way of obtaining the first minimum parallax constituting the first parallax information in photographing by the parallel viewing method. In photographing by the parallel viewing method, a position of an object from a camera allowing photographing as a stereoscopic vision is controlled by the base line length T and an angle of view θ. The camera's angle of view θ as shown in
For example, a similarity relation of a formula (1) is obtained where a distance to the object 14 is Ln and a distance to the background 15 is Lf from the base line length between the camera 12 and the camera 13, and parallax of the object 14 in a photographed image is Dn.
(Lf−Ln):Ln=Dn:T (1)
From the formula (1), the parallax Dn is obtained by a formula (2).
Dn=T×(Lf−Ln)/Ln (2)
Further, in
Additionally, a stereoscopic viewable parallax magnitude varies between individuals, and a user may thus input a recommended individually suitable photographing distance to a camera in advance. The camera may store the value above in a storage area thereof so as to be allowed to output the value any time as individual first minimum parallax for the user.
Next, description will be given for the way of obtaining first parallax information in photographing by the cross-eyed viewing method.
Wd0/W=Hd/H (3)
Because the characteristic point 24 is located on the optical axis 16, a position of the characteristic point 24 in a left-eye image is a center position of the left-eye image. Therefore, parallax of the characteristic point 24 is indicated as Hd. In the diagram, a distance from the camera 12 to the characteristic point 24 is L, a distance from the camera 12 to the convergence point 23 is Lo, a distance from the camera 13 to the line segment 25 is A, a distance from the convergence point 23 to the point 26 is A1, and a distance from the convergence point 23 to the camera 13 is A2. Further, where the center of the camera 12 is a point 27, the center of the camera 13 is a point 28, and a right endpoint of the line segment 25 is a point 29, a relation represented by a formula (4) is obtained by focusing on a triangle formed of the point 26, the point 28 and the point 29.
tan(θ/2)=W/(2×A) (4)
From the formula (4), the length W of the line segment 25 is obtained by a formula (5).
W=2×A×tan(θ/2) (5)
A relation represented by a formula (6) is obtained by focusing on a triangle formed of the convergence point 23, the characteristic point 24 and the point 26, where an angle at which the optical axis 16 intersects with the optical axis 17 is α.
A1=(L−Lo)×cos α (6)
Wd0=(L−Lo)×sin α (7)
Additionally, a relation represented by a formula (8) and a formula (9) is obtained by focusing a triangle formed of the convergence point 23, the point 27 and the point 28.
A2=T/sin α (8)
Lo=T/tan α (9)
Where, the length W of the line segment 25 is represented as a formula (10) from A=A1+A2, the formula (5), the formula (6), the formula (8) and the formula (9).
W=2×((L−T/tan α)×cos α+T/sin α)×tan(θ/2) (10)
From the formula (3) and the formula (10), parallax Hd is represented as a formula (11).
Hd=H×(L−T/tan α)×sin α/{L×cos α×2×tan(θ/2)} (11)
Where, the farther the characteristic point 24 moves away from the convergence point, the bigger the parallax Hd becomes in the cross-eyed viewing method, and therefore, a maximum value Hdmax of the parallax Hd is obtained by making L infinite. When approximating L to be infinite in the formula (11), T/tan α can be neglected compared to L, and Hdmax is thus represented as a formula (12).
Hdmax=H×tan α/{2×tan(θ/2)} (12)
As described above, a maximum parallax value by the cross-eyed viewing method is allowed to approximate as a value of the formula (12), and the value is decided according to a display size for displaying, an angle of view and an angle at which optical axes intersect, where the maximum parallax value at the time is output as maximum parallax in the first parallax information.
Further, minimum parallax that composes the first parallax information as the maximum parallax is indicated as a parallax value in a case where an object is set having a recommended camera-specific photographing distance described in a camera's instruction manual or the like similarly by either the parallel viewing method or the cross-eyed viewing method.
Additionally, as with the case by the parallel viewing method, a stereoscopic viewable parallax magnitude varies between individuals, and a user may thus input a recommended individually suitable photographing distance to a camera in advance. The camera may store the value above in a storage area thereof so as to be allowed to output the value any time as individual first minimum parallax for the user.
Next, description will be given for the second parallax information. The second parallax information is comprised of second minimum parallax as parallax at a nearest point which is a position nearest from a camera, and second maximum parallax as parallax at a farthest point which is a position farthest from a camera. These second minimum parallax and maximum parallax are obtained as parallax of an object at the nearest point in a position nearest from the camera and parallax of an object at the farthest point in a position farthest from the camera in a screen, respectively, by searching a corresponding point with use of a stereo matching method.
In the above description, the stereo matching method is a method of using a set of two images photographed with two cameras disposed right and left to compute which part of an image photographed with a right camera corresponds to an image photographed with a left camera by calculation of area correlation so as to presume a three-dimensional position of each point by triangulation using the correspondence relation. Further, as a method other than stereo matching, a user may search a corresponding point in an image to input parallax of an object viewed at the nearest point and the farthest point.
As described above, the first parallax information covering a physical parallax range obtained by geometric calculation from camera mechanism information such as information on an orientation as well as arrangement and an angle of view of a lens specific to a camera used for photographing, and the second parallax information covering an actual parallax range calculated by stereo matching, user's manual entry and the like are calculated, output, recorded and transmitted, thereby allowing display in a comfortable and safe parallax range even in the event of errors caused by stereo matching or the like in displaying on a large screen.
At step S1 of
At step S3, the first parallax information is created by a method in accordance with the way of obtaining the first parallax information explained with reference to
At determination step S4, the second parallax information is determined whether or not to have been manually input by a user to the stereo matching means 31 inside the parallax calculating means 2 by the user input means, and the process moves to step S6 in the case of having been input, while the process moves to step S5 otherwise.
At step S5, with use of the left-eye image data and the right-eye image data input to the stereo matching means 31, the second parallax information is created by the stereo matching method.
At step S6, the first parallax information output from the mechanism parallax calculating means 30 and the second parallax information output from the stereo matching means 31 are input to the parallax correcting means 32.
At determination step S7, determination is made on whether or not the parallax shift is performed with reference to the first parallax information and the second parallax information input to the parallax correcting means 32, and the process moves to step S8 in the case of performing the parallax shift, otherwise the process moves to step S10.
In the embodiment, description will be given for the parallax shift. In the case of generally performing stereoscopic display, the left-eye image data and the right-eye image data having the identical size are displayed at the same position in a frame sequential method, while displayed so that right and left images are vertically arranged in an alternate order by pixel unit or sub-pixel unit in a parallax barrier method. At the time, at least either of the right and left images is continuously shifted right or left uniformly on an entire screen from an original display position, thereby varying parallax of a corresponding point of the right and left images, which is referred to as the parallax shift. A user may select whether or not to perform the parallax shift. For example, an image may be stereoscopically displayed in a state of having no parallax shift to be shifted by a user so as to have preferable parallax.
Further, from among four pieces of parallax information of the first maximum parallax information and the first minimum parallax information in the first parallax information, as well as the second maximum parallax information and the second minimum parallax information in the second parallax information, at least one piece of parallax information may be selected to perform a parallax shift so that an absolute value of the selected parallax information is smaller than a predetermined value in the case of judging that the absolute value of the selected parallax information is larger than the predetermined value.
Additionally, in the four pieces of parallax information, all objects are determined whether to burst from or recede deep into a display surface, and in a case where the all objects burst from or recede deep into the surface, the parallax shift may be performed so as to have an image including both objects that burst from and recede deep into the surface. For example, in the case of photographing by the parallel viewing method, a background at infinite distance is displayed on the display surface, and all objects in front of the background are displayed so as to burst from the display surface. At the time, an entire screen may be uniformly shifted in a direction decreasing parallax of the all objects in front of the background.
At step S8, by the parallax correcting means 32 of
At step S9, by the image cutout means 33 of
Next, description will be given for a method of changing parallax by uniformly shifting an entire screen of either the left-eye image data or the right-eye image data. Note that, generally, with respect to the right-eye image data, all objects are entirely displayed at further rear positions compared to those before performing the parallax shift in the case of stereoscopic display by shifting a display position of the left-eye image data to the left, while all objects are entirely displayed at further front positions compared to those before performing the parallax shift in the case of stereoscopic display by shifting a display position of the left-eye image data to the right.
Here, the left-eye image data is shifted to the left by only Dn1, thereby generating an area having no respective points corresponding to an area 40 of the left-eye image data 34 surrounded by a thick frame and an area 41 of the right-eye image data 37 surrounded by a thick frame. At the time, images of the area 40 and the area 41 which run off the edge are cut out because they lose corresponding point. At the time of displaying, a horizontal display area is displayed by narrowing by only a cutout part. A method of performing parallax adjustment by performing a parallax shift in this manner is generally provided, and in the case of performing parallax adjustment, values of the first parallax information and the second parallax information are corrected by only parallax-shifted amount.
Description has been given above for a case where right and left cutout images are smaller than a display size of a display for stereoscopic display, however, a CCD of a camera may be set larger than an image to be actually displayed in advance so as not to be smaller than the display size of the display for stereoscopic display, even when arranging parallax. In this case, the parallax shift may be performed by not shifting a screen but by shifting each of positions of right and left images to be cut out.
At step S10, the left-eye image data and the right-eye image data cut out by the image cutout means 33 of
At step S11, by the 3D information creating means 3 of
Moreover, a user may separately input the assumed display size to the 3D information creating means 3 from the outside, or input to the 3D information creating means 3 from a camera as a part of camera mechanism information. Here, the parallax target image information is unnecessary for recording images in two right and left viewpoints, however, information required for processing images in three or more viewpoints. Since parallax is obtained from images in two viewpoints, it needs to designate parallax calculated from which two-viewpoint images among images in three or more viewpoints.
Additionally, description will be given here for the parallax unit information. The parallax unit information is information indicating a unit used for recording magnitudes of the first and second maximum parallax as well as the first and second minimum parallax. The magnitudes of the first and second maximum parallax as well as the first and second minimum parallax may be processed by pixel unit, processed by absolute unit such as “mm” or “cm”, or processed by percentage relative to a horizontal width of a screen. For example, the parallax unit information may include the pixel unit in the case of indicating 0, the absolute unit such as “mm” or “cm” in the case of indicating 1, and percentage relative to a horizontal width of a screen in the case of indicating 2.
In
In the case of having a configuration including a plurality of the set as described above, it is possible to use differently a set of the parallax unit information according to a usage state. For example, in the case of not recognizing a pixel pitch or the case of not performing dot-by-dot display including display by scaling and the like, parallax information represented by percentage relative to a horizontal width of a screen is used. Further, in the case of easily handling parallax using pixel unit like performing a parallax shift on a display side, parallax information represented using pixel unit is used. Moreover, parallax stereoscopically displayed in a depth direction is preferably handled using an absolute value in the case of not exceeding an interpupillary distance of both human eyes when confirmed.
As described above, a configuration including a plurality of the set of parallax unit information in 3D information is very convenient since it is possible to obtain parallax information of desired unit without converting a parallax unit on a reproducing side.
At step S12, the left-eye image data and the right-eye image data input from the parallax calculating means 2 of
At step S13, multiplexed data is created by the multiplexing means 5 of
Hereinafter, description will be given for a recording format in the case of recording multiplexed data in an IC memory. When an IC memory is generally used as a recording medium, a file system such as FAT (File Allocation Table) is established on the IC memory, and data is recorded as a file. As a file format used in the system, an existing format may be used or a newly defined unique format may be used.
On the other hand, in the case of using a new format as described in
a) shows an example where left-eye image data and right-eye image data are integrated into one piece of image data to be recorded in an image data area of
Furthermore,
Note that, in the case of recording two right and left image data as separate files as shown in
Further, as a file format for recording a still image, 3D information may be inserted into the “CIPA DC-006, stereo still image format for digital still cameras” or the “CIPA DC-007, multi-picture format” of CIPA standards.
Note that, description has been given for a case where an arrangement order of right and left images and a storing order in a file are fixed, however, these orders may be variable. In the case of variable orders, information of orders may be recorded in 3D information.
At determination step S14 of
At step S15, the process is finished since image data is not input to the stereoscopic image data creating device 1.
As described above, the stereoscopic image data creating device 1 creates, as stereoscopic image data, multiplexed data including compressed image data and 3D information including the first parallax information comprised of the first maximum parallax and the first minimum parallax and the second parallax information comprised of the second maximum parallax and the second minimum parallax, thereby making it possible to create multiplexed data including the first parallax information as information on maximum parallax of limit obtained from camera's mechanistic information and the second parallax information actually generated.
Further, the example has been given above for multiplexing 3D information and compressed image data created in the image compression encoding means 4, however, the 3D information and non-compressed image data may be multiplexed by omitting the image compression encoding means 4 and without compressing the input left-eye image data and right-eye image data. Moreover, description has been given above for the case of recording all of the first maximum parallax, the second maximum parallax, the first minimum parallax and the second minimum parallax, however, at least one or more of the maximum parallax or the minimum parallax among the parallax may only be recorded.
Additionally, in the above process, for each of the first maximum parallax, the second maximum parallax, the first minimum parallax and the second minimum parallax, a parallax recording feasibility flag indicating whether or not the value is recorded in 3D information may be recorded in the 3D information. At the time, among the first maximum parallax, the second maximum parallax, the first minimum parallax and the second minimum parallax, the value of parallax set to be recorded in the 3D information is only recorded in the 3D information according to the value of the parallax recording feasibility flag.
Next, description will be given for a reproducing device for stereoscopic display as a three-dimensional image of the image data created by the stereoscopic image data creating device 1.
The demultiplexing means 101 reads multiplexed data that is multiplexed in a predetermined format from a recording device or a communication device to demultiplex into compressed image data and 3D information. In a case where voice and music are multiplexed, which is not illustrated in
The 3D information analyzing means 102 analyzes 3D information to extract parallax target image information, parallax unit information, first maximum parallax, second maximum parallax, first minimum parallax, second minimum parallax and a set value of an assumed display size for outputting to the stereoscopic intensity converting means 104.
The image decoding means 103 decodes the input compressed image data to output the decoded image data to the stereoscopic intensity converting means 104.
The stereoscopic intensity converting means 104 has the parallax target image information, the parallax unit information, the first maximum parallax, the second maximum parallax, the first minimum parallax, the second minimum parallax, the assumed display size and the decoded image data that are input to create a three-dimensional video image with parallax adjusted for the decoded image data with use of the parallax target image information, the parallax unit information, the first maximum parallax, the second maximum parallax, the first minimum parallax, the second minimum parallax and the assumed display size for outputting to an external display device.
Description will be given in detail for parallax adjustment in the stereoscopic intensity converting means 104 at the time.
First parallax information obtained from camera's mechanistic information and second parallax information actually generated by stereo matching or the like may be used for parallax adjustment.
For example, in the case of having second maximum parallax like receding from a presented position of a stereoscopic vision by first maximum parallax when comparing the first maximum parallax to the second maximum parallax, the second maximum parallax is judged as inappropriate and not used, and the first maximum parallax is used. The magnitude of the first maximum parallax is obtained from the first maximum parallax at the time, the assumed display size and the parallax unit information, which value is provided so as not to exceed a width between human eyes, to adjust parallax. The width between human eyes at the time is set to 5 cm in consideration of a width between infant's eyes as described in Non Patent Literature 1.
Moreover, in the case of having a second minimum parallax like being in front of the presented position of the stereoscopic vision by the first minimum parallax when comparing the first minimum parallax to the second minimum parallax, since it is indicated that a position of an object is in front of a recommended camera-specific photographing distance in a camera's instruction manual or the like, the stereoscopic intensity converting means 104 may determine that the image is inappropriate for a stereoscopic view to perform parallax adjustment by a parallax shift for shifting an entire screen horizontally, perform parallax adjustment by reduced display of a stereoscopic vision, stop stereoscopic display to perform 2D display, or perform stereoscopic display by performing 2D-3D conversion with use of either one of images for right and left eyes, in order to make the second minimum parallax larger than the first minimum parallax.
Additionally, in a case where the second maximum parallax is smaller than the first maximum parallax when comparing the magnitude of the first maximum parallax to that of the second maximum parallax, parallax adjustment may be performed based on the value of the second maximum parallax.
Moreover, in a case where the second minimum parallax is larger than the first minimum parallax when comparing the magnitude of the first minimum parallax to that of the second minimum parallax, parallax adjustment may be performed based on the value of the second minimum parallax.
Further, the above parallax adjustment is performed by calculating with the assumed display size, however, when the display size to be displayed is different from the assumed display size, the display size to be displayed may be input to the stereoscopic image data reproducing device 100 from the outside by user's manual input, communication among devices or the like to obtain parallax with use of the input display size to be displayed. Note that, a visual distance at the time is assumed to be viewed in the assumed display size or height three times higher than a display size to be actually displayed.
Furthermore, description has been given above for the case of performing parallax adjustment with use of all of the first maximum parallax, the second maximum parallax, the first minimum parallax and the second minimum parallax, however, at least one or more of maximum parallax or minimum parallax among these parallax may be used to perform parallax adjustment. Further, in a case where all of the first maximum parallax, the second maximum parallax, the first minimum parallax and the second minimum parallax are not recorded in 3D information, at least one or more of maximum parallax or minimum parallax among the recorded parallax may be used to perform parallax adjustment.
Note that, in the above process, when multiplexed data input to the demultiplexing means 101 is comprised of 3D information and non-compressed image data, the image decoding means 103 may output the input non-compressed image data as-is without especially performing decoding processing in place of the decoded image data. As described above, the first parallax information and the second parallax information are used to perform appropriate parallax adjustment in a system comprised of the stereoscopic image data creating device 1 and the stereoscopic image data reproducing device 100 according to the first embodiment of the present invention, thereby allowing realization of safer and more comfortable stereoscopic display. Further, description has been given for the case of having two viewpoints in the above embodiment, however, the present invention may also be applied to the case of so-called multi viewpoints having three or more viewpoints.
Next, description will be given for the case of images in n or more viewpoints (n is an integer greater than or equal to 3) to be input to the stereoscopic image data creating device, as a second embodiment of the present invention.
Among respective means constituting the stereoscopic image data creating device 200, the 3D information creating means 3 and the image compression encoding means 4 are the same as those in the first embodiment, which explanation is thus omitted in this embodiment, and description will be given for each operation of the parallax calculating means 201 and the multiplexing means 202.
First, description will be given for the parallax calculating means 201.
First, image data in n viewpoints is input to the image selecting means 204, and the image selecting means 204 selects and outputs image data in two viewpoints from the input image data in n viewpoints. At the time, the image data in two viewpoints to be selected is selected as an image having a combination of viewpoints allowing consistency with camera mechanism information. For example, when the camera mechanism information is information from a camera having viewpoints next to each other, the data may be selected from any of a combination of cameras having viewpoints corresponding to the information. At the time, the parallax calculating means 201 outputs information on the selected viewpoint image data to the 3D information creating means 3 as parallax calculation viewpoint information. Further, in the embodiment, the image selecting means 204 may select a plurality of viewpoints in combination allowing consistency with the camera mechanism information. In this case, the stereo matching means 31 may obtain parallax for each of the plurality of the combination to output the largest parallax thereamong.
Next, description will be given for the image cutout means 205. Image data in n viewpoints is input to the image cutout means 205, and the image cutout means 205 performs a parallax shift by a parallax-shifted amount designated by the parallax correcting means 32 for the input image data in n viewpoints, thereafter outputting each cutout image as the image data in n viewpoints. At the time, in the parallax shift, so as to have the same shifted amount in each of the combination of viewpoint image data in n viewpoints next to each other, an image may be cut out so that all of the viewpoint image data in n viewpoints have parallax varying by the same amount, or the parallax shift may be performed with only a combination of two viewpoint image data indicated by parallax calculation viewpoint information.
As described above, the stereoscopic image data creating device 200 creates, as stereoscopic image data, multiplexed data including 3D information including the first parallax information comprised of the first maximum parallax and the first minimum parallax, the second parallax information comprised of the second maximum parallax and the second minimum parallax, and parallax calculation viewpoint information; and compressed image data in which viewpoint image data in n viewpoints is compressed, thereby making it possible to create multiplexed data corresponding to viewpoint image data in n viewpoints including the first parallax information as information on maximum parallax of limit obtained from camera's mechanistic information and the second parallax information actually generated.
Next, description will be given for a stereoscopic image data reproducing device 300 for performing stereoscopic display as a three-dimensional image the image data created in the stereoscopic image data creating device 200.
To the stereoscopic intensity converting means 301, as the 3D information analyzed and extracted in the 3D information analyzing means 102, parallax target image information, parallax unit information, first maximum parallax, second maximum parallax, first minimum parallax, second minimum parallax, an assumed display size and viewpoint image data in n viewpoints decoded in the image decoding means 103 are input. In the stereoscopic intensity converting means 301, image data in two viewpoints indicated by the parallax target image information is selected from the input viewpoint image in n viewpoints, and for the selected image data, as with the stereoscopic intensity converting means 104, the parallax unit information, the first maximum parallax, the second maximum parallax, the first minimum parallax, the second minimum parallax and the assumed display size are used to create and output a three-dimensional video image with parallax adjusted for the selected image data. At the time, the stereoscopic intensity converting means 301 may perform a parallax shift by the same amount for other viewpoint images next to each other which are not selected above.
As described above, the first parallax information and the second parallax information are used to perform appropriate parallax adjustment in a system comprised of the stereoscopic image data creating device 200 and the stereoscopic image data reproducing device 300 according to the second embodiment of the present invention, thereby allowing realization of safer stereoscopic display as with the case of two viewpoints even when image data in n viewpoints are processed.
Further, description has been given for a case where a plurality of images are input in the above embodiment, however, the present invention is also applicable for the case of mounting an adapter for stereography on a monocular imaging device. Adapters for stereography include a stereo adapter for photographing images for right and left eyes on one screen and an adapter for photographing a plurality of viewpoint images on one screen. In using these adapters, in the above-described parallax calculating means 2 and parallax calculating means 201, two images required for calculation of parallax are separated for calculating parallax, and selected whether to be output as one image as-is or to be output by individually separating viewpoints, to be output to the image compression encoding means 4 so as to be suited to a desired format of compressed image data. In the embodiment, others except the parallax calculating means 2 and the parallax calculating means 201 described above are operated the same, which description is thus omitted.
As described above, according to the stereoscopic image data creating device, the stereoscopic image data reproducing device and the file management method of the present invention, the stereoscopic image data creating device uses first maximum parallax as a maximum value of parallax geometrically obtained from a mechanism of imaging means, first minimum parallax as parallax that falls within a predetermined parallax range and is at a position nearest from the imaging means, second maximum parallax as a maximum value of parallax of a stereoscopic image and second minimum parallax as a minimum value of parallax of a stereoscopic image to be created as 3D information to create and transmit multiplexed data with a plurality of image data multiplexed constituting the created 3D information and stereoscopic image data, while the stereoscopic image data reproducing device receives and demultiplexes the multiplexed data to use the first maximum parallax, the first minimum parallax, the second maximum parallax and the second minimum parallax decoded from the 3D information, thereby allowing parallax adjustment suited to a display size to be displayed to be performed safe and appropriately with higher flexibility.
Further, according to the present invention, first parallax information and second parallax information are recorded and reproduced as 3D information, thereby allowing parallax adjustment with use of a limit value of parallax by a camera mechanism and an actual parallax value, while the magnitude of first maximum parallax is compared to that of second maximum parallax and the magnitude of first minimum parallax is compared to that of second minimum parallax so that it is possible to obtain an advantageous effect so as to allow to determine whether or not values of the second maximum parallax and minimum parallax are appropriate.
Additionally, according to the present invention, stereoscopic image data is displayed on a large screen by adjusting intensity of a stereoscopic effect by a parallax shift and the like with use of a limit value of parallax by a camera mechanism, thereby making it possible to obtain an advantageous effect so as to allow safer adjustment of a stereoscopic effect.
Furthermore, according to the present invention, first maximum parallax, second maximum parallax, first minimum parallax and second minimum parallax are formed into metadata so that it is possible to obtain an advantageous effect so as to allow image data for three-dimensional display to have versatility.
The above embodiments should be considered to be illustrated in all respects and not restrictive. For example, in the present embodiments, the stereoscopic image data creating device, the stereoscopic image data reproducing device and the file management method have been illustrated, however, the present invention is not limited to the stereoscopic image data creating device and the stereoscopic image data reproducing device, and just needed to create or reproduce a 3D video signal. The present invention may also be widely applied to equipment capable of outputting or displaying a 3D video signal of a 3D digital camera, a 3D digital movie, a 3D television, a digital video recorder, a portable movie player, a cellular phone, a car navigation system, a portable DVD player, a PC or the like, also for others except the stereoscopic image data creating device, the stereoscopic image data reproducing device and the file management method.
The stereoscopic image data creating device, the stereoscopic image data reproducing device and the file management method according to the present invention relate to a stereoscopic image data creating device, a stereoscopic image data reproducing device and a file management method capable of performing, in displaying a stereoscopic image, safer parallax adjustment with higher reliability and higher flexibility which is suited to a display size from a limit value of parallax by a camera mechanism and a limit value of parallax of a stereoscopic image.
Number | Date | Country | Kind |
---|---|---|---|
2010-248567 | Nov 2010 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2011/075289 | 11/2/2011 | WO | 00 | 3/14/2013 |