The perception of 3D scenes by human vision is largely based on two mutually interacting visual adaptation processes—stereoscopy and the eye's focal accommodation to object distance. Because the typical eye separation is about 60 mm, fixation upon objects at different distances gives different angles of convergence between the axes of the eyes, known as the “stereovision” effect or “stereoshift.” Though usually unconscious, this convergence angle is registered by the brain and contributes to the perception of object distance. To provide high resolution imaging of objects at different distances, the eye adjusts the shape, and thus the optical power, of the lens so it sharply focuses objects at a selected distance, a phenomenon known as “accommodation.” These two processes cooperate and engender accurate depth perception. For natural viewing of an object, the convergence and accommodation of the viewer's eyes should both be correct for the distance to the object.
There are several types of stereo displays that use stereo effects to simulate the perception of the 3D vision. In U.S. Pat. No. 4,734,756, a stereovision system generates on a screen stereoshifted images of the scene in two colors. The observer is wearing eye glasses with lenses of two different colors, and can see only one of the two images with each eye. The mixture of two images in the brain creates monochrome stereovision perception. This method usually is referenced as the anaglyph technique. U.S. Pat. Nos. 5,537,144; 5,594,843; 5,745,164 disclose glasses with perpendicularly aligned polarizing lenses for separate delivery of stereoshifted images to the left and right eyes. In U.S. Pat. No. 5,821,989, the stereoshifted images for left and right eye are repeatedly generated in a time sequence and liquid crystal shutter glasses are used to expose each eye in time with the respective image. U.S. Pat. No. 5,886,675 proposed an autostereoscopic display that does not need the use of special glasses. Two projectors generate images upon a holographic screen that conjugates the exit pupil of one projector with the pupil of the observer's left eye and the exit pupil of the second projector with the pupil of the right eye.
However, in all these techniques, the actual image viewed by the observer is at a fixed single distance from the observer, so that “objects” at supposed different distances are in fact all in focus at the same accommodation of the lens of the eye. This creates unnatural perception of a 3D scene that contains a number of objects at different supposed distances, for example, scenes with a close object in front of a landscape background. A true 3D display has not only to provide for the imaging of objects at different supposed distances stereoshift simulation but also to present the visible (usually virtual) image of the object at a distance from the observer's eye that adequately simulates the supposed distance to the object, so that the observer's eye can use both vision distance adaptation processes—stereovision and distance accommodation.
U.S. Pat. No. 5,956,180 proposed to use several screens at different distances from the observer with a beam combiner for 3D scene simulation. The problem is that the number of distance “slices” is in practice restricted to 2, and such an arrangement has problems with simulation of combinations of close and remote scenes together. In other words, the dynamic range of distance simulation is very limited. Another approach for comprehensive 3D scene simulation is found in displays that use variable computer generated holograms, as proposed in US patent application 2006/0187297. The holographic approach may provide comprehensive 3D scene perception but will experience problems with dynamic scenes due to its extremely high computation burden, as well as the limitations associated with RGB projection and image resolution.
The present invention provides autostereoscopic dynamic scene projection with improved depth perception over the prior art.
An embodiment of the presently proposed autostereoscopic display will have two scene projectors. The exit pupil of one projector is conjugated with the pupil of the left eye of the observer, while the exit pupil of the second projector is conjugated with the pupil of the right eye of the observer. The projector pupil diameter exceeds the eye pupil diameter to provide a reasonably sized “eye box,” the region within which the eye must be positioned to see the projected image fully. This provides for comfortable vision, by allowing some movement of the eye without losing the view of the image. The two projectors deliver to the observer's eyes 2D “depth-slice” images of the 3D scene with a stereoshift.
In one embodiment, variable-curvature membrane micromachined mirrors are incorporated into the projection scheme to provide appropriate real time image distance simulation by generating an image of each “slice” of the 3D scene at the correct distance from the observer for the objects in that slice. An alternative embodiment uses multiple layered liquid crystal lenses that perform a similar function. The pairs of slice images are generated so as to have the corresponding stereoshift for the slice distance. The observer can focus his or her eyes on a chosen scene “slice,” and the focusing accommodation can then be consistent with the convergence induced by the stereoshift. Consistent distance perception can thus be achieved.
With current technology, at least five depth “slices” can be projected during each image frame. The minimum frame refresh time is typically around thirty milliseconds, or 30 cps, to avoid a visible flicker. However, more than five depth slices can be provided as long as there is sufficient brightness for each “image” slice to provide sufficient luminous flux for each slice, and as long as the image generating and focusing elements of the system can change from slice to slice sufficiently quickly. There is no real limit to the number of slices that can be handled by a typical observer. This novel approach has many applications including: more realistic 3D games, military and civilian simulations, opthalmological testing, to name a few.
In an embodiment, an achromatic negative doublet is positioned in the path of light to and from said deformable mirror. The doublet is selected to shifting the required range of powers of the deformable mirror for the desired apparent slice distances, so that in normal operation the mirror is always concave, optionally including a flat position at one end of its range.
Aspects of the invention also provide methods of displaying a 3D image that comprises supplying slice images corresponding to parts of a scene at different distances from a viewer, and displaying each slice image in turn using different settings of a variable power optical element so as to create an apparent image of each slice image at an appropriate apparent distance from an observer position.
In an embodiment, the method comprises displaying different images to each eye of an observer, and stereoshifting the slice images displayed to different eyes to give parallax and eye convergence consistent with the apparent distances of the different slices.
The above and other aspects, features and advantages of the present invention will be apparent from the following more particular description thereof, presented in conjunction with the following drawings wherein:
A better understanding of various features and advantages of the present invention may be obtained by reference to the following detailed description and accompanying drawings, which set forth illustrative embodiments in which principles of the invention are utilized.
Referring to the drawings, and initially to
As shown by way of example in
Table 1 lists the optical-prescription surface list for the preferred embodiment shown in
The image source (Surface 0 in Table 1) of the projector shown in
The beamsplitter 102 (Surface 1 in Table 1) directs the LCD output to the assembly of negative achromatic doublet 104 (bounded by surfaces 2, 3, and 4) and a Micromachined Membrane Deformable Mirror (MMDM) 105 (surface 5) made by Flexible Optics Corp or equivalent component from another manufacturer. This mirror can produce different curvatures at very rapid rates, 1000 Hz, which exceeds the projector requirements. Each curvature corresponds to a particular depth slice. The mirror 105 is synchronized with the LC display 101, so that each slice frame from the LCD display is reflected off the MMD mirror 105 at the correct mirror curvature to produce the appropriate image position for the slice.
After reflection by the membrane mirror 105, each depth slice passes back through the achromatic doublet 104 (Surfaces 6, 7, and 8 are Surfaces 4, 3, and 2 in reverse) and the beamsplitter (inactive surface 9) and is projected to the eye by a reverse telephoto lens (surfaces 10 through 23). Optically inactive surface 16 of Table 1 is an aperture stop of the reverse telephoto lens 106, and is separately enumerated for convenience. The position of the exit pupil of the projection lens 106 is conjugated with the eye pupil of the observer. The diameter of the projector exit pupil exceeds the pupil diameter of eye 107, in order to accommodate small shifts in the observer's head position, and also so that the distance between the two projectors does not need to be adjusted too critically for each observer. The reverse telephoto lens 106 is calculated to have sufficient back focus release for mounting the membrane mirror 105 and the LC display 101, and sufficient long distance exit pupil release for conjugation with an observer's eye pupil 100 mm from the last optical surface of the lens 106.
In the projector of
The optical system 100 was designed for four configurations, listed in Table 2 below. Dimensions are in millimeters. In configurations 1, 2, and 3 the radius of curvature of the mirror 105 is 150 mm (i.e., curvature=0.006666 in Table 1) and the eye position laterally (in the Y direction) is on center or off center one millimeter. At this value of mirror radius of curvature the lens assembly 106 projects the image of LCD source 101 with a flat wavefront (i.e. from infinity, at depth-slice #1). The design assumes that a typical human eye focal length when it is focused at infinity is 17 mm.
The system 100 of
The normal human retina can register 200 line pairs per mm, or 2.5 microns resolution, about the size of the cone cells in the retina. Only 100% modulation, however, is visible at this highest of all retinal spatial frequencies and no incoherent optical system can deliver that 100% modulation.
In
The optical prescription of a second embodiment of a suitable projector with extended field of view and larger eye box is shown in Table 3.
The image source of projector 700 shown in
Membrane mirror 704 (surface 5) which has a 10 mm clear aperture, changes its radius of curvature from infinity to 150 mm. The curved membrane mirror, in combination with the reverse telephoto lens, creates a virtual image of the display output, at a distance from the exit pupil that is controllable by varying the curvature of the membrane mirror. The optical system was designed in four configurations listed in Table 4 below.
In configurations 1, 2, and 3 the radius of curvature of the mirror is 150 mm, or the curvature is 0.006666 in Table 1. When membrane mirror has 150 mm radius of curvature, the image is projected from infinity and a typical human eye, focused on an object at infinity, has a focal length of 17 mm. Using this system the eye can observe images with a field of view of 6 degrees field, which can be increased in a further development. The system was optimized for three eye positions: centered (Configuration 2), and shifted ±2.5 mm (Configurations 1 and 3) and for waveband 0.45-0.65 microns.
In the second embodiment the MMDM operates with a 10 mm clear aperture. A mirror with 10 mm clear aperture and with 150 mm radius of curvature has an 80 microns sag.
Current commercially available Flexible Optics Membrane Micromachined Deformable mirrors designed for real time adaptive optics wavefront correction have a maximum correction span of 25 microns. Nevertheless the 80 microns or more sag can be achieved with currently available technology. (Private communication with Dr. G. Vdovin of Flexible Optics Corp.)
The projector shown in
The image quality (MTF) for configuration 2, with the observer's eye in the center of the eye box and focused at infinity, is shown in
In
The mirror response time is about 1 millisecond. Currently available LC displays can operate with 150 Hz frequencies. So the system is able to generate up to 5 depth image “slices” during each 33-millisecond frame. At every frame the observer will receive five pairs of stereoshifted image “slices” located at five different distances from observer. The observer can focus his or her eyes on the chosen depth in accordance with the distance perception given by the stereo disparity of each depth slice.
While in the first and second preferred embodiments shown above the MMDM was used as an optical element of variable power, other technologies can be also be used. One example of a competitive technology to MMDM is to use a stack (sandwich) of electro-switchable LC Fresnel lenses. Another feasible competitive technology can be the stack of electro-switchable LC Fresnel zone plate lenses. A suitable lens system is described in Y. Fan, H. Ren, S. Wu “Switchable Fresnel lens using polymer-stabilized LC”, Opt. Express, Vol. 11, No. 23, 2003, which is incorporated herein by reference in its entirety. In both of these technologies the electro-optical lenses can be switched on and off during the imaging frame to create an array of precalculated focal powers. At any moment only one lens will be activated. In this case the number of projected depth slices will be equal to the number of LC Fresnel lenses packaged in the stack. A more sophisticated algorithm includes the use of LC Fresnel lenses switchable in combination, allowing in principle up to 2n−1 depth slices for n lenses.
Referring to
The displays 101, 701 and the mirrors 105, 705 or other optical elements of variable power are controlled by a driver, shown functionally in
The projection system shown in
For example, the pair of slice images for a single slice will typically be identical except for small zones, particularly at the edges of occluding objects in slices nearer the observer, as well as side surfaces of objects in each slice, and for the offsetting of objects at different depths within each slice. For example, the images for successive frames of an animated or otherwise moving image will similarly often have only small differences. Techniques for the efficient compression of images that are only slightly different are well known and, in the interests of conciseness, are not described here.
The binocular projector system disclosed herein can provide the observer with a natural perception of 3D scenes. It can be used in new generation of 3D TV systems, 3D displays, 3D head mounted displays, video games stations, flight simulators, Unmanned Aerial Vehicles control console simulators, Unmanned Ground Vehicle control console simulators, and other such 3D video systems.
Although specific embodiments have been described, the person skilled in the art will understand how variations may be made, and how features of different embodiments may be combined. For example, the number of slices proposed was based on the persistence of vision of the human eye, for which a frame refresh time of 33 milliseconds, corresponding to the 30 frames per second that is standard for television and video in the U.S.A., is reasonable for avoiding perceptible flicker, given the response speed of the available LC displays. If a faster display is available, the number of frames per second may be increased to reduce flicker. Alternatively, or in addition, the number of depth slices may be increased, though only at the expense of flux throughput, requiring brighter illumination. Conversely, the frame rate may be reduced, to reduce the demand on system resources, or free up resources to increase the number of slices, if a more noticeable flicker is acceptable.
In a desirable embodiment, at the display's working distance the diameter of the exit pupil of each projector is 60 mm, and the centers of the exit pupils are 60 mm apart, corresponding to the separation of the eyes of a typical human observer. Thus, the exit pupils of the two projectors constitute two touching circles. An observer needs only to place the pupil of left eye anywhere in the exit pupil of the left projector and the right eye anywhere in the exit pupil of the right projector. The proposed 3D display thus does not need adjustment to eye pupil diameter and eye spacing of different observers, and can permit sufficient movement of the observer's eyes and head to permit of comfortable viewing.
In an embodiment of a process for using the projection systems described, the generation of the still 3D scene begins with a standard geometrical procedure of calculating the obscuration of objects by other objects from the observer's viewpoint, and revealing the array of the active visible points at the scene. Then the array of the angular stereo disparities for the active points will be calculated. Because of the stereo disparity, the active visible points in partly obscured slices are different for the two eyes. The calculations can be made for the standard 60 mm observer eye separation, or adjusted for the eye spacing of a specific customer or other observer or category of observers.
In an embodiment, to generate the 3D scene the whole depth space from 250 mm distance to infinity will be divided into 5 zones at equal increments of eye accommodation power, which is proximately 1 diopter of accommodation for each zone. All objects and associated stereoshift data in the scene will be combined into 5 depth-slice files in accordance with the zone in which each object is located. Arrays of the angular stereoshifts will be transformed into arrays of linear lateral shifts in the focal plane of the projectors, and five slices will be generated in a cycle of 33 milliseconds. Each depth slice will be generated with the deformable mirror set to the radius of curvature associated with the position of that slice. For a dynamic scene, the still scene simulation algorithm will be repeated per 33 millisecond cycle with a new position of any moving object in each cycle.
In the above description, it has been assumed that the slice images are generated in pairs, one for each eye at a common depth, and that the pairs are generated in sets of five, one pair for each of the five depth slices, from or for a single 3D image, or a single 3D frame of a video sequence. It has also been assumed that the images are projected in their pairs, with the two projectors operating in synchrony. Those constraints are not strictly necessary, but as a practical matter it is usually most efficient to render a single 3D frame into five pairs of slices, because much of the analysis can be more efficiently used. For example, a single calculation of occlusion of objects in more distant layers by objects in nearer layers can then be used in generating all of the layers involved.
The preceding description of the presently contemplated best mode of practicing the invention is not to be taken in a limiting sense, but is made merely for the purpose of describing the general principles of the invention. The full scope of the invention should be determined with reference to the Claims.
This application claims benefit of U.S. Provisional Patent Application No. 61/273,743, filed Aug. 7, 2009, which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61273743 | Aug 2009 | US |