This invention relates to digital cameras, and to enhancing auto focusing performance and image processing.
Cameras and other imaging devices normally have a single plane of focus, with a range of acceptable focus (“depth of field”) near that range. Large apertures are useful for low light imaging, but create a narrower focal range. This means that it is impossible in some circumstances to generate sharp images of multiple subjects at different focal distances without the aid of external lighting, narrower apertures, and other measures that can affect desired images.
Modern digital cameras may employ a capability called “focus stacking” in which a fixed camera images a stationary inanimate subject (such as for product photography) and takes a series of many images at regular focal distance intervals. Each image is at an incrementally different focal distance to cover the range of distances from the closest to the farthest point of the subject, with the distances being selected for even spacing in the range, without regard to the elements of the subject or their distance. The image distance intervals are narrow enough to ensure that the intervals are less that the depth of focus of each image to that all subject points are in focus for at least one of the images. The images are then post-processed into a single image that uses the sharpest image segment for each area on the subject to generate an overall sharp image.
While effective for stationary subjects this is not useful for moving subjects like people. Even a person sitting relatively still for a portrait may move enough to generate misalignment of the images. Moreover, the number of images required can be in the dozens or even hundreds to cover large subjects, requiring extended periods when motionlessness is required. For example, even a fast 20 frames per second shutter with a limited 20-frame image will require one second of motionlessness, which is beyond the capacity for hand-holding, subject motion, and image stabilization. Moreover, the appearance of having all points of a subject in focus is unnatural and may be undesirable in instances when only selected elements (at different focal distances) are desired to be in focus. For example, a sharp image of each person in a small group (or of all facial features of a model) while the background is blurred to eliminate distractions.
Accordingly, there is a need for a camera system comprising a body that contains a lens with a range of focus settings and an image sensor operable to record an image. The camera system has a controller operably connected to the sensor to receive the image, and the controller is operably connected to the lens to control the focus setting. The controller is operable to focus the lens on a selected point, and the controller is operable to determine at least two different first and second subject elements. The controller is operable to focus the lens on the first subject and record a first image, and the controller is operable to focus the lens on the second subject and record a second image.
The system operates to image a scene 24 shown in
When the two subjects are at different focal distances such as is illustrated, the camera operates to take two images in rapid succession.
The sequence of captured images may simply be stored, for (in one basic usage) the user later to select which is the desired single subjects in an otherwise normally focused image. This avoids the need for the photographer to select from among subjects during the imaging event. An example might include a sideline image of a football line of scrimmage, with a rapid sequence of each of the linemen being imaged in focus. Notably for all embodiments, the system does not simply take a sequence of images at limited intervals irrespective of the subjects' actual locations in hope of getting everything approximately in focus. It operates to select image focal distances based on the location of actual subjects, and operates preferably to focus specifically on selected and identified subjects, with only that many images captured, and each subject optimally in focus. Even two subjects at very different focal distances will have images captured rapidly in sequence without intervening images to create an undesired delay in capturing the two (or more) most critical subjects.
In more advanced embodiments, the image is processed and composited using the techniques associated with conventional focus stacking or bracketing to create a single image with both (or all) subjects in focus. In a simple example, the sharply focused face is overlaid on the same subject's blurry face of the other image, in the sense of photoshopping a face in a group image to eliminate an eye blink. In the simple example, the face may be positioned in the same location on the frame, with the assumption that the brief interval between shots does not cause an objectionable offset.
In a more advanced system, the blurred image of the subject is analyzed to establish location datum points, and these are correlated with location datum points in the sharply focused face image, so that the sharp face is pasted onto the main image in registration with the face (or any subject) of the reference image. For instance, the blurry eyes of the left subject in
More than just the face or other key element of the second image may be composited with the first image. While the boundary of the face may be identified and pasted onto the other image, more advanced approaches may be employed. As with focus bracketing systems, each location may be assessed to determine which of the two or more images is in closer focus to represent that location more sharply. This will ensure smooth transitions where each image is about equally out of focus (or sharp) at a transition between images. And as with location of the sharp element in registration with the location of the blurred image of the same element, the transition areas may be stretched and shafted to ensure that there is registration potentially at many different points along a perimeter between images.
Foreground and background blur control may be employed. In the illustrated case of the nearer and farther subjects of
The nearer subject would be captures along with foreground subjects and other subjects closer to the camera than a midpoint between subjects that are rendered equally sharp in either image. The background and beyond-midpoint subjects would be taken from the image in which the background subject is sharp. A third option is to do the reverse, and to use the blurrier background from the image focused on the nearer subject to provide an often desired, and the blurrier foreground (if any) from the image focused on the farther subject. This provides a sometimes-desired greater blur of non-subject elements. A basic version toward this result where only background is a concern is to bias toward using the foreground subject image as the master image to get the blurriest background, and compositing in only the rear subject as needed. In any embodiment, these alternatives can be all tested in post processing and the user (or expert system) given the choice between the different looks provided, and to avoid any approaches that introduce unwanted artifacts be rejected in favor of better results.
An additional implementation for a person's face 40 is depicted in
The system may be programmed to recognize any of these specific elements, and to shoot images in rapid succession, with each in focus, changing focus between images. As above, the images may be composited in the manner of focus stacking systems, or the sharp selected features may be composited into a selected master image.
In an alternative embodiment, the nearest and farthest desired subjects may be identified, and the camera operated to automatically shoot a number of evenly spaced images at focal distances between the selected subjects, creating an effective depth of field. The intervals may be evenly spaced, and may be based on an analysis by the processor or controller of whether the selected subjects are at far apart enough focal distances to necessitate one of more intermediate images to generate sharp images (such as the nose of a subject when focusing primarily on a near and far eye). Also, if the subjects are separated (as with two people in foreground and background) how many images are required between the two selected focal distance extremes. This may also determine whether other subjects are in view in the middle distance between the two primary subjects. Because the intermediate subject matter may be of lesser importance, the camera may first image the two or more primary subjects detected and identified as important, then capture the intermediate focal distance images that are less problematic iflost due to subject movement of camera shake.
Conventional focus stacking may be improved (made faster and potentially hand-holdable) by employing similar principles by using a detected subject such as a face or eye at the starting point, and optionally a second detected subject as an end point, using appropriate intervals as determined by automatic image analysis (e.g. more images at tighter intervals for large aperture fast lens settings with thin depth of focus, fewer for smaller aperture slow lens settings).
The image processing may be in camera or post processing, and may be done interactively with the user who may select from the recorded images.
An alternative focus stacking feature may employ manual selection of at least one limit of focus by user input. This may be made as noted by selecting a subject or image area using a cursor or touch screen, or by selecting a focal distance by manually or automatically focusing to a limit (such as a near subject) and then indicating to the camera the selection such as by a button, voice command, or other input. While useful to select both limits, in some cases only a single user-selected input is needed, such as when the other limit is at infinity or at closest focus.
A focus stacking embodiment may differ from conventional focus stacking in that at least one of the image distances may be based on a subject's focal distance, either as manually selected or at automatically determined. Conventional systems employ a sequence of image distances that range from closer to the desired subject(s) to beyond the subjects, with the understanding that the small change in focus will mean that all elements are in sharp focus; the system does not base focus distances on the actual subject distance, but merely over a range that includes the subject but is not focused on it—no one image is focused on a subject, only at incremental distances based on things other than the actual subject distance. The preferred embodiment may base one of the several images on one subject, and then have a range that extends in one or both directions toward and away from the subject by appropriate increments. As an alternative, a second (or more additional) subject may be forced for one of the incremental distances, either as an extra shot in the range of distances, or as a primary second point, with the interval between first and second image to be subdivided into appropriate mathematical increments.
In such an application with only one subject 102 deliberately focused in plane 120C as shown in
Alternatively as shown in
With respect to the imaging of two subjects at different focal distances, it is notable that the camera has a single lens. While it normally has numerous lens elements, it is a single lens in the sense that it focuses only at one distance. Only one plane of focus is provided, and anything away from that distance will be depicted less sharply based on how far it is from the focal distance.
This is to be contrasted with bifocal systems that are essentially two lenses, each having its own focal distance. These enable to simultaneous focus on different-distance subjects, such as for broadcast of a footrace with two different runners at different distances, each in a different half of the frame. Such bifocal systems are simultaneously in focus on two different subjects, and do not shift focus from one to the other. They require complex dedicated systems having multiple lenses (or lens segments) and can focus only on as many subjects as they have lenses (typically two). Such systems will typically divide the frame in half arbitrarily based on the field of coverage of each lens, and this dividing line is preestablished without respect to the subject image. Such bifocal lens systems are unable to change the number of subjects in focus, and are unable to base the number of images captured and composited based on the number of desired subjects. For instance, the preferred embodiment may generate an image of basketball players on a court with each player in focus (acknowledging that in this example subject motion may be a challenge at current frame rates and a group portrait may be a more apt example).
In the preferred embodiment the two images are composited along a potentially complex boundary that may be as simple as an irregular oval around an eye or a face, or a complex set of contour lines based on lines of equivalent defocus. The analogy is superimposing two topographic maps in which elevation is the analog of sharpness, and at each point on the map, employing the data from the higher elevation map at that point. The boundaries between areas drawn from the first map and the second map will be the lines of equal elevation. Analogously, the boundaries between the first image and the second image will be the lines of equal defocus. The system can make exceptions to tolerate minor differences of defocus degree to avoid capturing large and irrelevant areas of an image, tolerating a threshold of difference to close a gap in the boundary around a desired subject such as a face or an eye.
In alternative embodiments, the compositing process may take note of whether the color or brightness of a location has varied significantly from one image to the other in a way that is not explained by a focus difference, but more likely due to subject motion. Such instances may be verified or discovered by defocusing the shaper image at that location to compare. If a difference is found, the compositing process may seek a second choice nearby area or boundary path that avoids the area with shifted subject but has tolerable similarity of defocus adequate to avoid an objectionable discontinuity between sharp and blurred. In some embodiments, the original images are preserved to enable various choices to be offered to the user for approval.
In further alternatives, a rapid sequence of images continuing more than one per subject may be imaged, allowing the generation of a composite image based not just upon sharpness, but on subject expression (avoiding a blink, for instance). This may be attained either with at least two rapid-fire image of the first subject in focus, then without delay two or more images of the second subject in focus. The images just before and after the focus transition are most likely to be aligned in the event of some subject motion, but the prior and subject images may offer usable substitutes in the event these are flawed.
As discussed, the proposed system is for images captured in rapid succession to minimize the time between images to minimized subject movement and misalignment. The images are captured “without delay” in the sense of no added delay or pause other than as necessary for the system. The capture is “immediate not in the sense of simultaneous, but only in the sense of one being as immediately after the other as is technically possible. This is understood to convey that there is no pause or unnecessary delay between the images, and that they are recorded as rapidly as possible under the conditions and capabilities of the equipment. When a mechanical shutter is desired, it may have a slower rate than an electronic or global shutter.
It is anticipated that future camera, sensor, and processor advancements may generate effective shutter speeds comparable to that of video with frame rates of 60, 120, and 240 FPS and up. In these instances, the rate of lens focus may be a limiting factor, and “as fast as possible” relates to the time required to physically move lens elements to change the point of focus. In such instances, the system is imaging sequentially without delay, other than the necessary time to focus. In extreme alternatives, the imaging may be continuous at the fastest available frame and the lens focus swept at highest speed from one focal distance to the other even if images are captured of intermediate distances between the subjects.
In different embodiments, the compositing and other image processing following capture may be done either in the camera to generate one image, with or without preserving the source images. The compositing may also be done in post process with a different processor other than the camera (such as a computer workstation).
This application is a Continuation-in-Part of U.S. patent application Ser. No. 17/345,677 filed on Jun. 11, 2021, entitled “DIGITAL CAMERA WITH MULTI-SUBJECT FOCUSING,” which is hereby incorporated by reference in its entirety for all that is taught and disclosed therein.
Number | Date | Country | |
---|---|---|---|
Parent | 17345677 | Jun 2021 | US |
Child | 17669405 | US |