Embodiments of the present invention generally relate to systems and methods for determining a set of orthonormal polynomials from a complete set of non-orthogonal polynomials over a circular or noncircular pupil. More particularly, embodiments relates to systems, methods, and software for wavefront analysis in optical design and testing for wavefront representation.
An optical system for imaging or propagation of laser beams generally has a circular pupil. It is quite common to analyze the aberrations of such a system with Zernike circle polynomials, which are orthogonal across a unit circle. However, often the pupil is circular for on-axis point objects only. For off-axis point objects, the pupil can be elliptical or irregular because of vignetting. For large optical systems, the primary mirror may be segmented, where each segment is hexagonal. Similarly, high-power laser beams may have rectangular or square cross sections. The use of circle polynomials for such cases may not be appropriate. Thus, there is a need for polynomials that are orthogonal over noncircular pupils.
Since the circle polynomials form a complete set, in principle, the aberration function can be expressed in terms of them regardless of the shape of the pupil. However, such an expansion can take a large number of terms and the advantages of orthogonality may be lost. For example, the expansion coefficients may not be independent of each other and their values may change as the number of aberration terms changes. Moreover, the variance of the aberration may not be equal to the sum of the squares of the aberration coefficients. Moreover, their physical significance may not be of much use since they may not correspond to balanced aberrations.
In practice, the aberrations of a system can be determined either by tracing rays through it or by testing it interferometrically. Thus, the aberrations may be known only at a discrete set of points, and the coefficients thus obtained may be in error even for an axial point object, since the Zernike circle polynomials, which are orthogonal over the full circular region, may not be orthogonal over the discrete points of the aberration data set.
Therefore, in light of above, it would be desirable to provide improved methods, systems, and software for determining a set of orthonormal polynomials over noncircular pupils or the discrete data set and techniques for obtaining the orthonormal aberration coefficients. Embodiments of the present invention provide solutions for at least some of these needs.
Advantageously, embodiments of the present invention provides systems, methods, and software for determining a set of orthonormal polynomials over a noncircular pupil from a complete set of polynomials using analytical and numerical approaches. Embodiments of the present invention can be generally useful for reducing the error in wavefront fitting and reconstruction due to the incorrect use of non-orthogonal polynomials, such as Zernike circle polynomials as described by V. N. Mahajan, Optics Imaging and Aberrations, Part II: Wave Diffraction Optics (SPIE, 2004 Printing), the full disclosures of which are incorporated herein by reference. Embodiments of the present invention can provide enhanced accuracy of wavefront information in data reduction in optical design and testing.
Moreover, embodiments of the present invention can be readily adapted for use with existing wavefront analysis software in optical design and testing. One particular embodiment provides a general approach in determining of a set of orthonormal polynomials over noncircular pupils, such as an annulus, a hexagon, an ellipse, a rectangle, a square, or an irregular shape.
In one aspect, embodiments of the present invention encompass methods of determining a set of orthonormal polynomials F over a domain Z. The method can include, for example, selecting a complete set of polynomials P as the basis for the set of orthonormal polynomials F, calculating a conversion matrix M comprising an inner product of the complete set of polynomials P over the domain Σ, and determining the set of orthonormal polynomials F based on the conversion matrix M and the complete set of polynomials P. The domain Σ may be one-dimensional, two-dimensional, or multi-dimensional. The domain Σ can be circular or noncircular. In some cases, the domain includes a noncircular pupil. Relatedly, a noncircular pupil can include an annulus, a hexagon, an ellipse, a rectangle, a square, an irregular shape, and the like. A complete set of polynomials P can include a power series. A complete set of polynomials P can include Zernike circle polynomials. A complete set of polynomials P can include a Fourier series. A complete set of polynomials P can include Taylor monomials. A complete set of polynomials P can include Jacobi polynomials. A complete set of polynomials P can include Chebyshev polynomials. A complete set of polynomials P can include Legendre polynomials. A complete set of polynomials P can include Laguerre polynomials. A complete set of polynomials P can include Hermite monomials. In some cases, an inner product of polynomials is calculated numerically as summation. In some cases, a conversion matrix M is determined by a classical Gram-Schmidt orthogonalization process. In some cases, a conversion matrix M is determined by a non-recursive method, which may involve an analytical method or a numerical method, and may involve the use of a Cholesky decomposition or a QR factorization. In some cases, a conversion matrix M is determined by a matrix transformation.
In another aspect, embodiments of the present invention encompass methods of determining a set of orthonormal polynomials over a noncircular pupil. Methods can include, for example, selecting a complete set of polynomials P as the basis for the new orthonormal polynomials F, determining a set of sub-areas of the noncircular pupil, formulating a simple relationship between x and y within each of the said sub-area such that polynomial P is integratable within each sub-area, calculating the conversion matrix M whose elements are the inner products of the complete polynomials over the noncircular pupil, and obtaining the set of orthonormal polynomials from the conversion matrix M and the complete set of basis P. A noncircular pupil can include an annulus, a hexagon, an ellipse, a rectangle, a square, an irregular shape, or the like. A complete set of polynomials P can include a power series, Zernike circle polynomials, a Fourier series, Taylor monomials, Jacobi polynomials, Chebyshev polynomials, Legendre polynomials, Laguerre polynomials, Hermite monomials, or the like. An inner product of polynomials can be calculated numerically as summation. A conversion matrix M can be determined by a classical Gram-Schmidt orthogonalization process, a modified Gram-Schmidt orthogonalization process, a non-recursive method, or a matrix transformation. A non-recursive method can include an analytical method or a numerical method. A non-recursive method can involve the use of a Cholesky decomposition or a QR factorization.
In another aspect, embodiments of the present invention provide a computer program stored on a computer-readable storage medium. The computer program can include code for calculating a pre-chosen complete set of polynomials P in a discrete, pre-chosen coordinate system, code for calculating the conversion matrix M whose elements are the inner products of the complete polynomials over a domain Σ, and code for determining the set of orthonormal polynomials from the conversion matrix M and the complete set of basis P. The computer program can also include code for integrating over the noncircular pupil of the inner products of the complete set of polynomials P symbolically. The computer program can also include code for integrating over the noncircular pupil of the inner products of the complete set of polynomials P numerically. The domain Σ can be one-dimensional, two-dimensional, or multi-dimensional. A two-dimensional domain can be circular or noncircular. A noncircular pupil can include an annulus, a hexagon, an ellipse, a rectangle, a square, an irregular shape, or the like. A complete set of polynomials P can include a power series, Zernike circle polynomials, a Fourier series, Taylor monomials, Jacobi polynomials, Chebyshev polynomials, Legendre polynomials, Laguerre polynomials, Hermite monomials, or the like. An inner product of polynomials can be calculated numerically as summation. A conversion matrix M can be determined by a classical Gram-Schmidt orthogonalization process, a modified Gram-Schmidt orthogonalization process, a non-recursive method, or a matrix transformation. A non-recursive method can include an analytical method or a numerical method. A non-recursive method can involve the use of a Cholesky decomposition or a QR factorization.
In another aspect, embodiments of the present invention encompass a system for determining a set of orthonormal polynomials over a domain Σ. The system can include, for example, a module configured to select a complete set of polynomials P as the basis for the new orthonormal polynomials F, a module configured to calculate the conversion matrix M whose elements are the inner products of the complete polynomials over the domain Σ, and a module configured to determine the set of orthonormal polynomials from the conversion matrix M and the complete set of basis P. The domain Σ can be one-dimensional, two-dimensional, or multi-dimensional. A two-dimensional domain can be circular or noncircular. A noncircular pupil can include an annulus, a hexagon, an ellipse, a rectangle, a square, an irregular shape, or the like. A complete set of polynomials P can include a power series, Zernike circle polynomials, a Fourier series, Taylor monomials, Jacobi polynomials, Chebyshev polynomials, Legendre polynomials, Laguerre polynomials, Hermite monomials, or the like. An inner product of polynomials can be calculated numerically as summation. A conversion matrix M can be determined by a classical Gram-Schmidt orthogonalization process, a modified Gram-Schmidt orthogonalization process, a non-recursive method, or a matrix transformation. A non-recursive method can include an analytical method or a numerical method. A non-recursive method can involve the use of a Cholesky decomposition or a QR factorization.
In another aspect, embodiments of the present invention encompass a system for determining a set of orthonormal polynomials over a noncircular pupil. The system can include, for example, a module configured to select a complete set of polynomials P as the basis for the new orthonormal polynomials F, a module configured to determine a set of sub-areas of the noncircular pupil, a module configured to formulate a simple relationship between x and y within each of the sub-area such that polynomial P is integratable within each sub-area, a module configured to calculate a conversion matrix M whose elements are the inner products of the complete polynomials over the noncircular pupil, and a module configured to determine the set of orthonormal polynomials from the conversion matrix M and the complete set of basis P.
For a fuller understanding of the nature and advantages of the present invention, reference should be had to the ensuing detailed description taken in conjunction with the accompanying drawings.
FIGS. 12B(i) and 12B(ii) illustrate wavefront fitting error as a function of obscuration ratio for various terms for an annular domain according to embodiments of the present invention.
FIGS. 12C(i), 12C(ii), and 12C(iii) illustrate wavefront fitting error as a function of aspect ratio for various terms for an elliptical domain according to embodiments of the present invention.
FIGS. 12D(i), 12 D(ii), and 12 D(iii) illustrate illustrates wavefront fitting error as a function of aspect ratio for various terms for a rectangular domain according to embodiments of the present invention.
FIGS. 14C(i) and 14C(ii) illustrate wavefront reconstruction error as a function of obscuration ratio for various terms for an annular domain.
FIGS. 14D(i), 14D(ii), and 14D(iii) illustrate wavefront reconstruction error as a function of aspect ratio for various terms for an elliptical domain according to embodiments of the present invention.
FIGS. 14E(i), 14E(ii), and 14E(iii) illustrate wavefront reconstruction error as a function of aspect ratio for various terms for a rectangular domain according to embodiments of the present invention.
Embodiments of the present invention can be readily adapted for use with existing laser systems, wavefront measurement systems, and other optical measurement devices. Although the systems, software, and methods of the present invention are described primarily in the context of a laser eye surgery system, it should be understood the present invention may be adapted for use in alternative eye treatment procedures, systems, or modalities, such as spectacle lenses, intraocular lenses, accommodating IOLs, contact lenses, corneal ring implants, collagenous corneal tissue thermal remodeling, corneal inlays, corneal onlays, other corneal implants or grafts, and the like. Relatedly, systems, software, and methods according to embodiments of the present invention are well suited for customizing any of these treatment modalities to a specific patient. Thus, for example, embodiments encompass custom intraocular lenses, custom contact lenses, custom corneal implants, and the like, which can be configured to treat or ameliorate any of a variety of vision conditions in a particular patient based on their unique ocular characteristics or anatomy.
Embodiments of the present invention involve the representation of ocular aberrations. In some cases, embodiments involve the representation of aberrations in other optical systems. Often, Zernike polynomials, circle polynomials, or Zernike circle polynomials are used to characterize ocular aberrations. Such polynomials are orthogonal over a circular pupil. Embodiments of the present invention provide polynomials that are orthogonal over any of a variety of shapes. These polynomials can be used with optical systems that have a pupil or aperture that is circular or noncircular. For example, these polynomials can be used with human eyes that have elliptical or irregularly shaped pupils. Similarly, these polynomials can be used with optical systems such as telescopes which may have hexagonal apertures. Relatedly, these polynomials can be used with optical systems such as high power lasers which may have square or rectangular apertures. It is understood that these polynomials can be used with any of a variety of optical devices. Embodiments of the present invention provide sets of polynomials, where each is orthonormal over a specific pupil or aperture. For example, some embodiments provide a set of elliptical polynomials for use with an elliptical pupil or aperture. Some embodiments provide a set of rectangular polynomials for use with a rectangular pupil or aperture. Polynomials may be orthonormal, which can be defined as orthogonal and normal. Orthonormal polynomials can have various properties. For example, the truncation of the wavefront expansion may not change the expansion coefficients. In some embodiments, each orthonormal polynomial except for the piston term has a mean value of zero. Further, a wavefront mean value can equal the piston coefficient. In some embodiments, each orthonormal polynomial has a minimum variance. What is more, a wavefront variance can be the sum of squares of each expansion coefficient, excluding the piston coefficient. Embodiments of the present invention provide methods for establishing or constructing orthonormal polynomials over pupils or apertures having various shape. These sets of polynomials can be orthonormal over their respective pupil or aperture shape. In some embodiments, the terms “pupil” and “aperture” can be used interchangeably.
Turning now to the drawings,
Laser 12 generally comprises an excimer laser, ideally comprising an argon-fluorine laser producing pulses of laser light having a wavelength of approximately 193 nm. Laser 12 will preferably be designed to provide a feedback stabilized fluence at the patient's eye, delivered via delivery optics 16. The present invention may also be useful with alternative sources of ultraviolet or infrared radiation, particularly those adapted to controllably ablate the corneal tissue without causing significant damage to adjacent and/or underlying tissues of the eye. Such sources include, but are not limited to, solid state lasers and other devices which can generate energy in the ultraviolet wavelength between about 185 and 205 nm and/or those which utilize frequency-multiplying techniques. Hence, although an excimer laser is the illustrative source of an ablating beam, other lasers may be used in the present invention.
Laser system 10 will generally include a computer or programmable processor 22. Processor 22 may comprise (or interface with) a conventional PC system including the standard user interface devices such as a keyboard, a display monitor, and the like. Processor 22 will typically include an input device such as a magnetic or optical disk drive, an internet connection, or the like. Such input devices will often be used to download a computer executable code from a tangible storage media 29 embodying any of the methods of the present invention. Tangible storage media 29 may take the form of a floppy disk, an optical disk, a data tape, a volatile or non-volatile memory, RAM, or the like, and the processor 22 will include the memory boards and other standard components of modern computer systems for storing and executing this code. Tangible storage media 29 may optionally embody wavefront sensor data, wavefront gradients, a wavefront elevation map, a treatment map, a corneal elevation map, and/or an ablation table. While tangible storage media 29 will often be used directly in cooperation with a input device of processor 22, the storage media may also be remotely operatively coupled with processor by means of network connections such as the internet, and by wireless methods such as infrared, Bluetooth, or the like.
Laser 12 and delivery optics 16 will generally direct laser beam 14 to the eye of patient P under the direction of a computer 22. Computer 22 will often selectively adjust laser beam 14 to expose portions of the cornea to the pulses of laser energy so as to effect a predetermined sculpting of the cornea and alter the refractive characteristics of the eye. In many embodiments, both laser beam 14 and the laser delivery optical system 16 will be under computer control of processor 22 to effect the desired laser sculpting process, with the processor effecting (and optionally modifying) the pattern of laser pulses. The pattern of pulses may by summarized in machine readable data of tangible storage media 29 in the form of a treatment table, and the treatment table may be adjusted according to feedback input into processor 22 from an automated image analysis system in response to feedback data provided from an ablation monitoring system feedback system. Optionally, the feedback may be manually entered into the processor by a system operator. Such feedback might be provided by integrating the wavefront measurement system described below with the laser treatment system 10, and processor 22 may continue and/or terminate a sculpting treatment in response to the feedback, and may optionally also modify the planned sculpting based at least in part on the feedback. Measurement systems are further described in U.S. Pat. No. 6,315,413, the full disclosure of which is incorporated herein by reference.
Laser beam 14 may be adjusted to produce the desired sculpting using a variety of alternative mechanisms. The laser beam 14 may be selectively limited using one or more variable apertures. An exemplary variable aperture system having a variable iris and a variable width slit is described in U.S. Pat. No. 5,713,892, the full disclosure of which is incorporated herein by reference. The laser beam may also be tailored by varying the size and offset of the laser spot from an axis of the eye, as described in U.S. Pat. Nos. 5,683,379, 6,203,539, and 6,331,177, the full disclosures of which are incorporated herein by reference.
Still further alternatives are possible, including scanning of the laser beam over the surface of the eye and controlling the number of pulses and/or dwell time at each location, as described, for example, by U.S. Pat. No. 4,665,913, the full disclosure of which is incorporated herein by reference; using masks in the optical path of laser beam 14 which ablate to vary the profile of the beam incident on the cornea, as described in U.S. Pat. No. 5,807,379, the full disclosure of which is incorporated herein by reference; hybrid profile-scanning systems in which a variable size beam (typically controlled by a variable width slit and/or variable diameter iris diaphragm) is scanned across the cornea; or the like. The computer programs and control methodology for these laser pattern tailoring techniques are well described in the patent literature.
Additional components and subsystems may be included with laser system 10, as should be understood by those of skill in the art. For example, spatial and/or temporal integrators may be included to control the distribution of energy within the laser beam, as described in U.S. Pat. No. 5,646,791, the full disclosure of which is incorporated herein by reference. Ablation effluent evacuators/filters, aspirators, and other ancillary components of the laser surgery system are known in the art. Further details of suitable systems for performing a laser ablation procedure can be found in commonly assigned U.S. Pat. Nos. 4,665,913, 4,669,466, 4,732,148, 4,770,172, 4,773,414, 5,207,668, 5,108,388, 5,219,343, 5,646,791 and 5,163,934, the complete disclosures of which are incorporated herein by reference. Suitable systems also include commercially available refractive laser systems such as those manufactured and/or sold by Alcon, Bausch & Lomb, Nidek, WaveLight, LaserSight, Schwind, Zeiss-Meditec, and the like. Basis data can be further characterized for particular lasers or operating conditions, by taking into account localized environmental variables such as temperature, humidity, airflow, and aspiration.
User interface input devices 62 may include a keyboard, pointing devices such as a mouse, trackball, touch pad, or graphics tablet, a scanner, foot pedals, a joystick, a touchscreen incorporated into the display, audio input devices such as voice recognition systems, microphones, and other types of input devices. User input devices 62 will often be used to download a computer executable code from a tangible storage media 29 embodying any of the methods of the present invention. In general, use of the term “input device” is intended to include a variety of conventional and proprietary devices and ways to input information into computer system 22.
User interface output devices 64 may include a display subsystem, a printer, a fax machine, or non-visual displays such as audio output devices. The display subsystem may be a cathode ray tube (CRT), a flat-panel device such as a liquid crystal display (LCD), a projection device, or the like. The display subsystem may also provide a non-visual display such as via audio output devices. In general, use of the term “output device” is intended to include a variety of conventional and proprietary devices and ways to output information from computer system 22 to a user.
Storage subsystem 56 can store the basic programming and data constructs that provide the functionality of the various embodiments of the present invention. For example, a database and modules implementing the functionality of the methods of the present invention, as described herein, may be stored in storage subsystem 56. These software modules are generally executed by processor 52. In a distributed environment, the software modules may be stored on a plurality of computer systems and executed by processors of the plurality of computer systems. Storage subsystem 56 typically comprises memory subsystem 58 and file storage subsystem 60.
Memory subsystem 58 typically includes a number of memories including a main random access memory (RAM) 70 for storage of instructions and data during program execution and a read only memory (ROM) 72 in which fixed instructions are stored. File storage subsystem 60 provides persistent (non-volatile) storage for program and data files, and may include tangible storage media 29 (
Bus subsystem 54 provides a mechanism for letting the various components and subsystems of computer system 22 communicate with each other as intended. The various subsystems and components of computer system 22 need not be at the same physical location but may be distributed at various locations within a distributed network. Although bus subsystem 54 is shown schematically as a single bus, alternate embodiments of the bus subsystem may utilize multiple busses.
Computer system 22 itself can be of varying types including a personal computer, a portable computer, a workstation, a computer terminal, a network computer, a control system in a wavefront measurement system or laser surgical system, a mainframe, or any other data processing system. Due to the ever-changing nature of computers and networks, the description of computer system 22 depicted in
Referring now to
More specifically, one wavefront measurement system 30 includes an image source 32, such as a laser, which projects a source image through optical tissues 34 of eye E so as to form an image 44 upon a surface of retina R. The image from retina R is transmitted by the optical system of the eye (e.g., optical tissues 34) and imaged onto a wavefront sensor 36 by system optics 37. The wavefront sensor 36 communicates signals to a computer system 22′ for measurement of the optical errors in the optical tissues 34 and/or determination of an optical tissue ablation treatment program. Computer 22′ may include the same or similar hardware as the computer system 22 illustrated in
Wavefront sensor 36 generally comprises a lenslet array 38 and an image sensor 40. As the image from retina R is transmitted through optical tissues 34 and imaged onto a surface of image sensor 40 and an image of the eye pupil P is similarly imaged onto a surface of lenslet array 38, the lenslet array separates the transmitted image into an array of beamlets 42, and (in combination with other optical components of the system) images the separated beamlets on the surface of sensor 40. Sensor 40 typically comprises a charged couple device or “CCD,” and senses the characteristics of these individual beamlets, which can be used to determine the characteristics of an associated region of optical tissues 34. In particular, where image 44 comprises a point or small spot of light, a location of the transmitted spot as imaged by a beamlet can directly indicate a local gradient of the associated region of optical tissue.
Eye E generally defines an anterior orientation ANT and a posterior orientation POS. Image source 32 generally projects an image in a posterior orientation through optical tissues 34 onto retina R as indicated in
In some embodiments, image source optics 46 may decrease lower order optical errors by compensating for spherical and/or cylindrical errors of optical tissues 34. Higher order optical errors of the optical tissues may also be compensated through the use of an adaptive optic element, such as a deformable mirror (described below). Use of an image source 32 selected to define a point or small spot at image 44 upon retina R may facilitate the analysis of the data provided by wavefront sensor 36. Distortion of image 44 may be limited by transmitting a source image through a central region 48 of optical tissues 34 which is smaller than a pupil 50, as the central portion of the pupil may be less prone to optical errors than the peripheral portion. Regardless of the particular image source structure, it will be generally be beneficial to have a well-defined and accurately formed image 44 on retina R.
In one embodiment, the wavefront data may be stored in a computer readable medium 29 or a memory of the wavefront sensor system 30 in two separate arrays containing the x and y wavefront gradient values obtained from image spot analysis of the Hartmann-Shack sensor images, plus the x and y pupil center offsets from the nominal center of the Hartmann-Shack lenslet array, as measured by the pupil camera 51 (
While the methods of the present invention will generally be described with reference to sensing of an image 44, a series of wavefront sensor data readings may be taken. For example, a time series of wavefront data readings may help to provide a more accurate overall determination of the ocular tissue aberrations. As the ocular tissues can vary in shape over a brief period of time, a plurality of temporally separated wavefront sensor measurements can avoid relying on a single snapshot of the optical characteristics as the basis for a refractive correcting procedure. Still further alternatives are also available, including taking wavefront sensor data of the eye with the eye in differing configurations, positions, and/or orientations. For example, a patient will often help maintain alignment of the eye with wavefront measurement system 30 by focusing on a fixation target, as described in U.S. Pat. No. 6,004,313, the full disclosure of which is incorporated herein by reference. By varying a position of the fixation target as described in that reference, optical characteristics of the eye may be determined while the eye accommodates or adapts to image a field of view at a varying distance and/or angles.
The location of the optical axis of the eye may be verified by reference to the data provided from a pupil camera 52. In the exemplary embodiment, a pupil camera 52 images pupil 50 so as to determine a position of the pupil for registration of the wavefront sensor data relative to the optical tissues.
An alternative embodiment of a wavefront measurement system is illustrated in
The components of an embodiment of a wavefront measurement system for measuring the eye and ablations may comprise elements of a WaveScan® system, available from VISX, INCORPORATED of Santa Clara, Calif. One embodiment includes a WaveScan system with a deformable mirror as described above. An alternate embodiment of a wavefront measuring system is described in U.S. Pat. No. 6,271,915, the full disclosure of which is incorporated herein by reference. It is appreciated that any wavefront aberrometer could be employed for use with the present invention. Relatedly, embodiments of the present invention encompass the implementation of any of a variety of optical instruments provided by WaveFront Sciences, Inc., including the COAS wavefront aberrometer, the ClearWave contact lens aberrometer, the CrystalWave IOL aberrometer, and the like.
Referring now to
Referring to
A. Complete Set of Polynomials
A complete set of polynomials can be defined as a set of polynomials with which any function or polynomial term can be accurately represented without error. A complete set of polynomials can be the set of Zernike circle polynomials, power series, Fourier series, and Taylor monomials. It can also be any other polynomials such as Jacobi polynomials, Chebyshev polynomials, Legendre polynomials, Laguerre polynomials, Hermite polynomials, and the like. A complete set of polynomials can be a series that can continue indefinitely.
Zernike circle polynomials can be a complete set of polynomials that is orthogonal over circular pupils. Their use in optical design and testing is appropriate and effective when the pupil is circular. However, when the pupil is noncircular, their use may not be appropriate. Zernike circle polynomials can be defined as
Zj(ρ,θ)=Znm(ρ,θ)=Rnm(ρ)Θm(θ), (1)
where j is the single-index and n and m are the double-index. n can be referred to as the radial degree and m as the azimuthal frequency. Typically, n−m is even and n−m≧0. The Zernike radial polynomials are related to Jacobi polynomials and can be expressed analytically as
and the triangular function is
However, we can also use power series
Pj(ρ,θ)=Pnm(ρ,θ)=ρnΘm(θ) (4)
to represent any function, because the set of power series is also a complete set. There may be advantages to using power series. First, it is a set of monomials, so it is simple. Second, any function can be naturally expressed as a function of ρ and θ, when it is expressed as a linear combination of power series. Other immediate complete sets of polynomials include Fourier series and Taylor monomials. Fourier series are often defined as sinusoidal functions that can be easily implemented with fast Fourier transforms. Taylor monomials are similar to power series. They can be expressed as
where u and v are the variables in the frequency domain, and p and q are the radial degree and azimuthal frequency, respectively.
In some embodiments, any continuous function, such as a wavefront, can be expressed in terms of any complete set of polynomials. For example, a wavefront can be expressed in terms of Zernike circle polynomials or power series as
where aj and bj are the expansion coefficients, respectively.
B. Determination of Orthonormal Polynomials
As discussed above, when the pupil is noncircular, Zernike polynomials may not be an appropriate set of polynomials for wavefront analysis. For aberration balancing to obtain minimum variance, it may be helpful to construct a set of orthonormal polynomials over a certain noncircular pupil.
Many noncircular pupils are common in practice. Annular, hexagonal, elliptical, rectangular, and square pupils are common. Irregular pupil shapes can also be common due to vignetting of rays. Examples of irregular pupil shapes are shown in
The inner product of two terms from two polynomials Fj and Gj′ can be defined as
where A is the area of the domain Σ. The orthonormality of F can be expressed as
Fj|Fj′=δjj′, (10)
where δjj′ is the Kronecker symbol.
Recursive and non-recursive approaches can be used to determine F. A recursive approach is the Gram-Schmidt orthogonalization process, and a non-recursive approach is a matrix transformation approach.
1. The Gram-Schmidt Orthogonalization Process
Because any continuous function can be expressed as linear combination of a complete set of polynomials, we may write
where M is a conversion matrix relating the orthonormal polynomials F to P. Calculation of F can be done either recursively by:
F1=G1=P1, (12a)
We now show how to derive a set of orthonormal polynomials over circular pupils from the set of power series, according to some embodiments of the present invention. Because the domain is a unit circle, the integration will be for the unit circle. The area A is π Following the process in Eqs. (12), we have
F1=G1=P1=1. (13a)
c2,1=−P2|F1=−ρ cos θ=0, (13b)
G2=P2=ρ cos θ, (13c)
F2=G2[G2|G2]−1/22G2=2ρ cos θ. (13d)
Continuing this process for higher terms, we obtain the next few terms as
It is quite clear that this set of orthonormal polynomials F can be exactly the same as the orthonormal Zernike circle polynomials. Thus, they can be derived from the power series using the Gram-Schmidt orthogonalization process.
For noncircular pupils, the procedure is the same, according to some embodiments. A first example is an annular pupil, as shown in
As another example, for hexagonal pupils, as shown in
Continuing this process, we obtain
The isometric plots (top), interferograms (left), and point-spread functions (right) for the first 4 orders H1 through H15 are shown in
For yet another example, for elliptical pupils, as shown in
E1=G1=P1=1. (17a)
c2,1=−P2|E1=−ρ cos θ=0, (17b)
G2=P2=ρ cos θ, (17c)
E2=G2[G2|G2]−1/2=2G2=2ρ cos θ. (17d)
Additional elliptical polynomials are given below:
These polynomials reduce to circle polynomials as e→1. The isometric plots (top), interferograms (left), and point-spread functions (right) for the first 4 orders elliptical polynomials E1 through E15 are shown in
For yet another example, for rectangular pupils, as shown in
Additional rectangular polynomials are given below:
The isometric plots (top), interferograms (left), and point-spread functions (right) for the first 4 orders rectangular polynomials R1 through R15 are shown in
For yet another example, for square pupils, as shown in
S1=G1=P1=1. (21a)
c2,1=−P2|S1=−ρ cos θ=0, (21b)
G2=P2=ρ cos θ, (21c)
S2=G2[G2|G2]−1/2=√{square root over (6)}G2=√{square root over (6)}ρ cos θ. (21d)
Additional square polynomials are given below:
The isometric plots, interferograms, and point-spread functions for the first 4 orders square polynomials R1 through R15 are shown in
It should be noted that the Gram-Schmidt orthogonalization process can also be used over irregular pupils, such as one shown in
2. Dai-Mahajan Technique of Orthonormality
As discussed before, orthonormal polynomials can be calculated from a set of non-orthogonal basis functions by means of the classical Gram-Schmidt orthogonalization process. In some cases, the recursive nature of the process can be slow and laborious. Furthermore, it can be numerically unstable and even lose orthogonality, as pointed out by Å. Björck, Numerical Methods for Least Square Problems, SIAM, 1996, the entire contents of which are incorporated herein by reference.
Consider a complete set of polynomials P in an arbitrary coordinate system, e.g., polar coordinates, over an arbitrary domain Σ. The completeness of P allows any other sets of polynomials to be represented as linear combinations of P, including an orthonormal set F that is orthonormal over domain Σ. Both sets of polynomials P and F can be one-dimensional, two-dimensional, or multi-dimensional. Therefore, we have
where Mij is a conversion matrix. If the set of P is known and the conversion matrix M can be determined, the orthonormal polynomials F can be determined. Using the notation of inner products of polynomials, the orthonormality of F can be written as
Fi|Fj=δij. (24)
In some embodiments it possible to calculate CPP using the inner products of the complete set of polynomials P, for example based on a procedure described in Dai et al. “Nonrecursive Determination of Orthonormal Polynomials with Matrix Formulation” Optics Letters 32(1):74-76 (2007), the entire contents of which are incorporated herein by reference for all purposes. This can be done instead of obtaining M directly from the inner product of the complete set of polynomials.
Multiplying Pi′ on both sides of Eq. (23) and integrating over domain Σ, we obtain
Equation (25) can be written in a matrix form as
CPF=CPPMT, (26)
where CPF is a matrix of inner products between the first N polynomials of P and the first N polynomials of F and CPP is a matrix of inner products of the first N polynomials of P themselves. MT is the transpose of the conversion matrix M.
Similarly, multiplying on both sides of Eq. (23), integrating over the domain Σ, and using the orthonormality of F in Eq. (24), we have
Equation (27) can be written in a matrix form as
MCPF=1 (28)
Substituting Eq. (26) into Eq. (28), we get
MCPPMT=1. (29)
If we let
M=(QT)−1, (30)
Eq. (29) can be written as
QTQ=CPP. (31)
Because Pi|Pi=Pj|Pi and real, matrix CPP is Hermitian. Solution of matrix Q in Eq. (31) can be performed with a standard Cholesky Decomposition, as described in W. H. Press, S. A. Teukolsky, W. Vetterling, and Flannery, Numerical recipes in C++, Cambridge University Press, 2002, the full contents of which are incorporated herein by reference. Therefore, the conversion matrix M can be calculated using Eq. (30). As such, the orthonormal polynomials F can be calculated with Eq. (23).
Zernike polynomials and power series can be used as polynomials P because they are complete sets of polynomials. Similarly, Fourier series and Taylor monomials can also be served as polynomials P, since they are also complete. Furthermore, Jacobi polynomials, Chebyshev polynomials, Legendre polynomials, Laguerre polynomials, Hermite polynomials, and the like can also serve as polynomials P. In some embodiments, power series may be preferred because they are simple and they represent naturally in polar coordinates. In the following examples, we will use power series as polynomials P.
As a first example, we derive annular polynomials. As discussed previously, the integration limits are [1,ε; 0,2π] in polar coordinates and the area A is π(1−ε2). So the matrix CPP for the first four polynomials is
Solution of Q for the first four polynomials using Eq. (31) is
Equation (30) can be used to solve the conversion matrix M for the first four polynomials as
From Eq. (32c), the annular polynomials for the first four terms can be written as
It should be noted that when ε=0, Eqs. (33) reduce to the corresponding Zernike circle polynomials.
As a second example, we derive hexagonal polynomials. As discussed previously, to do the integration according to some embodiments, we divide the hexagon into three sub-areas, rectangular ACDF 610B, and two similar triangles ABC 620B, and DEF 630B. The x and y integration limits for 610B are [−1/2,1/2; −√{square root over (3)}/2, √{square root over (3)}/2]. Similarly, those for 620B and 630B are [1/2, 1; −√{square root over (3)}(1−x), −√{square root over (3)}(1−x)] and [−1,−1/2; −√{square root over (3)}(1+x), −−√{square root over (3)}(1+x)], respectively. The area A is 2√{square root over (3)}/9. So the matrix CPP for the first four polynomials is
Solution of Q for the first four polynomials using Eq. (31) is
Equation (30) can be used to solve the conversion matrix M for the first four polynomials as
From Eq. (34c), the hexagonal polynomials for the first four polynomials can be written as
As a third example, we derive elliptical polynomials. As discussed previously, the integration limits are [−√{square root over (e2−y2)}/e, √{square root over (e2−y2)}/e; −e, e] in Cartesian coordinates. The area A is πe. So the matrix CPP for the first four terms is
Solution of Q for the first four terms using Eq. (31) is
Equation (30) can be used to solve the conversion matrix M for the first four terms as
From Eq. (36c), the elliptical polynomials for the first four terms can be written as
As a next example, we derive rectangular polynomials. As discussed previously, the integration limits are [−a, a; −√{square root over (1−a2)}, √{square root over (1−a2)}] in Cartesian coordinates. The area A is 4a√{square root over (1−a2)}. So the matrix CPP for the first four polynomials is
Solution of Q for the first four polynomials using Eq. (31) is
Equation (30) can be used to solve the conversion matrix M for the first four polynomials as
From Eq. (38c), the first four rectangular polynomials can be written as
As a final example, we derive square polynomials. As discussed previously, the integration limits are [−1/√{square root over (2)}, 1/√{square root over (2)}; −1/√{square root over (2)}, 1/√{square root over (2)}] in Cartesian coordinates. The area A is 2. So the matrix CPP for the first four polynomials is
Solution of Q for the first four polynomials using Eq. (31) is
Equation (30) can be used to solve the conversion matrix M for the first four polynomials as
From Eq. (40c), the square polynomials for the first four terms can be written as
They can also be obtained from the rectangular polynomials by letting a→1/√{square root over (2)}.
When the pupil is not regular, as the one shown in
where N is the total number of discrete data points within the irregular pupil. With Eq. (42), matrix CPP can be calculated. Therefore, Q and M matrices can also be calculated, all numerically.
C. Error Analysis
Zernike polynomials are commonly used in optical design and testing. Their use is proper when the pupil is circular. However, when the pupil is noncircular, their use may imply error. For any orthonormal polynomials F, suppose we use them to represent a wavefront W(ρ, θ) as
where the expansion coefficient cj and be calculated from the orthonormality of F as
The average of the wavefront can be calculated as
Comparison of Eqs. (44) and (45), we get,
Equation (46) indicates that the mean value of a wavefront within the domain Σ equals the piston coefficient of the orthonormal polynomials. The variance of the wavefront can be expressed as
Equation (47) indicates that the variance of a wavefront within the domain Σ equals the sum of the square of the coefficients of the orthonormal polynomials, excluding the piston coefficient. Because of this property, the orthonormal polynomials may be preferred than any non-orthogonal polynomials.
If we use Zernike circle polynomials instead of the orthonormal polynomials, we can express the wavefront as
From Eqs. (43) and (48), we obtain the error when we use Eq. (48) as
Substituting Eqs. (43) and (48) into (49) and some laborious arithmetic, we obtain
Equation (50) can be written in a matrix form as
Δ=ATA−2BTCZFA+BTCZZB, (51)
where A stands for a column vector containing the ci coefficients and B denotes a column vector containing the aj coefficients. Matrices CZZ and CZF are similar to the matrix CPP mentioned previously.
To derive the relationship between the F-polynomials coefficients ci and the Zernike coefficients aj, we begin by considering the situation where a wavefront map is known and to be fitted with Zernike circle polynomials within the domain Σ. From a least square point of view, one way to use Zernike polynomials is to make Ŵ=W. Hence from Eqs. (43) and (48), we have
Multiplying Zj′ on both sides of Eq. (52) and integrating over the domain Σ, we get
Equation (53) can be written in a matrix form as
CZZB=CZFA. (54)
When the domain Σ is a circular pupil, F-polynomials becomes Zernike circle polynomials and therefore, B=A. For noncircular pupils, solution of B from Eq. (54) can be done with a matrix inversion as
B=[CZZ]−1CZFA. (55)
Because the matrix CZZ is well behaved, its inverse can be determined analytically. Equation (55) is the correct way of calculating the Zernike coefficients when the pupil is noncircular. It results in no wavefront fitting error.
However, because we may not have the orthonormal polynomials, A may be unknown and Eq. (55) may be unusable in practice. When Zernike circle polynomials are used, perhaps incorrectly, over noncircular pupil, we have
Equation (56) can be written in a matrix form as
B=CZFA. (57)
Substituting Eq. (57) into Eq. (51) yields
Δ=AT(1−P)A, (58)
where
P=(CZF)T(2−CZZ)CZF. (59)
Matrix P is termed the deviation matrix of wavefront fitting. It indicates how much it is different from the unitary matrix 1. When the domain Σ is a circular pupil, F=Z so matrix P is 1, because both matrices CZZ and CZF are 1. In this case, Δ=0 so there is no fitting error by using Zernike circle polynomials. However, when the domain Σ is noncircular, Δ will not be zero.
In wavefront fitting, improper calculation of Zernike coefficients can result in error in the fitted wavefront.
For annular pupils, the fitting error is negligible when ε<0.2, as can be seen in FIGS. 12B(i) and 12B(ii). At ε=0.2, the greatest error comes from A11, which is less than 5%. This explains it is rather safe to use Zernike circle polynomials as basis functions for astronomical applications. However, as ε increases, the fitting error for terms with m=n, such as A2, A3, A5, A6, A9, A10, A11, and A15, the fitting error can be as much as 50 times the input wavefront error.
FIGS. 12C(i) to 12C(iii) show the fitting error as a function of the aspect ratio a for elliptical pupils, which is very different from the fitting error for annular pupils. The least error for elliptical pupils occurs for E6, E10, and E14, where m=n.
For rectangular pupils, the fitting error as a function of the aspect ratio a is shown in FIGS. 12D(i) to 12D(iii). Again, the error in this case is very different from that for annular and elliptical pupils. The least error for many terms occurs when 0.5≦a≦0.9, but the greatest for other terms.
Consider the case where the wavefront is reconstructed from slope-type data using modal reconstruction approach. Taking derivatives with respective to x and toy of both sides of Eq. (43) for each of the k data points (l=1, 2, . . . , k) yields
Equation (60) applied to all k data points can be written as a matrix form as
S=GA, (61)
where S is an array of the x- and y-slope data, A is an array of coefficients and G is a matrix of the derivatives of the F-polynomials at the measurement points and can be written as
Similarly, taking derivatives with respect to x and y of both sides of Eq. (48) for each of the k data points (l=1, 2, . . . , k) yields
Equation (63) can be written as a matrix form as
Ŝ=HB, (64)
where Ŝ is an array of estimated x- and y-slope data, B is an array of coefficients and H is a matrix of Z-derivatives at the measurement points and can be written as
To calculate the matrix H, it is helpful to know the derivatives of the Zernike circle polynomials. The least squares solution of Eq. (65) is
B=(HTH)−1HTS=H+S, (66)
where HT denotes the transpose of matrix H and it is the generalized inverse of matrix H.
Substituting Eq. (66) into Eq. (51), we get
Δ=AT(1−Q)A, (67)
where the matrix Q is
Q=(H+G)T(2CZF−CZZH+G). (68)
Similar to matrix P, matrix Q is termed the deviation matrix of wavefront reconstruction. It indicates how much it is different from the unitary matrix 1. When the domain Σ is a circular area, F=Z so matrix Q=1, because both matrixes CZZ and CZF as well as H+G are 1. In this case Δ=0 so there is no additional reconstruction error by using the Zernike circle polynomials. However, when domain Σ is non-circular, there can be an additional error due to use Zernike circle polynomials for wavefront reconstruction.
To investigate the wavefront reconstruction error, we can assume some specific configurations of data sampling points over which wavefront slopes are given.
For annular pupils, the reconstruction error as a function of ε is shown in FIGS. 14C(i) and 14C(ii) for 101×101 configuration. Again, error for some terms shows monotonical changes as a function of ε, but that for other terms fluctuates severely.
FIGS. 14D(i) to 14D(iii) show the reconstruction error as a function of the aspect ratio a for configurations 101×101, according to some embodiments. For most terms, the reconstruction error increases as a decreases, although the increase of the error for E11 and E12 is not monotonical. It is also clear that E3, E8 and E12 have relatively smaller error and E6, E and E13 have relatively greater error, according to some embodiments.
For rectangular pupils, R4 and R12 show the greatest reconstruction error as compared to other terms, as can be seen in FIGS. 14E(i) to 14E(iii), representing configurations of 101×101.
Each of the calculations or operations described herein may be performed using a computer or other processor having hardware, software, and/or firmware. The various method steps may be performed by modules, and the modules may comprise any of a wide variety of digital and/or analog data processing hardware and/or software arranged to perform the method steps described herein. The modules optionally comprising data processing hardware adapted to perform one or more of these steps by having appropriate machine programming code associated therewith, the modules for two or more steps (or portions of two or more steps) being integrated into a single processor board or separated into different processor boards in any of a wide variety of integrated and/or distributed processing architectures. These methods and systems will often employ a tangible media embodying machine-readable code with instructions for performing the method steps described above. Suitable tangible media may comprise a memory (including a volatile memory and/or a non-volatile memory), a storage media (such as a magnetic recording on a floppy disk, a hard disk, a tape, or the like; on an optical memory such as a CD, a CD-R/W, a CD-ROM, a DVD, or the like; or any other digital or analog storage media), or the like.
A variety of modifications are possible within the scope of the present invention. For example, a complete set of polynomials may be changed to Fourier series or Taylor monomials. Furthermore, other complete sets of polynomials, such as Jacobi polynomials, Chebyshev polynomials, Legendre polynomials, Laguerre polynomials, and Hermite polynomials, can also be used. A variety of domains can be considered in the determination of orthonormal polynomials. These domains can be one-dimensional, two-dimensional, or multi-dimensional. Within two-dimensional domains, a variety of pupil shapes, such as circular, annular, hexagonal, elliptical, rectangular, square, and irregular can be used. While the specific embodiments have been described in some detail, by way of example and for clarity of understanding, a variety of adaptations, changes, and modifications will be obvious to those of skill in the art. Although the invention has been described with specific reference to optical testing and design, other suitable applications in fields such as physics, engineering, mathematics, applied mathematics, and the like can find applications for the determination of orthonormal polynomials from a set of simple, non-orthogonal polynomials. Therefore, the scope of the present invention is limited solely by the specific claims.
This application is a nonprovisional of, and claims the benefit of the filing date of, U.S. Provisional Patent Application No. 60/819,587 filed Jul. 10, 2006, the entire disclosure of which is incorporated herein by reference for all purposes.
Number | Name | Date | Kind |
---|---|---|---|
5121337 | Brown | Jun 1992 | A |
5287273 | Kupfer et al. | Feb 1994 | A |
5446681 | Gethner et al. | Aug 1995 | A |
6556961 | Lafe | Apr 2003 | B1 |
7088457 | Zou et al. | Aug 2006 | B1 |
7311400 | Wakil et al. | Dec 2007 | B2 |
7553022 | Neal et al. | Jun 2009 | B2 |
7796784 | Kondo et al. | Sep 2010 | B2 |
20010016695 | Mihashi et al. | Aug 2001 | A1 |
20020186346 | Stantz et al. | Dec 2002 | A1 |
20030086063 | Williams et al. | May 2003 | A1 |
20030184736 | Fukuhara et al. | Oct 2003 | A1 |
20030194144 | Wenzel et al. | Oct 2003 | A1 |
20030206289 | Matsuyama | Nov 2003 | A1 |
20040130705 | Topa | Jul 2004 | A1 |
20050012898 | Chernyak et al. | Jan 2005 | A1 |
20050057723 | Wakil et al. | Mar 2005 | A1 |
20050152583 | Kondo et al. | Jul 2005 | A1 |
20050288910 | Schlessinger et al. | Dec 2005 | A1 |
20070165242 | Scott et al. | Jul 2007 | A1 |
20080086520 | Epelbaum | Apr 2008 | A1 |
20090036980 | Norrby et al. | Feb 2009 | A1 |
20090284753 | Neal et al. | Nov 2009 | A1 |
20100114076 | Reinstein et al. | May 2010 | A1 |
20100192709 | Wilcox et al. | Aug 2010 | A1 |
20100299113 | Robinson et al. | Nov 2010 | A1 |
20110027723 | Suzuki et al. | Feb 2011 | A1 |
20110149241 | Dai | Jun 2011 | A1 |
20110178785 | Tinnemans et al. | Jul 2011 | A1 |
20120206719 | Tanaka | Aug 2012 | A1 |
Entry |
---|
“Orthogonal Polynomial Vectors and Least Sqaure Approximation for Discrete Inner Product” by M. Van Barel et al published in 1995, pp. 1-23, Electronic Transactions on Numerical Analysis vol. 3 Mar. 1995. |
Zernike Annular Polynomials and Optical Aberrations of Systems with Annular Pupils; by Virendra N. Mahajan; Supplement to Optics & Photonics News vol. 5 No. 11 Nov. 1994; pp. 8126-8127. |
Orthonormal polynomials in wavefront analysis: analytical solution; Virendra N. Mahajan1,* and Guang-ming Dai2; J. Opt. Soc. Am. A/vol. 24, No. 9/Sep. 2007; pp. 2994-3016. |
Dai, Guang-ming et al., “Nonrecursive determination of orthonormal polynomials with matrix formulation”, OPTICS LETTERS, vol. 32, No. 1, Jan. 1, 2007; copyright 2006 Optical Society of America, pp. 74-76. |
Mahajan, Virendra N. et al., “Orthonormal polynomials for hexagonal pupils”, OPTICS LETTERS, vol. 31, No. 16, Aug. 15, 2006; copyright 2006 Optical Society of America, pp. 2462-2464. |
Mahajan, Virendra N. et al., “Orthonormal polynomials in wavefront analysis: analytical solution”, J. Opt. Soc. Am. A, vol. 24, No. 9, Sep. 2007; copyright 2007 Optical Society of America, pp. 2994-3016. |
Mahajan, V., ‘Zernike annular polynomials for imaging systems with annular pupils,’ The Journal of the Optical Society of America, 1984, vol. 1, No. 6, p. 685. |
Mahajan, V., ‘Zernike annular polynomials for imaging systems with annular pupils,’ The Journal of The Optical Society of America, 1981, vol. 71, No. 1, pp. 75-85. |
Dunkl, C., ‘Orthogonal Polynomials on the Hexagon,’ Society for Industrial and Applied Mathematics, 1987, vol. 47, No. 2, pp. 343-351. |
Number | Date | Country | |
---|---|---|---|
20080077644 A1 | Mar 2008 | US |
Number | Date | Country | |
---|---|---|---|
60819587 | Jul 2006 | US |