Material on one (1) compact disc accompanying this patent and labeled COPY 1, an exact copy of which is also included and is labeled as COPY 2, is incorporated herein by reference in it's entirety for all purposes. The compact disc has a software appendix thereon in the form of the following three (3) files:
A portion of the disclosure of this patent document contains material to which a claim of copyright protection is made. The copyright owner(s) has/have no objection to the facsimile reproduction by any one of the patent document or the patent disclosure as it appears in the Patent and Trademark Office patent file or records, but reserves all other rights in the copyrighted word.
It has long been known that elastic, electromagnetic or acoustic waves in homogeneous and layered environments in the frequency range of a fraction of a cycle per second up to hundreds of millions of cycles per second and higher can be propagated through many solids and liquids. Elastic waves are waves that propagate through solids, and have components of particle motion both parallel (longitudinal, or pressure, wave) and perpendicular (shear wave) to the direction of propagation of the wave energy itself. In contrast to this, acoustic waves are those waves that generate particle motion that is exclusively parallel to the propagation of wave energy. Electromagnetic waves have components of variation of field strength solely in the direction perpendicular to the direction of propagation. All of these types of waves may be used to image the acoustic longitudinal wavespeed and absorption, the electromagnetic wavespeed and absorption, the shear wavespeed, and the density of the material through which the wave energy has travelled.
It is also known that scattering is produced not only by spatial fluctuations in acoustic impedance, which is the product of mass density times wavespeed, but also by independent fluctuations in electromagnetic permeability, permittivity and conductivity, elastic compressibility, shear modulus, density, and absorption. These lead to variations in phase speed (which is the speed of propagation of fronts of constant phase) and in impedance (for the electromagnetic case, the ratio of the electric to the magnetic field strength). The net property of an object which describes the phenomenon of scattering in a given modality, is called the “scattering potential”.
Other imaging methods have been applied to one or the other modality, or restricted to acoustic or elastic media, the method described in this patent is applicable to any type of wave motion, whether electromagnetic, elastic (including shear wave effects) or acoustic (scalar approximation valid in liquid and gases).
Furthermore, the ambient media may have some forms of structure (layering) or microstructure (porosity) relevant to the medical, geophysical, or nondestructive imaging applications envisioned for this technology. In the prior art the presence of this layering or porosity has greatly diminished the effectiveness of the imaging program. The method of this patent minimizes the obscuring effect of such structures in the ambient media. In addition, we have made several changes to the previous U.S. Pat. No. 4,662,222 that significantly extend the applicability and speed of our algorithm. These changes are described, in part, below:
As discussed in U.S. Pat. No. 4,662,222, by one of the present authors, these principles can be used to image scattering bodies within a given medium. Some definitions will clarify the procedures and methods used:
The direct or forward scattering problem is concerned with a determination of the scattered energy or fields when the elastic or electromagnetic properties of the scattering potential are known.
The inverse scattering problem, by contrast, consists in the use of scattered electromagnetic, elastic, or acoustic waves to determine the internal material properties of objects embedded in a known (ambient) medium. An incident wave field is imposed upon the ambient medium and the scatterer. The scattered field is measured at detectors placed a finite distance from the scattering objects. The material parameters of the scatterer are then reconstructed from the information contained in the incident and scattered fields. In other words, as defined herein, acoustic or electromagnetic imaging using inverse scattering techniques is intended to mean electronic or optical reconstruction and display of the size shape, and unique distribution of material elastic or electromagnetic and viscous properties of an object scanned with acoustic, electromagnetic or acoustic energy, i.e., reconstruction of that scattering potential which, for a given incident field and for a given wave equation, would replicate a given measurement of the scattered field for any source location.
The description of prior art summarized in U.S. Pat. No. 4,662,222 (referred to as “the previous Patent”) is also relevant here. It is sufficient to say here, that the use of the conjugate gradient algorithms discussed in the previous patent brought the attainment of inverse scattering into a practical reality. Previous to that Patent (U.S. Pat. No. 4,662,222), only linear approximations to the full inverse scattering problem were amenable to solution, even with the aid of modern high speed digital computers. The resolution achievable with the inverse scattering methods described in the previous patent was far superior to any ultrasound imaging in medicine previous to that time.
This invention is designed to provide improved imaging of bodies using wave field energy such as ultrasound energy, electromagnetic energy, elastic wave energy or Biot wave energy. In particular it is designed to provide improved imaging for medical diagnostic applications, for geophysical imaging applications, for environmental imaging applications, for seismic applications, for sonar applications, for radar applications and for similar applications where wave field energy is used to probe and image the interior of objects.
In particular, this invention has important applications to the field of medical diagnosis with specific value to improved breast cancer detection and screening. The present method of choice is mammography. This choice is not supported by the outstanding performance of mammography, but rather because it is the most diagnostically effective for its present cost per exam. Another reason for the wide spread use of mammography is the inertia of changing to other modalities. It is known that hand-probe-based, clinical reflection ultrasound can match the performance of mammography in many cases, but only in the hands of specially trained ultrasound radiologists. Today and in the foreseeable future, there are more mammography machines and radiologists trained to use them than there are trained ultrasound radiologists that have access to high quality ultrasound scanners. Sophisticated breast cancer diagnostic centers use both mammography and ultrasound to compensate for weakness in either approach when used alone. When used alone, ultrasound and mammography require a biopsy to remove a sample of breast tissue for pathology lab analysis to determine whether a sampled lesion is cancerous or benign.
Even when mammography and ultrasound are used together, the specificity for discriminating between cancer and fibrocystic condition or between cancerous tumor and a fibroadenoma is not high enough to eliminate the need for biopsy in 20 to 30 percent of lesions. Given that early diagnosis of breast cancer can ensure survival and that one woman in eight will have breast cancer in her life, it is important to the general population for cancer to be detected as early as possible. Detection of cancer in an early stage for biopsy is thus very important. However, biopsy of benign lesions is traumatic to the patient and expensive. A mammogram costs about $90 but a biopsy is about ten times more expensive. Thus, it is important that a breast cancer diagnostic system have high specificity and sensitivity to eliminate unnecessary biopsies. Increasing the rate of diagnostic true positives to near 100 percent will identify all lesions as a cancer that are cancer without the need to biopsy. However, it is also necessary to increase the rate of true negatives to near 100 percent to eliminate biopsy of benign lesions. Neither mammography or ultrasound or their joint use has provided the combination of sensitivity and specificity to eliminate biopsy or to detect all cancers early enough to insure long term survival after breast cancer surgery for all women that have had breast exams.
There does not seem to be any obvious improvements in present mammography or hand-probe-based, clinical reflection ultrasound that can significantly improve these statistics. However, there is reason to believe that inverse scattering ultrasound tomography or electric impedance tomography can provide improved diagnostic sensitivity or specificity when used separately or jointly with themselves and with ultrasound and/or mammography. Inverse scattering ultrasound imaging provides several advantages over present clinical reflection ultrasound that uses hand held ultrasound probes. A hand held probe is a transducer assembly that scans an area of the patient's body below the probe. Probes consisting of mechanical scanning transducers and probes comprising electronically scanned arrays of transducer elements are both well developed technologies.
Inverse scattering has the following advantages over said clinical reflection ultrasound. Inverse scattering images have the following features: (1) two separate images can be made, one of speed of sound and one of acoustic absorption; (2) these images are a quantitative representation of actual acoustic bulk tissue properties; (3) the images are machine independent; (4) the images are operator independent; (5) the images are nearly free of speckle and other artifacts; (6) all orders of scattering tend to be used to enhance the images and do not contribute to artifacts or degrade the images; (7) the images of speed of sound and absorption create a separation of tissues into classes (fat, benign fluid filled cyst, cancer, and fibroadenoma), although the cancer and fibroadenoma values tend to overlap; (8) the speed of sound and acoustic absorption images have excellent spatial resolution of 0.3 to 0.65 mm at or below 5 MHz; and (9) the speed of sound images can be used to correct reflection tomography for phase aberration artifacts and improve ultrasound reflectivity spatial resolution. Inverse scattering easily discriminates fat from other tissues, while mammography and present clinical ultrasound cannot.
Because of the similar values of speed of sound and acoustic absorption between cancer and fibrocystic condition (including fibroadenoma), it is not known whether inverse scattering will provide the required high level of specificity to eliminate biopsy. Perhaps this performance could be achieved if inverse scattering were combined with reflection ultrasound (including reflection tomography) or with mammography or with both.
A traditional problem with inverse scattering imaging is the long computing times required to make an image. Diffraction Tomography is an approximation to inverse scattering that uses a first order perturbation expansion of some wave equation. Diffraction Tomography is extremely rapid in image computation, but suffers the fatal flaw for medical imaging of producing images that are blurred and not of acceptable quality for diagnostic purposes and it is not quantitatively accurate. In our last patent application, we addressed the computational speed problem with inverse scattering and showed how to increase the calculation speed by two orders of magnitude over finite difference methods and integral equation methods. This speed up in performance was achieved by use of a parabolic marching method. This improvement in speed was sufficient to allow single 2D slices to be imaged at frequencies of 2 MHz using data collected in the laboratory. However, making a full 3-D image with a 3D algorithm or even from several, stacked 2-D image slices made with a 2D algorithm would have required computing speed not available then and only becoming available now.
Another imaging modality that has been investigated for detecting breast cancer is EIT (electrical impedance tomography). This modality has been investigated for many years and many of its features are well known. One of its great advantages for breast cancer detection is its high selectivity between cancer and fibrocystic conditions including fibroadenoma. Cancer has high electrical conductivity (low electrical impedance) while fibrocystic conditions and fibroadenoma have low electrical conductivity (high electrical impedance). However, EIT has poor spatial resolution. Also EIT requires the use of inversion algorithms similar (in a general sense) to those of inverse scattering. In addition EIT algorithms have mostly been used to make 2-D images. Work on making 3-D EIT images is less developed because of the increased computer run time of 3-D algorithms.
Other problems with mammography may be listed, such as the pain associated with compressing the breast between two plates to reduce the thickness of the breast in order to improve image contrast and cancer detection potential. Another problem is the accumulated ionizing radiation dose over years of mammography. It is true that a single mammogram has a very low x-ray dose, especially with modern equipment. However, if mammograms are taken every year from age 40 or earlier, by age 60 to 70 the accumulated dose begins to cause breast cancer. The effect is slight, and the benefits of diagnosis outweigh this risk. Nevertheless, it would be an advantage to eliminate this effect. Mammography is not useful in many younger women (because of the tendency for greater breast density) or older women with dense breasts (such as lactating women). About 15 percent of women have breasts so dense that mammography has no value and another 15 percent of women have breasts dense enough that the value of mammography is questionable. Thus 30 percent of all mammograms are of questionable or zero value because of image artifacts from higher than normal breast density.
The present invention provides for a wave field imaging method including transmitting incident wave fields into an object., the incident wave fields propagating into the object and, at least, partially scattering. A transmission wave field transmitted, at least in part, through the object can be measured. The transmission wave field can be defined, in part, by the incident wave field and the object. A reflected wave field reflected from the object can also be measured. The reflected wave field can be defined, in part, by the incident wave field and the object. The reflected wave field can be processed to form a reflection data set. The transmission wave field can also be processed to form a transmission data set.
The method above further describes obtaining speed-of-sound and attenuation values from the transmission data set. The speed-of-sound values can correspond to a speed of ultrasound wave fields through tissue, and the attenuation values can correspond to an amount of attenuation of ultrasound wave fields propagating through the tissue.
The method of above further describes constructing a speed-of-sound image from the transmission data set. An attenuation image can be constructed from the transmission data set, and the speed-of-sound image and the attenuation image can be displayed.
The method above further describes obtaining speed-of-sound values from the transmission data set. The speed-of-sound values can correspond to a speed of ultrasound wave fields through tissue and the reflection data set can be modified based on the speed of sound values. The reflection data set can also be modified based on the speed of sound values to correct the reflection data set for refraction.
The method above further describes processing includes constructing a B-scan image from the reflection data set. Moreover, the object can be a breast and the processing can generate the reflected and transmission data sets of a breast during a breast scan.
The method above further describes the transmission data set can represent quantitative estimates of a speed-of-sound associated with the object. The transmission data set can also represent quantitative estimates of an attenuation associated with the object.
The method above further describes forming and displaying at least one of a transmission image and reflected image from the transmission and reflection data sets. Moreover, the transmission and reflected images can be formed and co-displayed from the transmission and reflection data sets.
The method above further describes the incident wave field can include at least one of acoustic energy, elastic energy, electromagnetic energy and microwave energy.
The method above further describes the transmitting and measuring can be performed by a transmitter and receiver, respectively, and at least one of the transmitter and receiver can be moved with respect to the object. Additionally, the transmitting and measuring can be repeated at multiple slices within the object to generate a 3D volume of data.
The method above further describes displaying 2-D images representative of slices through the object based on at least one of the transmission and reflection data sets. Additionally, a 3-D image of at least a portion of the object can be displayed based on at least one of the transmission and reflection data sets.
The present invention also describes an ultrasound scanner device including first and second arrays of transducer elements, and at least one of the first and second arrays can transmit incident wave fields into an object. The incident wave fields can propagate into the object, and at least partially scattering. Additionally, at least one of the first and second arrays can measure a transmission wave field transmitted, at least in part, through the object. The transmission wave field can be defined, in part, by the incident wave field and the object. At least one of the first and second arrays can also a reflected wave field reflected from the object with the reflected wave field being defined, in part, by the incident wave field and the object. A processing module can process the reflected wave field to form a reflection data set and can also process the transmission wave field to form a transmission data set.
The device above also describes that the processing module can obtain speed-of-sound and attenuation values from the transmission data set. The speed-of-sound values can correspond to a speed of ultrasound wave fields through tissue, and the attenuation values can correspond to an amount of attenuation of ultrasound wave fields propagating through the tissue. Additionally the processing module can construct a speed-of-sound image from the transmission data set, and can also construct an attenuation image from the transmission data set and displays the speed-of-sound image and the attenuation image. The processing module can also obtain speed-of-sound values from the transmission data set. The speed-of-sound values can correspond to a speed of ultrasound wave fields through tissue, and the processing module can modify the reflection data set based on the speed of sound values. The processing module can also obtain speed-of-sound values from the transmission data set, and modify the reflection data set based on the speed of sound values to correct the reflection data set for refraction. The processing module can also obtain speed-of-sound values from the transmission data set, and can modify the reflection data set based on the speed of sound values to improve ultrasound reflectivity spatial resolution of the reflection data set. The processing module can also generate, based on the transmission and reflection data sets, a reflection tomographic image having spatial resolution of at least 0.65 mm. The processing module can also generate, based on the transmission and reflection data sets, a reflection tomographic image having spatial resolution of between 0.3 nm and 0.65 mm.
The method above further describes that the incident wave field can be transmitted into the object at a fundamental frequency of approximately 5 MHz.
The method above further describes opposed first and second compression plates with the first and second arrays being held by the first and second compression plates, respectively, and the first and second plates being moveable toward one another to compress there between an object to be scanned. Additionally, the first and second arrays can be held by the first and second compression plates, respectively, facing one another and aligned parallel to one another.
The method above further describes that the scanner constitutes a breast scanner including opposed first and second compression plates with the first and second arrays being held by the first and second compression plates, respectively, facing one another and aligned parallel to one another. The first and second compression plates can be configured to compress a breast during a mammogram.
The method above further describes that the first and second arrays each include 1-D arrays. Additionally, the first and second arrays can each include 2-D arrays.
The method above further describes that the transmission data set can represent quantitative estimates of a speed-of-sound associated with the object. Additionally, the transmission data set can represent quantitative estimates of an attenuation associated with the object.
The method above further describes a display that can display a B-scan image generated based on the reflection and transmission data sets.33. Additionally, a display can display at least one of a transmission image and reflected image from the transmission and reflection data sets. A display can also co-display transmission and reflected images from the transmission and reflection data sets. A display can also display a 3-D image of at least a portion of the object based on at least one of the transmission and reflection data sets.
In order that the manner in which the above-recited and other advantages and objects of the invention are obtained, a more particular description of the invention briefly described above will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. Understanding that these drawings depict only typical embodiments of the invention and are not therefore to be considered limiting of its scope, the invention will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
Definition 1. An inverse scattering imaging method is any imaging method which totally or partially uses wave equation modeling and which utilizes a nonlinear operator relating the inversion data to the image components.
Definition 2. The inversion data is defined by that transformation of the wavefield data which is the range of the nonlinear operator. The domain of the nonlinear operator is the image components.
Definition 3. A parabolic propagation step is an arithmetic operation which computes the wavefield on a set of surfaces given the wavefield on another, disjoint set of surfaces using a parabolic differential (or difference?) equation in the derivation of the arithmetic operation.
Definition 4. By λ we mean the wavelength in the imbedding medium for the maximum temporal frequency in the wavefield energy.
Definition 5. By Δ we mean the average spatial separation of the points used to discretize the discretized wavefield in the computer implementation of the method.
Definition 6. By a short convolution operation we mean a discrete convolution which is faster if done by direct summation rather than by FFT.
The following explanation of notation is given to facilitate obtaining an understanding of the invention. Unless otherwise noted, the notation will essentially be the same as U.S. Pat. No. 4,662,222. The scattering potential γ changes from point to point within an object or body as well as changing with frequency of the incident field. Thus,
γωj≡γω(xj),xj∈R3 or R3
is used to signify the scattering potential at pixel j or point j, for the incident field at frequency ω.γω can be considered to be the vector composed of all values of j, i.e. for all pixels:
γω≡(γω1γω2 . . . γωN)T,
where T indicates “transpose”. For our purposes of exposition and simplicity, we will hence forth consider the case where γ is independent of frequency, although this is not necessary. This convention also is essentially the same as that employed in the U.S. Pat. No. 4,662,222.
Notation
Vector Field Notation
The following notation will be used throughout the patent to denote a vector field describing elastic waves, or electromagnetic waves:
(T-denotes “transpose”) is used to represent the total field.
(T-denotes “transpose”) is used to denote the incident field. The incident field is the field that would be present if there was no object present to image. In the case of layering in the ambient medium, it is the field that results from the unblemished, piecewise constant layering.
The scattered field is the difference between the total field and the incident field, it represents that part of the total field that is due to the presence of the inhomogeneity, i.e. the “mine” for the undersea ordnance locater, the hazardous waste cannister for the hazardous waste locator, the school of fish for the echo-fish locator/counter, and the malignant tumor for the breast scanner:
T-denotes “transpose” fωφinc(r) denotes the scalar incident field coming from direction (source position) Φ at frequency ω. The r could represent either a 3 dimensional, or a 2 dimensional vector of position.
Scalar Field Notation
A scalar field is indicated by a nonbold f(r), rεR3
Acoustic Scattering—Scalar Model Equations
This first example is designed to give the basic structure of the algorithm and to point out why it is so fast in this particular implementation compared to present state of the art. The background medium is assumed to be homogeneous medium (no layering). This example will highlight the exploitation of convolutional form via the FFT, the use of the Frechet derivative in the Gauss-Newton and FR-RP algorithms, the use of the biconjugate gradient algorithm (BCG) for the forward problems, the independence of the different view, and frequency, forward problems. It will also set up some examples which will elucidate the patent terminology. The field can be represented by a scalar quantity f. The object is assumed to have finite extent The speed of sound within the object is c=c(x). The speed of sound in the background medium is the constant co. The mass density is assumed to be constant. The attenuation is modelled as the imaginary part of the wavespeed. These are simplifying assumptions, which we make so as to focus on the important aspects of the imaging algorithm. By no means is the imaging algorithm restricted to the scalar case, either in theory, or, more importantly, in practical implementation.
We consider the two dimensional case first of all (equivalently the object and incident fields are assumed to be constant in one direction). Given an acoustic scatterer with constant mass density and constant cross-section in z, illuminated by a time harmonic incident field, fωφinc, with eiωt dependence and source position (incident angle), φ, the total field satisfies the following integral equation.
where
{overscore (ρ)}=(x,y)
and where
f≡fωφ
is the field internal to the object at frequency co and resulting from the incident field from source position φ,:fωφinc. The 2-D Helmholtz Green's function is given by:
where H0(2) is the Hankel function of the second kind, and zeroth order.
Now it is required to compare the field measured at the detectors, with the field as predicted by a given guess, γ(n) for γ. To accomplish this, first define the scattered field at the detectors as the total field minus the incident field.
fωφsc({overscore (ρ)})≡fωφ({overscore (ρ)})−fωφinc({overscore (ρ)})
This represents the field due entirely to the scattering potential. Using this relation to rewrite (1), gives
fωφsc({overscore (ρ)})=∫∫γ({overscore (ρ)}′)fωφ({overscore (ρ)}′)gω(l{overscore (ρ)}−pl)dx′dy′ (2)
These two equations, (1) and (2) are the basis of the imaging algorithm. They must be discretized, then exported to the computer in order to solve practical imaging problems. The purpose of the apparatus and method herein described, is to solve the discretized form of these equations without making any linearizing assumptions (such as used in conventional diffraction tomography), and in real time, with presently available computers.
Discretization of the Acoustic Free-Space Lippmann Schwinger Integral Equation and Green's function—2D free space case
Let us for a moment drop the frequency and source position subscripts. The scalar field “γf” is given by
Discretization of the integral equation is achieved by first decomposing this function into a linear combination of certain (displaced) basis functions
where it has been assumed that the scatterer γ lies within the support [0, Nxδ]x[0, Nyδ]−a rectangular subregion.
The basis functions S can be arbitrary except that we should have cardinality at the grid nodes:
whereas for all other choices of n′,m′,
An example of such a function is the 2-D “hat” function. Our algorithm uses the “sinc” function has its basic building block—the Whittaker sinc function which is defined as:
The two dimensional basis functions are defined by the tensor product:
If the equality in equation (3) is presumed to hold at the grid points x=nδ, y=mδ: the coefficients αnm can be determined to be precisely the value of the field γf:
Now this expression for the product if can be substituted into (1):
In particular this equation holds at
for which we get:
where the 2-D discrete Green's function is defined as:
Although it is not clear that the dependence of g on m and n is as so stated, in is indeed the case, since we have by the substitution of the transformation
x′→x′+nδ
y′→y′+m′δ
into this last equation shows explicitly that the discretized Green's function “g”, does in fact depend only on the differences n-n′, and m-m″. Thus the discrete equation (7) has inherited the convolutional form of the continuous equation (1). It is this fact which allows the use of the 2-D FFT to compute the summation in (8) in only order NXNYlog2(NxNy) arithmetic operations and is therefore critical to the real-time speed of the algorithm.
Expression of the Discretized Lippmann-Schwinger Equation in Compact Notation
The discretized Lippmann-Schwinger equation can be rewritten using a compact operator notation. We write (7) as:
f107 φi=(I−Gω[γ])fωφ (9)
where Gω represents 2-D discrete convolution with the discrete, 2-D Green's function for frequency ω and [γ] denotes the operator consisting of pointwise multiplication by the 2-D array γ. I denotes the identity operator.
In exactly the same manner, and with the same definitions, we can write the scattered field equation (2) as:
fωφsc=Gω[γ]fωφ (10)
for the scattered field inside of the object, and finally
fωφmeas≡T(fωφsc)=T(Gω[γ]fωφ) (11)
for the measured value of the scattered field at the detectors. The T operator is used to indicate the “truncation” of the calculated scattered field, calculated on the entire convolution range (a rectangle of samples [1 ,Nx]x[1,Ny] covering γ), onto the detectors which lie outside the support of γ. (see FIGS. 1A/B/C.). In other words T is simply the operation of “picking” out of the rectangular range of the convolution, those points which coincide with receiver locations.
Extension of the Method to Remote Receivers with Arbitrary Characteristics
In the event that the receivers do not lie within the range of the convolution and/or they are not simple point receivers, we need to modify the measurement equations (11). It is well known that, given source distributions within an enclosed boundary, the scattered field everywhere external to the boundary can be computed from the values of the field on the boundary by an application of Green's theorem with a suitably chosen Green's function, i.e.:
{overscore (ρ)} outside boundary B (12) where P is the Green's function the integral is about the boundary, and dl′ is the differential arclength (in the 3-D case, the integral is on an enclosing surface). Equation (12) allows for the construction of a matrix operator which maps the boundary values of the rectangular support of the convolution (2Nx+2Ny−4 values in the discrete case) to values of the scattered field external to the rectangle. Furthermore, this “propagator matrix” can be generalized to incorporate more complex receiver geometries. For example, suppose that the receiver can be modeled as an integration of the scattered field over some support function, i.e.;
vn≡signal from receiver n=∫∫fs({overscore (ρ)})Sn({overscore (ρ)}ds) (13)
where Sn is the support of receiver n. Then from (12):
{overscore (ρ)} outside boundary
Discretizing the integral gives the matrix equation:
where
Nb=2Nx+2Ny−4
is the number of boundary (border) pixels and Nd is the number of receivers. Equation (15) defines the matrix that we shall henceforth refer to as P or the “propagator matrix” for a given distribution of external receivers. The equation:
vωφmeas=Pω(Gω[γ]fωφ) (16)
includes (11) as a special case for which the receivers are point receivers inside the convolution support. Note that P is a function of frequency, but is not a function of source position. vωφmeas is a vector of dimension Nd.
The added flexibility of this propagator matrix formulation is particularly advantageous when interfacing our algorithms with real laboratory or field data. Often times the precise radiation patterns of the transducers used will not be known a priori. In this event, the transducers must be characterized by measurements. The results of these measurements can be easily incorporated into the construction of the propagator matrix P allowing the empirically determined transducer model to be accurately incorporated into the inversion. Equations (9) and (16) then provide in compact notation the equations which we wish to solve for the unknown scattering potential, γ. First we consider the forward problem, i.e. the determination of the field fωφ for a known object function γ and known incident field, fωφi. Then we establish how this forward problem is then incorporated into the solution of the inverse problem, i.e., the determination of γ when the incident fields, and the received signals from a set of receivers are known. Note that the internal field to the object is also—along with the object function γ, an unknown in the inverse problem.
Since (9) is linear in fωφ it can be solved by direct inversion of the linear system:
f=(1−G[γ])−1fi (17)
In discrete form, this is just a matrix equation (after linearly ordering the 2-D supports of the domain and range into vectors). Since the dimension of the range and domain of the linear system is NxNy, the dimension of the linear system is the same giving a total of (NxNy)2 elements. The arithmetic required to solve the system by standard direct means is thus order (NxNy)3. In other words, a doubling of the edge dimension of the problem space will increase the CPU time required by a factor 26=64 times! The arithmetic work required will quickly become intolerable as the size of the problem increases. It is precisely this large growth rate which has convinced many researchers not to pursue inverse scattering approaches based on integral equations. One could, of course, go to an iterative method. A single iteration of an iterative linear system solver, such as bi-conjugate gradients requires order (NxNy)2 operations (essentially a matrix-vector product). It can be shown that the growth rate in the number of required iterations for sufficient convergence for the BCG algorithm is order N for this equation. Thus, the overall computational complexity is order N5—only one order of N has been saved over direct inversion. Since inverse problems in 2-D generally require order N views, the iteration must be done N times and we are back to order N6 computation for BCG.
The key to overcoming this objection is the convolutional form of (7). If this is exploited by use of the FFT algorithm, the computation needed to perform a matrix-vector product is only order NxNylog2 (NxNy). This allows the BCG algorithm to be applied to a single view with order N3log2(N) operations and to all views with order N4log2(N) operations. Due to the slow growth rate of log2(N), this is essentially a reduction of two orders of N over nonconvolutional methods. Also, this convolutional approach avoids the necessity of storing the order N4 computer words needed to contain the linear system in full matrix form. Only order N2 words are needed due to the shift invariance of the discrete Green's kernel in (7). It is these two savings, more than any other advance that we have made, that allows us to perform inverse scattering in reasonable time for meaningfully sized problems. The use of BCG over the original CG algorithm also represents a major advance since it converges in essentially the square root of the number of iterations needed by CG. This combination of FFT and CG algorithm was originally developed in [Borup, 1989, Ph.D. dissertation, Univ. of Utah, Salt Lake City], herein incorporated by reference.
The Imaging or Inverse Problem
In order to solve the imaging problem, we need a set of equations relating the unknown scattering potential, γ, and total fields, fωφ, with measurements of the scattered field on a set of external detectors. These detector equations are given in (16). Equation (16), and the internal field equations (9) are the equations which are solved simultaneously to determine γ and the fωφ. There are NxNy unknowns corresponding to the γ values at each of the grid points, and Nx×Ny×Ω×Φ, unknowns corresponding to the unknown fields, Ω is the number of frequencies, and Φ is the number of source positions (angles). We have improved upon the state of the art by considering the internal field equations to define fωφ. for a given γ. Thus the total number of unknowns is reduced to NxNy.
The total number of measurement equations is Nd×Ω×Φ where Nd is the number of detectors. In the general case, where, the sources and detectors do not completely surround the object, the problem of determining γ is “ill-posed”, in a precise mathematical sense, therefore, in order to guarantee a solution, the number of equations Nd×Ω×Φ>NxNy, is chosen to be larger than the number of pixel values for over determination. Then the system is solved in the least squares sense. More specifically the solution of (9,16) for γ and the set of fields, fωΦ, in the least squares sense is obtained by minimizing the real valued, nonlinear functional:
subject to the satisfaction of the total field equations, (9), as constraints. The vector rωφ of dimension Nd is referred to as the “residual” for frequency, ω., and angle, φ.
The methods used to solve the nonlinear optimization problem in our inverse scattering algorithms are, thus far, all “gradient methods” in that second derivative information is not used (as it is in the full Newton and quasi-Newton methods). The principal computation involves the calculation of the gradient vector of (18). A straight forward calculation gives the gradient formula: ∇f(x)=JH(x)r(x) where the superscript H denotes the Hermitian transpose (complex conjugate transpose of the matrix) and J is the Jacobian of the nonlinear system:
The simplest gradient algorithm is the Gauss-Newton (GN) iteration. The GN iteration for finding a solution to ∇f=0 is given by:
xn=y−a(x)(n)) (20.1)
δx(n)=(JnHJn)−1JnHr(n) (20.2)
x(n+1)=x(n)+δx(n) (20.3)
where a is the vector of nonlinear equations. This iteration is well defined assuming that the columns of Jn remain linearly independent. Since (20.2) is equivalent to the quadratic minimization problem:
it can be solved by the minimum residual conjugate gradient method (MRCG). This approach also ensures that small singular values in Jn will not amplify noise if care is taken not to overconverge the iteration.
In the previous patent U.S. Pat. No. 4,662,222 the fields fωφ were considered to be independent variables. In this present patent these fields are considered to be dependent variables, with dependence upon γ given implicitly by
fωφ(n)=(I−Gω[γ(n)])−1fωφinc
In order to find the Jacobian expression we must then differentiate the residual vector defined in (18) with respect to γ. The details of this calculation are given in [Borup, 1992.]. The result is:
The final Gauss-Newton algorithm for minimizing (18) subject to equality constraints (9) is:
GN 1. Select an initial guess, γ(0)
GN 2 Set n=0
GN 3 Solve the forward problems using (biconjugate gradients) BCG:
fωφ(n)=(I−Gω[γ(n)])−1fωφ′ φ=1, . . . ,Φinc ω=1, . . . ,Ω
GN 4 Compute the detector residuals: rωφ(n)=vωφmeas−PωGω[fωφ(n)]γ(n), ω=1, . . . ,Ωφ=1, . . . ,Φ
GN5 If ωφΣ∥rωφ(n)∥2<∈, stop.
GN6 Use MRCG to find the least squares solution to the quadratic minimization problem.
for δγ(n)
GN7 Update γ. γ(n+1)=γ(n)+δγ(n).
GN8 Set n=n+1, go to GN3
The crux of the GN iteration is GN 6 where the overdetermined quadratic minimization problem is solved for the scattering potential correction. This correction is approximated by applying a set of M iterations of the MRCG algorithm. The details of GN 6 are:
GN.6.1 Initialize the zeroth iterate of MRCG: δγ0=0
GN.6.2 Initialize the MRCG residuals equal to the GN outer loop residuals: rωφ0=rωφ(n)
Note that iterates pertaining to the MRCG iteration are indexed without the ( ) in order to distinguish them from outer GN loop iterates.
GN.6.3 Compute the gradient of the MRCG quadratic functional:
GN.6.4 Set the initial search direction equal to minus the gradient: p0=−g0
GN.6.5 Set m=0
GN.6.6 Compute tωφn=pωGω(I−[γ(n)]Gω)−1[fωφ(n)]pmmφ=1, . . . ,Φω=1, . . . ,Ω
GN.6.7 Compute the quadratic step length:
GN.6.8 Update the solution of the quadratic minimization: δγm+1=δγm+αmpm
GN.6.9 Update the MRCG search residuals: rωφm+1=rωφm+αmtωφm
GN.6.10 Compute the gradient of the MRCG quadratic functional:
GN.6.11 Compute βm=∥gm+1∥2/∥gm∥2
GN.6.12 Update the MRCG search direction: pm+1=−gm+1+βmpm
GN.6.13 If m=M, go to CN.6.15
GN.6.14 m=m+1. go to GM.6.8
GN.6.15 Equate the solution of the quadratic minimization problem with the last iterate of MRCG: δγ(n)=δγM
GN.6.16 Return to the GN algorithm at GN 7
A problem with the algorithm above is the presence of the inverse of the transposed total field equation (I−[γ(n)]Gω)−1 in the computation of the Jacobian and its adjoint in 6.3 and 6.10 Since we do not invert (or even store) these large linear systems, the occurrence of these inverse operators must be dealt with. This problem can however be overcome by computing the action of this operator on a given vector, as needed during the MRCG iteration, by a few iterations of BCG. When performed in this way, the shift invariance of the Green's function is exploited in a maximal way. No large matrices need to be inverted or stored. This is because the Jacobian implementation now consists exclusively of shift invariant kernels (diagonal kernels such as pointwise multiplication by γ or f and the shift invariant kernel composed of convolution with the Green's function) Such shift invariant kernels can be implemented efficiently with the FFT as previously described.
An even greater increase in numerical efficiency can be obtained in cases for which the approximation:
(I−[γ(n)]Gω)−1≈(I+[γ(n)]Gω) (23)
(which is similar to the Born approximation of the forward problem) can be used in the Jacobian. This has been found to be the case for many acoustic scattering problems for biological tissue, and for EM problems for which the contrast in dielectric contrast is small. The other two gradient algorithms that are used to solve the inverse scattering equations are the Fletcher-Reeves and Ribiere-Polak algorithms. For the inverse scattering equations, they are given by the iteration:
RP 1. Select an initnal guess, γ(0)
RP 2. Solve the forward problems using (biconjugate gradients) BCG
fωφinc=(I−Gω[γ(0)])fωφ′ φ=1, . . . ,Φ(0) ω=1, . . . ,Ω
for the internal fields, fωφ(0)
RP 3 Computer the detector residuals: rωφ(0)=PωGω[fωφ(0)]γ(0)−vωφ, φ=1, . . . ,Φmeas ω=1, . . . ,Ω
RP 4 Compute the gradient: rωφ(0)=PωGω[fωφ(0)]γ(0)−vωφ, φ=1, . . . ,Φmeas ω=1, . . . ,Ω
RP 5 computer the search direction: p(0)=−g(0)
RP 6 n=0
RP 7 compute the Jacobian operating on the search direction:
tωφ(0)=PωGω(I−[γ(n)]Gω)−1[fωφ(0)]p(0), ω=1, . . . ,Ωφ=1, . . . ,Φ
RP 8 compute the step length (quadratic approximation):
RP 9 update the solution: γ(n+1)=γ(n)+αnp(n)
RP 10 Solve the forward problems using (biconjugate gradients) BCG:
fωφinc=(I−Gω[γ(n+1)])fωφ′ φ=1, . . . ,Φ(n+1) ω=1, . . . ,Ω
RP 11 Compute the detector residuals: rωφ(n+1)=PωGω[fωφ(n+1)]γ(n−1)−vωφ, φ=1, . . . ,Φmeas ω=1, . . . ,Ω
RP 13 Computer the gradient:
RP 14 Compute:
Fletcher-Reeves
Ribere-Pollack
RP 15 Update the search direction: p(n+1)=−g(n+1)=βnp(n)
RP 16 n=n+1, go to RP 7
The distinction between FR and RP lies in the calculation of βin RP 14. It is generally believed that the RP calculation is more rapidly convergent. Our experience indicates that this is true and so we generally use the RP iteration rather than FR.
Comparison of the linear MRCG iteration and the nonlinear RP iteration reveals that they are very similar. In fact, RP is precisely a nonlinear version of MRCG. Note that the only difference between them lies in the fact that the RP residuals, computed in steps RP.10 and RP.11 involve recomputation of the forward problems while the MRCG residuals are simply recurred in 6.9 (additional, trivial, differences exist in the computation of the a and β's). In other words, the RP iteration updates the actual nonlinear system at each iteration, while the MRCG simply iterates on the quadratic functional (GN linearization). The overall GN-MRCG algorithm contains two loops—the outer linearization loop, and the inner MRCG loop, while the RP algorithm contains only one loop. Sine the RP algorithm updates the forward solutions at each step, it tends to converge faster than GN with respect to total iteration count (number of GN outer iterations times the number, M, of inner loop MRCG iterates). The GN method is, however, generally faster since an MRCG step is faster than an RP step due to the need for forward recomputation in the RP step. The overall codes for the GN-MRCG algorithm and the RP algorithm are so similar that a GN-MRCG code can be converted to an RP code with about 10 lines of modification.
Before leaving this example it is important to note that the examples given in this patent all assume that the different frequencies, ω, and views, φ are all computed in serial fashion. It is important to note however, that another important link in the real time implementation of our algorithm, is the fact that the different frequencies and different views are independent in both the forward problem and Jacobian calculations, and therefore can be computed independently and in parallel. This should be clear by examining, for example, the computations in GN.3 and GN.6.3. The forward problem calculations, GN.3 are completely independent and in the gradient calculation, GN.6.3, the computations are independent in frequency and view up to the point at which these contributions are summed to give the gradient. The GN and RP algorithms could thus be executed on a multinode machine in parallel with node intercommunication required only 2 to 3 times per step in order to collect sums over frequency and view number and to distribute frequency/view independent variables, such as scattering potential iterates, gradient iterates, etc., to the nodes.
Image Reconstruction in Layered Ambient Media Using Optimization of a Bilinear or Quadratic Objective Function Containing All Detector Measurement Equations, and with Internal Fields Represented as a Function of the Scattering Potential
The examples in the previous patent U.S. Pat. No. 4,662,222 were characterized by the consideration of both the internal fields, and the object function y to be independent variables of equal importance. The example given above, however, has the distinguishing characteristic that the object function γ is considered to be the sole independent variable, and the internal field resulting from the incident field and γ interaction is considered to be a function of this γ. We may write
fθωsc≡fθωsc(γ)∀ω=1, . . . ,Ωθ=1, . . . ,Θ (24)
for the scattered field.
The difference between this example, and the previous one is that the background medium, in which the object is assumed to be buried, is assumed to be homogeneous in the previous case, whereas it is here assumed that the background is layered. The residual is defined in the same way as the previous example, with dθω used to represent the Nd-length vector of measured scattered field at the detector positions. That is, the residual vectors for all ω, θ are defined in the following way.
rθω≡fθωsc(γ)−dθω=0∀ω=1, . . . , Ωθ=1, . . . ,Θ (25)
The functional F(γ), dependent upon γ is defined in the same way as example 1:
F(γ)≡Σθω∥rθω∥22 (26)
This is the functional, dependent upon the object function γ in a highly nonlinear manner, which must be minimized in order to determine the γ values at each gridpoint in the object space. It is again necessary to compute the Jacobian:
where θ again refers to the multiple views and the o to the multiple frequencies available to us. That is, we again assume that the noise level in the system is zero. We apply the Gauss Newton algorithm to the associated least squares problem and get the same result. This leads to the overdetermined system described above (the notation is identical to the previous example, the difference lies in the actual form of the layered Green's function).
which must be solved for the γ-update δγ The left hand side of the above formula is given by
Again, we can use the complex analytic version of the Hestenes overdetermined conjugate gradient algorithm, adapted for least squares problems to iteratively solve this system. This is equivalent to finding the minimum norm solution.
The formula for the Jacobian in the layered medium situation in the presence of multiple frequencies, is,
where Gω is the Layered Green's function for the frequency ω. Therefore, in effect, to determine the γ-update, δγ, we merely solve the multiple view problem for each particular frequency, that is, we solve the overdetermined system:
which in component form is
[T∘([I−Gk·[γ]]−1Gk)][fjk]δγ(n)=−[rjk(n)]∀j+1, . . . ,Θk+1, . . . ,Ω (32)
See also the computer code in this patent for the complete description of this procedure. For multiple view and multiple frequency inversion then, the inversion scheme in layered media reads:
STEP 1. Choose an initial guess for the scattering potential, γ(o).
STEP 2. Set n=0
STEP 3. solve the forward problems (I−G[γ])fθω=fθωinc,∀ω=1, . . . ,Ωθ=1, . . . ,Θusing biconjugate gradients, and use the result to compute the φ-forward maps φθω(γ(n)) and
rθω(n)≡φθω(γ(n))−dθω=0∀ω=1, . . . ,Ωθ=1, . . . ,Θ
STEP 4. Determine the L2 norm of the total residual vector:
Step 6. Solve the NdθΩ by N overdetermined system:
in the least squares sense for
δγ(n), using the conjugate gradient algorithm adapted for least squares problems.
STEP 7. Update 7 via the formula
γ(n+1)=γ(n)+δγ(n)
STEP 8. Set n=n+1, go to begin
The actual values of the angles of incidence θ are chosen to diminish as much as possible the ill conditioning of the problem. Equally spaced values 0<θ<2Π are ideal, but experimental constraints may prohibit such values, in which case the multiple frequencies are critical. For biological contrasts (γ<0.15 say) the following approximation, which is essentially identical to the assumption in the previous example, is valid:
[I−Gω[γ]]−1=[I+Gω[γ]] (33)
This is the same Born-like approximation restricted to the Jacobian calculation only, that is assumed in example 1. This approximation has a much larger radius of convergence than the standard born approximation, but substantially reduces the computational complexity of the problem.
How to Implement the Convolution and Correlation in Layered Medium Imaging
All of the above calculations have been carried out without using the fact that the operation of matrix multiplication with G is in fact the sum of a convolution and a correlation, which is transformed into a pair of convolutions. Symbolically GL=GR+GV, where GR is the correlation and GV is a convolution operator. The actual numerical implementation of the above formulas, therefore, is somewhat different than would naively appear. The implementation of the correlation with convolution is a little more challenging than the convolution in the homogeneous case, in that a change of variables must be used to convert the correlation to a convolution before the Fast Fourier Transform (FFT)s can be applied. A “reflection” operator must be applied to γ before the convolution is applied, this operator is denoted by Z, and is defined by:
The use of this operator is incorporated into the action of the Lippmann-Schwinger equation on the field f That is, the equation for the internal fields, which in example 1 read:
(I−Gγ)f=finc(I−Gγ)
Now becomes
(I−Gvγ−GRZγ)f=finc
This change has non-trivial ramifications for the construction of the algorithms discussed above. The FFT implementation of this is given below. To prevent the notation from getting out of hand we use [] to indicate that a diagonal matrix is constructed out of an n vector, so that [γ(n)]fφ(n) denotes pointwise multiplication of the vectors γ and f. Also F is used to denote the Fourier transform, and * is used to indicate “convolution”. In this notation, the Lippmann-Schwinger equation becomes, in the symbolic form adopted above,
(I−Gvγ−GRZγ)f=(f—Gv*([γ]f)−GR*Z([γ]f))=(f−F−1{(F(Gv)·F([γ]f))−(F(GR)·F{Z([γ]f)})}) (35)
Substantial savings is achieved through the observation that the Fourier Transforms of GV and GR need only be done once, then stored, and finally, that the reflection operator, Z, will commute with Fourier transformation in the x and y (horizontal) directions, since it only involves reflection in the z (vertical) direction.
There are changes also, for example in the computation of f. The biconjugate gradient algorithm requires the use of the adjoint in the solution of the above equation. This adjoint differs from the homogeneous ambient medium case with the introduction of the correlation operator. Also, there are substantial changes in the implementation of the overdetermined conjugate gradient algorithm used to solve (in the least squares sense) the overdetermined system of equations. In this section, as before, we incorporate the ksc2 factor into the GV and GR terms. Of course, rather than carry out the inversion of a matrix, we use biconjugate gradients and Fast Fourier Transforms (FFT's), which require the Lippmann Schwinger operator, which we denote by LS, and its adjoint, which are defined by the following formulas:
(LS)f≡(I−F−1[({overscore (G)}v•F)+({overscore (G)}R•F∘Z)])([γ]f) (36)
Where
{overscore (G)}v≡F(Gv) and {overscore (G)}R≡F(GR)
are the Fourier Transforms of GV and GR and need only be calculated once, then stored. The unitarity of F is used below to determine the adjoint (LS)H.
(LS)H≡((I−F−1[({overscore (G)}R·F∘Z)])[γ])H=[{overscore (γ)}](I−[(F−1{overscore (G)}VH)+(ZHF−1{overscore (G)}RH)](F−1)H)=({overscore (γ)})(I−[(F−1{overscore (G)}V)+(ZF−1{overscore (G)}R)]F) (37)
where we have used the unitarity of Fourier Transformation F:FH=F−1, and the fact that point wise multiplication is used, and-the fact that ZH=Z. For practical problems it is best to obtain a reasonable first guess using low frequency data, then to build on this using the higher frequency information.
The Actual Construction of the Layered Greens Function
Up to this point, we have assumed the existence of the layered Green's function, but have not shown explicitly how to construct it. This is an essential step. For simplicity we deal with the scalar case, although the inclusion of shear wave phenomena is conceptually no more difficult than this case. The details are given in [Wiskin, 1991]. The general idea is based upon standard ideas in the literature concerning reflection coefficients in layered media. See for example [Muller, G., 1985, Journal of Geophysics, vol. 58] which is herein incorporated by reference. The novelty is the combination of the reflection coefficients into a bona fide Green's function, and the utilization of this in a forward problem, then more importantly, in the inverse problem solution. The procedure involves decomposing the point response in free space into a continuum of plane waves. These plane waves are multiply reflected in the various layers, accounting for all reverberations via the proper plane wave reflection/transmission coefficients. The resulting plane waves are then resummed (via a Weyl-Sommerfeld type integral) into the proper point response, which in essence, is the desired Green's function in the layered medium. The final result is:
R− and R30 are the recursively defined reflectivity coefficients described in Muller's paper, u is the horizontal slowness,
Recall that Snell's law guarantees that u will remain constant as a given incident plane wave passes through several layers.
ω is the frequency
bsc is the vertical slowness for the particular layer hosting the scattering potential
This Green's function for imaging inhomogeneities residing within Ambient Layered Media must be quantized by convolving with the sinc (really Jinc) basis functions described below. This is done analytically in [Wiskin, 1991], and the result is given below.
Discretization of Layered Medium Lippmann-Schwinger Equation
Unfortunately the basis functions that were used in the free space case (example 1) cannot be used to give the discretization in the layered medium case because of the arbitrary distribution of the layers above and below the layer which contains the scattering potential. The sinc functions may continue to be used in the horizontal direction, but the vertical direction requires the use of a basis function with compact support (i.e. should be nonzero only over a finite region). The sampled (i.e. discrete) Green's operator in the layered case is defined by
GL(y,k,m)=GL(jδ,kδ,mδ) (40)
where the three dimensional basis functions are of the form:
B(x)=B(x,y,z)=Sj,δ)(x)Sk,δ(y)Λm(z), (41)
(I is the set of integers) and where:
The basis functions in the horizontal (x-y) plane S(j,δ) are based upon the Whittaker
The basis functions in the vertical (z) direction, on the other hand, are given by
Λm(Z)=Λ(z−mδ),
where Λ(z) is the “tent” function:
The form of the discretized Lippmann Schwinger equation is
with the vectors Gπ/δR and Gπ/δV are the result of first convolving the Green's function with the basis functions, and then sampling at the gridpoints. The superscript R refers to the correlation part whereas the superscript V refers to the convolution part of the Green's function. Both are computed via Fast Fourier Transforms, and are very fast. In the free space case the correlation part is zero. That is, we make the definition for L=V or R- (ie. convolution or correlation):
substitution into equation (17) yields thee form (42). In matix notation, this equation gives:
f(I−ksc2GV*γL−ksc2ĜR*Zγl)−1finc (44)
where now * represents discrete convolution in 3D. Now that we have constructed the Integral equations in discrete form, we must derive the dosed form expression for the layered Green's operators. The construction of the closed form expression of the sampled and convolved acoustic layered Green's function Gπ/δL from the layered Green's function given in equations (38 and 39) above is given by performing the integration in (43) analytically. This process is carried out in [Wiskin, 1991]. The final result is, for the convolutional part,
In the above formulas
Jo≡Jo(uω|m|δ)
is the zeroth order Bessel Function, and the upper limit uδ is defined as:
for wavenumber
with bsc defined as the vertical slowness in the layer containing the scattering point. When this layer is assumed to have acoustic wave velocity, it is given explicitly by:
These expressions give the discretized 3D Acoustic layered Green's function, which is used in the layered media discretized Lippmann-Schwinger equation to obtain the solution field within the inhomogeneity. It is also used in the detector equations (9), which correspond to the detector equations (11).
This patent differs from the previous art in the important aspect of including the correlation part of the green's function. This correlation part is zero for the free space case. This correlational part is directly applicable as it occurs above to the fish echo-locator/counter, and to the mine detection device in the acoustic approximation. (There are specific scenarios, where the acoustic approximation will be adequate, even though shear waves are clearly supported to some degree in all sediments). The inclusion of shear waves into the layered media imaging algorithm (the technical title of the algorithm at the heart of the hazardous waste detection device, the fish echo-locator, and the buried mine identifier) is accomplished in Example 6 below. Actually a generalized Lippmann-Schwinger equation is proposed. This general equation is a vector counterpart to the acoustic Layered Green's function, and must be discretized before it can be implemented. The process of discretization is virtually identical to the method revealed above. Furthermore, the BCG method for solving the forward problem, the use of the sinc basis functions, and the use of the Fast Fourier Transform (FFT) are all carried out identically as they in the acoustic (scalar) case.
Imaging with Electromagnetic Wave Energy
The use of electromagnetic energy does not greatly affect the basic idea behind the imaging algorithm, as this example will demonstrate. Furthermore, we will see that it is possible to combine the layered Green's function with the electromagnetic free space Green's function to image materials within layered dielectrics. In fact this process of combining the layered Green's function with Green's functions derived for other structures and/or modalities than the free space acoustic case can be extended almost indefinitely. Another example is given below, where the combination of the Acoustic Biot Green's function with the layered Green's function is carried out. Further extensions that are not detailed here, are: (1) Combining Elastic (including shear wave energy) Wave equations with the layered Green's function, (3) Combining the Elastic Biot equations with the layered Green's function, (4) Combining the elastic and electromagnetic wave equations to model transducers. For simplicity, we consider a homogeneous, isotropic, non-magnetic background material. The time dependence will be eiωl, the magnetic properties of free space are summarized in {circumflex over (z)}o≡iωuo, and the electric properties are summarized in ŷo≡iω∈o. Within the object being imaged the magnetic properties are {circumflex over (z)}≡{circumflex over (z)}o≡iωuo (the equivalence with the free space value is the nonmagnetic media assumption). The electric properties of the object being imaged are summarized in
ŷ≡σ+ŷo≡σiω∈r∈o,
which is the complex admittivity of the object. The larger σ is, the less able the medium is able to support electromagnetic wave transport. These properties are combined in
which is a measure of the number of wave cycles present per unit distance for a given frequency ω, when the wave speed of propagation is co. The object's electrical properties (recall it is assumed non-magnetic for simplicity) are summarized in the “object function”, γ. (the normalized difference from the surrounding medium):
The total electric field equations are:
where Ei(r) is the 3-D incident field, E(r)is the 3-D total field.
The construction of the sinc basis discretization and the GN and RP algorithms for this 3-D EM case is essentially equivalent to the 2-D scalar acoustic case described above. See [Borup, 1989] for the details of the discretization and solution of the forward problem by FFT-BCG. The vector-tensor nature of the fields—Green's function is the only new feature and this is easily dealt with.
For simplicity, we now look at the situation where there is no z dependence in either the object function, i.e. γ(x,y,z)=γ(x,y), neither in the incident field. The vector ρ=(x,y) is used for the position vector in (x,y)-space.
in matrix form the equation (50) looks like:
From this form of the equation, it is clear that the electric field in the z-direction is uncoupled with the electric field in the x-y plane. Thus, in this situation the field consists of two parts. The so-called transverse electric (TE) mode, in which the electric field is transverse to the z direction, and the transverse magnetic (TM) mode, in which the electric field has a nonzero component only in the z direction. The TM mode is governed by the scalar equation:
which is identical to the 2-D acoustic equation discussed previously. Thus, the 2-D TM electromagnetic imaging algorithm is identical to the 2-D acoustic case discussed in detail above. The TE mode is given by the following equation:
The field is a two component vector and the Green's function is a 2x2 tensor Green': function:
In compact notation:
This equation also has a convolution form and can thus be solved by the FFT-BCG algorithm as described in [Borup, 1989] The construction of the GN-MRCG MRCG and RP imaging algorithms for this case is identical to the 2-D acoustic case described above with the exception that the fields are two component vectors and the Green's operator is a 2×2 component Green's function with convolutional components.
Finally, note that the presence of layering parallel to the z direction can also be incorporated into these 2-D algorithms in essentially the same manner as above. Special care must be taken, of course, to insure that the proper reflection coefficient is in fact, used. The reflection coefficient for the TM case is different from the TE case.
The Electromagnetic Imaging Algorithm
The electromagnetic imaging algorithm is virtually the same as the imaging algorithm for the scalar acoustic mode shown in the previous example, and therefore, will not be shown here. The Microwave Nondestructive Imager is one particular application of this imaging technology, another example is the advanced imaging optical microscope.
Advanced Imaging Microscope
The microscope imaging algorithm requires the complex scattered wave amplitude or phasor of the light (not the intensity, which is a scalar) at one or more distinct pure frequencies (as supplied by a laser). Since optical detectors are phase insensitive, the phase must be obtained from an interference pattern. These interference patterns can be made using a Mach-Zehnder interferometer. From a set of such interference patterns taken from different directions (by rotating the body for example), the information to compute the 3-D distribution of electromagnetic properties can be extracted. A prototype microscope system is shown in
The problem of extracting the actual complex field on a detector (such as a CCD array face) from intensity measurements can be solved by analysis of the interference patterns produced by the interferometer. It is possible to generate eight different measurements by: (1) use of shutters in path A and path B of the interferometer; (2) inserting or removing the sample from path A; and (3) using 0 or 90 degree phase shifts in path B. It is seen that all eight measurements are necessary.
The procedure to extract amplitude and phase from the interference patterns can be seen after defining variables to describe the eight measurements. Let:
fo(det)φωxA=Aoφωxeiωt+iα(φ,ω,x)
be the incident field on the detector from path A (with no energy in path B) when no sample is in place,
Aoφωx=|Aoφωx|,
fodetφωxB=B1φωneiωt+iβ(φωx)
be the incident field on the detector from reference path B1 when no additional phase shift is introduced in path B (with no energy in path A),
B1 φωx=|B1φωx|
f1(det)100ωxB=B2φωxeiωt+iβ(φ,ω,x)+iπ/2
be the incident field on the detector from reference path B when π/2 additional phase shift is introduced in path B (with no energy in path A),
B2φωx=|B2φωx|,
fo(det)φωxC=Co1φωxeiωt+θ(φ,ω,x)
be the field on the detector when both path A and path B1 are combined when no sample is in place,
Co1φωx=|Co1φωx|,
fo(det)φωxC=Co2φωxeiωt+θ(φ,ω,x)+iπ/2
be the field on the detector when both path A and path B2 are combined when no sample is in place,
Co2φωx=|Co2φωx|,
f(det)φωxA=A1φωxeiωt+φ(φ,ω,x)
be the field on the detector from path A (with no energy in path B) when a sample is present,
Aφωx=|Aφωx|,
f(det)φωxC=C1φωxeiωt+iσ(φ,ω,x)
be the field on the detector when both path A and path B, are combined when the sample is in place,
C1φωx=|C1φωx|
f(det)φωxC=C2φωxeiωt+iσ2(φ,ω,x)
be the field on the detector when both path A and path B2 are combined when the sample is in place,
C2φωx=|C2φωx|,
φ be the rotation angle of the sample holder
ω be the frequency of the light
x be the 2-D address of a pixel on the detector.
Now consider the one pixel for one rotation angle and at one frequency of light so that we may suppress these indices temporarily. The measurement made by the CCD camera is not of the field but rather of the square of the modulus of the field (i.e., the intensity). Thus, eight intensity fields can be measured on the detector:
M12=A02φωxM22=B12
M32=B22φωxM42=C012
M52=Co22φωxM62=A2
M72=C12φωxM82=C22
Because of the nature of an interferometer, the eight measurements are not independent. In particular, M42 is dependent on M12 and M22. M52 is dependent on M12 and M32 M72 is dependent on M12, M2 2, M42 and M62 M82 is dependent on M12, M32, M52 and M62. Also,
M32=K M22 (where, K=1 for a perfect phase shifter).
Thus,
B22 =KB12
There are four ways to add the two beams on specifying sample in or sample out and specifying 0-degree phase shift or 90-degree phase shift. For sample out:
M42=Co1φωx2=|Aoeiωt+iα(φ, ω,x)+B1eiωt+iβ(φnωx)|2=Ao2+B12+2AoB1cos(β−α),M52=Co2φωx2=|Aoeiωt+iα(φ,ω,x)+B2eiωx+iβ(φ,ω,x)+iπ/2|2=Ao2+B22+2AoB2cos(β−α+π/2)=Ao2+B22+2AoB2sin(β−α)(β−α)=−arctan{M3[M42−M12−M22]/M2[M52−M12−M32]}
For sample in:
M72=C1 φωx2=|Aφωxeiωt+iφ(φ,ω,x)+B1φωxeiωt+iβ(φ,ω,x)|2=A2+B12+2AB1 cos(β−Ψ),M82=C2φωx2=|Aφωxeiωt+iφ(φ,ω,x)+B2φωeiωx+iβ(φ,ω,x)|2=A2+B22+2AB2cos(β−Ψ+π/2)=A2+B22−2AB2sin(β−Ψ).
(β−Ψ)=−arctan{M3└M72−M62−M22┘/M2└M82−M62−M32┘}
(β−α)−(β−Ψ)=(Ψ−α)=phase shift introduced by sample
The scattering equations are
f(x)−finc(x)+C(x−y)*([γ(y)])Df(y)), f(x)
f(sc−det)(z)=f(det)(z)−f(inc)(z)=D(z−x)*([f(x)]Dγ(x))
Here, the symbol * means convolution and the symbol [.]D means the vector [.] is replaced by a diagonal matrix. When written in terms of the phase shift introduced by the sample these equations become:
(f(x)/f(inc)(x))−1=C(x−y)*([γ(y)]D(f(y))/f(inc)(x)))
f(sc−det)(z)=f(inc)(z)=(f(det)(z)/f(inc)(z))−1=D(z−x)*([f(x)/f(inc)(z)]Dγ(x))
If the incident field is known then then these equations can be solved. If the incident field is not known explicitly, but is known to be a plane wave incident normal to the detector, then the phase and amplitude is constant and can be divided out of both pairs of equations. It is the latter case that is compatible with the data collected by the interferometer. If the incident field is not a plane wave then, the amplitude and phase everywhere must be estimated, but the constant (reference) phase and amplitude will still cancel. The incident field can be estimated by inserting known objects into the microscope and solving for the incident field from measured interference patterns or from inverse scattering images.
The basic microscope has been described in its basic and ideal form. A practical microscope will incorporate this basic form but will also include other principles to make it easier to use or easier to manufacture. One such principle is immunity to image. degradation due to environmental factors such as temperature changes and vibrations. Both temgerature changes and vibration can cause anomalous differential phase shifts between the probing beam and reference beam in the interferometer. There effects are minimized by both passive and active methods. Passive methods isolate the microscope from vibrations as much as possible by adding mass and viscous damping (e.g. an air supported optical table). Active methods add phase shifts in one leg (e.g. the reference leg) to cancel the vibration or temperature induced phase shifts. One such method we propose is to add one or more additional beams to actively stabilize the interferometer. This approach will minimize the degree of passive stabilization that is required and enhance the value of minimal passive stabilization.
Stabilization will be done by adjusting the mirrors and beam splitters to maintain parallelness and equal optical path length. A minimum of three stabilizing beams per/mirror is necessary to accomplish this end optimally (since three points determine a plane uniquely including all translation). These beams may be placed on the edge of the mirrors and beam splitters at near maximum mutual separation. The phase difference between the main probing and main reference beam occupying the center of each mirror and beam splitter, will remain nearly constant if the phase difference of the two component beams for each stabilizing beam is held constant. Note that one component beam propagates clockwise in the interferometer, while the other propagates counterclockwise. Stabilization can be accomplished by use of a feed back loop that senses the phase shift between the two component beams for each stabilizing beam, and then adjusts the optical elements (a mirror in our case) to hold each phase difference constant. This adjustment of the optical elements may be done by use of piezoelectric or magnet drivers.
We now describe how use of the several stabilizing beams operates to stabilize the interferometer and the microscope. Since the stabilization of a single beam is known to those working with optical interferometers, the description given here will be brief. The description is easily seen by a simple mathematical analysis of the interaction of the two components of the stabilizing beam. The electric field of the light at a point in space may be represented E=Eoexp (i2π ft), where Eo contains the spatial dependence and exp(i2π ft) contains the temporal component, where f is the frequency of oscillation and t is time. For the remainder of this discussion we assume exp(i2π ft) time dependence at a single frequency f and this factor will be omitted. Let the electric field of the output of the interferometer for one component beam be A and let the electric field output of the second component be B expiθ. Then the combined output is [A+Bexp(iθ)]. The light intensity of the combined output that is detected is
|A+Bexp(iθ)|2=A2+B2+2ABcosθ.
The feedback conditions can be easily derived from the above equation. The feedback signal can be derived by: (1) synchronous detection of the summed beams when one of the beams, say beam B, is modulated; or (2) by use of the differences in intensity of the transmitted and reflected beam exiting the final beam splitter. We now describe the operation of the first method of synchronous detection.
Let θ=(θo+∈sin(ωt)) represent phase modulation of phase θ around equilibrium phase θo by amount ∈ at frequency ω. The frequency co is typically 100 Hz to 1000 Hz. Suppose that phase difference of zero degrees between beam A and beam B is required. Setting θo=δ and noting that both ∈ and δ are small we see that cosθ=cos(δ+∈sin(ωt)). By use of the trigonometric identity for the cosine of the sum of two angles we get cosθ=cosδ cos(∈sin(ωt))−sinδ sin(∈sin(ωt)). On expanding this expression to second order in δ we obtain
We wish to force δ to zero for optimal phase lock. There are two ways to accomplish this end: (1) by maximizing the modulus of the signal component ∈2(1−δ2)sin2(ωt); or (2) by minimizing the modulus of the signal component δ∈sin(ωt). The former method is accomplished by mixing (multiplying) |A+Bcosθ|2 by sin2(ωt), low pass filtering and then maximizing the signal by feedback changes to δ.
To lock the phase difference to π/2 a modification of the above procedure may be used. Set
θ=π/2+δ+∈sin(ωt).
Then
cosθ=cos(π/2+δ+∈sin(ωt))=sin(δ+∈sin(ωt))=−sinδcos(∈sin(ωt))+cosδsin(∈sin(ωt)).
On expanding this expression to second order in δ we obtain
Again, we wish to force δ to zero for optimal phase lock. There are again two ways to accomplish this end: (1) by maximizing the modulus of the signal component (1−δ2/2)∈sin(ωt); or (2) by minimizing the modulus of the signal component δ(1−(½)∈2sin2sin2(ωt)). The former method is accomplished by mixing |A+Bcosθ|2 with sin(ωt), low pass filtering and maximizing the result by changing to δ through feedback. The latter method is accomplished by mixing |A+Bcosθ|2 with sin2(ω), low pass filterings and minimizing the resulting signal by feedback to change δ.
We now describe a second or non-hetrodyning method of phase locking beam A and B to a constant phase difference. The net intensity of the respective transmitted and reflected beams from a beam splitter have different dependency of the phase difference from the that combines two beams. Let the two beams coincident on the beamsplitter be A and B cos θ Then the two exit beams from the beam splitter have respective intensities |A+B cos θ|2 and |A−B cos θ|2. This follows from a difference in phase of 180 degrees in the reflected beam depending on whether the incident medium has a higher or lower dielectric constant than the medium beyond the reflecting surface. The difference of the signals from detectors that detect transmitted and reflected beams is given by |A+B cos θ2−|A−B cos θ|2=4AB cos θ. If the desired phase angle is π/2 and the error is δ then 4AB Cos θ=−4AB sin δ≈−4ABδ4. Thus, the error signal δ can be used in a feedback loop to drive the difference in the two optical paths to π/2. This technique can be refined by dividing by the product of A and B. Since A is proportional to B dividing by A.sup.2 will be equivalent. The value of A is proportion to the sampled laser power that is used to generate beam A and beam B cos θ.
Extension to Biot Theory (Acoustic Approximation)
In the article [Boutin, 1987, Geophys. J. R. astr. Soc., vol. 90], herein incorporated by reference, a greens function for the determination of a point response to a scattering point located in an isotropic, homogeneous, porous medium that supports elastic waves is developed. The implementation of this Greens' function into an imaging algorithm has never been carried out before. In this section, we have adapted their approach to derive an acoustic approximation to the fully elastic Biot theory that enables us to present a simplified practical tool for the imaging of porosity-like parameters in a geophysical context. The implementation of the full elastic Biot imaging algorithm using the Green's function of [Boutin, 1987] in place of the one derived in [Wiskin, 1992] is no different from the discretization employed here. The use of the sinc basis functions, the FFT's, the biconjugate gradients, and so on, is identical.
We have developed a model that incorporates the parameters of fluid content, porosity, permeability, etc, but instead of the 3+1 degrees of freedom of the two phase elastic model, has only 1+1, corresponding to the two independent variables, Ps′ and Pl′ the pressure field within the solid (modelled as a liquid in this approximation) and truly liquid phase respectively. As with the previous example. The use of the Blot theory Green's function can be combined profitably with layering, to image material in a layered background.
Define (see [Boutin, 1987]., for more complete discussion of the parameters):
The “acoustic two phase Green's function” is the Green's function obtained by solving the above system with a distributional right hand side, where δ(x) is the Dirac delta distribution. That is we seek the solution to the following matrix operator equation:
BGBiot+I2×2.5δ(x)=0 where BGBiot:
R2,3→R2 is a function on R3 or R2and I2×2 is the two dimensional identity matrix. B is given by:
This author obtained
where w: R3→R2 is the 2D vector defined by:
and x∈R3 for the acoustic two-phase 3D model, and where d and v are 2 component vectors (representing the two phases) given by
A very similar analysis gives an equation and corresponding Green's Operator for the acoustic two-phase 2D (two spatial independent variables) case, in which w: R2→R2 and x∈R2 The operator is very similar except that the w vector contains Hankel functions of the zeroth order:
Notice the existence of the two wavenumbers δ1 and δ2 in this acoustic approximation to Biot theory guarantee the existence of a fast and slow compressional wave—a distinguishing characteristic of the classical elastic Biot theory.
More importantly this algorithm provides us with a computationally feasible means of inverting for phenomenological parameters derived from the classical elastic two phase Biot Theory.
Finally, it is important to point out that convolving the above functions with the sinc basis function analytically is done in exactly the same manner with GBIot=G2×2 above as it was done above because x only occurs in the form of the scalar acoustic Green's operator. This is why it is possible to create very efficient algorithms based upon the sinc-Galerkin method and FFT's as described in [Borup, 1992], using the Green's Operator described above.
It is now possible to obtain an algorithm for the inversion of the 2D Biot porous medium by analogy with the algorithm based upon the Lippmann-Schwinger equation used earlier. This new algorithm is based upon a “two phase” Lippmann-Schwinger type equation derived in a future publication.
the object functions γj are given by
This equation is discretized in the same manner as above, and the convolution character is preserved. With the explicit form of the discretized Biot Lippmann-Schwinger Integral Equation the imaging problem is solved by applying the analogous conjugate gradient iterative solution method to the Gauss-Newton Equations in a manner exactly analogous to the discussion above (see future publication for details).
Finally, as mentioned above, it is certainly possible to construct, using the above principles, a layered Green's function for the Biot theory in exact analogy with the construction for the acoustic layered Green's Operator. This operator will be a matrix operator (as is the acoustic two phase Green's Operator constructed above), and will consist of a convolutional and correlational part (as does the acoustic single phase Green's Operator constructed earlier). Because of these convolutional and correlational parts, the FFT methods discussed above are directly applicable, making the algorithm feasible from a numerical standpoint.
Non-Perturbative Inversion of Elastic Inhomogeneous Structures Buried Within a Layered Half Space
In elastic media (by convention in this patent, media that supports shear wave activity) the relevant parameters are γ, μ, (Lame′ parameters), ρ (density) and absorption. The inversion for these elastic parameters (i.e. the first and second Lame′ parameters, γ and μ follows virtually the same prescription as was outlined above for the acoustic scalar case. In a manner similar to the Electromagnetic Inversion problem discussed above, it is possible to break up the arbitrary 3 dimensional vector u(x,y,z), representing the displacement into components that propagate independently. The exact description of this procedure is given in [Wiskin, 1991], and [Muller, 1985]. This example will not deal with this decomposition since the idea behind this is more easily seen by looking at the electromagnetic example given earlier.
The idea here is to incorporate the above solution to the layered inversion problem directly into the Green's operator. As before, in the electromagnetic and the acoustic case, this Green's function is used in the integral equation formulation of the inversion problem of imaging an inhomogeneous body extending over an arbitrary number of layers.
The following is the general elastic partial differential equation which governs the displacement vector field u in the general case where X and p both depend upon x ∈R3. When λ and μ are independent of x, we have the homogeneous elastic case.
fi(x) represents the applied body force, and μ(x) and λ(x) are the Lame′ parameters, their dependence upon x ∈R3 is the result of both the inhomogeneit to be imaged and the ambient layered medium. In particular ui({right arrow over (y)}){right arrow over (y)}∈R3 is the ith component (i=1,2,3) of the displacement field at point {right arrow over (y)}∈R3
ui0({right arrow over (y)}) is the ith component of the incident field.
ρ1({right arrow over (x)})+ρ0(z)=ρ({right arrow over (x)}) is the total density variation, it consists of the 3-D variation in ρ1 and the vertical 1-D variation in ρo
λ1({right arrow over (x)})+λo(z)=λ({right arrow over (x)}) is the total variation in λ the first Lame′ parameter, λ1 has 3-D variation, and λ0 has 1-D vertical variation.
μ1({right arrow over (x)})+μo(z)=μ({right arrow over (x)}) is the total variation in the second Lame′ parameter .mu.. The μ1 has variation in 3-D, the μ2 has variation in 1-D (vertical).
({right arrow over ({right arrow over (x)})})=density=.ρ1({right arrow over (x)})+ρ0(z),
where ρ0({right arrow over (x)}) represents the layered medium without the object, and ρ1({right arrow over (x)}) represents the object to be imaged.
λ0(z) and μ0 (z) are the z-dependent Lame′ parameters representing the layered medium
λ1({right arrow over (x)}) and μ1({right arrow over (x)}) are the Lame′ parameters representing the object to be imaged
The above differential equation is converted to the following elastic “Generalized Lippmann-Schwinger equation
by means of the kernel Gim(LAY)({right arrow over (y)},{right arrow over (x)}), which is constructed below. Where the volume integral is performed over the finite volume that contains the image space (and the object being imaged, for example, an ore body, submarine mine, oil reservoir, etc). This is the basic integral equation which forms the theoretical foundation for the Layered Green's Operator approach to Inverse Scattering in a layered medium.
This kernel is a 3 by 3 matrix of functions which is constructed by a series of steps:
Step 1:
Beginning with the acoustic (scalar or compressional) Green's function given by
Step 2:
The free space elastic Green's matrix is a 3 by 3 matrix of functions, built up from the free space Green's function. It's component functions are given as
Step 3:
Next the layered Green's function, GimL({right arrow over (y)},{right arrow over (x)}), for a layered elastic medium is defined. It is defined in terms of Gim, the components i,m=1, . . . , 3, of the elastic Green's function in homogeneous elastic media given above. The components of the layered Green's matrix are integrals over Bessel functions and reflection coefficients in essentially the same manner as the acoustic layered Green's function consisted of integrals over wavenumber, of the acoustic reflection coefficients. This dyadic is patterned after [Muller, 1985], in the manner discussed in [Wiskin, 1992].
Step 4:
Finally the layered Green's kernel Gim(LAY), is constructed {right arrow over (x)}∈R3
where GimL({right arrow over (y)},{right arrow over (x)}) is layered Green's function for the elastic layered medium. The progressive constructions can be represented in the following way, beginning with the acoustic free space Green's function:
Alternatively, in words:
Using the last “kernel” in this series in the generalized elastic layered Lippmann-Schwinger Equation gives the vector displacement.
which is then discretized, using the sinc basis in exactly the same way as for the previous examples, and, the FFT, and biconjugate gradient, and conjugate gradient algorithms are then applied to this vector equation, in the same way as done above. Thus the elastic modality (including shear waves) is accounted for.
The construction of the layered Green's function in the full elastic case (with shear wave energy included is slightly more complicated than the purely scalar case. For this reason, we look at the construction in more detail, see also [Wiskin, 1993]
For discretization of the resulting equations see the electromagnetic case discussed in example 3.
The Construction of the Continuous Form of the Layered Green's Operator GL-3D and with full elastic mode conversion Case,
Now we proceed with the construction of the continuous variable form of the Layered Green's operator. The closed form expression for the discretized form of the bandlimited approximation to this continuous variable Green's operator is constructed in a manner exactly analogous to the vector electromagnetic case discussed in example 3.
By analogy with the acoustic (scalar) situation, the construction of the layered Green's operator can be viewed as a series of steps beginning with the 3D free space Green's operator.
Step a) Decompose a unit strength point source into a plane wave representation (Weyl-Sommerfeld Integral). For the elastodynamic (vector) case we must consider the the three perpendicular directions of particle motion, which we take to be the horizontal direction, the direction of propagation, and the direction perpendicular to the previous two. Muller has given the representation for a point source in elastic media in [Muller, 1985]
Step b) Propagate and reflect the plane waves through the upper and lower layers to determine the total field at the response position. The total field will consist of two parts: uup(r), the upward propagating particle velocity, and ud(r), the downward propagating particle velocity at position r. This propagation and reflection is carried out analytically by means of the reflection coefficients, which in the elastic case, are the matrices R− and R+(R− and R+ are 2×2 matrices), and the scalars, r−, and r+. The matrices correspond to the P-SV (compressional and shear vertical polarized waves) for the case of horizontal stratification. (x is the horizontal, and z is the vertical coordinate, z is positive downward). The scalar coefficients correspond to the SH (horizontally polarized) shear waves, which propagate without mode conversion to the other types for the case of horizontally layered media.
R− and r−, represent the wave field reflected from the layers below the source position, and R+, and r+ represent the cumulative reflection coefficient from the layers above the source position, for the matrix and scalar cases respectively.
These total reflection coefficients are formed recursively from the standard reflection coefficients found in the literature (e.g. Muller [1985], and Aki and Richards, Quantitative Seismology, Freeman and Co., 1980, herein included as a reference].
FIGS. 1-C and 27 should be kept in mind throughout the following discussion. As in the scalar case, in order to have a single reference position R+ and R− are both given with respect to the top of layer that contains the scattering point, which is denoted by “sc”. That is, R+ represents the reflection coefficient for an upgoing wave at the interface between layer sc and sc-1.
In our case the expressions Su and Sd, Sd and Su, and are those derived from the Sommerfeld integral representation of a point source of unit strength, given in Muller. [see also Aki and Richards, 1980]. For example:
Sd=eiwb
Su=eiwb
for the scalar case.
As shown in detail, by Muller, the total contribution of the upward travelling wave is given by
Su+R−R+Su+R−R+R−R+Su+R−R+R−R30 Su+ . . . =[1+R−R++R−R+R−R++R−R+R−R+R−R++ . . . ]Su=[1−R−R+]−1Su
for the transverse shear wave (SH or horizontal shear wave), and
Su+R−R+Su+R−R+R−R+Su+R−R+R−R+Su+ . . . =[1+R−R++R−R+R−R++R−R+R−R+R−R++ . . . ]Su=[1=R−R+]−1Su
for the P-SV matrix case. Note that some care must be exercised in the convergence of the matrix series, however, for practical situations we can omit this detail.
Step c) The process of forming the total field at the response point must be broken up into two cases:
Case 1: the response point is above the scatter point, that is Δz−Δz′<0(Δz is the distance from the interface above the scattering layer, to the response point, the z axis is positive downward), and
Case 2, where the response point is below the scattering point Δz−Δz′<0.
Furthermore each case consists of an upward travelling, and a travelling wave:
Upward travelling wavefield:
First, in case 1:
[1−R−R+]−1Su and [1−R−R+]−1Su represent the contribution to u.sup.up from the upward travelling part of the source. A similar expression can be formed for the contribution from the downward travelling part of the source, it is
[1−R−R+]−1R−Sd, and [1−R−R+]
Thus, the total upward component of the wave field at the response point is formed from:
Case 2: the response point is below the scatter point, that is Δz−Δz′>0
The result here is similar, the change occurring in the coefficient of the Su, and Su.
Downward Component of Wavefield
Case 1: A similar expression gives the downward component of the total wave field at the response point r, for case 1:
Case 2: For case 2, the result is similar, here the response point resides below the scatter point, that is Δz−Δz′>0
Step d) The final step in the process is the recombination of the plane waves to obtain the total wavefield (upgoing and downgoing waves) at the response position:
For the scalar case 1 Δz−Δz′<0, and:
GL(x−x′,y−y′,z/z′)=fu=fd,
with
uup=∫J0(uω/r−4\)[1−R−R+]−1[Su+R−Sd]expiωbsc(z−zsc)du
ud=∫J0(uω/r−r′|)[1−R−R+]−1[R+Su+R+R−Sd]exp−iωbsc(z−zsc)du
so that
GL(x−x′,y−y′,z/z′)=∫J0(uω/r−r′|)[1−R−R+]−1[Su+R−Sd]{eiωb
While for case 2, Δz−Δz′>0, similar algebra gives:
GL(x−x′,y−y′,z/z′)=∫J0(uω/r−r′|)[1−R−R+]−1R+Su+Sd[eiωb
For case 1 this can be rewritten as:
GL(r−r′,z/z′)+A∫u=0∞C(u:r−r′){Sueiωb
where the coefficient function C(u:r−r′) is given by:
For case 2, GL can be written as:
GL(x−x′,y−y′,z/z′)=∫J0(uω/r−r′|)−1{R+R−Sueiωb
Finally, using the definitions for Su and Sd and recognizing that products such as Sdeiωb
Sdeiωb
the operator GL turns out (after rearrangement of the above equations) to be the sum of a convolution and a correlation kernel:
GL(r−r′,z/z)=GR(r−r′,z+z)+Gv(r−r′,z−z′).
Case I has (Δz−Δz′)≦0, where Δz=z−zsc and Δz′=z′−zsc with zsc being the z co-ordinate of the top part of the layer that contains the scatterer. In fact GR and GV turn out to be given by the following formulae (they appear to differ from the integrals above because of the rearrangement that leads to the decomposition into convolutional and correlational parts):
GR(r−r′, Δz/Δz′)=A ∫u=0∞J0(uω|r−r′|)C(u){R+e−iωb
GV(r−r′, Δz/Δz′)=A ∫u=0∞J0(uω|r−r′|)C(u){R+R−e−iωb
where, now:
C(u)=(1−R+R−)−1
R− and R+ are the recursively defined reflectivity coefficients described in Muller's paper, u is the horizontal slowness,
Recall that Snell's law guarantees that u will remain constant as a given incident plane wave passes through several layers.
ω is the frequency
bsc is the vertical slowness for the particular layer hosting the scattering potential
Case II consists of the case where (Δz−Δz′≧0. The GR and GV are now given by the following:
GV(r−r′, Δz/Δz′)=A ∫u=0∞J0(uω|r−r′|)C(u){R+R−e−iωb
and
GR(r−r′, Δz/Δz′)=A ∫u=0∞J0(uω|r−r′|)C(u){R−e−iωb
These expressions can be combined into one equation by the judicious use of absolute values. The correlational part of the Green's operator can be transformed into a convolution by a suitable change of variables. The resulting Green's function for imaging inhomogeneities residing within Ambient Layered Media must be quantized by convolving with the sinc (really Jinc) basis functions described below. This is done analytically and the result iS given below. The same process is carried out as detailed above, in order to determine the P-SV total wavefield (the 2 by 2 matrix case), and is not repeated.
The matrix case is handled in exactly the same manner, with allowance for the differing algebra associated with matrices, the preservation of the convolution is preserved for the same reason as shown above in the scalar situation.
On avoiding convergence problems due to the presence of derivatives in the elastic Green's function (i.e. with shear wave motion)
One difficulty with this formulation is the presence of four derivatives acting upon the acoustic free space Green's function in the construction of the dyadic G(LAY). This problem we have overcome (see related code in Appendix E) by the following method:
Given an inhomogeneous distribution of isotropic density ρ and Lame parameters λ and μ, imbedded in a homogeneous medium with density ρ0 and Lame parameters λ0, and μ0, the total infinitesimal displacement vector u satisfies the partial differential equation
ω2ρui+(λujj)i+[μ(uij+uji)]j=0 (1)
while the incident displacement field ui satisfies
ω2ρuii+(λ0ujji)i+[μ0(uiji+ujii)]j=0 (2)
where ρ0, λ0, and μ0 are the homogeneous parameters of the imbedding medium. Subtracting (2) from (1) and rearranging results in
for the scattered displacement field us=u−ui. The inhomogeneous term f is given by
where the respective shear-wave and compression-wave velocities cs and cp and corresponding wave numbers ks and kp are given by
for the imbedding medium. Introducing the scattering potentials
Solution of (3) by the introduction of the free-space Green's function results
uis=Gij*fi=∫vfi(r′)Gij(F−r′)dv′ (8)
where * denotes 3-D convolution, v is the support of the inhomogeneity, and the Green's function is given by [Aki and Richards, Quantitative Seismology, Freeman and Co., 1980, herein included as a reference]:
where g(kR) is the scalar Helmholtz Green's function
where e−iωt time dependence has been assumed. Inserting (7) into (8) and integrating by parts yields the following integral wave equation:
uis=Gij*{γρuj}+Gij,j*{γλuk,k}+Gij,k*={γu(uj,k+ukj)} (11)
For now, consider the case where γμ=0 and note that:
using
(∇2+k2)g(kR)=−δ(|r−r′|) (13)
reduces (12) to:
which will henceforth be denoted Ci. We have now arrived at the integral equation:
since ukk=∇·u
The integral equation (15) can be solved by application of the 3-D FFT to compute the indicated convolutions, coupled with the biconjugate gradient iteration, or similar conjugate gradient method. [Jacobs, 1981]. One problem with (15) is, however, the need to compute ∇·u at each iteration. Various options for this include taking finite differences or the differentiation of the basis functions (sinc functions) by FFT. Our experience with the acoustic integral equation in the presence of density inhomogeneity indicates that it is best to avoid numerical differentiation. Instead, the system is augmented as:
Iterative solution of (16) for the four unknown fields ux, uy, uz, and ∇·u now involves no numerical differentiation of u, since all derivative operators have now been applied analytically to the Green's function. The incident component ∇·ui is assumed known. Equation (16) can be written symbolically as
Ui=(I−G[γ]diag)U (18)
where Ui and U are the augmented 4-vectors in Eq. (16), [γ]diag is the diagonal operator composed of γρ and γλ, and is the 4×4 dyadic Green's function in Eq. (16).
Inclusion of γμ Scattering to Give the General Elastic-wave Integral Equations We now give a method for solving integral equations in the general case for the inhomogeneous material properties ρ, λ, and μ.
Because the inclusion of the more complicated term in γμ. causes the matrix notation used above to require breaking the equations into parts on several lines, we elect for efficiency of space to use the more compact tensor notation. This should cause no difficulty because the translation is clear on comparing the special case of γμ=0 in the general tensor equations which follow with the previous matrix notation for this same case.
First we give again (see Eq (11)) the integral equations (which in practice have been discretized by sinc basis functions) for inhomogeneous ρ, λ, and μ. Here * means convolution (the integration operation):
uis=Gij*{{γρuj}+Gij,j*[γu(uj,k+ukj)] (19)
We note that the displacement field uj and its derivatives also ujk appear. We choose not to compute derivatives of the fields numerically and avoid the associated numerical instability. Instead, we augment the above equation with additional equations to compute the derivatives in a more stable way by solving for them directly. Thus, solving for (ui,1+u1,i) directly is more efficient and replaces computation of nine fields with only six. Forming the six unique symmetric-derivative pairs, we obtain an integral equation for these pairs.
ui,js+ul,js=(Gij,l+Glj,i)*[γρuj]+(Gij,kl+Gl,j,ki)*[γμ(ujk—ukj)]+(Gij,jl+Glj,ji)*[γλuk,k] (20)
Note that
We now augment the system Eq. (19) for the three components of uis with the system Eq. (20) which has six pairs of unique component derivatives.
These nine equations can also be placed in a matrix form similar to that of the augmented system given in Eq (16). The augmented system of nine components could be solved for γρ, γλ, and γμ, assuming knowledge of the fields ui and derivatives ui,k. Since the fields and derivatives are also unknown, we must solve for them simultaneously with γρ, γλ, and γμ. This is done by adding these constraint equations. These nine equations are composed of the three equations for ui
uii=ui−Gij*(γpuj)−Gij,j*[γλuk,k]−Gij,k* [γμ(uj,k+ukj)] (22)
and the six equations for uj,k+ukj
ui,li+ul,ii=(ui,l−ul,i)−(Gij,l+Glj,i)*[γpuj]−(Gij,kl+Glj,ki)*[γμ(ujk+ukj)]−(Gij,jl+Glj,ji)*[γλuk,k] (23)
These 9 equations can also be placed in a matrix form similar to that of the augmented system given in Eq (16). Solving these twelve equations (Eq. (19, 22, 23)) for (γρ, γλ, and γμ.), the fields, and their derivatives is accomplished using the Frechet derivative methods described next. As in Eq. (16) for the μ=μ0 case, Eq. (23) can be written symbolically as
Ui=(I−G[γ]diag)U (24)
where the operators and [γ]diag are now 9×9, and the vectors U and Ui consist of the three components of u and its six symmetric derivative sum-pairs.
Cylindrical Coordinate Methods for Numerical Solution of the SIE
The above examples all use the FFT-BCG algorithm for imaging. This method is one of three methods discussed in this patent. The other two methods are
(1) The cylindrical coordinate recursion method (Cylindrical Recursion)
(2) Rectangular recursion of scattering matrix method (Rectangular Recursion for short reference)
We now discuss the cylindrical recursion method:
7.1 Cylindrical Coordinate SIE Formulation and Discretization
We begin with the acoustic scattering integral equation (SIE) for the constant density case:
Let each field be expanded in a Fourier series in the angular variable:
Then using Graf's addition theorem:
results in the cylindrical coordinate form of (1.1):
where the kernel is
separably symmetric:
Henceforth assume that k0=1. Discretizing the radial integral by trapezoidal rule using an increment of Δ results in:
Notice that the extra 1′ resulting from the ρ′ in the integral in (1.3) has been placed in the definition of (γf)1′n′. Equation (1.5) is the final discrete linear system for the solution of the scattering integral equation in cylindrical coordinates. It can be rewritten as:
where the vector components are the Fourier coefficients:
where, or, equivalently,
{fl}n=fl,nn=−N, . . . ,N
where N is the range of the truncated Fourier series (The vectors are length 2N+1). The notation [x] denotes a diagonal matrix formed from the vector elements:
and the notation [γ1′*] denotes a convolution (including the 1′ factor):
Writing (1.8) out in full gives the matrix equation:
Notice that the kernel is composed of a L×L block matrix with 2N+1×2N+1 diagonal matrix components and that the L×L block matrix is symmetric-separable, i.e.:
where Lnm is one of the L×L component matrices.
For reference in the next section, we make the definitions:
7.2 Recursive Solution of the Cylindrical Coordinate SIE
The recursion for the 2-D cylindrical equation is:
Initialize:
pL=0 (2.1)
{sL}n=fLns/hL,n
For l=L, . . . ,1
sl−1=s1−[j1][γ1*]{[h1]s1+[j1]p1}
pl−1=p1−[h1][γ1*]{[h1]s1+[j1]p1}
Next l.
Thus far, we have assumed that the number of angular coefficients is the same for each radius l. In fact, the number of angular coefficients should decrease as l decreases. We find that for Δ=λ/4, an accurate interpolation is achieved with Nl=2l−1, n=−Nl, . . . , Nl Fourier coefficients. To introduce this modification, we must slightly define the operators as:
where the vector operated on is always length N where N is now the maximum number of Fourier coefficients, N=NL. The total field at each layer, 1, is given by:
In order to eliminate the need to know the starting values, sL, we note that at each iteration, sl and pl are linear functions of sL:
sl=AlsL+bl,pl=ClSL+dl (2.5)
where the matrices Al, C1 (dimension 2N+1×2N+1) and the vectors b1, d1 (dimension 2N+1) have the initial values:
AL=I,CL=0,b1_b =0,d1=0 (2.6)
Using (2.5) and (2.6) in (1.1) and equating common terms leads a matrix and a vector recursion:
Initialize:
AL=I,CL=0 bL=0,dL=0 (2.7)
for l=L, . . . ,1
Al−1=Al−[jl][γl*]{[hl]Al+[jl]Cl}
bl−1=bl−[jl][γl*]{[hl]bl+[jl]dl+fli}
Cl−1=Cl−[jl][γl*]{[hl]Al+[jl]Cl}
dl−1=dl−[jl][γl*]{[hl]bl+[jl]dl+fli}
Then using the fact that SL=0 leads to the solution:
AosL+bo=0,sL=−Ao−1bo,fL,ns=sL,n/hL,n (2.8)
for the scattered field at the outer boundary. Iteration (1.1) with (1.4) can then be used to evaluate the total field internal to the object.
Notice that the LHS matrix recursion in (2.7) is independent of the incident field. Once it is used to compute A.sub.0, the RHS vector recursion can be done for any number of incident fields. Concurrent solution for any number of incident fields can be obtained by replacing the RHS vector recursion with the matrix recursion:
BL=0,DL=0 (2.9)
Bl−1=Bl−[ji][γl*]{[hl]Bl[jl]Dl+Fli}
Dl−1=Dl+[hl][γl*]{[hl]Bl+[jl]Dl+Fli}
where the matrices Bl, Dll are 2N+1×Nv where Nv is the number of views and the matrix of incident fields, Fli is given by:
where the superscript after i is the view number (note that although the incident matrix is written as 2N+1×Nv, the entries are zero for the row index N1<|n|≦N) . A more compact recursion can be obtained by concatenating the two recursions to give:
GL=[Al,BL]=[I,O],HL=[CL,DL]=[0,0] (2.11)
For l=L, . . . ,1
Gl−1=Gl−[ji][γl*]{[hl]Al+[jl]Cl+[0,Fli]}
Hl−1=Hl+[hl][γl*]{[hl]Al+[jl]Cl+[0,Fli]}
where the matrices Gl, Hl are 2N+1×2N+1+Nv and in the notation [A,B] the first matrix is 2N+1×2N+1 and the second is 2N+1×Nv Then G0=[A0,B0] and the solution for all views is given by:
A slightly modified recursion can be shown to yield the scattering matrix of the object. Recall that the total field is given by:
fl=[hl]si+[jl]pl+fl
Any externally generated incident field can be expanded in a bessel series which implies that there exists a sequence, g−Ni, . . . ,gNi such that:
(recall that we are assuming that k0=1) which means that
fli=[jl]gi (2.15)
Redefining pl→pl+gi then gives fl=[hl]sl+[jl]pl and leads easily to the iteration:
GL=[I,O],HL=[0,I] (2.16)
For l=L, . . . ,1
Gl−1=Gl−[ji][γl*]{[hl]Al+[jl]Cl}
Hl−1=Hl+[hl][γl*]{[hl]Al+[jl]Cl}
where the matrices are now 2N+1×2(2N+1). The last iterate G0=[A0,B0] yields the scattering matrix:
S(γ)=A0−1B0 (2.17)
for the body γ which relates the incident field coefficients to the scattering coefficients: gs=Sgi where
for all incident field coefficient vectors, gi. Notice that in the previous notation:
gni=fL,ni/jL,n,gns=fL,ns/hL,n (2.19)
7.3 Computational Formulas for the Jacobian and its Adjoint
In order to apply the Gauss-Newton iteration to the solution of the imaging problem we must first derive recursive formulas for the application of the Jacobian and its adjoint. The Jacobian of the scattering coefficient vector, s.sub.L, operating on a perturbation in γ is given by:
From (2.8) we get:
s′L=−A0−1(b′0+A′0sL) (3.2)
where the prime denotes differentiation followed by summation over the elements of δγ
Equation (3.2) provides the formula for computing J.delta.γ is recursive formulas for A′0 and b′ can be found. Define the notation:
(note the lower limit of the l′ summation). Then using the LHS recursion in (2.7), it is simple to show that:
A′l−1=A′l−[jl]{[γl*{ht]A′l+[jl]C′l}+[δγl*]{[hl]Al+[jl]Cl}} (3.5a)
C′l−1=C′l+[hl]{[γl*{ht]A′l+[jl]C′l}+[δγl*]{[hl]Al+[jl]Cl}} (3.5b)
A further reduction in computational requirements can be achieved by noting from (3.2) that we do not need A′0 but rather A′0SL where SL is the matrix whose columns are the sL,s for each view. The matrix A′0 is 2N+1×2N+1 while the matrix A′.)SL is 2N+1×Nv. Thus define:
A′ls≡A′lSL,C′ls≡C′lSL,Als≡AlSL,C′ls≡ClSL (3.6)
Then we get the recursion:
ALs=ALSL=ISL=SL,CLs=0,A′ls=0,C′Ls=0 (3.7a)
For l=L, . . . ,1
A′l−1=A′ls−[jl]{hl]A′ls+[jl]C′ls}+[δγl*]{[hl]Als+[jl]Cls}} (3.7b)
C′l−1=C′ls+[hl]{hl]A′ls+[jl]C′ls}+[δγl*]{[hl]Als+[jl]Cls}} (3.7c)
A′l−1=A′ls−[jl]{hl]A′ls+[jl]C′ls} (3.7d)
C′l−1=C′ls+[hl]{hl]A′ls+[jl]C′ls} (3.7e)
Similarly, for the b's and d's we get the recursion:
BL=0,DL=0,B′L=0,D′L=0 (3.8a)
For L=L, . . . ,1
B′l−1=B′l−[ji]{[γl*]{[hl]Bl+[jl]D′l}+[δγl]{[hl]Bl+[jl]Dl+Fli}} (3.8b)
D′l−1=D′l+[hi]{[γl*]{[hl]B′l+[jl]D′l}+[δγl]{[hl]Bl+[jl]Dl+Fli}} (3.8c)
Bl−1=Bl−[ji][γl*]{[hl]Bl+[jl]Dl+Fli} (3.8d)
Dl−1=Dl−[ji][γl*]{[hl]Bl+[jl]Dl+Fli} (3.8e)
where the matrices are all 2N+1×Nv The final Jacobian is then:
S′L=−A0−1(B′0+A′0s) (3.9)
where the columns of S′L are the vectors Jnδγ for each view, n=1, . . . , Nv. We can of course concatenate these two recursions to give:
GL=[SL,0],HL=[0,0],G′Ls=[0,0],H′L=[0,0] (3.10a)
For l=L, . . . ,1
G′l−1=G′l−[ji]{[γl*]{[hl]G′l+[jl]H′l}+[δγl*]{[hl]Gl+[j1]Hl+[0,Fli]}} (3.10b)
H′l−1=H′l+[hi]{[γl*]{[hl]G′l+[jl]H′l}+[δγl*]{[hl]Gl+[j1]Hl+[0,Fli]}} (3.10c)
Gl−1=Gl−[ji][γl*]{[hl]Gl+[j1]Hl+[0,Fli]} (3.10d)
Hl−1=Hl+[hi][γl*]{[hl]Gl+[j1]Hl+[0,Fli]} (3.10e)
where, at the last iterate, G′0=[A′0s,B′0]. The matrices (G's and H's) in this recursion are all 2N+1×2Nv. This is the form of the Jacobian recursion used in the imaging programs.
Rectangular Scattering Matrix Recursion
This section describes a new recursive algorithm the used scattering matrices for rectangular subregions. The idea for this approach is an extension and generalization of the cylindrical coordinate recursion method discussed in the previous section. The computational complexity is even further reduced over CCR. The CCR algorithm derives from the addition theorem for the Green's function expressed in cylindrical coordinates. The new approach generalizes this by using Green's theorem to construct propagation operators (a kind of addition theorem analogue) for arbitrarily shaped, closed regions. In the following, it is applied to the special case of rectangular subregions, although any disjoint set of subregions could be used.
Consider two rectangular subregions A and B of R.sup.2 as shown in
Let the incident field due to sources external to boundary C, evaluated on boundary A be denoted fAi and similarly define fBi. For the total problem (A and B both containing scatterers) there exists a net, inward moving field at boundary A. Denote this field fAin and similarly define fBin. total field leaving boundary fAout=SAfAin. Knowledge of the radiated field on a closed boundary due to internal sources allows the field external to the boundary to be computed at any point. Let the operator that maps fAout onto the boundary B be denoted TBA (rectangular translation operator from A to B). Similarly denote the operator mapping fBout to boundary A be denoted TAB.
The total, inward moving field at boundary A has two parts—that due to the incident field external to C and that due to sources internal to boundary B. From the forgoing definitions, it should be obvious that the inward moving fields satisfy:
fAin=FAi+TABSBfBin (1a)
fBin=FBi+TBASAfAin (1a)
Solving for the total inward moving fields gives:
The total scattered field at boundary A has two components—one from inside A and one from inside B. It should be obvious that the total scattered fields at boundaries, A and B are given by:
Combining (2) and (4) gives:
Assuming that the scattering operators are invertible, then we have the equivalent form:
The scattered field on the boundary C can be obtained from fAs and fBs by simple truncation (and possible re-ordering depending on how the boundaries are parameterized). Let the operator that does this be denoted:
Let the incident field on boundary C due to external sources be denoted fci There exist operators DA and DB that operate on fCi to give fAi and fBi (similar to the external translation operators TAB and TBA). Thus:
from which we see that the scattering matrix for boundary C is given by:
Equation (9) then gives the core computation for our rectangular scattering matrix recursion algorithm. Technical details concerning the existence and discrete construction of the translation and other needed operators has been omitted in this write-up. We have, however written working first-cut programs that perform (9).
An O(N3) Algorithm for Computing all Scattering Views Based on Rectangular Scattering Matrix Recursion
Consider a region containing scattering material covered by an N×N array of rectangular sub regions as shown in
Again, although drawn as disjoint, assume that the subregions touch. Assume that the scattering matrix for each subregion is known (for example, if each subregion is only λ/2 on edge, the calculation of S is trivial given the material enclosed). Now coalesce 2×2 sets of these scattering matrices into larger scattering matrices (this coalesce of 4 subregions into one is similar to the algorithm defined above for coalescing two subregions). There are N/2×N/2 such 2×2 blocks to coalesce. When done, we have scattering matrices for the set of larger subregions shown in
This process is continued until, at the final stage, the scattering matrix for the total region is computed. Note that the physical parameters of the scattering media (speed of sound, absorption, etc.) are used only in the first stage (computation of the N×N array of scattering matrices).
Assuming that N is a power of two, the algorithm will terminate after log2(N) such stages with the scattering matrix for the whole region. A careful accounting of the computation required reveals that the total computation is O(N3). The end resulting scattering matrix then allows fast calculation of the scattered field anywhere external to the total region for any incident field (angle of view).
Although derived assuming that N is a power of 2, the algorithm can be generalized by including 3×3 (or any other sized) coalescing at a stage, allowing thereby algorithms for any N (preferably N should have only small, say 2,3,5, prime factors). Also, there is no reason that the starting array of subregions cannot be N×M (by using more general nxm coalescing at some stages).
In addition, the existence of layering can be included. If layering occurs above and below the total region so that the total inhomogeneity resides in a single layer, then the algorithm proceeds as before. Once the total region scattering matrix has been obtained, its interaction with the external layering can be computed. If inhomogeneous layer boundaries lie along horizontal borders between row of subscatterers, then the translation matrices can be modified when coalescing across such boundaries, properly including the layer effects. This is an advantage over our present layered Green's function algorithms which require that the inhomogeneity lie entirely within a single layer (This can be fixed in our present algorithms but at the expense of increased computation).
The O(N3) computation of this approach is superior to the O(N4log2(N)) computation of our original FFT-BCG approach and our present recursion based on cylindrical coordinates which is OO(N3log2(N).
Modeling System Transfer Function Including Driving Voltage, Transmitting Transducers Receiving Transducers, Preamplifiers, Analog to Digital Converter Etc.
Let the transfer function of the transmitting waveform generator, power amplifier, transmitting multiplexer, transmitting transducer, ocean/sediment, receiving transducers, preamplifiers, differential amplifier, differential waveform generator, and analog to digital converter be, respectively, TTWG, TPA, TTM, TTT, TO/S, TRT, TRM, TDA, TDWG, and TDAC. These separate transfer function can be identified with the system hardware. Then the total transfer function is:
Ttotal=TDAC(TDA1 TRM TRT TO/S TTT TTM TPA TTWG-TDA2 TDWG)
Note that the term TDA2 TDWG is subtracted in order to remove direct path energy and to remove reverberations in the transducers and the platform; this subtraction effectively increases the analog to digital converter dynamic range. The signal in the differential waveform generator is programmed to produce a net zero signal output from the analog to digital converter for the case of no sediment present.
Recall that the equation for the scattered field f(sc) (from the sediment) at a transducer (not the output voltage) at a given temporal frequency is given in terms of the transducer-to-sediment Green's function D, the sediment's acoustic properties γ and the internal field in the sediment f by
f(sc)=Dγf
The field (at a given temporal frequency) within the sediment itself f is given in terms of the incident field f(inc), the sediment's acoustic properties γ, and the sediment-to-sediment Green's function C by
f(inc)=(I−Cγ)f.
On combining these two equations we eliminate the internal field and find the scattered field in terms of the incinet field and the sediment properties
f(sc)=Dγ(I−Cγ)−1f(inc)
These equations involving C and D are true for both the free space Green's function [“Nonperturbative Diffraction Tomography Via Gauss-Newton Iteration Applied to the Scattering Integral Equation, Borup, D. T. et al., Ultrasonic Imaging] and our newly developed layered Green's functions [Johnson, S. A., D. T. Borup, M J. Berggren, Wiskin, J. W., and R. S. Eidens, 1992, “Modelling of inverse scattering and other tomographic algorithms in conjunction with wide bandwidth acoustic transducer arrays for towed or autonomous sub-bottom imaging systems,” Proc. of Mastering the Oceans through Technology (Oceans 92), 1992, pp 294-299.]. We now combine the scattering equations with the transfer functions. First, identify f(sc) with TO/S TTT TTM TPA TTWG and f(inc) with TTT TTM TPA TTWG. Next we note that measuring f(inc) is a direct way of finding the product TTT TTM TPA TTWG. We also note that Ttotal can be written as
Ttotal(γ,TTWG)=TDAC(TDA1 TRM TRT Dγ(I−Cγ)−1 TTT TTM TPA TTWG-TDA2 TDWG).
Then Ttotal (γ,TTWG) is a nonlinear operator that transforms (γ,TTWG into recorded signals Ttotal-measured. Thus, for a given set of Ttotal-measured measurements and for a given TTWG, we may in principal find γ by a nonlinear inverse operator
γ=Ttotal−1(Ttotal-measured(γ,TTWG))
Since the exact form of Ttotal (γ,TTWG) is not known, we find γ by a Gauss-Newton iteration method. This requires that the Jacobian of Ttotal(γ,TTWG) be computed. The Jacobian is readily computed in closed form [Borup, 1992] and is given by
J(γ)=−∂Ttotal(γ(n),TTWG)/∂γJ(γ).
Then the Gauss-Newton iteration for computing γ is given by: (1) set n=1, estimate a value for γ(n); (2) compute J(γ(n)); (3) solve
JT(γ(n))J(γ(n))δγ(n)=−JT(γ(n))(γ(n))[Ttotal-measured−Ttotal(γ(n),T
for δγ(n); (4) update γ(n) by the formula γ(n+1)=γ(n)+δγ(n); (5) if
[Ttotal-measured−Ttotal(γ(n),TTWG) <∈
then set γ=γ(n+1) and quite, else go to step 2.
The extra dynamic range provided by the differential waveform generator/analog to digital converter circuit raises questions as to the optimal setup procedure (e.g. how many bits to span the noise present with no signal). We have modeled the signal to noise ratio that can be achieved by a beamformer which delays and sums multiple channel signals, each channel being such a circuit. We find as a rule of thumb, that the lowest order one or two bits should span either the noise or the signal, depending which ever is smaller (i.e., for signal to noise ratios greater than unity the noise should be spanned, but for signal to noise ratios less than unity the signal should be spanned). Upon using commercial 16 or 18 bit analog to digital converters this method may well extend their range to 20 bits or more.
2. Model Electrical Crosstalk, Acoustic Crosstalk
Electrical cross talk can be removed by use of knowledge of the cross coupling matrix M. Let Vn(true) be true electrical signal at transducer n and let Vm(meas) be the measured signal at transducer m. Then Vn(meas)=MnmVm(true). We observe for small cross talk that matrix M has the form M=D2(I+∈)D1, where I is the identity matrix, D1 and D2 are diagonal matrices and ∈ is the differential cross talk matrix whose elements are small in value. We seek Vn(true)=M−1Vn(meas). By the binomial theorem (I+∈)−1≈(I−∈). Thus, M−1≈D1−1(I−∈)D2−1. Once D1, D2, and M are measured the problem of removing cross talk is quite inexpensive numerically (even if ∈ is not small the exact inverse M−1 can be computed once and stored). If the matrix M turns out to be noninvertible (as can be the case for large magnitude coupling) then we can alternatively concatenate M onto the inverse scattering equation to give:
vωφ(meas)=MωPωGω[fωφ]γ
to which the inverse scattering algorithm can be directly applied.
We believe that cross talk can be removed by good fabrication techniques including careful shielding. Nevertheless, the above numerical method can be used if necessary.
Acoustic cross talk can be removed by several methods: (1) acoustic baffling of each transducers; (2) calibration of individual (isolated) transducers, computing the acoustic coupling in the array from wave equations methods, then inverting the model by a cross talk matrix as above; (3) direct measurement of the coupling in a finished array to find the cross talk matrix and then inverting as shown above.
A more difficult type of cross talk to remove is the direct mechanical coupling between transducers. This problem will be attacked by using vibration damping techniques in mounting each transducer on the frame. We believe that such damping methods will eliminate direct mechanical coupling. As a backup we note that modeling of the coupled system are theoretically possible and has been successfully accomplished by the university's AIM Lab for circular mounting geometries (by derivation of a new “total system Green's function” for the imaging system that that includes cross coupling between elements).
Inclusion of Transducer Coupling
In the event that significant coupling exists between the object to be imaged and the transducers (and/or coupling between transducers is not negligible), a computationally efficient means of incorporating this coupling into the inverse scattering algorithm is needed. Equivalently, the transducers must be included as part of the scattering problem This section details a computational algorithm for achieving this incorporation.
Consider an object to be imaged, γ, illuminated by a transmitter, T, with the scattering received by a receiver, R, as shown in
Let S be the scattering matrix of γ which, given the incident field generated from sources outside of C, gives the outward moving scattered field evaluated on C. This operator can be computed by solving a sufficient number of forward scattering problems for the object γ.
Let PR denote the operator that computes the field impinging on R due to sources inside of C from the scattered field evaluated on C. This is a simple propagation operator computable by an angular spectrum technique.
Let PT denote the operator that computes the field impinging on T due to sources inside of C from the scattered field evaluated on C. This is a simple propagation operator computable by an angular spectrum technique.
Let ART denote the operator that computes the field impinging on R due to scattering from T (it operates on the net total field incident on T). This operator is computed by a moment method analysis of the transmitter structure.
Let ATR denote the operator that computes the field impinging on T due to scattering from R (it operates on the net total field incident on R). This operator is computed by a moment method analysis of the receiver structure.
Let BT denote the operator that computes the field on C due to scattering from T (it operates on the net total field incident on T). This operator can be computed by a moment method analysis of the transmitter structure.
Let BR denote the operator that computes the field on C due to scattering from R (it operates on the net total field incident on R). This operator can be computed by a moment method analysis of the receiver structure.
Assume that the transmitter T also produces a field, fi, due to eternally applied excitation (electrical). Denote by fci the values of this field on C. Denote by fRi the values of this field on the receiver surface. We assume that these field values are known (i.e., we know the free-space radiation characteristics of the transmitter).
Given these definitions, the total field incident from outside of C evaluated on C is given by:
fctot=fci+BTfTtot+BRfRtot 1
where the superscript tot indicates the field incident on the particular element due to all other sources. For T and R we have:
fTtot=PTSfctot+ATRfRtot 2
fRtot=fRi+PTSfctot+ARTfTtot 3
Note that the formula for fTtot has no superscript i term since the incident field emanates from its interior. Solving 1-3 for the tot fields gives:
The size of this matrix operator is O(N×N) and so computation of its inverse does not require much CPU time (mere seconds). In order to compute the signal received by the receiver transducer, we take fRtot computed in 4 and compute the surface currents (EM case) or surface velocities (acoustic case) from which the signal from the receiver can be computed.
This procedure for analyzing a scatterer in the presence of coupling between the T/R pair includes all orders of multiple interaction allowing transducers with complex geometries to be incorporated into our inverse scattering algorithms.
Frequency Dependent Scattering Parameters
Throughout the previous sections it has been assumed that y is independent of frequency. Suppose now that γ is a function of frequency. In the event that only a single frequency is needed (transmission mode with encircling transducers and only one complex parameter to be imaged) this is not a problem—the algorithm will simply image 2 or 3-D distribution of y evaluated at that frequency. However, in reflection mode or when imaging multiple parameters, multiple frequencies are needed. This increases the number of unknowns to Ω*Nx*Ny if we naively seek a separate image at each frequency. Since multiple frequencies were already needed to complete the data for a frequency independent unknown, we have no way of correspondingly increasing the number of equations by a factor of Ω. Instead, consider approximating the frequency variation with a set of parameters at each pixel:
γnm(ω)≈γnm(0)Ψ(0)(ω)+γnm(1)Ψ(1)(ω)+ . . . +γnm(q−1)Ψ(q−1)
where the basis functions Ψ are selected based on the physics of the frequency dependence (we may, for example, simply use the monomial basis: Ψ(n)(ω)=ωn, perhaps a rational function basis might be chosen since simple relaxation dispersion is a rational function). The formula for the GN update for this case is:
PωGω((I−[γω]Gω)−1[fωφ]Mωjδγ(i)=−Tωφ)
where summation over j is assumed and the matrix M is given by:
Solution for the q×Nx×Ny unknowns, assuming that q is sufficiently smaller than Ω can then carried out via the GN-MRCG or RP algorithms. We have already verified the success of this approach using quadratic polynomial models of the frequency variation
(q=3, Ψ(n)(ω)≡ωn).
We have discussed 3 separate algorithms for imaging:
(1) FFT-BCG
(2) Cylindrical Recursion
(3) Rectangular Recursion
We have given examples of how this technology is used in various scenarios. We have shown how various modalities of wave propagation are amenable to the imaging algorithm. Any propagating substance or energy that can be successfully modelled with the wave equation is suitable. We have also included examples of simulated and actual data, thereby establishing the concept as workable.
The apparatus and method of the present invention holds promise for many useful applications in various fields, including seismic surveying, nondestructive testing, sonar, radar, ground penetrating radar, optical microscopes, x-ray microscopes, and medical ultrasound imaging, to name just a few. For purposes of illustrating the utility of the present invention, the detailed description which follows will emphasize the apparatus and method of the invention in the context of a system for use in performing ultrasound imaging of human organs, such as the breast. However, it will be appreciated that the present invention as claimed herein may employ other forms of energy such as microwaves, light or elastic waves and furthermore may be used in other fields and is not intended to be limited solely to medical acoustic imaging.
1. The Scanner and Transducer Configuration
Reference is first made to
A stepping motor 46 mounted within the fixed base 32 is joined by a shaft 48 to a small pinion gear 50. Pinion gear 50 engages a large drive gear 52. Pillars 54-57 are rigidly ioined at one end to the top of drive gear 52 and at the opposite end to the underside of movable carriage base 34. Bearing block 58 supports drive gear 52 and movable carriage base 34.
Stepping motor 46 may be operated to turn the drive gear 52 which in turn will cause the movable carriage base 34 to rotate on top of the cylindrical pillar 36 within the tracks defined by shoulders 42-45. As hereinafter more fully described, rotation of the movable carriage base 34 may be employed to insure that an obiect is fully scanned from every possible angle.
With continued reference to
The ring of transducer arrays 70 is electrically connected through line 74 to components of an electronic system which may be housed in part within the chamber 64, as schematically indicated at 76. As hereinafter more fully described, the electronic system is used to control transmission and reception of acoustic signals so as to enable reconstruction therefrom of an image of the object being scanned.
Circular bracket 78 is attached to the top of the outer wall 62 of movable carriage base 34. A flexible, transparent window 80 extends between circular bracket 78 and the inner cylindrical wall 60 so as to enclose the transducer arrays 70 and stepping motor 66 within the chamber 64. The length of flexible window 80 is greater than the distance between bracket 78 and inner cylindrical wall 60. Window 80 thus serves as a flexible yet water-tight seal which permits vertical motion of the transducer arrays 70 for purposes of vertical focusing. Acoustically transparent window 80 may be made of any suitable material, such as plastic or rubber.
A stationary water tank generally designated 86 is adapted to fit within the movable carriage base 34. Water tank 86 consists of a fixed top plate 88 rigidly attached to vertical support bars 82 and 84. Support bars 82 and 84 are mounted on the fixed base 32. The length of support bars 82 and 84 is chosen such that the fixed top plate 88 of water tank 86 will be slightly suspended above the bracket 78 of movable carriage 34. Thus, a space 87 is provided between bracket 78 and fixed top plate 88. Additionally, a space 89 will be provided between side 94 and bottom 95 of water tank 86 and cylindrical wall 60 and bottom 61 of movable carriage 34. A third support bar 83 extends through a central hole (not shown) provided in block 58 and drive gear 52. Support bar 83 also extends through a watertight opening 85 provided in the bottom 61 of movable carriage 34. Support bar 83 thus helps to support water tank 86 in spaced relation from movable carriage 34. Since water tank 86 is suspended in spaced relation from movable carriage base 34, water tank 86 will remain stationary as movable carriage 34 is rotated. As hereinafter more fully described, rotation of the carriage 34 permits the transducer arrays 70 to scan the object 98 from every possible position around the object 98.
Fixed top plate 88 has a short downwardly extending lip 90 which extends over the end of circular bracket 78. A rubber-covered window 92 extends between the lip 90 and side 94 of the water tank. Window 92 encloses within space 89, water 97, or some other suitable acoustic medium so as to acoustically couple the transducer array 70 to the water 96 contained in tank 86. The rubber-covered window 92 also permits acoustic energy signals to be transmitted therethrough by the transducer arrays 70 and insures that the patient will be protected in the event window 92 should be broken.
The scanning apparatus generally described above may be employed to scan various parts of the human anatomy as, for example, a patient's breast, as schematically illustrated at 98. The scanning apparatus generally described above may be used for nonmedical applications and may employ acoustic or electromagnetic energy wavefields. For example, the same scheme may be used with acoustic transducers for scanning nonbiological specimens such as plastic parts. As further example, the same scheme may be used with microwave wavefields. In this latter case, the use of a coupling medium such as water 96 and water 97 is optional. Indeed, at frequencies over one GH.sub.z, the microwave absorption of water limits the size of the apparatus to a few centimeters. However, the use of air permits the use of electromagnetic energy over the spectrum from DC to far ultraviolet and soft x-rays. For microwave imaging, the ring of acoustic transducer arrays 70 is replaced with a corresponding ring of microwave transducer (i.e. antennas).
Reference is next made to
Each of the transducer arrays 100-115 may be any of several well-known types of transducers. For example, transducers 100-115 may be piezoelectric transducers which produce ultrasound energy signals directly from high-frequency voltages applied to the transducer. Alternatively, the transducer arrays 100-115 may be magnetostrictive transducers having a magnetic coil (not shown) which receives the electrical oscillations and converts them into magnetic oscillations which are then applied to the magnetostrictive material to produce ultrasound energy signals.
With continued reference to
Commutation of the transmitter arrays 100-107 permits acoustic energy to be transmitted from every possible position about the object, thus insuring that the data received (i.e. scattered acoustic energy) is complete. Commutation of the receiver arrays 108-115 insures that all spaces between receiver arrays 108-115 (known as “sound holes”) will be covered, thus providing for accurate collection of all acoustic energy that is transmitted through or scattered by the object 98. However, commutation of the receiver arrays 108-115 is not necessary where transmitter arrays 100-107 are also used to receive acoustic signals. The circular configuration of transducer arrays 100-115 permits certain parts of the body to be scanned which would otherwise be inaccessible because of bones or other obstructions of the tissue.
The method for commuting the arrays 100-115 is best understood by reference to
After rotation of the arrays 100-115 to a second position, each of the transmitter arrays 100-107 is again sequentially triggered and data are again collected through receiver arrays 108-115. This procedure is repeated until acoustic energy has been transmitted at each possible point about the object.
Where the arrays 100-107 are used only for transmitting acoustic energy, a second series of rotations must then be effected to cover the sound holes between each pair of receiver arrays 108-115. For example, by rotating transmitter array 101 to the position occupied by the transmitter array 100, receiver arrays 109, 111, 113 and 115 will, because of their longer arcuate (arcwise measured) length, cover the spaces previously occupied by transmitter arrays 101, 103, 105 and 107. This procedure is repeated until all sound holes have been covered. It should be noted that for a fixed circumference by decreasing the length of each array and increasing the number of arrays, electronic commutation may be used to reduce the angle through which the ring of transducer arrays must be rotated to achieve complete collection of both echo and transmission data.
It should also be noted that, in principal, no mechanical rotation of the array of detectors is necessary if every element is small enough and can be made to act as either a receiver or transmitter. Such an arrangement is more expensive than the technique illustrated in
The transducer array 70 generally describes an implementation for an acoustic scanner and, in particular, of a breast scanner. In this case, the transducer elements generate and detect acoustic energy wavefields. However, can be seen that the acoustic transducers can be replaced with microwave transducers (antennas). The functions of interleaving, rotation, commutation are identical, for either microwave or acoustic transducers.
2. The Electronic System
Reference is next made to
As hereinafter more fully described, the electronic system generates the acoustic energy that is propagated through and scattered by the object 98. The electronic system thereafter detects and processes the acoustic energy signals that are scattered by and transmitted through the object 98, and then communicates the processed signals to a computer (CPU) which interprets the signals and outputs the result in the form of a visual display or printed output.
In the transmission mode, CPU 118 causes an oscillator 128 to output a waveform which is amplified by a power amplifier 130 before being sent through multiplexer 132 to one of the transmitters. CPU 118 controls the multiplexer 132 so as to sequence each transmitter array 100-107 used to generate the acoustic energy propagated through the acoustic medium and object. If desired, after it is amplified, the waveform can also be input to an impedance matching transformer (not shown) and to a series of RC or RLC networks connected in parallel across the transmitter arrays 100-107 as illustrated and described in U.S. Pat. No. 4,222,274 (hereinafter the Johnson '274 patent), which is incorporated herein by reference. The impedance matching transformer may be used to achieve maximum power transfer while the RC or RLD networks may be used to distribute power across each transmitter array 100-107 in a way that deceases the side lobes in the transmitted signal.
Each of the acoustic receivers 108-115 (
The operation and components of phase detector 138 are best illustrated in
Once the received signals have been digitized, they are input and stored in the memory of an array processor 120. Alternately, a parallel processor or other special purpose high-speed computational device may be used for even higher computational speed. As hereinafter more fully described, CPU 118 in combination with the array processor 120 are programmed to then reconstruct the acoustic image using inverse scattering techniques, several alternatives of which are described more fully in Examples 1-3 of section 3 below. Once the acoustic image is reconstructed, it may be output either visually at a display 124 or in printed form at printer 122, or stored on a disc or other storage medium 123. Mechanical scan devices represented at 126 correspond to the motors 46 and 66 (
The electronic circuits of
Such one-dimensional and two-dimensional arrays of receivers and transmitters have a direct application to advanced medical imaging instruments where motion of the array is undesirable or in seismic exploration in which such movements are difficult.
In the receiving mode, signals received at transducer element 131a are typically less than one diode voltage drop (about 0.7 volt) and thus are isolated from transmitter 135a, since point 135a is a ground and diodes 139 are not conducting. Diodes 141 and 143 are not conducting and therefore output 133a is not shunted. Thus, the preamplifier following 133a would only see an impedance of resistor 145 plus resistor 147 plus that of the transducer element 131a. In practice, resistor 145 plus resistor 147 can be made about equal to or less than the impedance of transducer element 131a to minimize signal loss. It is clear that the principles illustrated in
The use of multiple frequencies or signals containing many frequencies also has the advantage of obtaining data that may be used to accurately reconstruct frequency dependent material properties.
In the electronic system of
As shown in
With each clock pulse from counter-timer 184, the sample and hold circuit 182 retrieves one of the analog signals from DAC 180 and then holds the value of that signal for the duration of the clock pulse. Sample and hold circuit 182 thus outputs the approximate shape of the desired waveform which is then input to a low pass filter circuit 186 which is used to smooth the shape of the waveform before amplification and transmission through multiplexer 132 to one of the transmitters.
The electronic system of
Optical Microscope
The operation of the inverse scattering microscope can be seen in block diagram form in
The operation of main beams 515 and 516 is now described. Beam 515 is the precursor of a beam that will pass through the interferometer in a clockwise direction and will pass through and scatter from the sample and will act as a probing beam. Main beam 516 is the precursor of a beam that will not pass through the sample, but will act as a reference beam as it traverses the interferometer in a counterclockwise direction. The probing beam and the reference beam will be combined to produce various interference patterns from which phase and modulus information can be obtained. Beam 515 passes through shutter 520 and then is reflected by mirror 522 to become beam 519. Beam 519 passes through objective lens 523 into sample holder 525. The light that is scattered by the sample and the sample holder is collected by objective lens 526 and passed as beam 527 to beam splitter 528. Companion main beam 516 passes from beam splitter 514 through shutter 530 to phase shifter 532. Phase shifter 532 can be set to introduce either a 0 degree phase shift or a 90 degree phase shift. Beam 533 is the output of phase shifter 532.
The purpose of objective lens 526 is to collect as large an angle of scattered light as possible. Objective lens 523 serves several purposes including: (1) that of focusing a small spot of light onto the sample holder of higher intensity than normally would be present in beam 519 in order to improve the signal to noise ratio; and (2) that of restricting the illumination to a narrow, focal spot region with desirable, plane-wave-like phase fronts with lateral amplitude anodizing.
Light beam 531 from shutter 530 passes through transmission-type phase shifter 532 where its phase is shifted either 0 degrees or 90 degrees. Beam 533 exits transmission-type phase shifter 532 and passes to mirror 536. At mirror 536 beam 533 is reflected to become light beam 539. Mirror 536 is driven by optional mirror driver 538 in small amplitude, periodic motion at frequency co and in a direction perpendicular to the mirror face in its resting position. As mirror 538 vibrates it introduces a small periodic phase shift modulation to beam 537, but this phase shift is negligible. The purpose of either transmission-type phase shifter 534 or reflection phase shifter 538 is to impart a small periodic phase shift, typically from 1 degree to 10 degrees at frequency (o into beam 518 in order to stabilize the main beams 527 and 541 as combined to form beam 543. Either of these phase modulation methods is workable either separately or in combination. Usually, for cost reasons, only one of these phase modulation methods will be implemented in hardware.
Beam 539 emerges from mirror 538 and passes through beam transformer 540 and thence to beam splitter 528. Beam transformer 540 serves the purpose of adjusting the phase fronts and lateral amplitude of beam 541 to match those of beam 527 at the mirror 528 and at camera 544. While-such matching may not be strictly necessary it does provide a more simple interference pattern that is mostly a function of the inserted object and not a function of the interferometer itself. Beam transformer 540 could be constructed in a number of ways, but one obvious way would be to replicate objective lens pair 523 and 526, however with the following modifications: (1) the sample holder and object is omitted; or (2) the sample holder is present but no object is present.
Beams 541 and 527 are combined by beam splitter 528, now acting as a beam combiner, and produce an interference pattern on the sensitive one-dimensional or two-dimensional solid state array of detector cells in camera 544. The intensity pattern on the array of detector cells is then converted into an electrical signal that is sent to the digitizer and computer 545 where this signal is digitized and stored in memory. Computer 545 controls the angular position of sample holder 525 by means of rotation device 524. Rotation device 524 also serves the function, under direction of computer 545, of inserting or removing the object from the beam. Rotation device 524 may be a stepping motor or a worm gear driven rotation stage as is common in optical technology. Computer 545 also controls the transmission of shutters 520 and 530, controls the phase of phase shifter 532, and controls the digitizing of camera 544. Computer 545 performs the important role of creating different diffraction patterns by changing the combinations of: (1) sample in or sample out; (2) main beam A only or main beam B only or their sum; or (3) phase sifter in main beam set at 0 degrees or at 90 degrees. From these different diffraction patterns, as described by the eight measurements and equations in the summary of invention section, computer 545 can compute the scattered detected field from the object. By including various test objects of know properties the incident field present at object 526 can be calculated. The set of scattered fields calculated for each of the various respective adjusted angular position of the object holder 525 is used as the detected scattered fields in the inverse scattering algorithms. Likewise, the incident field in and around the sample holder is taken as the incident fields in the inverse scattering algorithms.
Returning to
Locking amplifier system 546 functions in the following manner. The signal from detector 542a passes into locking amplifier system to amplifier 547. Output of amplifier is the input of multiplier 548. Multiplier has a second input 550a. The product of the output of amplifier 547 and input 550a passes through low pass filter 549 thence both to non inverting amplifier 550h and to inverting amplifier 550i. Switch 550j selects either the output of noninverting amplifier 550h or that of inverting amplifier 550i. The output of switch 550j passes to switch 550 s and thence to summing amplifier 550g. Input 550a to multiplier 548 originates in oscillator 550f at frequency ω. The output of oscillator 550f is shown as signal 550l and is split into two parts: one part, signal 552n, is an input to summing amplifier 550g; the other part, signal 550p, is in turn split into two signals 550c and signal 552q. Signal 550c is one input to switch 550b. Signal 552q is imposed on both inputs of multiplier 550e. The effect of multiplying a signal by itsself is to produce sum and difference signal. The difference signal is at zero frequency while the sum frequency is at double the input frequency. The output of multiplier 550e passed through high pass filter 550d to switch 550b. High pass filter 550d rejects the zero frequency signal but passes the doubled frequency. An offset constant voltage 550m is also an input of the summing amplifier 550g. Summing amplifier 550g sums signals 550m, 550k and 552n. The output of summing amplifier 550g is then sent to a phase modulator which may be mirror driver 538 or phase modulator 534. If output of summing amplifier 550g is sent to mirror driver 538 then this signal is first amplified by amplifier 507. If output of summing amplifier 550g is sent to phase modulator 534 then this signal is first amplified by amplifier 505.
The phase locking circuits can function in several modes. In a first mode, heterodyning techniques are used. In the summary of the invention section four different heterodyning methods are described. The hetrodyning mode is now described. One of two heterodyne methods to lock the two component beams at the same phase is now described. The output of amplifier 547 is multiplied by signal of frequency ω at multiplier 548 and the resulting signal is low pass filtered in low pass filter 549. The output is then passed through inverting amplifier 550i or noninverting amplifier 550h and thence through switches 550j and 550s to summing amplifier 550g and thence to phase modular 534 or phase modulator 538. The feedback loop adjusts the phase shifters 534 or 538 to minimize the signal component at ω present in the signal output of amplifier 547.
One of two heterodyne methods to lock the two component beams in quadrature (i.e. 90 degree phase difference) is now described. The output of amplifier 547 is multiplied by signal of frequency 2ω from high pass filter 550d. The signal from high pass filter 550d passes through switch 550b to multiplier 548 and thence to low pass filter 549. From low pass filter 549 the signal passes through either inverting amplifier 550i or noninverting amplifier 550h to switch 550j and thence to switch 550s and thence to input 550k to summing amplifier 550g. Output from summing amplifier 550g then drives either phase modulator 534 or mirror driven 538 to maximize the signal component at 2ω present in the signal output of amplifier 547.
The phase locking circuit can operate in a second mode where no heterodyning techniques are used. In this mode, output signal from detector 542a and output signal from detector 542b are passed to differential amplifier 552q, whose output is proportional to the difference in voltage between these two input signals. The output of differential amplifier is low pass filtered by low pass filter 550r to remove high frequency noise and passed to switch 550s and thence to summing amplifier 550g. The output of summing amplifier is then sent to either mirror driver 538, or phase modular 534. The advantage of this method is a reduction in parts, since no heterodyning circuits are required. Also significant is the elimination of phase modulators. Only the mirror driver for zero frequency baseband stabilization adjustments is necessary. A possible disadvantage may be an insensitivity to systematic errors.
Referring to
The paths of the stabilization beams are now described. The stabilization beams are comprised of three parallel beams. These beams enter mirror e1, as respective beams b1, c1, and d1. They leave in the clockwise direction as respective beams b4, c4, and d4 and reach mirror e3. They leave mirror e3 as respective beams b5, c5, and d5 and reach beam splitter e4. Beams b1, c1, and d1 also incident at beam splitter e1 produce respective counterclockwise directed beams b2, c2 and d2 which leave beam splitter e1 and are reflected from mirror e2 to become respective beams b3, c3, d3. Beams b3, c3 and d3 reach beam splitter e4 where they combine with respective beams b5, c5 and d5. This combination then exits beam splitter e4 as respective pairs b6, c6 and d6 to the right, and as respective pairs b7, c7, and d7 to the top. Beams b6, c6, and d6 are detected by respective detectors m1, n1 and p1. Beams b7, c7 and d7 are detected by detectors m2, n2, and p2, respectively. Outputs of m1 and m2 are combined to form a first feedback signal for mirror and beam splitter alignment. Outputs of n1 and n2 are combined to form a second feedback signal for mirror and beam splitter alignment. Outputs of p1 and p2 are combined to form a third feedback signal for mirror and beam splitter alignment. The stabilization scheme show in
The function of the feedback signals is now described. The signal outputs of detectors p1 and p2 are respectively q1 and q2 and these pass into differential amplifier r1. The output of differential amplifier V1 passes to linear actuator t1 that moves mirror e2. A solid frame is not shown on which linear actuators t1, t2 and t3 are mounted as well as components e1, e3, e4, i, j, m1, n1, p1, m2, n2, p2, and h. Differential actuator provides motion parallel to normal of mirror e2. Detectors m1 and m2 produce respective signals U1 and U2 that pass to differential amplifier r2 and thence becomes signal S2 that drives linear actuator t2 at mirror e2. Detectors n1 and n2 produce respective signals v1 and v2 that pass to differential amplifier r3 and thence as signal s3 to linear activator t3 on mirror e2. Once mirrors e2 and e3 and beam splitters e1 and e4 are aligned, the stabilizing feedback loops just described will maintain alignment.
Inverse Scattering Scenarios
The layers 1302, 1304, 1306, and 1316 are arbitrary both in vertical dimension, as well as speed of sound associated with them. The sources 1314, and receivers 1298, are in both boreholes 1308.
Software Description
Having described some examples in the previous section which illustrate mathematical and physical models which may be used to model scattered wavefield energy and to process data so as to reconstruct an image of material parameters using inverse scattering, theory, reference is next made to
This figure schematically illustrates a flow diagram for programming the CPU No. 118 or combination of CPU 118 and array processor 120 or parallel processor 146 in order to enable CPU 118 or said combination to control the electronic system of
Once the scattered wavefield energy has been detected, processed, and digitized, CPU 118 next determines the incident field f107 φinc, as indicated at step 204. The incident field is determined from the wavefield energy [i.e., frequency and amplitude components of the transmitted waveform] and transducer geometry from which the incident field is transmitted. The process of determining this incident field may involve direct measurement of the incident field or calculation of the incident field using the appropriate Green's function whether it be Biot, elastic, electromagnetic, scalar, vector, free space or some combination or permutation of these. The incident field is determined at each pixel in the imaged space for each source position φ=1, . . . ,Φ and for each frequency ω, which varies from 1 to Ω. In step 206, the scattered wavefield is measured, recorded and stored at the wavefield transducer receivers for each ω, varying from 1 to Ω each source position 1-Φ, and each pixel element at each wavefield transducer receiver position. As described in step 208, the initial estimate for the scattering potential is input directly into CPU 118 or estimated within CPU 118 from data input to it from some outside source. The initial estimate for the scattering potential can be set at either 0 or at an average value which is derived from the average material parameters estimated a priori for the object. When applicable, a lower resolution image of γ, such as produced by a more approximate method such as the time of flight method described in U.S. Pat. No. 4,105,018 (Johnson) may also be used to advantage (in the acoustic case) to help minimize the number of iterations. Similar approximations for other forms of wavefield energy may be used to advantage to help minimize the number of iterations.
CPU 118 next determines in step 210 an estimate of the internal fields for each pixel in the image space and for each frequency ω, and each source position Ω. This estimate is derived from the incident field and the estimated scattering potential. This step corresponds to step 222 in
The total fields are then stored in central processor 382 of
Before applying this algorithm, it is required to discretize equation 1 or 17 of the Summary of Invention section so that it may be input into the CPU. This process is described in detail in the Summary of the Invention. The subroutine that solves the forward problem is given explicitly in Appendix B for the two-dimensional case and in Appendix F for the three-dimensional forward problem solution. Our algorithm uses the Whittaker sinc function which is defined on page 18, equation E of the Summary of the Invention. We use the sinc function because of its optimal approximating qualities. However, the infinite support of the sinc function precludes its use in certain situations. More specifically in the presence of layering, one-dimensional “hat” functions are required. This is outlined in detail after eqn (41) in the Summary of the Invention. The number of frequencies that are used in the inverse scattering problem is a function of the source receiver geometry, among other things.
All forward problems for all possible frequencies, and all possible source positions are solved with one call of the subroutine in Appendix B for the two-dimensional case, and Appendix F for the three-dimensional case, whichever is relevant.
In step 352, back in
We now return to step 224 of
Step 1150 of
Now consider
Step 1118 of
The input to
In step 1128, the matrix P, constructed earlier is used to propagate the field to remote detectors. In step 1130 of 21, control is returned to the program which called the subroutine. The matrix P will change when correlation is present; the fundamental form of the propagation algorithm remains unchanged, however.
These scattered fields are stored in vector dωφ. Control is then transferred to step 228 in
There are in effect at least two possibilities regarding the use of the Jacobian. First, the full Jacobian could be used or, secondly, an approximation to the Jacobian could be used. The determinator of which path to follow is the programmer, who bases his decision on the magnitude of the γ which is to be solved in the full inverse scattering problem. If the γ is large, the exact Jacobian must be used. On the other hand, if the y is reasonably small, although still larger than required for the Born or Rytov approximation, the approximation to the full exact Jacobian which is shown on
Now we step through
Step 212 of
We have just concluded a general description of the overall inverse scattering algorithm leaving out many important details. We now look at each of the steps in more detail. First, we explain how the δγ0 shown in Step 240 of
The appendix A contains the main program for imaging in two dimensions, which represents the code associated with the
It is important to note that the code in Appendices A, B, C, F, and G, deal with the most general case. This means that they can account for layering as is discussed in the Summary of the Invention Example 2. This means that the Lippman-Schwinger operator described earlier incorporates a convolution and correlation part. If it is known a priori that there is negligible layering in the geophysical or non-destructive testing or other scenario envisioned then it is appropriate to use the code given in Appendix H and Appendix I which deals with the case of no correlation part in the Green's function. The great simplification of the code that results from this omission and specialization is reflected in the fact that the code in Appendix H contains both the main imaging program and the forward problem solution. Furthermore, this code also computes the two-dimensional free space Green's function from scratch. This is in contradistinction to the codes in Appendix A or B or C where in the two-dimensional case in the presence of layering, three separate programs are used to solve the inverse scattering problem, the forward problem and the construction of the layered Green's function. The construction of this layered Green's function which is discussed in Example 2 for the acoustic scalar case, requires substantial amount of time for its computation. However, it is important to note that once the Green's function has been constructed for a given layered scenario, in a geophysical context, for example, it is not necessary to recalculate this Green's function iteratively. Rather the Green's function once it has been determined by methods discussed in the literature, see for example [Johnson et al. 1992, or Wiskinet al, 1993], is used in the code in Appendix A and Appendix B to complete the imaging problem. To display the versatility of this inverse imaging code note that the code in Appendix H deals with the transmission mode imaging problem with the particular modality being electromagnetic wave propagation in particular in the microwave frequency range. Note also that Appendix H incorporates within it the situation where the detectors are at some finite distance away from the image space, i.e. they are not juxtaposed to the image space. The implication being that Green's theorem or some type of “scattering matrix” is required to propagate the field from the border pixels of the image space to the detector positions at some finite distance away from the image space. Such a scattering matrix is constructed earlier.
It is important to note that all of the Appendices deal with the rectangular iterative method. It is also important to recognize that the construction of the layered Green's function as shown in Summary of the Invention, Example 2 shows explicitly how the convolution and correlation are preserved even though the distribution of the layers above and below the layer containing the image space are arbitrarily distributed. The preservation of convolution and correlation is a critical element of the ability to image objects in a layered or Biot environment (Biot referring to the Biot theory of wave propagation in porous media) in real time. The reflection coefficients which are used in the construction of the layered Green's function in the acoustic, elastic and the electromagnetic case are well known in the literature. See, for example [Muller, 1985] or Aki and Richards, 1980. The incorporation of this Green's function for layered media in the acoustic, elastic and electromagnetic case for the express purpose of imaging objects buried within such a layered medium is novel. The ability to image in real time is critical to practical application of this technology in the medical, geophysical, non-destructive and testing microscopic environments.
We now elucidate the steps shown in
This n is the counter that keeps track of the number of conjugate, gradient steps that have been performed in the pursuit of δγ, the update to the scattering potential estimate. Step 246 of
It is important to note that the inverse of this operator, (I−γG)−1 , is never explicitly calculated in Step 714. The actual computation of the inverse of this operator, which in discrete form is represented as a matrix would be computationally very expensive. Therefore, the inverse action of the operator (I−γG))−1 acting on the point-wise product calculated in Step 712 is calculated by using bi-conjugate gradients or BiCGStab as indicated in Step 714 of
The loop just described and displayed in
It is appropriate to study in detail the application of the Lippman-Schwinger operator and the Jacobian in the situation where layering is present. It is important to note that the discussion heretofore, has emphasized the special situation where layering is not present so that the free space or similar Green's function not involving any correlation calculation, has been used. This specialization is appropriate for pedagogical reasons, however, the importance of the layered Green's function makes it necessary to discuss in detail the application of the Lippman-Schwinger operator in this case. Before doing this however, we look at
In Step 600 the full Hermitian conjugate of the Jacobian calculation is undertaken with the disk files containing the Green's functions and the disk files containing the internal fields for each ω and each φ being rewound. In Step 604 ω is set equal to 1, i.e. the first frequency is considered. In Step 606 the complex Fast Fourier Transform (FFT) of the Green's function at this particular frequency ω is retrieved from disk where it has been stored previously. In Step 608 φ is set equal to 1 indicating we are dealing with the first source position. Step 610 retrieves the internal-field fωφ, for this source position φ at frequency ω. Step 612 propagates the scattered field from the detector positions to the border pixels of the image space. Also in Step 612 the field values at the non-border pixels of the image space are equated to 0. In Step 614 the Fast Fourier Transform (FFT) of the field resulting from Step 612 is computed and stored in vector x. In Step 616 the complex conjugate of the Fast Fourier Transform (FFT) of the Green's function G is point-wise multiplied by x, the vector computed in Step 614. As before the actual computation of this inverse operator is computationally prohibitive. Therefore, the application of the inverse of I−G*γ* to the vector x is carried out by using bi-conjugate gradients or BiCGStab as discussed above. In Step 620 either the inverse of the operator I−G*γ* or the approximation (I+G*γ*) is applied to this result. The system [I−G*γ*]γ=x is solved for y using BCG. See FIGS. 11A-1/11A-2. In Step 622 of FIG. F01 the point-wise multiplication by the complex conjugate of fσφ, the internal field for this frequency and source position, is carried out. This point-wise multiplication is the product with the result of Step 620. The Step 624 stores the result of this point-wise multiplication. Step 626 determines if φ is equal to Φ, the maximum number of source positions. If the maximum number of source positions has not yet been achieved, then the φ is incremented by 1 to indicate movement to the next source position. Transfer of control is now made to Step 610, where the internal field for this particular source position is retrieved. The loop from Step 610 to Step 626 is performed iteratively until the maximum number of source positions has been achieved. When this occurs, it is determined if ω is equal to Ω, the maximum number of frequencies, in Step 630. If the maximum number of frequencies has been achieved then control is transferred to Step 634 which transfers control back to
It is appropriate now to turn to that part of
It is very important to note that the correlation that is calculated in the case of layering is, without exception, calculated as a convolution by a change of variables. It is the presence of the convolution which allows the introduction of the Fast Fourier Transform (FFT) and allows this algorithm to be completed in real time when the Fast Fourier Transform (FFT) is used in conjunction with the other specific algorithms and devices of this apparatus.
It is now appropriate to consider in detail the action of the Lippmann-Schwinger operator and the Jacobian in the case when layering is present.
It is also important to note regarding the Jacobian calculation with correlation, that both the reduced and the exact form have their counterpart in the Hermitian conjugate. This is discussed in FIGS. 19A/B. In Step 654, the re-winding of the disk with the Green's function and the disk with the internal fields is carried out, and ω is set equal to 1. In Step 658 the complex Fast Fourier Transform (FFT) of the Green's function at this particular frequency Co is retrieved. Step 658 also sets φ (source position) equal to 1. Step 660 retrieves the internal field for this φ. Step 662 calculates the Hermitian conjugate of the truncation operator which puts values in border positions of the image space of the appropriate field and zeros everywhere else. Step 664 point-wise multiplies the result of 662 by complex conjugate of the Fast Fourier Transform (FFT) of the Green's function. Step 668 takes the inverse Fast Fourier Transform (FFT) and stores this in vector y. Now, in Step 670 depending upon the value for JACFLAG, either the full inverse (I−G*γ*)−1 (via BCG or BiSTAB, etc.) or (I+G*γ*) is applied directly to y. The second alternative is carried out in the standard manner. That is, one takes the point-wise multiplication of the complex conjugate of γ, the scattering potential estimate at this time with the vector y. The one takes the Fast Fourier Transform (FFT) of the result and takes the point-wise multiplication of the complex conjugate of the Fast Fourier Transform (FFT) with this result. Finally one adds the vector y and stores it in vector y, (overwriting y). The Step 672 multiplies the final result point-wise, with the complex conjugate of the internal field for this particular frequency and source position, and stores the result. Step 672 transfers control to 674 where it is determined if φ is equal to Φ. If no, then φ is incremented by 1 and returned to Step 660 in Step 676. If yes, control is transferred to 680 where it is determined if ω is equal to Ω; if yes, increment ω by 1 in Step 682, and transfer control to Step 656. If no, transfer control to Step 684 where control is transferred back to the inverse scattering main algorithm,
It is important to note at this point that when the operator I−Gγ is applied to the vector T1, and also when the Lippmann-Schwinger operator I−Gγ is applied to any input vector in the layering scenario,
The step 558 passes control to step 560 where the vector of pointwise products shown explicitly there, is formed. This pointwise produce of γ, the scattering potential estimate, and F, the internal field, is Fourier transformed in step 564 and in parallel simultaneously in step 562, the reflection operator is applied to this pointwise product γF computed in step 560. In step 566, following step 562, the Fourier Transform of the reflected product γF is taken. In step 570 and in step 568, there is pointwise multiplication by the correlational part and the convolutional part of the layered Green's function, respectively. The correlational part of the layered Green's function is pointwise multiplied by the Fast Fourier Transform (FFT) computed in step 566, the convolutional part of layered Green's function fast Fourier transformed is pointwise multiplied by the Fast Fourier Transform (FFT) which was computed in step 564. In step 572, the vector W, which is the difference of the results of step 568 and 570, is computed and stored. In step 574, the inverse Fast Fourier Transform (FFT) of the vector W is computed. In step 576, the final answer which is the action of the Lippmann-Schwinger operator acting upon the field f is computed for the layered medium case, i.e., the case in which correlation exists. This process of applying a Lippmann-Schwinger operator in the presence of layering established in
The operator described by
The input to
In 15 control is transferred to step 1242, where the pointwise product is formed, between the complex conjugate of the scattering potential estimate, and the input to this subroutine, x. The Fast Fourier Transform (FFT) is then applied to this product in step 1244. In step 1246 the pointwise multiplication by G*, the complex conjugate of the Fast Fourier Transform (FFT) of the Green's function at frequency ω, is effected. In step 1248, the Inverse Fast Fourier Transform (FFT) of the product is taken, and finally the result is stored in y. In step 1252, the difference is formed between the input to this subroutine, x, and the computed vector y, and the result is stored in y.
In
If the detectors are remote, the field is propagated with the aid of the propagator matrix P (described in section EXTENSION OF THE METHOD TO REMOTE RECEIVERS WITH ARBITRARY CHARACTERISTICS), in step 1128. Control is then transferred to the calling subroutine in the situation where the detectors are remote.
In
In
Having seen in the previous pages how the full nonlinear inversion yields substantial increases in resolution and utility of image, we are now prepared to discuss the advanced marching technique employed in both the forward problem and the Jacobian calculations (as well as the closely related Hermitian conjugate of the Jacobian calculation) in the following algorithm.
Scientific Background to Advanced Parabolic Marching Method
The parabolic equation method is a very efficient method for modelling acoustic wave propagation through low contrast acoustic materials (such as breast tissue). The original or classical method requires for its applicability that energy propagate within approximately ±20° from the incident field direction. Later versions allow propagation at angles up to ±90° from the incident field direction. Further modifications provide accurate backscattering information, and thus are applicable to the higher contrasts encountered in nondestructive imaging, EM and seismic applications. [M. D. Collins, A two-way parabolic equation for acoustic backscattering in the ocean, Journ. Acoustical Society of America, 1992, 91, 1357-1368, F. Natterer and F. Wubbeling, “A Finite Difference Method for the Inverse Scattering Problem at Fixed Frequency,“Lecture Notes in Physics, 1993, vol. 422:157-166, herein included as reference]. The source/receiver geometry we use in this device, and the relatively low contrast of breast tissue, allow us to utilize this efficient approximation. The resulting speed up relative to the Gauss Newton, conjugate gradient method [Borup et al., 1992] described in the previous patent, is dependent upon the contrast but is estimated to be 100-400 times. This allows us, for the first time, to apply inverse scattering to fully 3-D reconstruction of the human breast. Furthermore the coarse grain parallelization employed in the integral equation method is equally applicable to the parabolic algorithm. The basic structure of the inversion algorithm is essentially unchanged over our previous patent. The main difference is in the use of the Parabolic equation approximation, and more exactly, the “split step Fourier Method” [R. H. Hardin and F. D. Tappert “Applications of the split-step fourier method to the solution of nonlinear and variable coefficient wave equations,” SIAM Rev. 15, 423 (1973). and M. D. Collins, “A Split-step Pade” solution for the parabolic equation method,” J. Acoust. Soc. Am. 93, 1736-1742 (1993), herein included as reference] in the implementation of both the Jacobian of the residual function, and the solution to the forward problems for each view. The source/receiver geometry requirements and the relatively low contrast of breast tissue, allow us to utilize this efficient approximation for breast cancer scanner applications, for example. In particular because we may elect to use only transmission data in the breast problem, we are able to use this “parabolic approximation” in a straightforward, simple manner [see Hardin and Tappert].
To derive and elucidate our parabolic equation method in 2-D, we begin with the 2-D Helmholtz wave equation governing wave propagation in inhomogeneous media:
Now Fourier transforming y to λ results in:
and the * notation denotes convolution:
Eqn (2) can be factored in the sense of pseudo-differential operators [M. E. Taylor, “Pseudo-differential Operators,” Princeton University Press, Princeton, 1981, herein included as reference]
An intuitive feel for the manner in which the Parabolic equation arises can be seen by looking at particularly simple case: when k is a constant. Then we have, upon Fourier Transforming (1):
where the square root is now simply the square root of a scalar value. From 6, it is clear that in this special case, two solutions exist:
for an arbitrary function g, which represents a rightmoving wave for e1017 t time dependence and:
representing a wave which “moves” to the left. Suppose that we know that the field is due to sources entirely on the LHS of the x-y plane. Then, for x>0 we must have:
Note that knowledge of f on the line x=0 (boundary condition) completes the solution since:
{tilde over (f)}(0,λ)=g(λ) 10
and so, on inverse Fourier Transforming:
In fact, for any x0>0:
The idea behind the parabolic equation method is to try to factor a general inhomogeneous (i.e. k=k(x,y)) problem into forward and backward moving (in x) factors and then solve only the forward (+x) moving part (assuming that the field source is to the left (on the x-axis) of the scatterer).
Since 13 is local to x=(n+½)Δ we consider the discretization/approximation:
Detailed Description of Advanced Parabolic Marching Method Now we desire to take the y-dependence out from under the square root in order that it may then be factored out from under the integral, which will result in the integral being an inverse Fourier Transform, yielding a substantial increase in efficiency. This goal can be achieved approximately in the following manner:
where
kn,m≅k(xn+1/2,ym)·ym=mΔ.
and kn is the wave number averaged over y. Equation 14 is then:
We have thus approximately maintained the y variation in k, while simultaneously giving a Fourier Transform formulation for the field fn+1, and maintaining the exact Fourier propagator which is exact for all angle of scattering in homogeneous medium kn.
Using the notation:
to indicate the Inverse Fourier Transform, gives:
fn+1(y)=F−1{{tilde over (f)}n(λ)Pn(λ)}(y)
where Pn is the range (x) dependent propagator def˜ned as:
Pn(λ)≅e−iΔ√k
This is our basic equation for the parabolic split step method. The “standard” form of the parabolic equation with approximate propagator is derived by using the binomial approximation:
In the above integral this approximation is valid only for angular deviations of approximately 20° from the incident propagation direction. This yields the “standard” form of the split-step parabolic equation method:
Our numerical experiments have proven the superiority of 15 over 16 due to the occurance of the exact propagator in our approach, 15. In Fourier notation, 16 is:
This is the usual form of the split-step PE method. Our more general equation is (in Fourier notation):
An interesting interpretation of our split step method is evident from this: The algorithm step consists of an exact propagation a distance of Δ in kn (which includes diffraction) followed by a phase shift correction, i.e, multiplication by
e−iΔ(k
which corrects the phase of a plane wave travelling forward.
Relation to the Generalized Born approximation The above split step Fourier method can be seen to be a generalization of the “Generalized Born” method as developed at TechniScan scientific research division in the following manner. Suppose that we assume that the f˜eld variation in y is very slow, i.e.,
F{fn}(λ)≈0 for λ≠0
so that we may replace
in the above integral, 15, which gives
Then 15 becomes:
which, in integral form, is:
This is our generalized Born approximation to the field. Thus, our PE formula 15 reduces to the GB formula if straight line propagation and no diffraction are assumed. In order to reintroduce diffraction in the GB approach, we must now “rescatter” the internal field approximation, 21, via:
where fGB is computed from 21.
This “rescattering” will also improve the calculation of backscatter in our PE method. The PE formulation has no explicit backscatter in it. Equation 22 is one way to put it back in, in a manner that gives a good approximation for weak scattering.
It is important to note that the above formulation of the parabolic equation inversion method does not require the storage of any fields (ru˜like the integral equation method of the previous patent). Of course, our PE method (as in the method described in our previous proposal) does not require the explicit storage of a large Jacobian, or its adjoint.
Inverse Problem and Implementation of the Jacobian.
A computational formula for the Jacobian is desired for the construction of the inversion algorithm
More exactly, we use the conjugate gradient algorithms in our inversion, and consequently desire only the action of the Jacobian defined above on a wavenumber perturbation δkij, i.e. the total variation of f due to perturbation δk:
The recursion formula for the action of the Jacobian on the perturbation in k is given by
It is advisable to rewrite the recursion expression for the field values fn+1(Ym) as:
is the transmission coefficient of pixel n,m. The equation for the Jacobian with repect to the new unknown variable t is:
and the total field variation in terms of a perturbation in t, δ t, is:
The recursion formula for δ f is therefor:
The Formula for the Adjoint (Conjugate Transpose) of the Jacobian:
The recursion for the total variation in the data δ fN (on a detector line at x=NΔ) can be written as:
δfn=δtN{circle around (×)}vN+WNδfN−1
where the matrix Wj is defined as:
Wj≅tj{circle around (×)}Aj=[tj]Aj
where {circle around (×)} denotes the Hadamard product, [tj] represents the diagonal matrix:
whose diagonal terms consist of the elements of the vector tj and Aj is the matrix defined as (where juxtaposition always indicates matrix multiplication):
Aj≅F−1PjF
Also the vector Vj is defined as
vj≅Ajfj−1
i.e.
vj=F−1PjFfj−1
With these notational conventions, the recursion for the total variation becomes:
δfN=[vN]δtN+WN[vN−1]δtN−1[vN−2]δtN−2+. . . +(WNWN−1. . . W1)[v0]δt0
The total variation can be written as a matrix product to facilitate the formation of the adjoins (conjugate transpose):
Defining the matux M as:
M≡([vN]WN[vN−1]. . . (WNWN−1. . . W1)[v0])
one can form the conjugate transpose, which is (using the symmetry of the diagonal matrices [vk])):
The equation for the action of the conjucate transpose of the Jacobianacting on δ fN is then:
For computational purposes it is advantageous to write this equation in conjugate form:
This reduces the number of conjugation operations needed for computer implementation. Thus the matrix required is actually MT:
The expression for WT is (as usual F will denote Fourier Transform.):
(Wj)T=AjT[tj]=FPjF−1[tj]
where use has been made of the fact that since the propagator P is a diagonal operator:
Pj=PjT
The vj=F−1PjFfj−1 computed and stored as the forward fields are computed within the “subroutine jach” which computes the action of the transpose of the jacobian on the complex conjugate of the residual vector, followed by condgugation of the result.
The updates for one view, then, are constructed in sequence using the formulae:
{overscore ([δtN])}=[vN][δfN]
{overscore ([δtN−1])}=[vN−1]WNT{overscore ([δfN])}=[vN−1]FPNF−1[tj]{overscore ([δfN])}
etc.
See FIGS. 35 and forward, for the flowohart, and the microfiche documentation for the source code.
Preconditioned Inversion
General Discussion of Conditioning
Suppose that the goal is to find mint∥At−r∥2, where A is m by n matrix (m<n), t is an n-dimensional vector, and r is m-dimensional “residual” vector. By making a change of variables BS=t, to s, where B is n by m., the system to be solved is now
ABs=r
is a square system. Therefore the BiConjugate gradient method or BiCGsquared algorithm can be applied to this system. BiCG squared has much better convergence behaviour than standard conjugate gradient methods for non-square systems [see references in previous patent]. Consequently the convergence to a solution ŝ will generally be much quicker, whereupon {circumflex over (t)}≡Bŝ computes the answer to the original problem. Due to the position of the conditioning matrix B, this procedure is hereafter referred to as ‘post conditioning’
Two special cases:
1. B=AT in which case
s is the minimum norm solution to an underdetemined problem
where
indicates the direct sum in the sense of vector spaces over all frequencies, ω, and views, φ, ie concatenation. In fact, for operators Aj, j=1, . . . , N, we have explicitly:
Of course, this is a nonlinear operator, and the corresponding Jacobian
evaluated at some iterated guess is the object that is used in the BiCG minimization routine. B in this case is the Born or Rytov reconstruction. The general procedure for the solution of the Jacobian equation:
then involves the application of said Jacobian (1), and its conjugate transpose:
gsc is the scattered field to which the Born reconstruction procedure is applied to yield γ,
γ=Bgsc
where the standard Born approximation is utilized:
gsc(x)=∫ΩG(x,x′)γ(x′)finc(x′)dx′
which upon discretization reads:
gsc=G[finc]γ
where the standard notation for the matrix whose diagonal elements are the components of the vector finc has been used:
It is important to realize that the explicit matrix inversian of B is never carried out: rather, the standard Born reconstruction based upon the Fourier Diffraction Theorem is used, for computational efficiency.
Subroutine “scatter” is called from main program in
Tv(x)≡M(θ)T(x)xεR3
Next control is moved to box 2004, where the field for the first “slice” (ie. j=1) is set to predetermined values: s(ik), and the y index, “j” is set=
1. f0(x,z)≡f(x,jδy,z)|j=0←s(z)
Control is then transferred to box 2006, where the Fourier Transform is applied to field slice
j−1 .F(fj−1)
Control is then transferred to box 2008, where the pointwise product is taken between the propagator Pj, defined as
and the result from box 2006, to get
PjF(fj−1).
Now transfer to box 2012 where we take the Inverse Fourier transform in z-direction to get
vj≡F−1(PjF(fj−1)).
Then move to box 2014, where jth field slice is given by pointwise product of transmission coefficient Tj and vj.
fj≡Tj(x)vj.
Then transfer control to decision box 2016, whereupon: if j is strictly less than Ny then control moves to box 2018, where index j is increased by 1, and control is directed along arrow 2020 to box 2006. If j is exactly Ny, control is transferred to decision box 2024. Then, if inv <Nview, control is transferred to box 2030 where inv is incremented by 1 to inv+1, and control is then transferred along arrow 2026 to box 2002. When inv=Nview, control is returned to the main program.
Referring to
f0(x,z)≡f(x,jδy,z)|j=0←s(z)
Now begin the march across the pixelated image space by transferring control to box 2112 where j (y index) is set=1. Then control moves to box 2114 where we apply Fourier Transform to field slice j−1:
F(fj−1)
Then control is transferred to box 2116 of
We now move to box 2118 where the Inverse Foufier transform in z-direction is applied to get
vj≡F−1(PjF(fj−1))≡Afj−1
(this slice vj will be used later, and so is not overwritten).
Control is then transferred to box 2120, where
Aj(δfj−1)≡F−1(PjF(δfj−1))
is computed in exactly the same manner, i.e. the Fourier transform is applied to δfj−1, then the propagator is pointwise multiplied, and finally the inverse Fourier Transform is taken.
Control is then transferred to box 2122, where the pointwise product between the vj (calculated above) and δtj, [δtj]vj is formed. Also the pointwise product involving tj.: [tj]Aj(δfj−1) is formed. The definition of the pointwise product is given explicitly by
[δtj]vj≡Σi,kj fixedδtijkvijk.
Finally form the sum of the resulting “vectors”,
δfj←[δtj]vj+[tj]Aj(δfj−1)
If control has been transferred to decision box 2130 of
Referring to
f0(x,z)≡f(x,jδy,z)|j=0←s(z)
Next control is transferred to box 2212, where j is set=1, and the march across the pixelated image space is begun.
Next control is transferred to box 2214 the Fourier transform is applied to the field slice fj−1:F(fj−1) and control is transferred to box 2216, where pointwise multiplication by the propagator Pj is carried out, where
The result, PjF(fj−1) is then transferred to box 2218 where the inverse Fourier transform in the z (vertical) direction is applied, yielding
vj≡F−1(PjF(fj−1)).
This vj will be used below, and so is saved.
From this, control moves to box 2220, where the field slice at y=jδ (recall that field values are recomputed to obviate the time consuming storage/retrieval of these on disk) is given by pointwise product of transmission coefficient Tj and vj. fj≡Tj(x)vj. Control now goes to decision box 224, whereupon if j is strictly less than Ny, control is switched to box 2222, where index j is increased by 1, and control reverts to box 2214, where the Fourier transform of the next field slice is taken. When j is equal to Ny, control is transferred to box 2226 of
fc: fc←F(r≡δfN).
Now use Vj (calculated in loop above) and fields fcj to calculate action of Jacobian on r, the error in field at the detector (far side of image space) in the following manner:
Control then moves to box 2226 where fc is redefined as fc·(tυ+1) and the pointwise product between the resulting fc, and vj is taken, and stored in vj←[vj]fc . Control is then transferred to box 2228 where we pointwise multiply fc by rotated transmission coefficient Tv, Fourier Transform the result, pointwise multiply by propagator Pj, then inverse Fourier Transform (in z direction) the result; finally store the resulting slice in fc (overwriting previous contents) fc←F−1PiF·Tv(x)fc. Next, control is transferred to box 2230, in order to test whether j=0. If not, then control is transferred to box 2238, where the y index, “j”, is decreased by one (put j←j−1), and control is transferred along arrow 2236, back to box 2226. If, however,j does=0, then the loop is done, and control moves to box 2232, where the hermitian conjugate of rotation matrix is applied to matrix of transmission coefficients tu computed in the above loop, where:
in order to rotate (estimate of) object back to original orientation
Then, control moves to box 2234, where we add contribution for this view to total transmission coefficient adjustment dt
Now control is transferred over to decision box 2240, where it is determined if index inv=Nviews. If it is, then control is transferred over to box 2242, where index inv is increased by 1: inυ←inυ+1, and control is transferred up arrow 2242 to box 2246 to do the next view. If it is determined that index inv does in fact=Nviews, the control is passed along arrow 2244, to return to the calling program, with output→(δt)
Embodiments within the scope of the present invention also include computer-readable media having computer-executable instructions or data structures stored thereon. Such computer-executable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store desired computer-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer. When information is transferred or provided over a network or another communications connection to a computer, the computer properly views the connection as a computer-readable medium. Thus, such connection is also properly termed a computer-readable medium. Combinations of the above should also be included within the scope of computer readable media. Computer-executable instructions comprise, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions.
The following discussion is intended to provide a brief, general description of a suitable computing environment in which the invention may be implemented. Although not required, the invention can be implemented in the general context of computer-executable instructions, such as program modules, being executed by computers in network environments. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Computer-executable instructions, associated data structures, and program modules represent examples of the program code means for executing steps of the methods disclosed herein.
Those skilled in the art will appreciate that the invention may be practiced in network computing environments with many types of computer system configurations, including personal computers, hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, and the like. The invention may also be practiced in distributed computing environments where tasks are performed by local and remote processing devices that are linked through a communications network. In the distributed computing environment, program modules may be located in both local and remote memory storage devices.
Water Bath Scanner Compatible with Reflection Tomography and Large Pixel Inverse Scattering
General features of the proposed water bath scanner are shown in
A slight vacuum can be applied to the tube to make a tighter fit as needed for the latex version. This shield is disposable and contributes to the esthetics and sanitation of the scanner. A tilting cot provides a comfortable means in patient positioning, shown in
Some of the important specifications for the scanner are given in TABLE 1 below. Said table illustrates the performance of both reflection tomography and inverse scattering transmission tomography. The in-plane and normal to plane spatial resolution of reflection tomography will approach 0.2 mm and 2 to 3 mm respectively (the vertical resolution may be increased to 1 to 1.5 mm with special apodizing functions).
The number of views for reflection tomography will be determined experimentally, but best estimated as equal to be the number of depth resolution lengths per circumference of a circle of diameter equal to the lateral resolution of a single view. Thus, N=λπ[2R/A]/[c/(2Pf)]=4π λRPf/[cA]=4π RP/A where λ is wavelength, R is range to target from array, A is the active lateral aperture of the array, c is the speed of sound, P is the fractional bandwidth, and f is the center frequency of the signal. The total data collection time for one slice is 12 seconds for the system specified in TABLE 1.
The in-plane spatial resolution of inverse scattering can be varied by choice of the pixel size (this ability is not possible with reflection tomography and is one of the new developments we report). In TABLE 1 we use 2 wavelength pixels (0.6 mm at 5 MHz) as an example.
The number of views for inverse scattering tomography will be taken to be the same as for diffraction tomography which is N=πD/λ where D is the breast diameter. For inverse scattering this generalizes to N=πD/p=667 at 5 MHz for D=5 inches and for p=pixel size=2 wavelengths. Using many views at one frequency is equivalent to using fewer views with more frequencies per view. This creates a speed up factor in data collection since many frequencies can be collected at each view in the same time required to collect a single frequency after 2 wavelengths. The frequency sample interval required to reduce the view-to-view angle separation for a complete high quality image reconstruction is given by Δf=2πf/N==2πf/[πD/p]=2pf/D=0.047 MHz. The frequency bandwidth for 16 frequencies is then 0.047 MHz×16=0.752 MHz and requires only 667/16=42 views. For this example with p=2 wavelengths, the spatial resolution is 0.6 mm. If p=1 wavelength were used, then the spatial resolution would be 0.3 mm. The total single slice, data collection time for 42 views and 16 frequencies is 4.3 seconds for the system specified in TABLE 1.
The block diagram of the scanner electronics is shown in
The question of over loading the PCI bus has been examined and is not a problem since our algorithms are coarse grained and each distributed process needs minimum communication with the other processes. Thus, similar operations can run in parallel on all the G4s nodes and possibly parts of each node (each G4 has 4 parallel SIMD processors). SIMD (single instruction multiple data) computers also work well with our coarse grained algorithms. The natural process divisions are frequency, view angle and slice height.
The signals to the transmitter elements are generated by an arbitrary wave form generator card (such as the GaGe™ CompuGen 1100) that is placed in the PCI bus of one of the computers 10 that acts as the master computer. This master computer synchronizes the actions of the other computers. This is not a time consuming task since each computer's processing is largely independent of the other nodes. The output of the waveform generator is amplified by a linear amplifier and sent to the transmitter MUX. The MUX will be designed to allow future expansion for using 2 to 8 linear amplifiers to be placed after respectively 2 to 8 delay lines(connected to the 15 common wave form generator) to form transmit beams. The MUX circuits will be solid state analog switches of either the diode network type or of the Supertex™ (Sunnyvale, Calif. 94089) MOS (metal oxide semiconductor) type. We have used the diode network type with success in one of our present scanners. The diode network requires more printed circuit (PC) board space, but has lower series resistance (about 5 Ohms) and low shunt capacitance. The Supertex approach uses less PC board space (the HV20822 has 16 switches in a 0.354 inch square package) but has higher series resistance (about 25 Ohms) and shunt capacitance (20 pf off, 50 pf on). The diode network is a safe bet. However, we will look at placing the Supertex switches in the MUX circuit bays close to the arrays to reduce cable capacitance. We will also consider using preamplifiers or buffer amplifiers after the Supertex switches to reduce the effect of their higher resistance and capacitance. The Supertex is not a bad choice and is used in many commercial ultrasound scanners with normal cable lengths from probe to chassis where space-savings is important in the chassis. The MUX circuits will use two stages, for both the transmitter and receiver connections.
The tank and scan motion control will be constructed and assembled as per well known methods and components known to the art. The arrays can be built by transducer jobbers such as Blatek, State College, Penn. The engineering and tooling for the arrays has already been completed; and allows the present array we are using (8 rows and 16 columns) to be replicated 6 times per new array to make a 96 by 8 element array.
A more detailed understanding of the MUX shown in
The receiver multiplexer is shown in the bottom half of
The design and construction of multiplexer circuit boards is a process well known in the art. We have experience in designing such multiplexers for the compression plate inverse scattering scanner and other scanners in our lab.
The circuit board fabrication can be sent to a jobber. The circuit boards layout work can be done by Orcad or similar software. The design and coding of the software for data acquisition is a well known process in the art. For example, one can use C++ or the Labview™ application software from National Instruments™ (Austin, Tex. 78759-3504) for the main data acquisition program to allow the use of a virtual instrument panel (showing scanner array position, digitized waveforms, gain settings, etc.). The inner loops of the Labview program will be written in C or C++ for speed. Algorithm implementation and programming may be done in Fortran or C or C++.
Reflection Tomography and Inverse Scattering Imaging Algorithms
The algorithms used for reflection tomography are described further below. The exact algorithms for inverse scattering are given by Borup et al in [D. T. Borup, S. A. Johnson, W. W. Kim, and M. J. Berggren, “Nonperturbative diffraction tomography via Gauss-Newton iteration applied to the scattering integral equation,” Ultrasonic Imaging 14, 69-85, (1992), and Wiskin, J W, Borup, D T, and Johnson, S A, “Inverse Scattering from arbitrary two-dimensional objects in stratified environments via a Green's Operator”, J. Acoust. Soc. Am. Pt.1, August 1997.]. We show here an improved faster version of the parabolic method which is one of three faster versions of inverse scattering that uses forward propagating wave solutions. This method is an approximate method that is much more accurate than the Born or Rytov methods, and even approaches closely the accuracy of the exact solution. An outline of the theory will be given here, but a full account of several approaches using this method may be found in the literature and our patent.
We start with the Helmholtz wave equation
[∂2/∂x2+[k2(x,y)+∂2/∂y2]]f(x,y)=0.
Next we take the Fourier transform with respect to y and then assume an approximate factor of the resultant equation. Note that the convolution theorem of Fourier transforms is used, where {circle around (×)} denotes the convolution operator. Let {tilde over (f)}(x,λ) denote the Fourier transform of f(x,y) with respect to y. The approximate factored equation (by forcing commutation of all variables) is then:
We next note that the two factors represent waves moving to the left and to the right along the x axis. Taking only the wave moving from left to right (the first factor), we have
where the superscript+indicates that part of {tilde over (f)}(x,Aλmoving in the +x direction (left to right). It is a parabolic differential equation. For the special case, k2(x,y)=k02=const., the factorization is exact and convolution becomes multiplication and we can solve the equation for {tilde over (f)}+(x,λ) and then take the inverse Fourier transform to get the propagation formula:
which approximation propagation in breast tissue where contrast in k2 is small, i.e. k2216 k02.
This formula is the basis for a multitude of marching approaches for solving the direct scattering problem. The general idea is to perform the integral on a line parallel to the y-axis and to propagate the angular spectra forward by an increment of x. Then the process is repeated for additional increments of x. The method for dealing with k2(x,y)≠const. varies between methods (including a binomial expansion of the square root). Our approach follows this general pattern but adds additional refinements to improve accuracy and speed. This completes the outline of the parabolic method.
The parabolic method has allowed us to design the proposed scanner to use inverse scattering at 5 MHz. Using several tricks has accomplished this. One trick is to use larger pixels to reduce the computational load. Other tricks involve further approximations that increase speed but do not degrade performance. The basis of inverse scattering imaging is to solve by optimization methods the following least squares problem for the vector of independent variable y or (x,y)*, whose components are the image parameters. Let f(measured)(xdet) be the measured field on detectors and let f(calcutlaed)(xdet,γ,f(inc)) be the computed received field as a function of an estimate of γ and the incident field f(inc)(x) in the body and on the detectors. Then the scattering potential is found by finding the minimum value of the norm of the measurement residual R, where, R is defined as the square of the difference, in Hilbert space, between the measured and computed fields on the detector.
We have shown how this optimization problem may be solved by Gauss Newton, Fletcher Reeves, Ribiere Polak and other methods [D. T. Borup, S. A. Johnson, W. W. Kim, and M. J. Berggren, “Nonperturbative diffraction tomography via Gauss-Newton iteration applied to the scattering integral equation,” Ultrasonic Imaging 14, 69-85, (1992), J W Wiskin, D T Borup, S A Johnson, “Inverse Scattering from arbitrary two dimensional objects in Stratified Environments via a Green's Operator”, APPARATUS AND METHOD FOR IMAGING WITH WAVEFIELDS USING INVERSE SCATTERING TECHNIQUES, S. A. JOHNSON ET AL.], herein included by reference. Formerly we used integral equations or finite difference time domain (FDTD) methods to solve the forward problem (given γ and f(inc)(x), compute f(calulated)(xdet,γ,f(inc))). The Jacobian is defined as
The solution, γ, is found by the iteration
J(n)δγ(n)−R(n)
γ(n+1)=γ(n)+δγ(n)
The derivation and use of the Parabolic Jacobian is based on a recursion process and is found in our patent [APPARATUS AND METHOD FOR IMAGING WITH WAVEFIELDS USING INVERSE SCATTERING TECHNIQUES, S. A. JOHNSON ET AL.] in a subsection called “Inverse problem and construction of Jacobian” in section call “EXAMPLE 12, PARABOLIC MARCHING METHODS”.
We form a Jacobian for the parabolic method and incorporate it as part of the above iterative formula. We next show how the new parabolic method performs in terms of speed and accuracy by simulating the geometry and parameters for scanners similar to the proposed scanner specified in TABLE 1.
In this example we consider the design of a compression plate ultrasound scanner combining both reflection imaging and transmission inverse scattering imaging. This is not the geometry for the water bath scanner, yet we show that both geometries can produce images of sound speed and attenuation.
We specify two arrays facing each other, with their faces mutually parallel. Each array can be a 1-D array or a 2-D array of transducer elements. We have constructed a compression plate scanner where each array is a 1-D array with 256 elements with λ/2 element separation at 2 MHz. A drawing of this compression plate configuration is shown in
The array geometry can be extended to provide improved imaging by adding side reflecting plates or side transducer arrays which also serve as side compression plates. This extension in function is shown in
A simulation of the compression plate scanner is described next. The bandwidth for the reflection imaging mode was set equal to a Blackmann window from 0 to 2 MHz with a center frequency of 1 MHz. The reflection mode beam former used a 16-element segment of the array to create a beam perpendicular to the array face which was then translated across the array. The beam former was focused at all ranges for both the transmitter and receiver elements and a Hamming window was used to apodize both the transmitter and receiver beams
The inverse scattering data was generated by computer simulation using a direct scattering solver. This solver generated data at all elements on the receiving array for each element transmitting on the opposite transmitter array. The inverse scattering image was formed by software that inverted this simulated data. Perfectly reflecting plates (infinite impedance) were assumed to exist on the other two sides. Two frequencies, 1 MHz and 2 MHz, were used. The image array was 256 by 100, λ/2 at 2 MHz, pixels. For 30 iterations, the algorithm required 1.5 hr. of CPU time on a 400 MHz Pentium II. The model defined in
The performance of the standard parabolic method with ½ wavelength pixels is shown if
Inverse scattering images from real lab data from two ¼ inch, 5 Hz Panametrics™ video scan transducers facing each other and using parabolic inverse scattering algorithm.
This example illustrates that our breakthrough in large pixel parabolic codes is practical. We applied it to existing time of flight data (digitized waveforms) that were collected to be analyzed by time of flight tomography (a modified x-ray CT algorithm). No transducer array was used as a receiver (thus no diffraction pattern was recorded). Nevertheless, the parabolic inverse scattering method has defined the two holes in an agar phantom somewhat better than the straight line time of flight method (see
Conclusion for Example 2: The new fast algorithm has been tested in adverse conditions with poor data (incomplete angular spectrum) from the laboratory and performs better than the “gold standard” time of flight CT algorithm. Its predicted performance should be about 10 times better when using data that is designed to meet it's computational and interface requirements. This will be verified in the next two examples.
Inverse scattering images from simulated data from ¼ inch piston transducers using the parabolic inverse scattering algorithm.
For this computer simulation example we use the following parameters: Frequency=5 MHz. Number of frequencies=1. Pixel dimension=2λ Image size 100 by 100 pixels=2.36 by 2.36 inches. Element aperture=0.236 inches. 90 element translation positions for 90 view angles. Speed of sound values (m/s): cwater=1500, cskin=1505, cfat=1490, cparenchyma=1505, ctumor=1510. The image size can easily be doubled or beyond for real data. The results of this simulation are shown in
Conclusion for Example 3: Using an accurate model of the two, 5 MHz, ¼ inch diameter piston transducers rather than lab data that was not designed with the transducers accurately aligned and calibrated, the spatial resolution is about 4 times better than the lab data. In fact, the images are remarkable. This simulation experiment suggest a follow up simulation experiment where a much larger receiver aperture is used to sample a much greater part of the scattered (diffracted) waves in the forward direction. In this case an additional factor of improvement should be obtained. This experiment is performed in the next example.
Images made from simulated data for the same phantom as in EXAMPLE 3, but with the ¼ inch diameter piston transducers replaced with an transducer array of 120 elements, each 2 wavelengths in width and center to center separation.
See
Note the greater accuracy of the image from the new algorithm (right) vs. the time of flight (i.e. CT) image (center). Also note that using an array (
Conclusion for Example 4: Increasing the receiver aperture does indeed improve spatial resolution as predicted by theory. The new algorithm is stable, accurate and fast and can use this increased data set size. The results of the four (4) Examples are self consistent and a speed up factor of 256 over previous parabolic methods is verified.
Transition to 3-D Imaging
The transition from 2-D algorithms (either standard parabolic or fast multi pixel parabolic) is straight forward and only involves replacing the 1-D convolution kernel by a 2-D convolution kernel and other obvious dimensional changes. We have reprogrammed our standard 2-D parabolic algorithm to make 3-D inverse scattering images. The image pixel space for the following test case was 150 by 150 in the (x,y) topographic plane with 60 stacked planes (z axis direction).
We conclude from these images that 3-D imaging is a straight forward modification of 2-D imaging.
Value of Incorporating Reflectivity Imaging by Reflection Tomography.
We show next that incorporating reflection tomography imaging with the inverse scattering algorithm is straightforward and valuable. The speed of sound image made by inverse scattering can be used with ray tracing (either straight ray approximation or actual ray tracing) as a correction map or correction table to adjust the delay times for back propagation of data from a common transmission and reception transducer array. This process is well known to the art. The main difficulty to date has been obtaining an accurate speed of sound image to proceed with the mapping process to generate the correction table. The inverse scattering method fulfills this requirement almost perfectly. We show the result of straight line ray tracing through a speed of sound image made by time of flight tomography to correct a reflection tomography image of human breast tissue in a thin plastic cylinder.
Invention Disclosures for the Parabolic Inverse Scattering Algorithm
—Large (2λ) Pixels
In our original parabolic disclosure: example 12 of APPARATUS AND METHOD FOR IMAGING WITH WAVEFIELDS USING INVERSE SCATTERING TECHNIQUES, S. A. JOHNSON ET AL., the forward propagation of the wavefield from f(x,y) to f(x, y+Δ) where Δ is the pixel site, was performed by Fourier transforms via:
f(x,y+Δ)=Fλ→x−1{p(λ)Fx′→λ{f(x′,y)}} 101.1
where F and F−1 denote the forward and inverse Fourier transforms and the exact propagator, p, is given by:
This propagator is exact. By the convolution theorem, 101.1 can be rewritten as:
f(x,y+Δ)=∫f(x′,y)p(x−x′)dx′ 101.3
where
p(x)=Fλ→x−1{p(λ)} 101.4
In other words, the propagator can be implemented by convolution. The question is, for a discrete version of these equations, which is faster, FFT's and multiplication in the frequency domain or convolution in the spatial domain?
Notice that for Δ=λ/2, if we assume that we can cut off the propagator at i=±20 then the implementation of the propagator by a direct convolution sum would require N*41 operations. This convolution length is probably too long to be any faster than the FFT implementation. However, the Δ=λ case can be cut off at i=±6 and the Δ=2λ case can be cut off at i=±3. In particular, have found that the Δ=2λ case can be implemented with a cut off at i=±3 with no discernible error over the FFT implementation providing that the propagator is multiplied by a Blackmann window:
{tilde over (p)}(i)=p(i)(0.42+0.5 cos (πi/(nfilt+1))+0.08 cos (2πi/(nfilt+1))), i=−nfilt, . . . , nfilt
where nfilt=3 for the Δ=2λ case. Without this filter multiplication, the parabolic propagation was found to be unstable. Apparently, even though the neglected values are small, simple truncation of the propagator results in at least one eigenvalue slightly greater than I resulting in exponential growth for large numbers of parabolic steps.
We had originally thought that the parabolic algorithm required Δ≦λ/2 in order to maintain accuracy for scattering from biological tissue contrasts. This is certainly true if one requires scattering at angles out to 45 deg. from the incident field propagation direction. However, for computation of all scattering in a ±15 deg. cone, Δ=2λ sampling is sufficient. Of course, in a typical medical imaging problem, there will be scattering at large angles, however, suppose that the receivers are 2λ segments. Then the receivers do not see this large angle scattering energy anyway. Thus, the Δ=2λ algorithm will accurately compute the received data. Since the pixel size is now larger, the number of views needed for a complete dataset is also reduced by a factor of 4 over the Δ=λ/2 algorithm.
Compared with the previous Δ=λ/2 algorithm, the speedup in compute time is given by: 4 times fewer views * 16 times fewer pixels*speedup factor of short convolution vs. FFT propagator.
Even assuming that the short convolution buys only a factor of 4 over the FFT (it is likely better than this for nfilt=3), the algorithm is still 256 times faster than the previous algorithm for the same sized object at the same frequency.
Pseudo Code Definitions of the Plane Wave Parabolic Algorithms for the Frequency Domain
nx=x-axis array dimension (x is normal to the parabolic propagation direction)
ny =y-axis array dimension (y is parallel to the parabolic propagation direction)
nr=the number of receivers per view
nv=the number of plane wave views from 0 to 360 deg.
nf=the number of frequencies used
nfilt defines the width of the short convolution propagator: the values f(−nfilt+n) to f(n+nfilt) are used to propagate one step for field index f(n).
ne defines the number of samples used to define a receiver element. Each receiver is −ne to ne, Δ samples long. For a point receiver, ne is set equal to 0.
Propagator Definition
The following pseudo code defines the short convolution propagator generation.
The next step is done by the FFT algorithm:
Definition of the Propagator Operation
The Notation:
Definition of the Rotation Operator
The Notation
tr=Rot(t,φ)
will henceforth denote the rotation of the function in array t(nx,ny) by 4 radians followed by placement into array tr(nx,ny) via bilinear interpolation.
Definition of the Receiver Location Array
The nr receivers are located at the points mr(1r), 1r=1, . . . , nr where each mr(1r) is an element of [1+ne, . . . , nx−ne]
Definition of the Receiver Sensitivity Function
Receiver 1r is centered at point mr(1r) and extends from mr(1r)−ne to mr(1r)+ne. The array wr(−ne:ne,nf) is set to be the receiver sensitivity function on the support of the receiver for each frequency 1, . . . , nf.
The Enveloped Time Domain Parabolic Algorithm
The second invention disclosure for the parabolic inverse scattering algorithm is a means of avoiding local minima when imaging an object with many π phase shifts and when starting from a zero (or a somewhat poor) initial guess. If one has only one frequency and the phase shift through the object is less than 7i radians then one can converge to the global, correct solution from a starting guess of zero. If the phase shift is greater than n then the algorithm will converge to a local minimum if a zero starting guess is used. Extrapolating this, if one has a bandwidth from fmin to fmax then convergence to the correct solution will occur from a zero starting guess only if the phase shift through the object is less than π at fmin. For biological tissue at ultrasound frequencies, say 5 MHz, phase shifts of on the order of 10 π are encountered. This means that fmin needs to be on the order of 1/10 th fmax. This is not possible with the typically 50% bandwidth transducers available at present.
In order to get around this difficulty, we have examined the use of starting guesses computed by straight line time of flight imaging. In this approach the time delay of the acoustic time pulse through the body is extracted from the data. Assuming that the energy took a straight line path through the body for each receiver (not a good assumption due to ray bending (refraction) and diffraction effects) then the speed of sound image can be reconstructed by well known x-ray CT type algorithms. We have found that these starting guesses will work up to a point but as we increase the size and contrast of the body up to that of tissue, we find that the time of flight image is not sufficiently close to the truth to allow inverse scatter imaging with 50% bandwidth transducers.
One may reasonably ask the question: why does the time of flight algorithm work with 50% bandwidth transducers? The answer is that even though the time pulse that travels through the target has only 50% bandwidth, it is still possible to deduce the time delays.
Suppose now that we attempted to find the center of the waveform in
Clearly we would find the negative peak near sample 51—a local maximum. This is analogous to what happens with the parabolic inversion algorithm starting with a zero guess. The time of flight algorithm, on the other hand, finds the time delay by first taking the envelope of the waveform. The envelope squared of the signal shown in
It is now a simple matter to find the time delay to the waveform center with either an optimization algorithm or by simply detecting the index of the maximum value.
The preceding discussion suggests that the parabolic inversion algorithm might avoid local minima, even with 50% bandwidth transducers, if one operated on the envelope of the time waveform as the data. It is a simple matter to take the frequency domain output of the parabolic algorithm, transform to time, and then take the envelope. Unfortunately, the envelope operator is not differentiable and the parabolic inversion algorithm relies on gradient direction optimization (conjugate gradients). This difficulty can, however, be overcome by making the square of the envelope the time domain output data. This function is differentiable and a conjugate gradient optimization algorithm can be applied. Specifically, the new functional to be minimized is:
F[γ]=∥env(fparabolic(γ,t))2−env(fdata(t))2∥2 102.1
(with multiple receivers and views of course). Derivation of the Jacobian and its adjoint for the new functional is straightforward. The resulting forward scattering calculations and the Jacobian and its adjoint are best viewed in the following pseudo code form.
PseudoCode for Time Domain Parabolic Code Algorithm:
For the time domain code, we first set the parameter Δf=the frequency domain sample interval so that the total time period T=1./Δf is sufficient to contain the minimum and maximum time shifts in the data. We then specify the minimum and maximum frequencies by their indices nmin and nmax:
fmin=minimum frequency=nmin Δf,
fmax=maximum frequency=nmax Δf,
using the desired fmax and fmin.
We then select the number of time steps, nt>nmax and set the time domain sample interval as:
Δt=1/(nt Δf).
Then, nf=the number of frequencies, is set=nmax−nmin+1 and the frequency list is set as:
freq(1f)=(1f−1+nmin)Δf, 1f=1, . . . , nf
Now, suppose that the array fc(nr,nv,nf) contains the frequency domain total field computed by the parabolic algorithm for nr receivers and nv views and let the array fiω(nr,nf) contain the incident field for the parabolic algorithm. We then transform this frequency domain to the envelope squared of the time domain data via the operations:
Now, f(nr,nv,nt) contains the time envelope squared of the simulated signals. Note we have also computed the array fco(nr,nv,nt) which contains the complex analytic time domain waveforms. This function is needed for the Jacobian and Jacobian adjoint calculations. The frequency domain filter wt(nf) is given by:
freqc=(fmax+fmin)/2
freqb=(fmax−fmin)/2
wt(1f)=(0.42+0.5cos (π(freq(1f)−freqc)/(freqb)+0.08cos(2π(freq(1f)−freqc)/freqb))*exp(−i2πfreq(1f)Δt (nt/2)), 1f=1, . . . , nf
and is applied to prevent sidelobes in the envelope and to shift the time=0 index in the array from 1 to nt/2+1.
In order to compute the gradient (Jacobian adjoint operating on the residual), we first compute the time domain residual array:
r(1r,1v,1t)=fdata(1r,1v,1t)−f(1r,1v,1t), 1t=1, . . . ,nt, 1v=1, . . . ,1v,1r=1, . . . ,nr.
We then compute the frequency domain residual, rω(nr,nv,nf) by the calculation:
Once the frequency domain residual array is computed, we then apply the original Jacobian adjoint of the frequency domain parabolic algorithm:
∇F=J(frequency domain parabolic Jacobian)H·rω
The Jacobian operating on a perturbation to y (which is also needed to perform a conjugate gradient optimization) is similarly computed.
Note on the Jacobian implementation: observe that the operation of converting to time domain data, then taking the envelope and squaring is concatenated onto the operation of propagating the fields via the parabolic algorithm at the respective frequencies, therefore, it follows that the implementation of the Jacobian will follow from the standard chain rule of vector calculus. This implementation is in fact given explicitly in the above pseudo-code, but is restated explicitly here for further elucidation of the process.
We have implemented this enveloped time domain parabolic algorithm and have verified that it does indeed converge from a zero starting guess. The final image is somewhat degraded (lower resolution) relative to the frequency domain algorithm with a good starting guess due to the loss of some information from the envelope operation. However, this code does produce a sufficiently accurate starting guess for the frequency domain algorithm which can then refine the resolution of the image.
The present invention describes an Apparatus and Method for Imaging Objects with Wavefields as further described in U.S. patent application Ser. No. 11/223,084, filed Sep. 9, 2005; U.S. patent application No. 11/222,541, filed Sept. 9, 2005; U.S. patent application Ser. No. 10/615,569, filed Jul. 7, 2003; U.S. Pat. No. 6,636,584; U.S. Pat. No. 6,587,540; U.S. patent application Ser. No. 08/706,205, filed on Aug. 29, 1996, now abandoned; U.S. patent application Ser. No. 08/486,971, filed Jun. 22, 1995, now abandoned; and U.S. Pat. No. 5,588,032; which are all herein incorporated by reference in their entirety for all purposes.
The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
This is a continuation of U.S. patent application Ser. No. 10/615,569, filed Jul. 7, 2003, which is a continuation of U.S. patent application Ser. No. 10/024,035, filed on Dec. 17, 2001, now U.S. Pat. No. 6,636,584, which is a continuation of U.S. patent application Ser. No. 09/471,106, filed on Dec. 21, 1999, now U.S. Pat. No. 6,587,540, which is a continuation-in-part of U.S. patent application Ser. No. 08/706,205, filed on Aug. 29, 1996, now abandoned; which are all herein incorporated by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 10615569 | Jul 2003 | US |
Child | 11437001 | May 2006 | US |
Parent | 10024035 | Dec 2001 | US |
Child | 10615569 | Jul 2003 | US |
Parent | 09471106 | Dec 1999 | US |
Child | 10024035 | Dec 2001 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 08706205 | Aug 1996 | US |
Child | 09471106 | Dec 1999 | US |