This invention relates to systems and methods for crosstalk-free source encoding for ultrasound tomography.
Over the past two decades, Ultrasound Computed Tomography (USCT) has become popular for breast cancer detection. Ray-based solvers have proven to be efficient in this context, and waveform-based inversions have delivered enhanced spatial resolution. The combination of weak out-of-plane scattering, linear transducer arrays, and a large number of ultrasound transmissions has enabled successful two-dimensional (2D) inversions. Despite these advances, the detection of tumors within the entire breast volume requires scanning over numerous stages, resulting in a substantial data acquisition time. Volume reconstruction is based on stacks of 2D images, and spatial resolution is limited by the number of acquired slices and by an averaging effect over the transducer's height. Combining matrix transducer arrays to record data with three-dimensional (3D) wave modeling could mitigate these effects, and fewer data transmissions may be required to sample the entire volume.
In addition to breast cancer screening, detection of prostate cancer with USCT has been investigated, as well as bone and brain tomography. Because these applications involve hard tissues with complex 3D shapes, strong reverberations and diffraction effects are expected, which motivates the use of accurate 3D wave propagation solvers. However, it is well known that 3D waveform inversions come with a significant computational cost, because they involve iterative modeling for each ultrasound transmission. This challenge is mitigated when using frequency-domain solvers, but also becomes computationally expensive when considering large 3D problems.
Thus, there remains a pressing need for novel methods and systems for ultrasound tomography.
This disclosure addresses the need mentioned above in a number of aspects. In one aspect, this disclosure provides systems and methods for real data acquisition acceleration. In the medical imaging context, for instance, the disclosed systems and methods enhance the patient experience, who has an examination lasting a few seconds instead of ten minutes. From a clinic perspective, this allows examining more patients per day. In particular, emitting ultrasound transducers are excited simultaneously such that they each generate a monochromatic wave at a specific frequency. After enough time, this generates a steady polychromatic wavefield, which can be recorded by receiving ultrasound transducers. As a result, the observed “super” dataset of a given receiving transducer contains useful information coming from each of the emitting sources. Recording must last long enough during the steady state to make deblending possible. The deblending operation may be performed numerically (e.g., on a computer) or with a dedicated hardware system.
In another aspect, this disclosure provides systems and methods for numerical simulation acceleration (or image processing acceleration). In the medical imaging context, for instance, the disclosed systems and methods reduce the duration a radiologist must wait to obtain an image, allowing a given computing machine to process more images per day. The disclosed systems and methods can speed up the computation of simulated “super” dataset. It is the numerical equivalent of the above aspect or real data acquisition acceleration. Only one “super” forward simulation is necessary, instead of S simulations, when considering S sources. The disclosed systems and methods can also speed up the computation of a gradient that includes all S sources contributions. Only two “super” simulations (one forward, one adjoint) are necessary, instead of 2S simulations. After deducing a “super” adjoint source that depends on the cost function, a “super” adjoint simulation can be run. To obtain the gradient, it is possible to cross-correlate, in the time domain, the two super wavefields when both are in a steady state. When doing so, it is no longer necessary to use the deblending formula.
Accordingly, this disclosure provides a system for ultrasound tomography. The system comprises: a plurality of sources configured to transmit waves at one or more frequencies within a frequency bandwidth; one or more receivers configured to receive the waves transmitted by the plurality of sources; and a processor configured to: (a) use a computerized representation of physical property maps of a medium that influence wave propagation; (b) model all the sources and the receivers involved in a dataset; (c) associate to each of the sources a sinusoidal source time function with a defined frequency of a set of frequencies regularly spaced within a bandwidth of interest; and (d) perform a wave propagation simulation where sources are transmitting simultaneously for a time Tss+kΔτ, Tss is a simulation duration for simulated wavefield to reach a steady state, where k is an integer>=1, and Δτ=1/Δƒ.
In some embodiments, the system further comprises an ultrasound wave generator configured to associate each of the sources with a defined frequency ƒs of a set of frequencies regularly spaced at Δf within a [ƒmin, ƒmax] bandwidth.
In some embodiments, the processor is further configured to: (e) receive observed measurements from at least one sensor based on a wave transmitted from at least one source passing through a medium; (f) estimate a frequency-domain observed pressure {circumflex over (p)}obs(x, ƒs) for each source, where x is a spatial location of a receivers is a ƒs is a frequency, associated with source s; (g) evaluate
simultaneously for all sources at all receiver positions, where t is time of simulation; and (h) estimate a cost function based on the values of {circumflex over (p)}obsand {circumflex over (p)}syn.
In some embodiments, the processor is further configured to: (i) evaluate the gradient of the cost function for the bulk modulus κ as defined by
where Psyn † is encoded adjoint wavefield, and ∂dt
In some embodiments, the processor is further configured to repeat steps (d) and (g)-(i) to minimize the cost function.
In some embodiments, the defined frequency ƒs=ƒmin+(s−1)Δf and
where s denotes an individual source and Sis the total number of sources, ƒs is frequency associated with source s, and [ƒmin, ƒmax] is the bandwidth of interest.
In some embodiments, each source is detected by a different subset of sensors. In some embodiments, the waves transmitted from the sources are visco-acoustic waves.
In some embodiments, the simulation comprises at least one forward simulation and adjoint simulation per iteration.
In some embodiments, the processor is further configured to evaluate
where s represents a given source simulated in simulation, r represents a given receiver, that records a given source s, xr is the position of the receiver r, t is time of the simulation, τ is time after Tss, Psyn is a wavefield simulated by a solver, ƒs is a frequency associated with source s, and ε can be any value.
In some embodiments, wherein the processor is further configured to compute a gradient of the cost function represented by:
where m is a wavespeed, density, attenuation map, or a combination,
In some embodiments, the cost function comprises a source-encoded cost function that integrates contributions from all the sources. In some embodiments, the cost function comprises waveform differences, phase and amplitude measurements, double-difference phase and amplitude measurements, cross-correlation traveltime measurements, or a generic adjoint tomography cost function.
In some embodiments, the system comprises an ultrasound device. In some embodiments, the medium comprises human tissue or soil. In some embodiments, the processor is further configured to generate an image representation of the medium.
Also provided in this disclosure is a method for ultrasound tomography. The method comprises: (i) using a computerized representation of physical property maps of a medium that influence wave propagation; (ii) modeling all the sources and the receivers involved in a dataset; (iii) associating to each of the sources a sinusoidal source time function with a defined frequency of a set of frequencies regularly spaced within a bandwidth of interest; and (iv) performing a wave propagation simulation where sources are transmitting simultaneously for a time Tss+kΔτ, where Tss is a simulation duration for simulated wavefield to reach a steady state, k is an integer>=1, and ΔT=1/Δƒ.
In some embodiments, the method further comprises associating each of the sources with a defined frequency ƒs of a set of frequencies regularly spaced at Δf within a [ƒmin, ƒmax] bandwidth.
In some embodiments, the method further comprises: (v) receiving observed measurements from at least one sensor based on a wave transmitted from at least one source passing through a medium; (vi) estimating a frequency-domain observed pressure Pobs(x, ƒs) for each source, where x is a spatial location of a receiver, ƒs is a frequency associated with source s; (vii) evaluating
simultaneously for all sources at all receiver positions, where t is time of simulation; and (viii) estimating a cost function based on the values of {circumflex over (P)}obs and {circumflex over (P)}syn.
In some embodiments, the method further comprises (ix) evaluating the gradient of the cost function for the bulk modulus κ, as defined by
Psyn † is an encoded adjoint wavefield, and ∂t
In some embodiments, the method further comprises repeating steps (iv) and (vii)-ix) to minimize the cost function.
In some embodiments, the defined frequency ƒs=ƒmin+(s−1)Δf and
where s denotes an individual source and Sis the total number of sources, ƒs is frequency associated with source s, and [ƒmin, ƒmax] is the bandwidth of interest.
In some embodiments, each source is detected by a different subset of sensors. In some embodiments, the waves transmitted from the sources are visco-acoustic waves.
In some embodiments, the simulation comprises at least one forward simulation and adjoint simulation per iteration.
In some embodiments, the method further comprises evaluating
where s represents a given source simulated in simulation, r represents a given receiver, that records a given source s, xr is the position of the receiver r, t is time of the simulation, τ is time after Tss, Psyn is a wavefield simulated by a solver, ƒs is a frequency associated with source s, and ε can be any value.
In some embodiments, the method further comprises computing a gradient of the cost function represented by:
where m is a wavespeed, density, attenuation map, or a combination,
In some embodiments, the medium comprises human tissue or soil. In some embodiments, further comprising generating an image representation of the medium.
The foregoing summary is not intended to define every aspect of the disclosure, and additional aspects are described in other sections, such as the following detailed description. The entire document is intended to be related as a unified disclosure, and it should be understood that all combinations of features described herein are contemplated, even if the combination of features are not found together in the same sentence, or paragraph, or section of this document. Other features and advantages of the invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples, while indicating specific embodiments of the disclosure, are given by way of illustration only, because various changes and modifications within the spirit and scope of the disclosure will become apparent to those skilled in the art from this detailed description.
This disclosure describes crosstalk-free source encoding for ultrasound tomography. The disclosed approach allows for, for example, rapid image reconstruction using full waveform inversion (FWI) and rapid acquisition of experimental signals, an improvement of existing methods. More particularly, in one use for the disclosed approach, it should greatly speed up the computation time of ultrasound images based on FWI, and also allows for more ultrasound data to be included in the image inversion, leading to much better image quality. This use could be applied, for example, in a geoscience context. In another use for the disclosed approach, real data acquisition may be performed using source encoding, i.e., each transducer uses a specific source function in order to speed the data acquisition process. In other words, obtaining data could be done in a matter of seconds or less, instead of minutes currently.
An important feature of the disclosed approach relies on the crosstalk-free aspect of the source encoding method. Thanks to this, the disclosed encoding provides exactly the same result as conventional inversion, but using two wave simulations per iteration, instead of 2*S simulations per iteration, where S represents the number of sources that are considered for inversion. This greatly increases the speed with which inversions (image reconstructions) can be conducted. Existing source encoding methods introduce artifacts that reveal to be of major negative impact when the number of sources increases. Therefore, the disclosed method allows for the reconstruction of higher quality images.
An exemplary application of the disclosed approach is a program that implements source encoding to speed up the computational process. Another exemplary application is an ultrasound device, with transducers simultaneously firing multiple, specific frequencies, instead of a more conventional one by one firing strategy. This greatly speeds up the data acquisition process.
The disclosed approach may be employed to construct a 3D ultrasound FWI scanner that provides accurate images in an acceptable time (<30 min). Other solutions would increase errors in the resulting images, because of 1) fewer sources recorded, 2) simplified physics in the inversion process, and 3) patient movement during scanning leading to blurring.
The disclosed method is implemented in two complementary ways. In the first approach, the encoded forward and adjoint wavefields are run until they reach a steady state, at which point they are “deblended” to obtain their stationary parts: one pair associated with each source in the dataset. The stationary parts of the encoded forward and adjoint wavefields are combined to calculate the misfit gradient by summing their respective contributions. In the second approach, the steady-state encoded forward and adjoint wavefields are convolved over a time period proportional to the inverse of the frequency bandwidth of interest. Using this strategy, the encoded forward and adjoint wavefields do not need to be deblended, nor is there a need to calculate or store intermediary stationary contributions to the gradient. A wide variety of source-encoded cost functions are considered, including waveform differences, phase and amplitude measurements, “double-difference” phase and amplitude measurements, cross-correlation travel-time measurements, or a generic adjoint tomography cost function. “Super measurements” are used based on source-encoded Fourier coefficients of entire observed and simulated seismograms, as in pure FWI, one iteration requires just two numerical simulations, independent of the number of sources and receivers. In terms of device development, the disclosed device will allow a significant speed-up of acquisition of data from equipment. For example, instead of taking measurements source by source, measurements can be taken across all sources simultaneously, rapidly decreasing the amount of time it takes to acquire ultrasonic data from an object, patient etc.
Accordingly, in one aspect, this disclosure provides a system for ultrasound tomography. The system may include: a plurality of sources configured to transmit waves at one or more frequencies within a frequency bandwidth; one or more receivers configured to receive the waves transmitted by the plurality of sources; and a processor configured to: (a) use a computerized representation of physical property maps of a medium that influence wave propagation; (b) model all the sources and the receivers involved in a dataset; (c) associate to each of the sources a sinusoidal source time function with a defined frequency of a set of frequencies regularly spaced within a bandwidth of interest; and (d) perform a wave propagation simulation where sources are transmitting simultaneously for a time Tss+kΔΣ, where Tss is a simulation duration for simulated wavefield to reach a steady state, k is an integer>=1, and Δτ=1/Δƒ.
In some embodiments, the system may further include an ultrasound wave generator configured to associate each of the sources with a defined frequency ƒs of a set of frequencies regularly spaced at Δf within a [ƒmin, ƒmax] bandwidth.
In some embodiments, the processor may be further configured to: (e) receive observed measurements from at least one sensor based on a wave transmitted from at least one source passing through a medium; (f) estimate a frequency-domain observed pressure {circumflex over (p)}obs(x, ƒs) for each source, where x is a spatial location of a receiver, ƒs is a frequency associated with source; (g) evaluate
simultaneously for all sources at all receiver positions, where t is time of simulation; and (h) estimate a cost function based on the values of {circumflex over (p)}obsand {circumflex over (p)}syn.
In some embodiments, the processor may be further configured to: (i) evaluate the gradient of the cost function for the bulk modulus κ, as defined by
where Psyn † is an adjoint wavefield, and ∂t
In some embodiments, the processor may be further configured to repeat steps (d) and (g)-(i) to minimize the cost function.
In some embodiments, the defined frequency ƒs=ƒmin+(s−1)Δf and
where s denotes an individual source and Sis the total number of sources, ƒs is frequency associated with source s, and [ƒmin, ƒmax] is the bandwidth of interest.
In some embodiments, each source can be detected by a different subset of sensors. In some embodiments, the wave transmitted from the sources can be visco-acoustic waves.
In some embodiments, the simulation may include at least one forward simulation and adjoint simulation per iteration.
In some embodiments, the processor may be further configured to evaluate
where s represents a given source simulated in simulation, r represents a given receiver, that records a given source s, xr is the position of the receiver r, t is time of the simulation, τ is time after Tss, Psyn is a wavefield simulated by a solver, ƒs is a frequency associated with source s, and ε can be any value.
In some embodiments, wherein the processor may be further configured to compute a gradient of the cost function represented by:
where m is a wavespeed, density, attenuation map, or a combination,
In some embodiments, the costfunction may include a source-encoded cost function that integrates contributions from all the sources. In some embodiments, the cost function comprises waveform differences, phase and amplitude measurements, double-difference phase and amplitude measurements, cross-correlation traveltime measurements, or a generic adjoint tomography cost function.
In some embodiments, the system may include an ultrasound device. In some embodiments, the medium comprises human tissue or soil. In some embodiments, the processor is further configured to generate an image representation of the medium.
In another aspect, this disclosure further provides a method for ultrasound tomography. The method may include: (i) using a computerized representation of physical property maps of a medium that influence wave propagation; (ii) modeling all the sources and the receivers involved in a dataset; (iii) associating to each of the sources a sinusoidal source time function with a defined frequency of a set of frequencies regularly spaced within a bandwidth of interest; and (iv) performing a wave propagation simulation where sources are transmitting simultaneously for a time Tss+kΔτ, where Tss is a simulation duration for simulated wavefield to reach a steady state, k is an integer>=1, and Δτ=1/Δƒ.
In some embodiments, the method may further include associating each of the sources with a defined frequency ƒs of a set of frequencies regularly spaced at Δf within a [ƒmin, ƒmax] bandwidth.
In some embodiments, the method may further include: (v) receiving observed measurements from at least one sensor based on a wave transmitted from at least one source passing through a medium; (vi) estimating a frequency-domain observed pressure {circumflex over (P)}obs(x, ƒs) for each source, where x is a spatial location of a receiver, ƒs is a frequency associated with source s; (vii) evaluating
simultaneously for all sources at all receiver positions, where t is time of simulation; and (viii) estimating a cost function based on the values of {circumflex over (P)}obs and {circumflex over (p)}syn.
In some embodiments, the method may further include: (ix) evaluating the gradient of the cost function for the bulk modulus κ as defined by
where Psyn † is an encoded adjoint wavefield, and ∂t
In some embodiments, the method may further include repeating steps (iv) and (vii)-(ix) to minimize the cost function.
In some embodiments, the defined frequency ƒs=ƒmin+(s−1)Δf and
where s denotes an individual source and Sis the total number of sources, ƒs is frequency associated with source s, and [ƒmin, ƒmax] is the bandwidth of interest.
In some embodiments, each source can be detected by a different subset of sensors. In some embodiments, the wave transmitted from the sources may be visco-acoustic waves.
In some embodiments, the simulation may include at least one forward simulation and adjoint simulation per iteration.
In some embodiments, the method may further include evacuating
where s represents a given source simulated in simulation, r represents a given receiver, that records a given source s, xr is the position of the receiver r, t is time of the simulation, τ is time after Tss, Psyn is a wavefield simulated by a solver, ƒs is a frequency associated with source s, and ε can be any value.
In some embodiments, the method may further include computing a gradient of the cost function represented by:
where m is a wavespeed, density, attenuation map, or a combination,
The following sections describe in greater detail the disclosed methods and systems as well as their applications (e.g., ultrasound, earthquake seismology).
I. Source Encoding for Viscoacoustic Ultrasound Computed Tomography
Ultrasound computed tomography (USCT) is a noninvasive imaging modality that has shown its clinical relevance for breast cancer diagnostics. As opposed to traveltime inversions, waveform-based inversions can exploit the full content of ultrasound data, thereby providing increased resolution. However, this is only feasible when modeling the full physics of wave propagation, accounting for 3D effects such as refraction and diffraction, and this comes at a significant computational cost. To mitigate this cost, a crosstalk-free source encoding method for explicit time-domain solvers is disclosed herein, as shown in
This disclosure presents a crosstalk-free source encoding technique for explicit solvers in USCT. In the context of waveform inversion, this method takes advantage of the computational efficiency and minimal memory requirements of an explicit solver, while only two wave simulations are used to compute the misfit gradient, regardless of the number of sources and receivers. This disclosure provides the equations for acoustic and viscoacoustic problems and illustrates the approach with different configurations, including synthetic and real data experiments. Computational considerations are emphasized to highlight the relevance and practical advantages of the disclosed technique.
i. Frequency-Domain Measurements
Due to the nature of the wave equation, the observed Green function, Gobs(X, Xs, t), capturing wave propagation from a source Xs to a position X, can be seen as a linear filter. It has a finite impulse response due to geometrical divergence and/or attenuation effects; its duration is denoted by T1. In the time domain, one may express the observed pressure pobs(X, t) as a convolution of the observed Green function and the observed source time function, Sobs(Xs, t), that is,
Pobs(X,t)=∫0T1Gobs(x,xs,t−τ)Sobs(xs,τ)dτ. (1)
Let T2 denote the duration of the source sobs. It is assumed that the pressure time series is recorded over a duration T≥T1+T2, and the Fourier transform is applied:
In the frequency domain, the observed pressure pobs(x, ƒ) can be conveniently described as a product:
where Sobs(Xs, ƒ) denotes the observed source time function at frequency ƒ and Gobs(X, Xs, ƒ) the Green function of the medium between source Xs and position X at frequency ƒ.
Because the Green function Gobs is directly influenced by the mechanical properties of the medium (e.g., sound speed, attenuation.), the frequency-domain observed pressure
ii. Source Encoding
In this section, how each source is encoded in a single numerical simulation and how to decode each source contribution from this “super” wave-field are illustrated. The advantages of the disclosed method were analyzed from a computational perspective.
A. Theory
1. Encoding
The key idea is to take advantage of the fact that for any Δτ-periodic trigonometric polynomial
P(t)=Σn=−∞+∞cne2iπnt/Δτ. (4)
its Fourier coefficients cn may be extracted based on the following expression:
Numerically, it is possible to generate a wavefield be-having as an Δτ-periodic trigonometric polynomial in any point of the domain by enforcing multiple monochromatic sources with regularly spaced frequencies. Rather than modeling the full spectrum of a certain source, one (or a few) specific frequencies, which are a multiple of 1/Δτ are attributed.
Let us consider a set of S sources operating in the frequency band [ƒmin, ƒmax]. Each source s is associated with a monochromatic source time function with a frequency
Each source s, located at Xs and transmitting at frequency fs, may have a certain amplitude and phase shift, represented by a complex constant Ssyn(Xs, ƒs), which can be useful to model a source time function. As for Eq. (3), the simulated pressure psyn of a single source in the medium may be expressed as follows:
where Gsyn denotes the synthetic Green function capturing monochromatic behavior at frequency ƒs due to a source located at Xs recorded at position X.
Because an explicit time-domain solver is used, there are causality effects. To attain monochromatic behavior, the simulation must run sufficiently long to eliminate transient behavior of the wavefield, as illustrated in
When simultaneously firing multiple sources, after reaching a steady state, the resulting “super” wavefield may be expressed in the form
∀t≥Tss,Psyn(x,t)=RΣs=1Spsyn(x,ƒs)e2iπƒst (9)
For any position X, Psyn is a trigonometric polynomial of the form of Eq. (4) with a period determined by Eq. (7).
2. Decoding
An explicit time-domain solver is used to recover the individual synthetic pressures,
While Psyn is a superposition of multiple monochromatic waves, Eq. (10) enables us to “decode” the contribution of each source without crosstalk thanks to the orthogonality of the Fourier basis, as illustrated in
It is worth noting that, contrary to the approach of (Capdeville et al., 2005; Geophys. J. Int. 162, 541-554), one may consider a different set of receivers for each source. Furthermore, if the same receivers are used, it is possible to discard some receivers for each source. For instance, when evaluating Eq. (10) through a fast Fourier transform, Fourier coefficients of frequencies that are not of interest may be discarded.
B. Practical Considerations
It may be a challenge to determine the time to reach a steady state, Tss. Setting a very large value to ensure that transient effects have vanished can be time-consuming. For the breast model shown in
where s represents a given source simulated in simulation, r represents a given receiver, that records a given source s, xr is the position of the receiver r, t is time of the simulation, τ is time after Tss, Psyn is a wavefield simulated by a solver, ƒs is a frequency associated with source s, and ε can be any value.
While this approach is more accurate, evaluating Eq. (11) for multiple time windows, multiple frequencies, and multiple points can also be computationally expensive.
It is worth noting that while a single simulation suffices to encode as many sources as desired, the associated compute time still depends on the number of sources, since, according to Eq. (7), there is a linear dependence between the integration period Δτ and the number of sources S. When Tss is much smaller than Δτ, the link between the number of sources and compute time becomes nearly linear. Nevertheless, a longer simulation with an explicit solver does not impose extra RAM requirements. Recorded data can be regularly written on disk during the simulation, or the decoded Fourier coefficients can be computed on the fly to avoid I/O. In addition, the overall compute time between one encoded simulation and S non-encoded simulations still significantly decreases. As shown in the next section, while small meshes can result in a speedup of around 50, larger meshes can result in a larger speedup, which can be in the thousands because of the increase in bandwidth. For instance, the expected speedup for a [500-1,500] kHz bandwidth is 10 times bigger than for a [50-150] kHz bandwidth, even though the ratio ƒmax/ƒmin is the same for each case.
iii. Gradient Computation
In the context of waveform inversion, source encoding is used to compute the gradient of a cost function that integrates the contributions from all the sources, using only two “super” simulations.
A. Cost Function
To take full advantage of the disclosed source encoding method, frequency-domain measurements is considered. While time-domain data processing cannot be performed, the measurement computation requires just a single “super” simulation.
As an example, the classical waveform cost function is first considered
where m is a wavespeed, density, attenuation map, or a combination,
While observed measurements
In the frequency domain, observed and synthetic measurements can be interpreted as a product between a source time function and a Green function, as expressed in Eqns. (3) and (8). Thus, based on Eq. (12), matching the observed and synthetic Green functions was attempted. A challenge with this approach is that is predicated on the assumption that
Ssyn(Xs,ƒs)≈Sobs(xs,ƒs) (13)
Alternatively, the observed source time function needs to be estimated from the data.
It is possible to avoid this challenge by considering the following “double difference” measurement:
Contrary to the standard waveform cost function (12), this measurement does not depend on observed or synthetic source time functions. It only relies on matching observed and synthetic Green functions, which includes both phase and amplitude information, because the complex logarithm is defined as
log(z)=ln|z|+iArg(z) (16)
B. Adjoint Wavefield
The adjoint state method was used to determine Frechet derivative of the cost function. The adjoint wavefield is denoted by P†(x, t). The adjoint source corresponding to the waveform cost function described in Eq. (12) is given by
F†(x,t)=RΣs=1SΣr=1Rs[
and the adjoint source corresponding to the double difference, complex logarithmic cost function (14) is given by
F†DDZ(x,t)=Σs=1SΣr=1RsΣr′≠rRs[R{ΔΔlog Zrr′S}cos(2πƒst−θsyn)+{ΔΔlog Zrr′s}sin(2πƒst−θsyn)]Asyn−1δ(x−xr) (18)
where
It is worth noting in Eqns. (17) and (18) that for each receiver r the adjoint source due to source s is a superposition of monochromatic sources of frequency ƒs. Thus, by construction, the adjoint source involves for each source the same frequency content as the forward wavefield, and a single “super” adjoint wavefield suffices to compute the Frechet derivatives for all the sources.
To invert for attenuation, a second adjoint wave-field is computed to obtain the inverse bulk quality factor, Qk−1, Frechet derivative. More details about modeling attenuation are provided in the sections below. This second adjoint wavefield uses a new adjoint source, based on a transformation of the adjoint source of Eqns. (17) or (18), namely,
F†Qκ−1(x,t)=RΣs=1S[(2/π)ln(|ƒs|/ƒ0)−isgn(ƒs)]×
where ƒ0 is the frequency of reference for the bulk modulus.
C. Frechet Derivatives
The Frechet derivative expressions is adapted for acoustic media. The bulk modulus Frechet derivative, Kκ, is taken as an example:
It is interesting to note that the equalities Eqns. (21) and (22) are both of practical use, each with different advantages and disadvantages. Specifically, Eq. (22) requires decoding of both the “super” forward and the “super” adjoint wavefields using Eq. (10). To efficiently compute
iv. Synthetic Data Inversions
In this section, various numerical experiments are considered to illustrate the disclosed source encoding method and discuss strategies to maximize its benefits. All inversions are performed on a workstation with an Nvidia RTX 2080Ti GPU with 11 Gb of global memory, 16 Gb of RAM memory, 100 Gb of SSD NVMe memory, and 2Tb of HDD for data storage. As in the previous section, the SPECFEM2D and SPECFEM3D packages in single precision and the SeisFlows framework are used to perform inversions.
A. 2D Wavespeed Inversion
As a first example, the wavespeed model shown in
Tss=0.25 ms of wave propagation were simulated, and Δτ≈2.56 ms was used for the first ten iterations. Super simulations are 2.81 ms long while standard simulations are 0.25 ms long, but the gradient can be computed with only two super simulations, in contrast with 2,048 standard simulations in the usual case. The theoretical speedup is 91X. The practical speedup is even more important, because multiple solver initializations are avoided, in addition to significantly reduced I/O. When refining the mesh, the available bandwidth increases, and one can reduce the value of Δτ. During the ten last iterations, Δτ 0.34 ms was used, leading to a 863× speedup compared to a standard inversion. Taking advantage of the increase in bandwidth helps to further reduce the impact of mesh refinement over compute time. This inversion, which involves 1,024 sources, 1,024 receivers, 90 iterations, and a data content up to 3 MHz, requires a compute time of 45 minutes using a single GPU.
B. 2D Joint Inversion
The same configuration was used as in the previous test—this time including attenuation effects-using the target attenuation map shown in
As shown in
C. 3D wavespeed inversion
The target map for the 3D example is a breast model constructed with the extremely dense breast dataset (OA-Breast Database, dataset Neg 47 Left) from (Lou et al., 2017; Journal of biomedical optics 22). 2,400 transducers evenly spaced on four faces of the model (600 transducers per face) were used, which act as sources and receivers. “Observed” data are simulated with 2,400 wave simulations, with each transducer emitting one-by-one a source time function combining two Ricker wavelets with 140 kHz and a 420 kHz central frequencies to cover the [50-1,100] kHz bandwidth. The map is modeled in a 15 cm×15 cm×10 cm domain, and discretized with 281,250 spectral elements, each using 512 discretization points. The local grid involves 144 million points, resulting in a 549 Mb wavespeed map. The corresponding SPECFEM3D run uses 5.3 Gb of RAM memory and 3.7 Gb of GPU global memory, which highlights the modest memory requirements of explicit solvers.
Inversion results are presented in
Larger problems involving a bigger ROI or a higher frequency range can be handled using the multi-GPU capability of the SPECFEM3D package. Unlike implicit methods, this explicit solver achieves excellent weak and strong scaling.
v. Real Data Inversion
In this example, the disclosed source encoding method was applied to the open-source Multimodal Ultrasound Breast Imaging System (MUBI) dataset for the SPIE USCT Data Challenge 2017 (Ruiter et al., 2017; International Society for Optics and Photonics, Vol. 10139, p. 101391N). The experimental setup consists of two linear transducer arrays with 16 active elements and a central frequency of 3.5 MHz. Each array can rotate around the region of interest, as described in
To obtain the synthetic measurements within a single super simulation, the synthetic pressure was recorded at all 176×23=4, 048 positions that are used in the real experiment, even if a single transmission is recorded at only 176 different positions. When a source is not recorded by a given receiver, then—for this receiver dataset—the coefficient of the frequency associated with this source was discarded.
The initial model is homogeneous. A complex logarithmic double difference cost function was used, to avoid an estimation of the source time function, and “empty” measurements (measurements made in pure water) were not used in the analysis. The amplitude spectrum of the data is shown in
As the emitting and receiving transducer characteristics are not known, they are modeled as a point source/receiver, which largely reduces the expected resolution, in particular for a waveform-based measurement.
This point source modeling becomes less valid with increasing frequency and explains the weak resolution improvement between
A. Viscoacoustics and Constant-q Absorption Band Model
In the absence of attenuation, wave propagation may be represented by the following equation:
κ−1∂t2p−∇·(ρ−1∇p)=θ (A1)
where κ=ρVp2 denotes the bulk modulus, ρ the mass density, Vp the compression wavespeed, ƒ the source term, and p the acoustic pressure. To incorporate effects due to attenuation using a constant-Q absorption band model, the bulk modulus becomes frequency-dependent, namely,
where ƒ0 is a frequency of reference and Qκ the bulk quality factor. Alternatively, a power-law Q-model is readily accommodated. Thus, to describe attenuation, one must provide Qκ(x) and κ(x, ƒ0) maps for a chosen frequency of reference f0. Modeling of an absorption band model using a time-domain solver can be achieved using a series of standard linear solids (SLS). Based on the experiments, three SLS are enough to properly model attenuation effects in a frequency band that is two decades wide. From a computational perspective, a viscoacoustic simulation is about 70% slower than a purely acoustic simulation and uses about 35% of additional RAM memory with the SPECFEM package.
B. Orthogonality
For s≠s′, using Eq. (6),
Thus, as long as T=nΔτ, for some integer n, this yields
This orthonormality of the Fourier basis is at the heart of the source encoding algorithm.
vii. Conclusions
This disclosure demonstrated the feasibility of a crosstalk-free source encoding method for explicit solvers in the context of medical imaging. This enables waveform inversion based on full-physics modeling. The approach is compatible with modest computational resources, because RAM memory requirements do not depend on the number of sources, despite all sources being simultaneously inverted. The disclosed approach scales linearly on thousands of, thereby providing an opportunity to further reduce the computational cost.
While the compute cost of a 3D explicit solver is proportional to the fourth power of the highest modeled frequency, the cost of a 3D inversion with the disclosed method is proportional to the third power of the highest modeled frequency. Even if some challenges remain at higher frequencies, this full-physics-based approach becomes more relevant as diffraction and other 3D effects become increasingly important, including in soft tissue. These improvements, combined with future generations of GPUs, help us anticipate the possibility of performing 2 MHz and higher inversions with 3D full physics for breast imaging using reasonable computational resources within the next decade.
II. Source Encoding for Adjoint Tomography and its Applications in Geoscience
A version of source encoding is used to facilitate the calculation of the gradient of a cost function independent of the number of sources or receivers. The crosstalk-free method requires only two numerical simulations per iteration, namely, one source-encoded forward simulation and one source-encoded adjoint simulation. Importantly for practical applications, each source does not need to be recorded by all receivers. The method is implemented in two complementary ways. In the first approach, the encoded forward and adjoint wavefields are run until they reach a steady state, at which point they are ‘decoded’ to obtain their stationary parts. These stationary parts are combined to calculate the misfit gradient by summing their respective contributions. In the second approach, the steady-state encoded forward and adjoint wavefields are convolved over a time period proportional to the inverse of the encoded frequency spacing. Using this strategy, the encoded forward and adjoint wavefields do not need to be decoded, nor is there a need to calculate or store intermediary stationary contributions to the gradient. A wide variety of source-encoded cost functions are considered, including waveform differences, phase and amplitude measurements, double-difference phase and amplitude measurements, cross-correlation traveltime measurements, or a generic adjoint tomography cost function. When measurements in specific time windows are involved in the construction of the source-encoded cost function, as in adjoint tomography, the computational cost scales linearly with the number of seismic sources, because the necessary synthetic seismograms must be computed individually. In contrast, when using super measurements' based on source-encoded Fourier coefficients of entire observed and simulated seismograms, as in pure full-waveform inversion (FWI), one iteration requires just two numerical simulations, independent of the number of sources and receivers. The method is illustrated based on examples from both earthquake and exploration seismology, highlighting inversion options and strategies involving frequency—and time-domain encoding, decoding (with more than 16 000 frequencies), encoded frequency randomization, encoding multiple frequencies per source, effects of noise, a variable number of receivers per event, various measurements and related cost functions and attenuation.
In this application, the following Fourier convention is used,
s(x,ω)=∫−∞∞s(x,t)exp(−iωt)dt (23)
with inverse
Here s denotes a vector displacement field, x a position vector, t time, and ω angular frequency.
i. Forward Wavefield in Earthquake Seismology, the Seismic Source May be Represented as an Equivalent Body
force of the form:
ƒj(x,t)=−Mjk(xs)∇kδ(x−xs)S(t) (25)
where Mk denotes an element of centroid moment tensor, xs the centroid location, and S(t) the source time function. In terms of the Green's function, Gij(x, x′, t−t′), the displacement field due to a body force may be expressed in the form:
si(x,t)=∫−∞tGij(x,x′,t−t)ƒj(x′,t′)d3x′dt′ (26)
Written out explicitly, the displacement due to a point force (25) becomes
si(x,t)=∫−∞t∇kGij(x,xs;t−t′)Mkj(xs)S(t′)dt′ (27)
The goal will be to design a method for the calculation of Frechet derivatives that combines a set of earthquakes into a single distributed encoded ‘super’ source, thereby significantly reducing the numerical cost of an inversion.
ii. Source Encoding
To perform adjoint tomography with a set of S sources in an angular frequency band [ωmin, ωmax]. The goal is to perform such an inversion based on just a single combination of encoded ‘super’ forward and adjoint simulations by effectively tagging each individual source. This is accomplished by randomly assigning each source s a unique frequency, ωs, s=1, . . . , S, defined by
ωs=ωmin+(s−1)Δω (28)
where
Δω=(ωmax−ωmin)(S−1) (29)
thereby covering the frequency band of interest, [ωmin, ωmax].
The approach will be to run encoded forward and adjoint simulations driven by a superposition of monochromatic sources with frequencies (Eq. (28)) until the wavefields reach a steady state after some time Tss. The technique will take advantage of the fact that the source-encoded Fourier basis is orthogonal in the following sense:
The time interval Δτ determines a period of integration required for ‘deblending’ or ‘decoding’ the encoded forward and adjoint wavefields, as well as the length of the convolution of the encoded forward and adjoint wavefields for the calculation of Frechet derivative.
While Huang & Schuster (2018) (Geophys. Prospect., 66, 1243-1257) and Dai et al. (2013) (Geophysics, 78, S233-S242) choose Tss=Δτ, this choice leads to poor performance when many frequencies are encoded, or to strong spectral leakage in the gradient when few frequencies are encoded. On one hand, as expressed in Eq. (31), Δτ solely depends on the available bandwidth and the number of encoded frequencies. On the other hand, Tss is determined by simulation parameters, such as wave speeds and the spatial dimensions of the domain. Intuitively, a steady state is obtained once transient effects disappear. Practically, one can compute the Fourier coefficients of the encoded wavefield evaluated at the encoded frequencies (28) over a time interval [t0; t0+Δτ]. There exists a time Tss such that these coefficients remain unchanged for any t0>Tss. This evaluation may be performed prior to inversion.
In practice, the set of frequencies determined by (28) is randomly distributed over the sources at the start of every iteration, and it was demonstrated that one may choose to assign more than one frequency per source. If the number of frequencies per source is Nƒ, then there are a total of SNƒ frequencies. In that case, the bandwidth of interest is divided up accordingly:
Δω=(ωmax−ωmin)/(SNƒ−1) (32)
and the time interval for integration becomes
iii. Encoding Forward Wavefield
To encode the forward wavefield, for a specific source s, Eq. (27) can be expressed in the frequency domain as
Sis(x,ω)=∇kGij(X,xs;ω)Mkj(xS)Ss(ω) (34)
where Ss denotes the source-time function associated with source s. At source frequency ωs, the stationary field is defined as
Based on the results discussed in the above section about forward wavefield, the last equality motivates the definition of the encoded time-domain super source
Fj(x,t)=RΣs=1S(x,ωs)exp(iωst) (37)
where ƒjs(x, ωs) denotes the Fourier transform of body force (25) associated with sources. Effectively, all sources broadcast simultaneously, each at its own unique frequency ωs. Encoded super sources for an earthquake modeled by a Gaussian moment-rate tensor and a shot modeled by a Ricker wavelet are presented above.
The encoded super source (37) gives rise to the encoded forward wavefield
Si(x,t)=∫−∞t∫Gij(x,x′;t−t′)Fj(x′,t′)d3x′dt′ (38)
Thus the recipe for calculating the encoded super forward wavefield is simple: combine all earthquakes into the source-encoded super source (15) and run one forward simulation to obtain the encoded super forward wavefield (38).
iv. Decoding the Encoded Forward Wavefield
The encoded forward simulation (38) is carried out until the wavefield reaches a steady state at a time Tss, that is, until it can be expressed as a trigonometric polynomial where
S
i
s
=A
i
s(x)−iBis(x) (40)
where
Ais(x)=Tsis(x) (41)
and
Bis(x)=−sis(x) (42)
Using the orthogonality relationship (30), the stationary parts of the encoded forward wavefield Si(x, t) may be ‘deblended’ or ‘decoded’ based on the following integrations:
The time Δτ required for decoding is defined by Eq. (31). Alternatively, the stationary parts may be determined based on trigonometric interpolation of the steady-state encoded forward wavefield using a nonuniform fast Fourier transform.
The Fourier coefficients of interest, Ais(x) and Bis(x), may also be obtained based on a discrete Fourier transform of the steady-state encoded forward wavefield (39). This is possible when the sampling frequency Δƒ of the seismograms is chosen multiple of 1/Δƒ, (2) the Shannon theorem is respected, that is Δƒ>ωmax/π, and (3) ω1=ωmin must be chosen as a multiple of Δω. Under these conditions, the vector generated by the fast Fourier transform has a subset that contains exactly the Fourier coefficients of the encoded frequencies ωs, s=1, . . . , S. The delay theorem must be applied to recover the correct phase at Tss.
v. Full Waveform Inversion
To develop intuition for inversions based on the encoded forward wavefield (38), the simple case of classical FWI is first considered.
A. Classical FWI
Where dis(xr, t) denote the data recorded at receiver r for sources, and where r=1, . . . , RS labels the receivers that recorded events. It is important to recognize that in earthquake seismology, an event is typically recorded by a subset of the available receivers. The variation of the cost function (45) is
δχs=Σr=1Rs∫(xr,t)−dis(xr,t)]δssi(xr,t)dt (46)
and the associated adjoint source is
fj†s(x,t)=Σr=1Rs[sis(xr,−t)−dis(xr,−t)]δ(x−xr) (45)
This adjoint source gives rise to the FWI adjoint wavefield
si†(s,t)=∫−∞t∫Gij(x,x′,t−t′)ƒj†(x′,t′)d3x′dt′ (48)
B. Source-Encoded FWI
If the desired measurement is based on the Fourier coefficients of the full time-series of the seismograms, the observed data dis(xr, t) are Fourier transformed for all sources s=1, . . . S, recorded by a set of receivers xr, r=1, . . . , Rs, to obtain the Fourier coefficients di(xr, ωs), which are evaluated at the encoded frequencies ωs:
dis(xr)≡di(xr,ωs) (49)
To ensure that the discrete Fourier transform of the observed data contains exactly the encoded frequencies ωs, s=1, . . . , S, resampling may be used, such that the new sampling frequency respects the three conditions described above, and zero padding, such that the integral
can be computed. Here k is the smallest integer such that kΔτ>T, where T denotes the length of the unpadded seismogram. It is worth noting that T and Δτ are not correlated. To obtain a good measurement, it is desirable that T captures the full time series. It is not possible to perform any time windowing in seismograms prior to Fourier transformation, because these operations cannot be mimicked in an encoded simulation, that is, windowing requires simulations of individual events.
Next, the encoded waveform cost function is used
Effectively this cost unction is a frequency-domain version of (45) summed over all sources. The options of randomizing the frequency assigned to each source, as well as the option of assigning multiple frequencies to the same source, are discussed. The variation of this encoded cost function may be expressed as
where the orthogonality (30) in the third equality is used.
The cost function is given by
Fj†(x,t)=RΣs=1Sƒj†s(x,ωs)exp(iωst) (53)
where ƒj†s(x, ωs) denotes the Fourier transform of the classical waveform adjoint source (25) associated with source s. To avoid spectral leakage, it is critical that the Fourier coefficients dis(xr) and sis(xr) are evaluated exactly at the source encoded frequencies, not approximated via a narrow bandpass filter.
It is important to note that the encoded super adjoint source (53) may actually be constructed directly from the stationary parts of the encoded forward wavefield, sis(xr), and the data Fourier coefficients (49), dis(xr). In other words, the waveform adjoint source may be calculated based on a ‘super measurement,’ requiring just a single super adjoint calculation.
The ‘super’ adjoint wavefield Si†(x, t) generated by the encoded adjoint source (53) may be expressed in the form ƒt f
Si†(x,t)=∫−∞t∫Gij(x,x′; t−t′) Fj†s(x′, t′) d3x′dt′ (54)
Thus, like the encoded super forward wavefield (16), the recipe for calculating the encoded super adjoint wavefield is simple: combine all the usual adjoint sources into the source-encoded super adjoint source (53) and run one simulation to obtain the encoded super adjoint wavefield (54).
In anticipation of the Fréchet derivatives, the combined sums over s and r in the misfit variation (52) give rise to the encoded adjoint wavefield (54), whereas the sum over s□ gives rise to the encoded forward wavefield (38).
vi. Phase, Amplitude, Traveltime, and “Double-Difference” Cost Functions
The choice of measurement and related cost function is a critical component of a successful inversion strategy. In the interest of succinctness, discussions of source-encoded cost functions are relegated, and adjoint sources are related involving phase, amplitude, cross-correlation traveltimes and various ‘double-difference’ measurements. It was demonstrated that phase, amplitude, and even hybrid (phase+amplitude) measurements are amenable to double-differencing, as opposed to conventional waveform measurements. Double-difference measurements hold great promise for source-encoded adjoint tomography, because they do not require any knowledge of the original source-time function or the instrument response.
vii. Source-Encoded Adjoint Tomography
Suppose there are a set of adjoint sources ƒj†s(x, t), s=1, . . . , S, one for each earthquake in the database. These adjoint sources are constructed in the usual fashion based on measurements of the differences between observed and simulated seismograms. To obtain the encoded adjoint wavefield, the classical waveform adjoint sources are combined into an encoded adjoint super source
Fj†(x,t)=RΣs=1Sƒj†s(x,ωs)exp(iωst) (55)
Based on this source, the super adjoint wavefield is given by
Si†(x,t)=∫−∞t∫Gij(x,x′;t−t′)Fj†s(x′,t′)d3x′dt′ (56)
viii. Decoding the Encoded Adjoint Wavefield
The super adjoint wavefield simulation (56) is carried out until it reaches a steady state at time TSS and expressed as:
where
si†s(x)=Ai†s(x)−iBi†ts(x) (58)
And where the stationary parts
Ai†s(x)=Rsi†s(x) (59)
and
Bi†s(x)=si†s(x) (60)
May be decoded based on the expressions
ix. Fréchet Derivative
For an isotropic medium, using the adjoint state method, the variation in the encoded cost function may be expressed in the form
δχ=∫(δ ln ρKρ+δ ln κKκ+δ ln μKμ)d3x (63)
where Kρ, Kκ, and Kμ denote Frechet derivatives with respect to density ρ, bulk modulus κ, and shear modulus μ, respectively. Based on the encoded forward and adjoint wavefields Si(x, t) and Si†(x, t), these Frechet derivatives are determined as follows, using the density derivative as an example:
Note that there is no crosstalk between sources, thanks to the orthogonality (8) of the constituent monochromatic waves. It is important to recognize that one may either calculate the density Frechet derivative by convolving the encoded forward and adjoint wavefields based on the equality (64), effectively a time-domain approach, or one may choose to calculate it by summing the decoded contributions for each source based on either (65) or (66), effectively a frequency-domain approach. Two complementary recipes are presented for the calculation of Frechet derivatives based on these time- and frequency-domain approaches. Similarly, the remaining Frechet derivatives are
where Dij(x, t) and Dij†(x, t) denote the strain deviators associated with the encoded forward and adjoint wavefields, respectively, and where
are the corresponding decoded contributions. Further extensions to anisotropic media are straightforward.
x. Attenuation
Interestingly, when using decoding, the calculation of the shear Q kernel no longer requires an additional adjoint simulation. The adjoint source for attenuation may be obtained from the regular adjoint source via the transformation
ƒ†Qj(x,ω)=[(2/π)ln(|ω|/ω0)−i sgn(ω)]ƒ†j(x,ω) (75)
Consequently, the Qμ Fréchet derivative is simply a modification of the Kμ kernel (72), namely,
KQ
To avoid decoding, a second encoded adjoint wavefield, S†Qi(x, t), needs to be calculated, driven by
F†Qj(x,t)=RΣs=1S[(2/π)ln(|ωs|/ω0)−isgn(ωs)]ƒj†s(x,ωs)exp(iωst) (77)
xi. Methods
The calculation of Frechet derivatives is initiated by combining all earthquakes or shots into a single source-encoded super source (37), followed by simulating the corresponding source-encoded super forward wavefield (38) until it reaches a steady state. The next ingredient involves calculation of the source-encoded super adjoint wavefield (56) until it reaches a steady state. This requires the construction of the source-encoded super adjoint source (55). In this context, in adjoint tomography measurements are made by comparing observed and simulated seismograms in carefully selected time windows, a process that requires the calculation of individual synthetic seismograms for all sources at a cost that scales linearly with the number of sources. In contrast, in classical FWI adjoint sources may be constructed based on the source-encoded Fourier coefficients of the observed and simulated wavefields, effectively involving a single ‘super measurement.’ The latter may be obtained directly from the source-encoded super forward wavefield, therefore not requiring the calculation of individual synthetic seismograms. Thus, desirably and remarkably, one iteration in the inverse problem requires just two simulations, independent of the number of sources or receivers.
Once the super forward and adjoint wavefields have been calculated, source-encoded Frechet derivatives may be obtained based on one of two mathematically equivalent approaches, one in the time domain and the other in the frequency domain. These two approaches lead to the following options for the calculation of Frechet derivatives.
A. Option I: Frequency-Domain Approach
In the first approach, steady-state, source-encoded forward and adjoint simulations are decoded to obtain a set of stationary fields proportional to the number of sources in the dataset (Eqs. 43, 44, 61, and 62). These stationary parts are subsequently combined to calculate Frechet derivatives by summing their respective contributions (Eqs. 65, 68, and 71). An advantage of this option is that the encoded forward and adjoint wavefields may be calculated simultaneously and independently. Another advantage is that attenuation poses no challenges, because both wavefields are used only in forward time. A disadvantage of this option is that the stationary parts of the encoded wavefields must be computed simultaneously. This requires significant RAM, even assuming that Fourier transforms are performed on the fly and that no time storage of the wavefields is necessary. Each stationary part consists of a 3-D volume of displacement, and the number of such volumes scales linearly with the number of sources, which can run in the thousands. Ultimately, it is of interest to obtain Frechet derivatives, and neither the stationary parts of the wavefields nor their contributions to the Frechet derivatives are of any practical value.
B. Option II. Time-Domain Approach
In the second approach, steady-state source-encoded forward and adjoint wavefields are simply convolved over a time period proportional to the inverse of the frequency bandwidth of interest, thereby directly rendering the Frechet derivatives (Eqs. 64, 67, and 70). In this option, there is no need to decode the encoded forward and adjoint wavefields, nor do intermediary contributions to the misfit gradient need to be calculated or stored. Attractively, this option requires only minor modifications of the currently used adjoint tomography workflow. As in the current workflow, the encoded forward wavefield must be reconstructed during the calculation of the encoded adjoint wavefield so that they may be convolved, a challenge elegantly solved by the wavefield reconstruction algorithm of Komatitsch et al. ((Komatitsch et al., 2016; Geophys. J. Int. 206(3), 1467-1478). It is important to note that in the source-encoded approach, only the super forward wavefield needs to be reconstructed during the calculation of the Frechet derivatives, whereas in the classical approach, this procedure must be executed for every single forward wavefield, that is, for every event in the database. This translates into a significantly smaller amount of I/O, independent of the number of events.
xii. Comparison of Computational Costs
Traditional time-domain FWI or adjoint tomography requires one forward simulation of a chosen duration T per event to capture the seismic arrivals of interest and one adjoint calculation of the same duration to determine the contribution of this event to the gradient. Unless the forward calculation can be stored on disk, the adjoint calculation is twice as expensive as the forward simulation, because the forward wavefield needs to be reconstructed during the adjoint simulation for convolution. In this context, many industrial applications use checkpointing. Thus, for a given event, cumulative seismograms are calculated for a total duration of 3 T, and a total of
Ttraditional=3TS (78)
simulation time is required for one iteration involving S events.
In source-encoded FWI, one super forward calculation of duration Tss+Δτ is required, followed by one super adjoint calculation of the same duration. The transient part of the forward wavefield, Tss, does not need to be reconstructed to compute the sensitivity kernels. Thus,
using Eq. (31), a total simulation time of
is required per source-encoded iteration. The relative reduction in computational cost is
where the last equation holds asymptotically for large values of S, that is for a large number of events.
To make measurements in the traditional way before calculating the misfit gradient based on source encoding, then T simulation time needed to be added per event, and thus the cumulative simulation time becomes
In this case, the relative reduction in computational cost is
where the last equation holds asymptotically for large values of S. It was concluded that in this case a 3× speed-up can be achieved. With regards to read/write operations (I/O), to calculate traditional sensitivity kernels, one can (1) store the entire forward wavefield, (2) attempt to reconstruct the forward wavefield with absorbing boundary storage or (3) reconstruct the forward wavefield on the fly based on the parsimonious storage algorithm of Komatitsch et al. (Komatitsch et al., 2016; Geophys. J. Int. 206(3), 1467-1478). In any case, I/O is proportional to the cumulative forward simulation time, Ttraditional. For large meshes, the remaining I/O costs (seismogram writing, adjoint source reading, sensitivity kernel writing, etc.) become negligible. Thus, it gives
I/Otraditional=αTtraditional (83)
for some suitably chosen proportionality factor α.
For a source encoded simulation, I/O is only required for the sensitivity kernel computation, and does not involve any storage or reconstruction of the transient part of the super forward wavefield. Thus I/O for an encoded simulation is determined by
Since no forward wavefield storage is required for making measurements, I/O remains basically unchanged, and thus
I/Oencoded+measurements≈I/Oencoded (86)
even if the relative I/O cost due to writing individual seismograms may become significant for a large number of events.
Thus, given the duration of a seismogram for making measurements T, the time to reach a steady state Tss, and a simulation bandwidth ƒmax-ƒmin, Eqs. (78)-(82) and Eqs. (83)-(86) enable us to quantitatively assess the computational benefits of source encoding, as illustrated in
A case of particular interest involves global FWI. Such computationally demanding tomographic inversions are currently carried out in the 9-150 s period range using databases with hundreds to thousands of earthquakes and three-component seismograms 60-180 min in duration.
Parsimonious storage requirements for kernel calculations involve more than a petabyte of disk space. If traditional measurements are made, the best a 3× speed-up in simulation time can be achieved, as predicted by Eq. (82). The difference is much more significant for I/O, where a 700× reduction in storage requirements was observed. If global FWI could be performed without making individual measurements, which is not a simple proposition, the speed-up would be enormous, approaching factors of 700×.
xiii. Inversion Options and Strategies
In this section, the method and various inversion options and strategies are illustrated. Following Yuan et al. (Yuan, Y. O., et al., Geophys. J. Int., 206(3), 1599-1618), a global Rayleigh-wave phase speed map determined by Trampert & Woodhouse (Trampert, J. & Woodhouse, J. H., 2003. Geophys. J. Int., 154(1), 154-165) was used as the 2-D target model. Designed for didactic purposes, this simplified model is computed on a 180 m×90 m rectangular mesh, with Stacey absorbing boundaries, using the membrane wave propagation solver capability of the SPECFEM2D package (Komatitsch, D. & Tromp, J., 1999. Geophys. J. Int., 139(3), 806-822). In the experiments, a combination of 32 earthquakes and 293 seismographic stations were used. The ‘observed’ data were simulated using a Ricker wavelet with a 260 Hz central frequency for each event. Each wave simulation ran on a 80×40 spectral-element mesh divided into 8 MPI slices, totaling 52 552 integration points, and using an Intel Xeon CPU E5607 cadenced at 2.27 GHz, and running computations in single precision. To perform the inversions, the SeisFlows framework (Modrak, R. T., et al., 2018.Comput. Geosci., 115, 88-95) and a constant phase speed map of 4500 m s−1 were used as the initial model.
A. Frequency-Domain Encoding
The first experiment is designed to demonstrate the absence of crosstalk between encoded sources in the frequency domain. The result of conventional FWI, described in the section bout Classical FWI, in which sensitivity kernels or ‘event kernels’ are calculated in the frequency domain for each of the 32 sources and summed to obtain the gradient of the cost function, requiring 64 numerical simulations per iteration. In this case, the gradient is calculated in the frequency domain based on Eq. (72) using one super forward and one super adjoint simulation, that is, just two numerical simulations per iteration. The maximum discrepancy over space of the absolute difference between the two approaches is less than 0.0058 percent.
B. Time-Domain Encoding
In
C. Decoding
Contributions of individual events were compared to the overall gradient calculated individually and simultaneously, and decoded based on Eqs. (61) and (62). Each of the 32 events is encoded with 512 frequencies, for a total of 16 384 frequencies over the [200;400] Hz bandwidth, using a frequency spacing of 0.012207 Hz. Note that despite the fact that events 1 (200.0 Hz) and 2 (200.012 Hz) are extremely close in frequency, they are still perfectly decoded without crosstalk.
The absolute differences between the individual simulations (first column) and decoded super simulations (second column) are tiny (third column), further demonstrating the absence of cross talk between sources during simultaneous runs. The success of using such massive amounts of encoded frequencies also relies on the accuracy of the wave solver.
D. Comparisons Between Classical and Encoded FWI
E. Randomization
The previous test was repeated using randomization of the source frequencies used for each iteration. It was concluded that randomly changing the source frequencies eliminates model oscillations, and an increased convergence rate compared to the previous experiment was observed.
F. Multiple Frequencies Per Source
Another option is to assign more than one frequency per event. In this case, the time interval required for decoding is given by Eq. (33). The relative reduction in computational cost, given by Eq. (80), becomes
that is, the gain is reduced by a factor Nƒ. Again an increased convergence rate compared to the experiment summarized in
G. Effects of Noise
Next, the effects of noise on regular and source encoded inversions were explored. A Gaussian perturbation in phase is added to each frequency in the [50;750] Hz range, with zero mean and a standard deviation of 2Tri, where i denotes a percentage perturbation.
Traditional and source encoded inversions was tested when random noise at levels of 1, 5, 10, 20, and 40 percent is added to the data. Unlike traditional FWI, source-encoded inversions remain stable when more than 10 percent of random noise is added to the data, and both the L2 model and data misfits are reduced in a stable fashion.
H. Variable Number of Receivers
A key benefit of the approach to source encoding is that every source need not be recorded by every receiver. This is basically always the case in real inversions, because some stations may not record a given event, or the data quality of certain receivers may be considered too poor for use in the inversion. To accommodate this situation, a mask must be applied when computing the super adjoint source and the cost function, but no major changes are required. Excellent convergence was demonstrated even when each source is recorded by a different random subset of only 30 percent of the 293 available seismographic stations. The difference in terms of convergence rate relative to an inversion based on the full data set is minimal.
I. Phase, Amplitude, and Double-Difference Measurements
Next, source-encoded inversions were considered based on phase or amplitude measurements. Convergence in model space is slightly slower than source-encoded waveform inversion when using phase measurements, and convergence is even slower when using amplitude measurements. However, using a hybrid approach, combining phase and amplitude measurements, leads to a similar convergence rate as the source-encoded FWI inversion, which used a waveform misfit.
Double-difference measurements help reduce inversion problems related to source uncertainties. the results of a double-difference phase inversion, and the results of a double-difference amplitude inversion. For each event, double differences between all station pairs in the database were taken. The double-difference cost functions behave similarly to their classical counterparts in terms of L2 data and model misfit reduction as a function of iteration number.
J. Attenuation
Next, a synthetic model that includes the effects of attenuation was considered. The wave speed map, with a reference frequency of 300 Hz, is the same as in
K. Offshore Acoustic FWI
To mimick a streamer-type acquisition, 460 acoustic sources are recorded by 600 hydrophones placed to the left of each source, spaced uniformly 10 m below the top of the model. These hydrophones cover 6 km, or less when the source is located less than 6 km from the left side of the model. Unlike in the preceding surface-wave examples, the challenge in this case is recovery of the model from one-sided illumination and acquisition. Each source is encoded with one randomized frequency in the 1-5.6 Hz range, and each super shot involves 120 000 time steps. The resulting speed-up compared to the 20 000 time steps required for a regular simulation is around 77. The resulting model after 400 iterations based on a double-difference phase cost function. As the frequency content of the inverted data increases with each iteration, an increase of the evolving data misfit during the second half of the inversion was observed.
L. Offshore Coupled Acoustic-Elastic FWI
As a second exploration seismology example, the results of a source-encoded inversion for a coupled acoustic-elastic offshore version of the Marmousi model. There are 150 acoustic sources recorded by 500 hydrophones placed 10 m below the top of the water layer, providing one-sided illumination and acquisition. Each source is encoded with two randomized frequencies, using a double-difference phase cost function. The first iteration uses waves in the 1-6 Hz range. At each subsequent iteration, the range is increased by 0.1 Hz until iteration 30. Iterations above 30 are performed in the 4-9 Hz range. Each super shot involves 50 000 time steps, while a nonencoded simulation requires 10 000 time steps, leading to a speed-up of 30. Despite the fact that both the sources and receivers are located near the top of the water layer, the shear wave speed model is reasonably well recovered. Adding a few ocean bottom seismometers would undoubtedly improve the quality of the recovered shear wave speed model, but this is not the objective of this study.
A version of source encoding was developed for explicit time-domain solvers that facilitate the calculation of the gradient of a cost function independent of the number of sources or receivers. The approach does not suffer from crosstalk between different sources, and all receivers do not need to record all sources. To obtain the gradient, source-encoded forward and adjoint wavefields are run until they reach a steady state. At that point, the steady-state fields are either decoded to extract their stationary parts, which are subsequently combined to obtain the full gradient by summing their respective contributions. Alternatively, the steady-state forward and adjoint wavefields are convolved over a time period proportional to the inverse of the encoded frequency spacing, which mathematically results in the same gradient. The second approach involves only minor modifications of the currently used adjoint tomography workflow.
xiv. Conclusion
In adjoint tomography, one generally compares individually observed and simulated seismograms by making measurements in specific time windows. In this approach, the number of numerical simulations required for making measurements and constructing adjoint sources scales linearly with the number of seismic sources, S. Once the synthetics have been calculated, the gradient of the cost function may be calculated based on source encoding in just two additional numerical simulations per iteration, which means that one iteration requires S+2 simulations. The overall computational cost may be reduced to just two simulations per iteration if the measurements needed for the construction of the adjoint sources are based on the source-encoded Fourier coefficients of the observed and simulated seismograms, leading to a single ‘super measurement’. It was demonstrated that the latter approach is stable for acoustic and offshore coupled acoustic-elastic FWI.
III. Computing System for Implementing Disclosed Methods
Processor 106 is coupled bi-directionally with memory 107, which can include, for example, one or more random access memories (RAM) and/or one or more read-only memories (ROM). As is well known in the art, memory 107 can be used as a general storage area, a temporary (e.g., scratch pad) memory, and/or a cache memory. Memory 107 can also be used to store input data and processed data, as well as to store programming instructions and data, in the form of data objects and text objects, in addition to other data and instructions for processes operating on processor 106. Also as is well known in the art, memory 107 typically includes basic operating instructions, program code, data, and objects used by the processor 106 to perform its functions (e.g., programmed instructions). For example, memory 107 can include any suitable computer-readable storage media described below, depending on whether, for example, data access needs to be bi-directional or uni-directional. For example, processor 106 can also directly and very rapidly retrieve and store frequently needed data in a cache memory included in memory 107.
A removable mass storage device 108 provides additional data storage capacity for the computer system 100, and is optionally coupled either bi-directionally (read/write) or uni-directionally (read-only) to processor 106. A fixed mass storage 109 can also, for example, provide additional data storage capacity. For example, storage devices 108 and/or 109 can include computer-readable media such as magnetic tape, flash memory, PC-CARDS, portable mass storage devices such as hard drives (e.g., magnetic, optical, or solid state drives), holographic storage devices, and other storage devices. Mass storages 108 and/or 109 generally store additional programming instructions, data, and the like that typically are not in active use by the processor 106. It will be appreciated that the information retained within mass storages 108 and 109 can be incorporated, if needed, in standard fashion as part of memory 107 (e.g., RAM) as virtual memory.
In addition to providing processor 106 access to storage subsystems, bus 1010 can be used to provide access to other subsystems and devices as well. As shown, these can include a display 101, a network interface 104, an input/output (I/O) device interface 102, an image processing device 103, as well as other subsystems and devices. For example, image processing device 103 can include a camera, a scanner, etc.; I/O device interface 102 can include a device interface for interacting with a touchscreen (e.g., a capacitive touch sensitive screen that supports gesture interpretation), a microphone, a sound card, a speaker, a keyboard, a pointing device (e.g., a mouse, a stylus, a human finger), a global positioning system (GPS) receiver, a differential global positioning system (DGPS) receiver, an accelerometer, and/or any other appropriate device interface for interacting with system 100. Multiple I/O device interfaces can be used in conjunction with computer system 100. The I/O device interface can include general and customized interfaces that allow the processor 106 to send and, more typically, receive data from other devices such as keyboards, pointing devices, microphones, touchscreens, transducer card readers, tape readers, voice or handwriting recognizers, biometrics readers, cameras, portable mass storage devices, and other computers.
The network interface 104 allows processor 106 to be coupled to another computer, computer network, or telecommunications network using a network connection as shown. For example, through the network interface 104, the processor 106 can receive information (e.g., data objects or program instructions) from another network, or output information to another network in the course of performing method/process steps. Information, often represented as a sequence of instructions to be executed on a processor, can be received from and outputted to another network.
An interface card or similar device and appropriate software implemented by (e.g., executed/performed on) processor 106 can be used to connect the computer system 100 to an external network and transfer data according to standard protocols. For example, various process embodiments disclosed herein can be executed on processor 106 or can be performed across a network such as the Internet, intranet networks, or local area networks, in conjunction with a remote processor that shares a portion of the processing. Additional mass storage devices (not shown) can also be connected to processor 106 through network interface 104.
In addition, various embodiments disclosed herein further relate to computer storage products with a computer-readable medium that includes program code for performing various computer-implemented operations. The computer-readable medium includes any data storage device that can store data which can thereafter be read by a computer system. Examples of computer-readable media include, but are not limited to: magnetic media such as disks and magnetic tape; optical media such as CD-ROM disks; magneto-optical media such as optical disks; and specially configured hardware devices such as application-specific integrated circuits (ASICs), programmable logic devices (PLDs), and ROM and RAM devices. Examples of program code include both machine code as produced, for example, by a compiler, or files containing higher level code (e.g., script) that can be executed using an interpreter.
The computer system as shown in
IV. Definitions
To aid in understanding the detailed description of the compositions and methods according to the disclosure, a few express definitions are provided to facilitate an unambiguous disclosure of the various aspects of the disclosure. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.
It should be noted that the roles of source and receiver may be interchanged using the reciprocity theorem of acoustics, elastic wave propagation, and electricity and magnetism. It will be understood throughout, including in the claims, that whenever “source” or “receiver” are referred to, those designations will be understood to include the reverse resulting from application of reciprocity.
It is noted here that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural reference unless the context clearly dictates otherwise.
The terms “including,” “comprising,” “containing,” or “having” and variations thereof are meant to encompass the items listed thereafter and equivalents thereof as well as additional subject matter unless otherwise noted.
The phrases “in one embodiment,” “in various embodiments,” “in some embodiments,” and the like are used repeatedly. Such phrases do not necessarily refer to the same embodiment, but they may unless the context dictates otherwise.
The terms “and/or” or “/” means any one of the items, any combination of the items, or all of the items with which this term is associated.
The word “substantially” does not exclude “completely,” e.g., a composition which is “substantially free” from Y may be completely free from Y. Where necessary, the word “substantially” may be omitted from the definition of the invention.
As used herein, the term “approximately” or “about,” as applied to one or more values of interest, refers to a value that is similar to a stated reference value. In some embodiments, the term “approximately” or “about” refers to a range of values that fall within 25%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%1, 1%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value). Unless indicated otherwise herein, the term “about” is intended to include values, e.g., weight percents, proximate to the recited range that are equivalent in terms of the functionality of the individual ingredient, the composition, or the embodiment.
It is to be understood that wherever values and ranges are provided herein, all values and ranges encompassed by these values and ranges, are meant to be encompassed within the scope of the present invention. Moreover, all values that fall within these ranges, as well as the upper or lower limits of a range of values, are also contemplated by the present application.
As used herein, the term “each,” when used in reference to a collection of items, is intended to identify an individual item in the collection but does not necessarily refer to every item in the collection. Exceptions can occur if explicit disclosure or context clearly dictates otherwise.
The use of any and all examples, or exemplary language (e.g., “such as”) provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as is indicating any non-claimed element as essential to the practice of the invention. When used in this document, the term “exemplary” is intended to mean “by way of example” and is not intended to indicate that a particular exemplary item is preferred or required.
All methods described herein are performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. In regard to any of the methods provided, the steps of the method may occur simultaneously or sequentially. When the steps of the method occur sequentially, the steps may occur in any order, unless noted otherwise.
In cases in which a method comprises a combination of steps, each and every combination or sub-combination of the steps is encompassed within the scope of the disclosure, unless otherwise noted herein.
Each publication, patent application, patent, and other reference cited herein is incorporated by reference in its entirety to the extent that it is not inconsistent with the present disclosure. Publications disclosed herein are provided solely for their disclosure prior to the filing date of the present invention. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided may be different from the actual publication dates, which may need to be independently confirmed.
It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims.
This application claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Patent Application No. 63/011,369, filed Apr. 17, 2020. The foregoing application is incorporated by reference herein in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
20040099815 | Sfez | May 2004 | A1 |
20140364737 | Huang | Dec 2014 | A1 |
20160045184 | Courtney | Feb 2016 | A1 |
20160066893 | Cho | Mar 2016 | A1 |
20190083059 | Byrnes | Mar 2019 | A1 |
20190206576 | Shelton, IV | Jul 2019 | A1 |
Number | Date | Country |
---|---|---|
WO-2017181201 | Oct 2017 | WO |
Entry |
---|
Jeroen Tromp “Source encoding for adjoint tomography”, pp. 2019-2044, May 15, 2019. |
“Source encoding for viscoacoustic ultrasound computed tomographya” , Etienne Bachmann; Jeroen Tromp, pp. 3321-3235 (Year: 2020). |
“Coded EXcitation with Spectrum Inversion (CEXSI) for Ultrasound Array Imaging” Yao Wang, Student Member, IEEE, Kurt Metzge, July , pp. 805-823 (Year: 2003). |
“Improved Contrast-Enhanced Ultrasound Imaging With Multiplane-Wave Imaging” Ping Gong, pp. 178-187, Feb. 2018 (Year: 2018). |
Qingchen Zhang “Hybrid-domain simultaneous-source full waveform inversion without crosstalk noise” pp. 1659-1681 (Year: 2018). |
Fabian Kuhn “Ultrasound medical imaging using 2d viscoacoustic full-waveform inversion” pp. 1-128 (Year: 2018). |
Tromp et al: “Source Encoding for Adjoint Tomography”, Geophysical Journal International, (2019) 218, 2019-2044, doi: 10.1093/gji/ggz271. |
Number | Date | Country | |
---|---|---|---|
20210330287 A1 | Oct 2021 | US |
Number | Date | Country | |
---|---|---|---|
63011369 | Apr 2020 | US |