This invention relates to acoustic imaging.
Ultrasound tomography typically employs circular or conformal transducer arrays to acquire data. In typical transmission tomography, one element transmits the sound wave and the scattered waves are received by other transducer elements facing the transmitter, resulting in large datasets and slow acquisition. This process is then rotated around the array. Computationally intensive iterative methods such as full wave inversion, Born inversion, inverse Radon transform and inverse scattering are then applied to construct the image and estimate parameters. Consequently, reconstruction of one image slice typically requires 10 seconds to several hours resulting in small patient throughput. Further, real-time techniques, including Doppler imaging, elastography and functional imaging have not been developed in the context of ultrasound tomography. Accordingly, it would be an advance in the art to provide improved acoustic tomography.
Preferably the processor is configured to provide a frame rate for the first image of 10 Hz or more. This capability is a substantial advantage of the present approach, since conventional acoustic tomography can't be done at such high frame rates.
The apparatus can be configured to provide a second image according to a second imaging modality that is co-registered with the first image.
Plane wave excitations can be provided by driving the acoustic transducer elements of the selected transducer module in phase with each other. Here the acoustic transducer elements of the selected transducer module can be driven with a linear phase gradient to provide beam steering.
Diverging wave excitations can be provided by defining a virtual source and driving the acoustic transducer elements of the selected transducer module with phases corresponding to the virtual source.
Preferably the two or more of the transducer modules does not include all of the transducer modules. This can desirably prevent direct transmission from the selected transducer module to the two or more of the transducer modules.
The data set for tomographic reconstruction can be a reflection tomography data set or a transmission tomography data set. The first image can be a B-mode image or an attenuation image.
Preferably the processor is configured to apply a coherence factor correction to the data set for tomographic reconstruction.
The processor can be configured to determine a speed of sound correction by estimating a target speed of sound in the target and an ambient speed of sound in a medium surrounding the target and between the target and the three or more transducer modules. In some cases, this medium is water.
The processor can be configured to reduce an effect of physical gaps between the transducer modules by defining two or more virtual acoustic elements (e.g., virtual elements 130a and 130b) at locations between the transducer modules. In this example, virtual elements 130a are between transducer modules 106h and 106a, and virtual elements 130b are between transducer modules 106h and 106a. As described in section C below, by estimating received acoustic signals at locations of the virtual acoustic elements it is possible to reduce the effect of the physical gaps between the transducer modules on imaging performance.
Section A describes in detail an example of improved acoustic tomography according to the above described principles. Section B relates to dual-modality imaging. Section C relates to compensating for gaps between the transducer modules.
Ultrasound (US) imaging is used throughout medicine; however, operator-dependent acquisition and poor spatial resolution have limited the utility. Conventional ultrasound imaging uses a small linear or 2D matrix probe to transmit (Tx) and receive (Rx) ultrasound signals. B-mode images are typically then reconstructed using reflected signals to provide a representation of the interrogated region. Typical limitations of conventional ultrasound imaging include limited field of view, limited penetration compared to the size of the imaged object and diffraction-limited resolution. Tomography, defined as a technique for displaying a representation of a cross section through a human body, facilitates high resolution (lambda/2) imaging by effectively rotating the US point spread function (PSF) and increasing the aperture to limit the effect of diffraction. Tomographic imaging has the potential to create an operator-independent acquisition protocol.
Ultrasound tomography typically employs circular or conformal transducer arrays to acquire data. In typical transmission tomography, one element transmits the sound wave and the scattered waves are received by other transducer elements facing the transmitter, resulting in large datasets and slow acquisition. This process is then rotated around the array. Computationally intensive iterative methods such as full wave inversion, Born inversion, inverse Radon transform and inverse scattering are then applied to construct the image and estimate parameters. Consequently, reconstruction of one image slice requires 10 seconds to several hours resulting in small patient throughput. Further, real-time techniques, including Doppler imaging, elastography and functional imaging have not been developed in the context of ultrasound tomography. Transmission tomography has been shown to facilitate tissue characterization through estimation of the speed-of-sound (SOS) and attenuation.
The recent development of high channel count ultrafast US systems offers the opportunity to capture images at a high frame rate using plane waves or diverging waves to insonify a large field-of-view. These systems have been leveraged to create vector flow imaging, super resolution imaging and functional brain imaging.
While tomographic systems have been developed in the past, here, we sought to develop a system for real-time quantitative reflection tomography. In reflection tomography, the maximum propagation path length is typically reduced, and the center frequency can be increased as compared with transmission tomography. While transmission tomography cannot yet be successfully applied to image body regions containing bone, we demonstrate that reflection tomography can produce high-quality cross-sectional images of limbs (surrounding longitudinal bones). We use plane wave acquisition to achieve a high-volume acquisition rate, facilitating spatial compounding and further increasing image quality. By combining tomographic acquisition with coherence processing and localized speed of sound correction, high resolution images are obtained. In addition, for the first time, we assess attenuation estimation using tomographic plane wave acquisition. By combining images obtained from multiple acquisition directions, regions of locally enhanced attenuation are quickly recognized, and the attenuation coefficient estimated. In summary, we exploit ultrafast tomographic acquisition to achieve nearly isotropic in-plane resolution (˜150 microns at 5 MHz) and reduce the clutter floor, thus improving image contrast in studies of phantoms, small animals and human volunteers.
We first evaluated the point spread function (PSF) for the tomographic reconstruction resulting from plane wave insonation with 1 array transmitting for each view and 1, 3 and 5 arrays used on reception, respectively (
We then further compared the PSF determined both with and without the application of coherence factor (CF) processing. For an 8-view acquisition, nearly half-wavelength isotropic spatial resolution (0.16 mm) was achieved with or without CF (
Experimental results were consistent with the simulation (
Tomographic imaging with the full aperture and CF also improved image contrast (
With the addition of CF weighting, the contrast ratio (CR) between hyperechoic and background regions and between anechoic and background regions increased by 2.2 dB (from 4.6 dB to 6.8 dB) and 30.4 dB (from −18.9 dB to −49.3 dB), respectively, for the 8-view acquisition (Tables 1 and 2). Comparing the 8-view and 1 Tx/1 Rx acquisitions, the CNR increased by 20% (from 0.55 to 0.66) between the hyperechoic and background regions, and by 85% (from −0.65 to −1.20) between anechoic and background regions (Table 2). The combinational use of CF and tomographic acquisition (
Applying CF weighting increases the dynamic range of the image which is evidenced by the comparison of the images with and without CF both displayed with 90 dB dynamic range (
Consistent with the results in phantom studies, CF weighting improved the contrast of tomographic images of the rat abdomen (
A2c) Image Fidelity is Improved with Large Imaging Coverage Angle
Applying the dual SOS beamformer, we evaluated the impact of the number of segments and views used in the tomographic reconstruction of the rat model. We tested whether the improvement in the tomographic view resulted from the use of three arrays vs one array on reception of a single view and found that resolution and field of view were improved by using three arrays in reception (
Using 8 arrays, dual SOS beamformer and CF, high fidelity images of the anatomical structures in thoracic (
We also investigated the limited view effect in imaging of the human hand, wrist and forearm (
Finally, the tomographic system with CF and dual SOS provides the opportunity to quantify tissue properties in vivo. We therefore evaluated tomographic quantification of local attenuation as a demonstration of the feasibility of quantitative imaging with this system. The transition region where the image intensity decreases distal to the attenuating inclusion (creating a shadow artifact) was obvious in each view of the B-mode phantom images (
We further quantified the ultrasound attenuation coefficients in vivo in a transverse section of the human palm (
We set out to develop a system for real-time quantitative reflection tomography and to apply this system for imaging phantom, small animal and human tissue. Programmable, high frame rate scanners with a high channel count provide an unprecedented opportunity to optimize tomographic acquisition for tissues for which through transmission is not optimal. Here, we first optimized acquisition using a subset of arrays for each view. The entire object was imaged by coherently compounding data from eight views. By combining a 3-array receive aperture with the acquisition of eight views, we achieved isotropic in plane resolution on the order of half wavelength. We found that the spatial resolution and contrast could be further improved by coherence factor weighting and by considering different speed of sound values within and outside the imaged object. We achieved a high-volume acquisition rate through the use of plane wave imaging, facilitating coherent compounded imaging and further increasing image quality. Taking advantage of ultrafast imaging with plane wave sequences and a large aperture in ultrasound tomography, anatomical information was acquired with sub-wavelength in-plane resolution in a large field of view in real-time.
Although our system can also be applied in transmission tomography mode, we use pulse-echo (reflection) signals from plane wave excitation in order to preserve real-time imaging. Consistent with previous work on improving image quality by increasing effective transducer aperture size, we also showed improvements in key image quality metrics including spatial resolution, sSNR, CR and CNR with tomographic imaging. Adding to the panoramic imaging capability, high resolution tomographic images in the rat were obtained. We further demonstrated the capability of our system by imaging the hand and wrist of a healthy volunteer. A fully tomographic view of anatomical features was obtained. These promising results show the potential of our system for orthopedic and myopathic imaging. With plane wave or diverging wave-based methods developed for Doppler, color flow, ultrasound attenuation and SOS imaging, and elastography, we foresee expanding ultrasound tomography to include these features.
The full aperture ring allows fast in-plane image acquisition within a few milliseconds with the 2-way travel of ultrasound waves ultimately limiting the acquisition rate. Here, tomographic imaging was carried out with an effective reconstruction frame rate of 10 Hz using a GPU. The imaging speed can be further improved using partial beamforming, i.e., beamform the images on each secondary system, then send the images to the primary system for final image formation. This scheme should improve the frame rate by at least a factor of 2 through relieving the computational burden and reducing the amount of data that needs to transfer between secondary and primary systems. With real-time imaging, tomographic functional imaging can be further developed.
In this work, we utilized a low-cost method of extending the transducer aperture using 3D printing. With this technology, assembly geometries can be designed to accommodate the requirements of a specific application and for an individual patient. However, one important limitation of this approach is that grating lobes caused by the missing transducer elements (in the gaps between the arrays) can degrade image quality, as has been shown in. A promising method for filling in missing data has been shown to be successful in geophysics using deep learning. For specific clinical applications, dedicated arrays will also be designed that will not involve gaps between elements. Section C below considers a different approach for gap compensation.
In addition, estimates of attenuation were achieved using tomographic plane wave acquisition. By combining images obtained from multiple acquisition directions, regions of locally enhanced attenuation were quickly recognized, and the attenuation coefficient estimated. Using the shadow-like artifact in the B-mode images from different views, high attenuation region in the phantom can be localized in real-time in a large field of view. The spectral-log-difference method applied here can apply a calibration procedure using a homogeneous phantom with known ultrasound attenuation coefficient to compensate for factors affecting the estimation, such as diffraction effects and backscattering. Nevertheless, without calibration, we were able to estimate the ultrasound attenuation coefficient of specific regions of interest and the estimated values reasonably differentiate tissues including the muscle, connective tissue and metacarpal bone in the human palm. To improve the quantitation, further work with a range of known materials is required.
A real-time, programmable 1024-channel ultrasound platform was used to drive a customized tomographic transducer with 1024 elements. The primary system (Vantage 256, Verasonics Inc., Kirkland, Wash.) sends clock signals to synchronize three secondary Vantage 256 systems. A GPU card (RTX Titan, Nvidia, Santa Clara, USA) in the primary system speeds processing and the data transfer rate between the primary and each secondary system is 100 Gb/s. The tomographic transducer consists of 8 L7-4 linear arrays (ATL/Philips, Amsterdam, Netherlands) hosted in 8 3D printed sub-assembly pieces forming an octagonal manifold. Each array consists of 128 elements with a lateral pitch of 300 μm and an elevation width of 7.5 mm. The linear arrays were carefully positioned to align their respective elevation plane. The edge length of each sub-assembly (i.e. for 1 linear array) is about 6.2 cm, creating a large field of view of ˜13×13 cm2. The lateral and axial directions with respect to array 1 were denoted as the x-direction and z-direction in the image, respectively (
A customized water tank having plastic membranes and 3D printed tank walls with acoustic windows opened to couple with the arrays was designed. Water was employed as the coupling agent between the arrays and the object.
For one imaging view, 3 linear arrays (
The image from one view (view j) was beamformed by delay-and-sum (DAS) method combined with coherence factor (CF) as,
r
j(x,z)=Σn=1NΣm=1Mαmnsmn(x,z(CFm(x,z), (1)
where n indexes the Rx channels and the total Rx channel number is N=384; M is the number of plane waves transmitted by each linear array and for the mth Tx angle, smn(x,z) is the Hilbert Transform applied and delayed RF signal from each Rx channel; αmn is the apodization coefficient which is set to 1 if the pixel position falls inside the propagation path of the plane wave and 25° acceptance angle of the Rx channel, or 0 otherwise; rj is the beamformed image for view j; x,z are the spatial location of the image pixel and CF is expressed as,
The active arrays were rotated to acquire images from 8 different views which were then coherently compounded to form the final tomographic images according to,
r(x,z)=Σj=18rj(x,z), (3)
Then, the envelope of the beamformed data was detected and log compressed.
The secondary systems transfer the RF signals to the primary system for image reconstruction performed on the GPU card which allows real-time imaging capability. A frame rate of 10 frames/second was achieved for a 10×10 cm2 image with an isotropic pixel size of one wavelength (0.3 mm) and 3 plane waves transmission per array setting, accounting for data transfer, processing and display.
As water is employed to couple the 8 arrays to the object, delay errors can occur in the presence of SOS mismatch between water and the object. Considering a single SOS to reconstruct the image, as commonly used in ultrasound imaging, would thus results in artifacts and degrades the image quality. For small animal and human hand imaging, we used a dual SOS beamformer following the procedure described hereafter.
We first reconstructed the image with single SOS. The image resolution can be set coarsely for a fast reconstruction, e.g. pixel size=1.2 mm. As the object was surrounded by water, its contour has a good contrast separation with the background and therefore can be selected manually or detected automatically using classic image processing methods, such as Grabcut. The image field was then partitioned to two domains with two SOS values. With the assumption that ultrasound rays travel straight from the source to the detectors, the paths along which the rays travel inside the subject were calculated to correct for the delay errors caused by the SOS difference in the object. With the corrected delays, the image was reconstructed with a finer pixel size of 0.15 mm.
As is shown in
We used the following metrics to evaluate the image quality of our platform: 1) Spatial resolution, defined as the full-width-half-maximum (FWHM) of the PSF of the system; 2) Contrast ratio (CR)=20 log 10(μ0/μ1), where μ0 and μ1 are the mean values of the envelope signal in region 0 and 1, respectively; 3) Contrast-to-noise ratio (CNR)=(μ0−μ1)/√{square root over (σ02+σ12)}, where σ0 and σ1 are the standard deviations of the envelope signal in region 0 and 1, respectively; and 4) Speckle signal-to-noise ratio (sSNR)=μ0/σ0, for region 0. Spatial resolution of our system was evaluated by imaging a point target at the center of the image filed in simulation and experiments (see section A4e for more details). CR, CNR, sSNR were evaluated by imaging an agarose-based tissue mimicking phantom fabricated following the procedure described in section A4f.
Attenuation imaging was implemented using the spectral-log-difference method. Briefly, B-mode images from the 8 views were displayed in real-time. An ROI covering the high attenuating region was defined for each view based on shadowing distal to the inclusion in each direction (
log10(S(f,z
where zp and zd are the center depth of the proximal and distal windows, respectively; R is a constant related to the backscatter coefficients of the windows, assuming that the material in the windows has the same effective scatterer size. The block dimension was 7.5×7.5 mm2 with a 90% overlap between successive estimates used in both the lateral and depth directions. Each block included 200 scanlines with 200 time samples in each scanline. The ultrasound attenuation coefficient was estimated from the slope of the spectral difference in the 2-5 MHz bandwidth. As the intensity in the proximal window is expected to be greater than that in the distal window and the high frequency components in the signals are expected to be more attenuated as compared with the low frequency components, two constraints were applied to remove invalid estimates. They were (1) α must be positive and (2) the spectral difference at 2 MHz estimated on the linear fitted slope must be positive. If the two conditions were not satisfied, the estimate for that data block was not retained. The ultrasound attenuation images from views 1 to 8 were compounded to create the final attenuation image which was then smoothed with a Gaussian filter of 0.3×0.3 mm2 window size.
A point target was imaged in experiments and simulations to evaluate the spatial resolution of the system. The simulations were carried out using the Verasonics Research Ultrasound Simulator, with a point target of strong reflectivity defined at the center of the arrays. An ideal eight arrays geometry without gaps between arrays was first simulated and thereby the distance between the point target to the arrays was 4.6 cm (
Image quality metrics including sSNR, CR and CNR were evaluated on an agarose-based tissue-mimicking phantom with two cylindrical inclusions (one hyperechoic and one anechoic). The background substrate of the phantom was prepared by dissolving 1.5% (w/v) agarose powder (A10752, Alfa Aesar) in degassed water at 80° C. and mixed with 1% (w/v) silicon carbide (SiC) powder (A16601, Alfa Aesar) homogeneously using a magnetic stirrer (SH-2, Faithful) prior to solidification. The hyperechoic inclusion (1.5% agar w/v, 2% SiC w/v) was prepared following the same procedure as for preparing the background substrate. The anechoic inclusion was filled with water (
An agarose-based (1.5% w/v agar, 0.5% w/v SiC) tissue mimicking phantom with one high ultrasound attenuation inclusion in the center was prepared following the same procedure described above. The inclusion was obtained by adding an additional 13% w/v aluminum oxide powder (#3 Micron, Beta Diamond Products) to the agarose solution. The ultrasound attenuation coefficient of the inclusion and the background substrate measured by insertion loss techniques were 2.3 and 0.1 dB/(MHz·cm), respectively. We then imaged this inclusion phantom with 56 plane waves (8-view acquisition, 7 plane waves per array, −13° to 13).
All animal experimental procedures were performed in accordance with protocols approved by the local Institutional Animal Care and Use Committee. A 7-week old female rat (200 g body weight) was anesthetized with vaporized isoflurane (1 L/min of oxygen and 2% isoflurane) gas system and then, the hair was removed using clippers and depilatory cream. The rat was humanely euthanized and placed in the water tank for imaging. The rat was secured to a 3D printed mold holding its head and weight was applied to the tail to ensure an upright position during imaging. For 3D tomographic imaging, the rat was mounted on two linear translation stages which were motorized by a motion controller (ESP 300, Newport, Irvine, Calif., USA). The region between the base of the neck to the base of the tail was scanned in the transverse plane with 1 mm interval between scans. In total, 130 slices were acquired (covering 130 mm) and 56 plane waves (8-view acquisition, 7 plane waves per array, −13° to 13) were used for imaging each slice.
A healthy female volunteer (31 years old) was recruited for the in vivo tomographic imaging of the forearm, wrist and hand. All imaging procedures followed the protocol approved by local Review Board. Informed consents were received from the volunteer after explaining the protocol. Imaging was performed with the same sequence used for the small animal scan described in the previous paragraph. During the experiments, the volunteer was instructed to immerse the left hand and forearm vertically in the water tank. Tomographic images were acquired on the fly while the volunteer moved the hand and forearm freely.
To calibrate the location of the transducer elements, each L7-4 array transmitted a plane wave of 0° steering angle to insonify two static targets (40 μm wires spaced by ˜15 mm) and recorded the backscattered echoes. The targets were immersed in water, placed at the center of the field-of-view and oriented to avoid overlap between the echo traces. The delay associated to each target was estimated with cross-correlation between the channels. The relative position (x, z) of the two targets with respect to the transmitting array was then recovered by fitting the delay trace associated with the target position (x, z) to the measured delays. The water speed-of-sound was set based on the measured temperature. After determining the target's location with respect to each of the 8 arrays, translation and rotation were applied to each array location to obtain their absolute position with respect to array 1.
A metal wire (40 μm thickness) suspended vertically at the center of the commercial arrays with the inter-array gap was imaged to show the PSFs.
The present approach can be used in dual-modality configurations, where any compatible imaging modality is combined with improved acoustic imaging as described above.
High throughput ultrasound systems allowing full control over a high number of channels (>256) are emerging as a powerful tool to improve imaging. This new technology development driven by volumetric imaging enables real-time control of a high number of elements. For 1D-array geometries this translates to the use of large apertures which improves image quality and offers a wider field-of-view suited for whole organ imaging.
In this section, we explore such configurations by combining three commercial phased arrays and test the improvements achieved by a large aperture with 384 elements and 10 cm aperture. Employing phased arrays, with a wide acceptance angle, offers an efficient way to interrogate the tissue with a limited number of transmit events using diverging waves (DW). Moreover, an auto-regressive filter is applied on virtual receive elements filling the inter-array gaps to mitigate the associated grating lobes.
The multi-array assembly is composed of three P6-3 (ATL/Philips, Amsterdam, Netherlands), each having 128 elements (0.22 mm pitch). The arrays are held together by a stackable 3D-printed manifold designed in-house. The aperture extends over 98.5 mm laterally with inter-array gaps of 9.3 mm resulting in visible grating lobes on bright reflectors. The arrays are interfaced to 2 Vantage 256 system (Verasonics, Kirkland, USA) which are part of the volume imaging package and allowing real-time control and processing. The position of the arrays was first calibrated with wire targets in water. Imaging was then performed at 4.5 MHz using diverging waves. Delay-and-sum beamforming was implemented on a GPU (Titan RTX, Nvidia, Santa Clara, USA) for real-time imaging.
To reduce the grating lobes generated by the physical gaps, 84 virtual receiving elements were created on receive and their associated signals estimated with an auto-regressive filter (
S
f(n+1)=αf(1)Sf(n)+αf(2)Sf(n−1)+ . . . +αf(p)Sf(n−p+1)
with af being the coefficients of the filter. In this work, we chose p=8. The coefficients af are first estimated over the bandwidth of the transducer from the time delayed radiofrequency signals. Then the filter is used to predict signals on the virtual elements.
The improvements in term of lateral resolution at different depths were investigated with wire targets (40 μm diameter) placed at different depth and immersed in degassed water (
The lateral resolution was calculated from the experimental point spread function as the full width half maximum (Table 3). The lateral resolution was determined to improve by a factor of 3 at a depth of 50 mm and a factor of 2 at a depth of 125 mm. At 50 mm and 125 mm, the equivalent f-number of the multi-array aperture is 0.5 (1.8 with 1 array) and 1.25 (4.5 with 1 array) respectively.
As expected, the inter-array gaps induced grating lobes which can be seen in the PSF cross-sections in
Imaging was performed on a multi-purpose ultrasound phantom containing various targets (model 040GSE, CIRS, USA; speed of sound 1540 m/s, attenuation 0.5 dB/MHz/cm). 30 DW were used for imaging with either the central array or all 3 arrays (10 DW per array) (
The array assembly was tested on a healthy volunteer (34 years old) following the local protocol approved by the local Institutional Review Board. Informed consent was received from the volunteer after explaining the protocol. The array was positioned under the ribcage to image the liver. Acquisition was performed on the fly during the scan (real-time beamforming). For comparison, both the 1 array (30 DW) and 3-array (also 30 DW) sequences were acquired at the same time.
The multi-array acquisitions show here more defined structures (mostly vessels) compared to the single array acquisitions.
The array assembly presented in this work allowed imaging of a wide field of view with improved resolution as demonstrated both in vitro and in vivo. Evaluation of the point-spread function on wire targets showed a lateral resolution improvement of a factor of 2 and above compared to using a single array. The use of virtual elements on receive for which signals are predicted with an auto-regressive filter yielded a significant reduction of the gap-related grating lobes. This modular configuration facilitates imaging of entire organs with improved metrics which is here particularly noticeable for deeper targets as seen on the commercial phantom. Initial evaluation of the multi-array configuration on the liver showed enhanced diagnostic capabilities.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/054009 | 10/2/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62910875 | Oct 2019 | US | |
63074813 | Sep 2020 | US |