A problem in the volumetric medical image analysis is to characterize the 3D local structure of tumors across various scales, because the size and shape of tumors varies largely in practice. Such underlying scales of tumors also provide useful clinical information, correlating highly with probability of malignancy. There are a number of previously proposed approaches addressing this problem. However, these prior art approaches are prone to be sensitive to signal noises and their accuracy degrades when the target shapes differ largely from an isolated Gaussian. In the medical domain, these constraints are too strong since many tumors appear as irregular shapes within noisy background signals.
This invention is directed to methods for the robust estimation of the covariance matrix that describes the spread and 3-dimensional (3D) orientation of the structure of interest. Given an input signal, a mean shift-based gradient ascent is performed for an extended mean-shift vector using all the available data points for each analysis bandwidth. The data points that converge to the same point are grouped together, forming a set of local structure candidates. These convergence candidates can be interpreted as spatial extrema of the signal. Then, for each candidate, the underlying scale is determined by estimating a covariance matrix by a constrained least-squares method. Finally, for each candidate, a stability test is performed across the analysis scales, resulting in an optimal scale estimate for each local target. As a result, one can find a signal's local scales that can vary spatially.
These methods utilize an algorithm, referred to herein as a mean shift-based bandwidth selection algorithm, for analyzing general discretized signals and estimating fully parameterized covariance matrices. With the methods of the invention, it becomes possible to address the problem of representing local structures in images. The robust mode and scale selection of the mean shift-based bandwidth selection algorithm together with the consideration of the fully parameterized covariance matrix also enables one to estimate a tumor's scale in more flexible and robust manner, mitigating the aforementioned shortcomings of the previous methods. Since many target objects in the medical domain possess complex 3D structures, the methods of the invention can be deployed for a number of different application scenarios.
FIGS. 3-4- are images depicting results of analyses performed using a preferred method of the invention.
1. Fixed-Bandwidth Mean Shift Vector for Continuous Signals
A medical image can be represented by a D-dimensional continuous signal ƒ:RD→R evaluated at n d-dimensional points xi, and the uncertainty associated with each point xi can be represented by a D×D matrix Hi, for an iε1, . . . n. The matrices Hi are referred to as bandwidth matrices. The signal can have one or more extrema. An extrema of the signal can be associated with a location of a tumor or other target object. Referring now to
where a Gaussian kernel Φ(x−μ; H) can be defined as
with D2(x,μ; H)=(x−μ)T H−1(x−μ) and H−1 is a weighted harmonic mean of the bandwidth matrices,
The weights can be defined as
and can be normalized to unity.
Eq. (3) can be used to locate spatial extrema μ of ƒ given a fixed analysis bandwidth H as follows. First, make an estimate of an extrema, μ1, and then evaluate m1 (x; H) for this extrema from Eq. (3). If y1 is used to denote the result of the first term of Eq. (3) for the initial estimate of μ1, then, for the next iteration of Eq. (3), x is replaced with y1 and μ1 is replaced with m1 (x; H), denoted as μ2. This process can be repeated, each time replacing the second term of Eq. (3) with the result of the first term from the previous iteration, and evaluating the first term on the previous evaluation of m (x; H). For each iteration k of Eq. (3), the resulting difference will converge to zero. The value of μk for which the extended mean shift vector m (x; H) is sufficiently close to zero can be taken as an extrema of the signal ƒ. The data space of the signal can be partitioned by grouping data points that converge into the same extrema.
2. Constrained Least-Squares Solution of Covariance Matrices
The next step, step 12 of
The covariance Σ can be defined by the equation m(x)≈−H(Σ+H)−1(x−μ) when ƒ can be approximated by a Gaussian. This can be rewritten in the following simple form,
ΣH−1mi≈bi≡μ−xi−mi (4)
Considering all the trajectory points {xi: i=1, . . . , tu} that converge to an extremum μ, an over-complete set of linear equations can be contructed:
AΣ≈B, (5)
where
A=(m1; . . . ; mt
B=(bi; . . . ; bt
and where Σ is a symmetric, positive definite matrix in RD×D. The covariance can be estimated by a constrained least-squares solution of Eq. (5). This solution yields the following closed form,
Σ*=UpΣP−1U{tilde over (Q)}Σ{tilde over (Q)}U{tilde over (Q)}TΣP−1UPT; (6)
where the solution involves the following symmetric Schur decompositions: ATA=UPΣP2UPT and BTB≡Q with {tilde over (Q)}=ΣPUPTQUPΣP=U{tilde over (Q)}Σ{tilde over (Q)}2U{tilde over (Q)}T. This closed form can be found by determining the unique minimizer for an area, g(Y)≡˜AY−BY−T∥F2, where Σ=YYT.
3. Scale Selection
The above two steps can result in pairs of center location and covariance estimates {μh; Σh} for each analysis bandwidth H. The next step, step 13, concerns finding the optimal estimate of the target structures analyzed across a range of bandwidths. This optimal estimate can be found by using a form of the Jensen-Shannon divergence,
Given a neighborhood parameter a, the divergence can be computed for each analysis bandwidth h. The extremum of the divergences JS(h) across the bandwidths can provide a final scale estimate that is most stable over a range of scales.
The stability test described requires the set of analysis bandwidths a priori. In one embodiment of the invention, H=hI and h is varied with a constant step. In order to achieve higher performance for the scale selection, it is preferred to have more densely distributed analysis bandwidths. However, such dense sampling can prohibitively enlarge the search space, especially when a fully parameterized H is considered.
4. Algorithm for Local Multi-Scale Analysis
In some application scenarios, the task to be solved is to represent the scale of local structure whose rough location is provided by another means. An example is the structural analysis of tumors whose locations in a volumetric image are provided manually by radiologist. The simplest strategy in such a case is to perform the mean shift iteration only from the given marker point. The convergence point serves as the tumor center estimate and all the trajectory points from the marker are used to estimate the scale. This naive strategy can fail when the provided locations are contaminated by uncertainties and when the iteration converges too soon, forcing the Eq.(4) to be under-complete and rank-deficient. These issues can be addressed by the following steps, depicted in
A 3D domain implementation of the local multi-scale analysis algorithm described in Section 5 is evaluated with high-resolution computerized tomography (HRCT) images of 14 patients displaying pulmonary tumors. A total of 44 analysis scales with 0.25 interval h=(0.252; . . . ; 112) and a=1 were used. The rough location of the tumors were provided. As a pre-process, volumes of interest of size 32×32×32 are extracted using the markers.
It is to be understood that the present invention can be implemented in various forms of hardware, software, firmware, special purpose processes, or a combination thereof. In one embodiment, the present invention can be imnplemented in software as an application program tangible embodied on a computer readable program storage device. The application program can be uploaded to, and executed by, a machine comprising any suitable architecture.
Referring now to
The computer system 101 also includes an operating system and micro instruction code. The various processes and functions described herein can either be part of the micro instruction code or part of the application program (or combination thereof) which is executed via the operating system. In addition, various other peripheral devices can be connected to the computer platform such as an additional data storage device and a printing device.
It is to be further understood that, because some of the constituent system components and method steps depicted in the accompanying figures can be implemented in software, the actual connections between the systems components (or the process steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings of the present invention provided herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations of the present invention.
While the present invention has been described in detail with reference to a preferred embodiment, those skilled in the art will appreciate that various modifications and substitutions can be made thereto without departing from the spirit and scope of the invention as set forth in the appended claims.
This application claims priority from “Robust Scale-Space Analysis of 3D Local Structures in medical Images”, U.S. Provisional Application No. 60/488,603 of Okada, et al, filed Jul. 18, 2003, the contents of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60488603 | Jul 2003 | US |