The present application claims priority from Japanese Patent Application JP 2021-064617 filed on Apr. 6, 2021, the content of which is hereby incorporated by reference into this application.
The present invention relates to an apparatus and a method for processing medical images obtained by a medical imaging apparatus such as an X-ray CT (Computed Tomography) apparatus or the like and, more particularly, to technologies for reducing metal artifacts which may be introduced by a metal if included in an object under examination.
The X-ray CT apparatus which is an example of medical imaging apparatus irradiates an object under examination with X rays from surroundings of the object under examination to acquire projection data at a plurality of projection angles, and projects back the projection data in order to reconstruct a tomographic image of the object under examination for use in diagnostic imaging. A metal, e.g., a plate for bone fixation or the like, if included within the object under examination, introduces metal artifacts which are artifacts affected by the metal in a medical image to interfere with the diagnostic imaging. The technology to reduce the metal artifacts is referred to as MAR (Metal Artifact Reduction), and various techniques have been developed such as a beam hardening correction technique, a linear interpolation technique, a deep learning technique and the like, but each technique has its advantages and disadvantages.
For example, the literature, Y. Zhang and H. Yu, “Convolutional Neural Network Based Metal Artifact Reduction in X-Ray Computed Tomography,” in IEEE Transactions on Medical Imaging, vol. 37, no. 6, pp. 1370-1381, June 2018, discloses a combination of the advantages of the techniques by performing the beam hardening correction technique and the linear interpolations technique to obtain images with metal artifacts reduced, and by applying an original image and the obtained images to the deep learning technique as input images.
However, in the above literature, although the metal artifacts are reduced, a degradation in image quality may be caused in a region less affected by the metal artifacts, for example, in a region away from the metal.
It is accordingly an object of the invention to provide a medical image processing apparatus and a medical image processing method which are capable of reducing metal artifacts and preserving the image quality even in a region less affected by the metal artifacts.
To achieve the object, an aspect of the present invention provides a medical image processing apparatus including an arithmetic section that reconstructs a tomographic image from projection data of an object under examination including a metal. The arithmetic section acquires a machine learning output image that is output when the tomographic image is input to a machine learning engine that machine-learns to reduce metal artifacts, and composites the machine learning output image and the tomographic image to generate a composite image.
Another aspect of the present invention provides a medical image processing method to reconstruct a tomographic image from projection data of an object under examination including a metal, which includes the steps of: acquiring a machine learning output image that is output when the tomographic image is input to a machine learning engine that machine-learns to reduce metal artifacts; and compositing the machine learning output image and the tomographic image to generate a composite image.
According to the present invention, a medical image processing apparatus and a medical image processing method can be provided to reduce metal artifacts without losing detailed anatomy in a metal region.
Embodiments of a medical image processing apparatus and a medical image processing method according to the present invention will now be described with reference to the accompanying drawings. It is noted that throughout the following description and the accompanying drawings, like reference signs are used to indicate components/elements having like functional configurations for the purpose of avoiding repeated description.
The arithmetic section 2 controls operation of each component, which specifically is CPU (Central Processing Unit), MPU (Micro Processor Unit) and/or the like. The arithmetic section 2 loads and executes programs and data required to execute the programs which are stored in the storage device 4, into the memory 3 in order to perform various types of image processing on a medical image. The memory 3 stores the progress of a program and/or arithmetic processing which are executed by the arithmetic section 2. The storage device 4 stores programs executed by the arithmetic section 2 and data required to execute the programs, which specifically is HDD (Hard Disk Drive), SSD (Solid State Drive) and/or the like. The network adapter 5 connects the medical image processing apparatus 1 to the network 9 such as LAN, telephone lines, the Internet and/or the like. Various data handled by the arithmetic section 2 may be transmitted to and received from the exterior of the medical image processing apparatus 1 via the network 9 such as LAN (Local Area Network) or the like.
The display apparatus 7 displays processing results of the medical image processing apparatus 1, and the like, which specifically is a liquid crystal display and/or the like. The input apparatus 8 is an operation device through which an operator provides operation instructions to the medical image processing apparatus 1, which specifically is a keyboard, a mouse, a touch panel, and/or the like. The mouse may be replaced with another pointing device such as a track pad, a track ball, and the like.
The medical imaging apparatus 10 is an X-ray CT (Computed Tomography) apparatus that acquires, for example, projection data of an object under examination and reconstructs a tomographic image from the projection data, which will be described later with reference to
The machine learning engine 12 is generated by machine-learning to reduce the metal artifacts included in the tomographic image, and is configured using, for example, CNN (Convolutional Neural Network). For the generation of the machine learning engine 12, for example, a tomographic image without metal is used as a teacher image. For the input image, images obtained by adding a metal region to the teacher image in question are sequentially projected to create projection data including the metal, and thus a tomographic image including metal artifacts obtained by projecting back the projection data in question is used.
An overall configuration diagram of an X-ray CT apparatus 100 which is an example of the medical imaging apparatus 10 is described with reference to
The X-ray tube 211 is an apparatus that irradiates with X rays the object under examination 210 placed on the bed 240. The high voltage generator section 217 generates a high voltage in accordance with a control signal transmitted from the X-ray control section 216, and the high voltage is applied to the X-ray tube 211, so that the object under examination is irradiated with X rays from the X-ray tube 211.
The collimator 213 is apparatus that limits the irradiation range of the X rays emitted from the X-ray tube 211. The X-ray irradiation range is set in accordance with a control signal transmitted from the collimator control section 221.
The detector 212 detects the X rays passing through the object under examination 210 to measure spatial distribution of the passing X rays. The detector 212 is disposed on the opposite side from the X-ray tube 211, and a plurality of detecting elements is two-dimensionally arranged in a plane facing the X-ray tube 211. A signal measured by the detector 212 is amplified at the preamplifier 222, and then converted to a digital signal at the A/D convertor 223. Then, various types of correction processing are performed on the digital signal in order to acquire projection data.
The drive section 214 rotates the X-ray tube 211 and the detector 212 around the object under examination 210 in accordance with a control signal provided from the scanner control section 218. The projection data from a plurality of projection angles is acquired through the rotation of the X-ray tube 211 and the detector 212 and the irradiation and detection of the X rays. A unit of data collection at each projection angle is referred to as a “view”. In the array of the two-dimensionally arranged detecting elements of the detector 212, the rotation direction of the detector 212 are referred to as a “channel” and a direction perpendicular to a channel is referred to as a “row”. The projection data is identified by a view, a channel, and a row.
The bed control section 219 controls the operation of the bed 240 such that the bed 240 remains at rest while X-ray irradiation and detection occur, and moves with constant velocity in the Z axis direction which is the direction of the body axis of the object under examination 210. Scanning performed while the bed 240 remains at rest is referred to as an axial scan, and scanning performed while the bed 240 is moving is referred to as a helical scan.
The central control section 215 controls the abovementioned operation of the scanner 200 in accordance with instructions from an operation unit 250 which is described as follows. The operation unit 250 has a reconstruction processing section 251, an image processing section 252, a storage section 254, a display section 256, an input section 258, and the like.
The reconstruction processing section 251 reconstructs a tomographic image by projecting back the projection data acquired by the scanner 200. The image processing section 252 performs various types of image processing on the tomographic image in order to obtain an image suitable for diagnosis. The storage section 254 stores projection data, tomographic images, and images after the image processing. The display section 256 displays tomographic images and images after the image processing. The input section 258 is used by the operator setting acquisition conditions of projection data (a tube voltage, a tube current, a scan speed, and/or the like) and reconstruction conditions of a tomographic image (reconstruction filter, FOV size, and/or the like).
The operation unit 250 may be the medical imaging processing apparatus 1 illustrated in
An example of the flow of processing executed in a first embodiment is described for each step with reference to
The arithmetic section 2 acquires a tomographic image I_ORG of the object under examination including a metal. Due to the metal included in the object under examination, the tomographic image I_ORG includes metal artifacts. An example of the metal artifacts is shown in
The arithmetic section 2 acquires a machine learning output image I_MAR to be output when the tomographic image I_ORG is input to the machine learning engine 12 that has machine-learned to reduce metal artifacts. In the machine learning output image I_MAR, the metal artifacts are reduced, but the image quality may be degraded in a region less affected by the metal artifacts, for example, in a region away from the metal.
The arithmetic section 2 composites the machine learning output image I_MAR acquired in S302 and the tomographic image I_ORG acquired in S301. In the machine learning output image I_MAR, the image quality may be degraded in the region less affected by the metal artifacts, whereas in the tomographic image I_ORG, a degradation in image quality is not caused in the region less affected by the metal artifacts. Therefore, the machine learning output image I_MAR and the tomographic image I_ORG are composited together to generate a composite image so that the metal artifacts are reduced and the image quality is preserved in the region less affected by the metal artifacts. The generated composite image is displayed on the display apparatus 7 or stored in the storage device 4.
An example of the flow of processing in S303 is described for each step with reference to
The arithmetic section 2 acquires a weight map in which weight coefficients w which are real numbers between zero and one (inclusive) are mapped. The weight map I_w is generated from, for example, the following equation.
I_w=|I_ORG−I_BHC| (Eq. 1)
where I_BHC is a beam hardening correction image that is obtained by applying the beam hardening correction technique to the tomographic image I_ORG.
The beam hardening correction image I_BHC is obtained by, for example, using the following procedure. Initially, a metal pixel is extracted from the tomographic image I_ORG. Then, the projection value corresponding to the metal pixel is corrected in projection data P ORG used for generation of the tomographic image I_ORG, so that projection data P_BHC is obtained. A projection value corresponding to the metal pixel is corrected by using the projection value in question and a length of the metal pixel on a projection line associated with the projection value in question. Specifically, the longer the length of the metal pixel on the projection line or the larger the projection value, the greater the correction strength becomes. And the projection data P_BHC is projected back and then is added to or subtracted from the tomographic image I_ORG in order to obtain the beam hardening correction image I_BHC.
The weight map I_w may be generated from the following equation.
I_w=|I_ORG−I_LI| (Eq. 2)
where I_LI is a linear interpolation image that is obtained by applying the linear interpolations technique to the tomographic image I_ORG.
The linear interpolation image I_LI is obtained by, for example, using the following procedure. Initially, a metal pixel is extracted from the tomographic image I_ORG. Then, in the projection data P ORG used to generate the tomographic image I_ORG, a projection value corresponding to the metal pixel is linearly interpolated with a projection value adjacent thereto to generate a projection value which is substituted, thereby obtaining projection data P_LI. Then, the projection data P_LI is back-projected and the extracted metal pixels are composited, so that the linear interpolation image I_LI is obtained.
Because the beam hardening correction image I_BHC and the linear interpolation image I_LI are images with reduced metal artifacts, the weight map I_w generated from Equation 1, 2 is also an artifact map indicating a probability distribution of presence of metal artifacts.
The arithmetic section 2 uses the weight coefficient w in the weight map I_w acquired at S501 to composite the machine learning output image I_MAR and the tomographic image I_ORG to create a composite image I_CMP. For example, the following equation is used for generation of the composite image I_CMP.
I_CMP=w·I_MAR+(1−w)·I_ORG (Eq. 3)
According to Equation 3, the product of each pixel value of the machine learning output image I_MAR multiplied by the weight coefficient w which is each pixel value in the weight map I_w is added to the product of each pixel value of the tomographic image I_ORG multiplied by (1−w). Stated another way, in a region with many metal artifacts, the ratio of machine learning output image I_MAR is higher, and in a region with fewer metal artifacts, the ratio of the tomographic image I_ORG is higher. As a result, in the composite image I_CMP, the metal artifacts are reduced and the image quality is preserved in the region less affected by the metal artifacts.
The beam hardening correction image I_BHC is an image obtained based on correction of a projection value corresponding to a metal pixel. Therefore, if the weight map I_w of Equation 1 is used, the artifacts in the region more affected by the metal pixel can be further reduced. The linear interpolation image I_LI is an image obtained by linearly interpolating the projection value corresponding to the metal pixel with a projection value adjacent thereto. Therefore, if the weight map I_w of Equation 2 is used, the artifacts directly introduced by the metal can be further reduced.
Because the metal artifacts become smaller in size with an increasing distance from the metal pixel extracted from the tomographic image I_ORG, the weight coefficient w becomes smaller with an increasing distance from the metal pixel. Also, the larger the pixel value of the metal pixel, the larger the metal artifact becomes. Therefore, the larger the pixel value of the metal pixel, the higher the weight coefficient w becomes.
For the weight coefficient w, by using any of the tomographic image I_ORG, the machine learning output image I_MAR, the beam hardening correction image I_BHC, and the linear interpolation image I_LI, the weight map I_w may be adjusted depending on tissue in the object under examination and/or tissue such as air and/or the like. For example, well-known thresholding-based segmentation is used to divide the tomographic image I_ORG into regions of metal, the object under examination other than metal, and air, and the weight coefficient w may be adjusted using prior information on images such as the machine learning output image I_MAR (w=1) for the metal, the weight map I_w for the object under examination other than the metal, and the tomographic image I_ORG (w=0) for air.
Also, the weight coefficient w may be adjusted as appropriate by the operator. For example, an adjustment coefficient set by the operator may be used to adjust the weight coefficient w. The adjustment coefficient is a real number between zero and one (inclusive), and all the weight coefficients w are simultaneously adjusted by multiplying the weight map I_w by the adjustment coefficient. In short, all the weight coefficients w included in the weight map I_w are multiplied by the same adjustment coefficient.
An example of manipulation windows used for setting of the adjustment coefficient is described with reference to
The operator can use the manipulation screen illustrated in
It is noted that, without limitation to the above artifacts introduced by a metal, similar means can be used to reduce artifacts that are introduced by a high absorber having a high X-ray absorption coefficient such as bone, a contrast medium, and the like other than a metal, and artifacts that are introduced by a low absorber having an extremely lower X-ray absorption coefficient, such as a lung field, an intestinal canal, and the like, in comparison with tissues of an object under examination.
By the flow of processing described above, a composite image can be provided in which the metal artifacts are reduced and the image quality can be preserved even in a region less affected by the metal artifacts.
The composite image I_CMP generated by compositing the tomographic image I_ORG and the machine learning output image I_MAR output from the machine learning engine 12 has been described in the first embodiment. In a second embodiment, a correction image with reduced metal artifacts is described which is acquired by inputting, to the machine learning engine 12, the tomographic image I_ORG and an artifact map indicating a probability distribution of presence of metal artifacts. The hardware configuration of the medical image processing apparatus 1 in the second embodiment is the same as that in the first embodiment and a description is omitted.
An example of the flow of processing executed in the second embodiment is described for each step with reference to
In like manner with S301, the arithmetic section 2 acquires a tomographic image I_ORG of an object under examination including a metal.
The arithmetic section 2 acquires an artifact map indicating a probability distribution of presence of metal artifacts. The artifact map may be generated using, for example, Equations 1 and 2.
The arithmetic section 2 inputs, to the machine learning engine 12, the artifact map acquired in S702 and the tomographic image I_ORG acquired in S701. The machine learning engine 12 receives the tomographic image I_ORG and the artifact map, and outputs a correction image in which the metal artifacts are reduced and the image quality is preserved even in the region less affected by the metal artifacts.
The arithmetic section 2 acquires the correction image output from the machine learning engine 12 in S703. The acquired correction image is displayed by the display apparatus 7 and/or stored in the storage device 4.
By the flow of processing described above, a correction image can be obtained in which the metal artifacts are reduced and the image quality is preserved even in a region less affected by the metal artifacts. It is to be understood that, in S703, the beam hardening correction image I_BHC and/or the linear interpolation image I_LI may be additionally input to the machine learning engine 12. Additionally inputting the beam hardening correction image I_BHC and/or the linear interpolation image I_LI allows the machine learning engine 12 to output a correction image with further reduced metal artifacts.
A plurality of example embodiments according to the present invention has been described. It is to be understood that the present invention is not limited to the above examples and may be embodied by modifying components thereof without departing from the spirit or scope of the present invention. Further, a plurality of components disclosed in the above examples may be combined as appropriate. Further, several components of all the components described in the above examples may be omitted.
Number | Date | Country | Kind |
---|---|---|---|
2021-064617 | Apr 2021 | JP | national |