|
1.INTRODUCTIONDual-energy computed tomography (DECT) has been widely used in clinical diagnosis, including kidney stone characterization, iodine quantitative examination, and many other applications. Compared to traditional CT, DECT provides two sets of attenuation measurements by exploiting two different energy spectra to achieve the material decomposition and energy-selective imaging, which opens up new diagnosis possibilities. However, the direct decomposed material images from DECT images would suffer from severe noise and noise-induced artifacts, especially under low-dose scan protocols. The main reason is that the material decomposition process suffers an ill-conditioned problem that leads to severe noise in the decomposed material images.1 To tackle this problem, many advanced algorithms have been proposed to improve the signal-to-noise ratio of decomposed material images. These material decomposition algorithms can be generally divided into two categories: model-driven material decomposition methods and data-driven material decomposition methods. The model-driven methods2 generally build a model on the physical properties and the attenuation characteristics, then optimize the objective function to get the decomposed material images. These methods also can be characterized into one-step inversion, projection-domain, and image-domain decomposition methods. One-step inversion is mathematically the most elegant with one-step matrix inversion, but they are very computationally expensive. Projection-domain methods directly decompose the projection data into the basis material sinograms and then reconstruct them with the filter back-projection algorithm. Although the decomposition performance is significant, they require accurate system calibrations that use nonlinear models. Image-domain methods reconstruct each energy bin image and then directly perform material decomposition on image data, which may lead to beam hardening artifacts. On the other hand, the data-driven methods3 train the convolutional neural network (CNN) in an end-to-end manner to obtain accurate decomposed material images. However, these methods have two main problems. First, these methods do not take into account the inherent physics mechanism, their unexplained property obstacles deep learning techniques to be widely applied in clinical. Second, most of these existing algorithms are supervised learning methods. The decomposition performance depends on the quality and quantity of training data pairs. However, high-quality training data pairs are difficult to acquire in clinic, which limits the accuracy of decomposed material images and the generalization performance of models. To address these problems, we propose a nonlocal spectral similarity-induced material decomposition network (NSSD-Net) for DECT. The NSSD-Net couples the model-based material decomposition model with the deep network and optimizes by the iterative shrinkage-thresholding algorithm (ISTA). Considering the structural similarity and spectral correlation underlying DECT images, the nonlocal spectral similarity-based cost function is designed to guarantee network convergence in a self-supervised manner. The main contributions of the proposed NSSD-Net can be summarized as follows: the first one is that the proposed method fully considers the material decomposition physician mechanism and combines the benefits of model-driven methods and data-driven methods. The second one is that the proposed methods use the nonlocal spectral similarity features as prior information and optimize hyper-parameters by self-supervised learning strategy with no ground-truth data, which enhances the generalization of the proposed model. Experiments on clinical data have confirmed the significant decomposition performance of the NSSD-Net. 2.METHODS2.1Model-driven material decomposition modelAccording to image-based decomposition theory, the linear combination of pixel values in the basis material images can represent the linear attenuation coefficient of each pixel in the input images. Based on the above assumption, the formulation of the material composition for DECT can be written as: where U = [μH, μL]T denotes the linear attenuation coefficient of high and low energy spectrum, X = [x1,x2]T denotes the normalized densities of the basis materials, A is the decomposition matrix and can be represented as follows: where μMij denotes the mass attenuation coefficient of the basis materials, I is an identity matrix with the dimension of 2N-by-2N, and N is the total number of pixels in one CT image. Direct decomposition generating decomposed material images with severely degraded SNR, especially at low-dose levels. This is because there is a large condition number on the matrix A, and this can be treated as an ill-posed problem, which makes the decomposition process is sensitive to the noise on the raw CT images. To alleviate this situation, least-square estimation model with regularization R(X) is introduced to suppress decomposed material images noise, which can be expressed as follows: where R(X) = ||X||1 and λ is the hyper-parameters of regularization term. ISTA is a prevailing framework to optimize the decomposition solvation process with non-smooth regularizers. Each iteration of ISTA involves gradient descent update the decomposed material images followed by a shrinkage-threshold step: where ρ is the step size, X(k) and R(k) denote the intermediate variables of the decomposed results, and λ is the hyper-parameter of the regularization term. However, the model-based material decomposition methods are computationally expensive, and it is difficult to select the optimal parameters (i.e., ρ and λ) and the global optimal solution in practical applications. 2.2The proposed NSSD-NetTo address these problems, we propose a self-supervised nonlocal spectral similarity-induced material decomposition network. As shown in Fig. 1, the proposed NSSD-Net consists of two parts: (a) ISTA-based decomposition network and (b) NSS feature generation.
2.3Loss FunctionNSSD-Net takes the noisy DECT image and NSS feature-induced decomposed material image as input, and generates the decomposed material image, denoted by Xi(Nt) as output. We optimize the proposed network by the self-supervised learning strategy and the total loss function can be expressed as: where τ is a constant and we set τ = 1 in this work. 2.4Experimental Dataset and SettingsThe clinical patient study was conducted to assess the proposed NSSD-Net performance. After writing the informed consent from 29 volunteer patients, clinical data were acquired from the GE Discovery CT750 HD scanner with 140kVp and 80kVp. The virtual monochromatic spectral (VMS) images were generated using the GE commercial software. In this work, the VMS images at 90kev and 140kev were selected as the DECT images. In addition, the linear attenuation coefficient of basis materials according to the VMS images reported by the National Institute of Standards and Technology (NIST). Then, we simulated the noisy DECT data by the previous work5 to further analyze the noise suppression performance among different material decomposition methods. In the experiment, 1000 simulated noisy VMS image pairs (i.e., 90kev and 140kev) from 22 patients were used for training and 188 simulated noisy VMS images pairs from the remaining patients were used for testing. The network was implemented on an NVIDIA Tesla P40 GPU based on the PyTorch framework and using the Adam optimizer. The initial learning rate, batch size, epoch, and iterations were set to 1e−3, 1, 50, and 100, respectively. During training, the number of CNN basic module was set to 8. According to experience, the initial gradient descent step ρ = 0.5, the hyper-parameter τ =1, and the threshold θ = 0.1. 3.RESULTSFig. 2 shows the decomposed material images of different methods for two test cases. The first column is the ground-truth decomposed material images. It can be observed that both direct matrix inversion method (DIMD) and statistic iterative based decomposition method (Iterative MD) failed to properly distinguish basis materials, and the decomposed material images were heavily corrupted by the noise and noise-induced artifacts. Compared to the DIMD and Iterative MD, the data-based material decomposition methods (i.e., CD-Convnet and ISTA-Net) remove the noise and artifacts further and provide accurately estimates of different materials. However, compared to NSSD-Net, the data-based material decomposition methods tend to over-smoothen the images, especially in the structural edge and soft tissue area. The results demonstrate that the self-supervised nonlocal spectral similarity prior can provide the spectral data structure feature information and effectively guide the material decomposition process. Fig. 3 shows the profile comparison indicated by the orange lines in Fig. 2. It can be judged that the NSSD-Net achieves a good balance between bias control and noise suppression. In order to quantitatively validate the decomposed performance of the proposed NSSD-Net, the peak signal-to-noise ratio (PSNR), structure similarity index measure (SSIM) and root mean square error (RMSE) metrics are calculated. As shown in Tab. 1, both NSSD-Net results (i.e. water and bone equivalent fractions), achieve relative superior quantitative metrics than the other decomposition methods expect for the ISTA-Net. The possible reason is that the ISTA-Net utilize the fully supervised manner to train the network, while the proposed NSSD-Net trained without the labeled data. It is worth noting that the proposed method outperforms the ISTA-Net in terms of the SSIM and confirm that the NSSD-Net can provide the global and local structure information from DECT images. Table 1.Quantitative results of different material decomposition methods.
4.CONCLUSIONIn this paper, we propose a novel material decomposition framework for DECT based on the self-supervised nonlocal spectral similarity prior. Specifically, the proposed NSSD-Net combines the traditional model-based decomposition model with the data-based method and optimizes the objective function by the nonlocal spectral similarity prior. The experimental results with clinical data show that the NSSD-Net can obtain accurate decomposed material images of dual-energy. ACKNOWLEDGMENTSThis work was supported in part by the National Natural Science Foundation of China under Grant U21A6005 and Grant U1708261, the National Key R&D Program of China under Grant No.2020YFA0712200, and Young Talent Support Project of Guangzhou Association for Science and Technology. REFERENCESW. A. Kalender, E. Klotz, and L. Kostaridou,
“An algorithm for noise suppression in dual energy CT material density images,”
IEEE Transactions on Medical Imaging, 7
(3), 218
–224
(1988). https://doi.org/10.1109/42.7785 Google Scholar
N. Ducros, J. Abascal, and B. Sixou,
“Regularization of nonlinear decomposition of spectral x-ray projection images,”
Medical Physics, 44
(9), e174
–e187
(2017). https://doi.org/10.1002/mp.12283 Google Scholar
Z. Shi, H. Li, and Q. Cao,
“A material decomposition method for dual-energy CT via dual interactive Wasserstein generative adversarial networks,”
Medical Physics, 48
(6), 2891
–2905
(2021). https://doi.org/10.1002/mp.v48.6 Google Scholar
H. Zhang, D. Zeng, J. Lin, and J. Ma,
“Iterative reconstruction for dual energy CT with an average image-induced nonlocal means regularization,”
Physics in Medicine and Biology, 62
(13), 5556
–5574
(2017). https://doi.org/10.1088/1361-6560/aa7122 Google Scholar
D. Zeng, J. Huang, Z. Bian, and J. Ma,
“A Simple Low-Dose X-Ray CT Simulation From High-Dose Scan,”
IEEE Transactions on Nuclear Science, 62
(5), 2226
–2233
(2015). https://doi.org/10.1109/TNS.2015.2467219 Google Scholar
|