Open Access
15 February 2024 Toward the use of diffuse reflection spectroscopy for intra-operative tissue discrimination during sarcoma surgery
Author Affiliations +
Abstract

Significance

Accurately distinguishing tumor tissue from normal tissue is crucial to achieve complete resections during soft tissue sarcoma (STS) surgery while preserving critical structures. Incomplete tumor resections are associated with an increased risk of local recurrence and worse patient prognosis.

Aim

We evaluate the performance of diffuse reflectance spectroscopy (DRS) to distinguish tumor tissue from healthy tissue in STSs.

Approach

DRS spectra were acquired from different tissue types on multiple locations in 20 freshly excised sarcoma specimens. A k-nearest neighbors classification model was trained to predict the tissue types of the measured locations, using binary and multiclass approaches.

Results

Tumor tissue could be distinguished from healthy tissue with a classification accuracy of 0.90, sensitivity of 0.88, and specificity of 0.93 when well-differentiated liposarcomas were included. Excluding this subtype, the classification performance increased to an accuracy of 0.93, sensitivity of 0.94, and specificity of 0.93. The developed model showed a consistent performance over different histological subtypes and tumor locations.

Conclusions

Automatic tissue discrimination using DRS enables real-time intra-operative guidance, contributing to more accurate STS resections.

1.

Introduction

Soft tissue sarcoma (STS) is a rare form of cancer that accounts for <1% of all cancer cases worldwide.1 More than 50 histological subtypes have been identified arising from various mesenchymal tissue types, with the majority of cases located in the extremities, trunk, and retroperitoneum.2 Patients can be asymptomatic for a long time, and as a result, the tumor has usually grown significantly by the time it is diagnosed.

The primary treatment for sarcomas is surgery. Achieving a complete resection of the tumor with negative resection margins (no tumor cells at the resection edge) is crucial for patient outcomes. Positive resection margins are associated with an increased risk of local recurrence and worse prognosis.35 Unfortunately, identifying the tumor boundaries during surgery can be challenging and the removal of too much healthy tissue or surrounding critical structures may cause complications or difficulties with reconstruction. The incidence of positive resection margins varies per sarcoma location and subtype. Incomplete resection rates of 11 to 33% are reported for STS in general,616 which rises up to 47% for retroperitoneal located sarcomas.6,7 In addition, myxofibrosarcoma is known for its particularly high local recurrence rate, due to an infiltrative growth pattern, which makes it difficult for surgeons to identify the tumor borders.1720 Therefore, additional therapies, such as (post-operative) radiotherapy or re-excisions, may be required to improve patient prognosis. Overall, high-grade sarcoma is associated with a relatively low 5-year survival rate of 60%.

Current margin assessment techniques for sarcoma surgery are limited. Frozen sections can be obtained during surgery for a simple histopathological examination of specific locations, but this process takes 30 to 60 min and the accuracy is not optimal, especially for adipocytic and (myo)fibroblastic sarcomas.21 The definitive resection margin status is determined by sectioning the complete excised specimen and applying the full-range of histopathological techniques. However, it may take up to 10 days before these results are available. This highlights an unmet need for an intra-operative tissue discrimination technique that can accurately and in real-time determine the resection margin status during sarcoma surgery, to ensure complete resections.

Optical techniques have emerged as a potentially valuable tool for tissue sensing purposes in the medical field. For STS, a couple of optical techniques have been studied in recent years, including near-infrared autofluorescence,22 Raman spectroscopy,23,24 mid-infrared spectroscopy,25 and indocyanine green fluorescence imaging (ICG).26 While some of these methods have demonstrated promising outcomes, not every approach is readily available, has a sufficient measurement volume, is offered as an automatic tumor detection model, or achieved an adequate level of accuracy for clinical application.

Diffuse reflectance spectroscopy (DRS) is another optical technique, which is non-invasive, real-time, easy to use, and does not require administration of contrast agents. It provides unique information about the tissue composition by illuminating the tissue with a broadband light source via an optical fiber, and subsequently measuring the reflectance spectrum after the light underwent multiple scattering and absorption interactions. Previous studies have successfully used DRS for tumor classification and surgical margin assessment in other cancer types, including breast,2729 colorectal,3032 head and neck,33 and lung tumors.34 In this study, we investigate the potential of DRS to distinguish tumor tissue from healthy tissue during sarcoma surgery. The tissue classification performance is evaluated in freshly excised sarcoma specimens from multiple tumor locations and various histological subtypes. To the best of our knowledge, this is the first study on the use of broadband DRS for margin assessment in STS surgery.

2.

Materials and Methods

2.1.

Diffuse Reflectance Spectroscopy System

The DRS system used in this study contains a halogen broadband light source (Avantes, AvaLight-HAL, 360 to 2500 nm) and two spectrometers. The first spectrometer covers the visible wavelength range (Avantes, AVASPEC-HS2048XL-EVO, 200 to 1160 nm) and the second spectrometer covers the near-infrared range (Avantes, AVASPECNIR256-1.7-RS, 900 to 1750 nm). The handheld probe consists of two fibers (400  μm diameter), with a source-to-detector distance of 2 mm. All components were controlled by in-house developed MATLAB software, which was also used to process the data.

2.2.

Patient Inclusion

Measurements were performed on freshly excised specimens from patients who were diagnosed with sarcoma and underwent a surgical resection at the Antoni van Leeuwenhoek Hospital - Netherlands Cancer Institute. This ex vivo study has been performed under the approval of the Hospital Ethics Review Board (IRBm23-074), and all patients have given permission for the further use of their data and biological materials for scientific research.

2.3.

Measurement Protocol

The sarcoma specimens were collected from the operating room directly after surgical resection. Before performing the DRS measurements, the tumor region was identified using palpation and ultrasound imaging with a 15 MHz transducer, interpreted by an expert technical physician. Next, DRS measurements were performed successively in the tumor region and locations with different types of healthy tissue. For every measured location, a corresponding US image was saved for further review by an expert. Ground truth labels were assigned based on the expert US annotations. The number of measurement locations per tissue type depended on the tumor and specimen size. To prevent potential patient-specific biases, the measurements within each patient were acquired at locations with a minimum separation of at least 2 cm. This measurement workflow is summarized in Fig. 1, in which also two example US images are shown with their corresponding tissue labels.

Fig. 1

Measurement workflow. First, the location of the tumor area was determined by an expert using palpation and ultrasound imaging (shown by the dashed black line in the illustrations). Next, DRS measurements were performed at multiple locations in this area. Subsequently, surrounding locations with healthy tissue types (e.g., fat, muscle, skin) were selected by an expert using palpation and ultrasound imaging. Here, DRS measurements were performed as well. For each measured location, a corresponding US image was saved for further review by an expert. The US images in this figure show examples of sarcoma and healthy (fat) measurement locations of a leiomyosarcoma resection specimen.

JBO_29_2_027001_f001.png

2.4.

Data Preprocessing

Several preprocessing steps were employed to optimize the raw DRS data for accurate tissue discrimination analysis. The DRS data were calibrated to correct for the sensitivity of the system and the ambient light, by measuring a white reference (Spectralon Avantes WS-2, Avantes, Apeldoorn, the Netherlands) and a dark reference (switching off the light source), as described in detail in a previous study.35 To create one broad spectrum per measurement location, the results from the visible and near-infrared spectrometers were stitched together in the DRS software by using the shared wavelength overlap. The spectral regions before 400 nm and after 1600 nm were removed due to low signal-to-noise ratio, leaving a wavelength range of 400 to 1600 nm for further analysis.

All DRS spectra were normalized using standard normal variate (SNV) normalization, to reduce non-tissue-related variations in the reflectance intensities.36 SNV normalization involves subtracting the mean of the spectrum and dividing it by the standard deviation of the spectrum.

2.5.

Data Analysis

To predict the tissue types of the measured locations, we trained a supervised classification model. The input to the model consisted of the SNV-normalized broadband DRS spectra, obtained from the resection specimens, together with the tissue type labels. We used a weighted k-nearest neighbors (KNN) model with a 10-fold cross-validation approach. In this approach, the dataset was randomly divided into 10 subsets. Each subset was used as a validation set once while the remaining nine subset partitions were used as a training set. This process was repeated 10 times, until the performance had been assessed for all samples in the dataset.

To assess the tissue classification performance of the model, the accuracy, area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and Matthews correlation coefficient (MCC) were calculated. The MCC accounts for any class imbalance in the dataset. The first four metrics have an outcome range from 0 (indicating poor prediction) to 1 (indicating perfect prediction), whereas the MCC can range from 1 to 1.

Well-differentiated liposarcomas (WDLS, also called atypical lipomatous tumors when present in the extremities) are currently being considered as borderline malignant or non-malignant. In terms of clinical relevance, this subtype might therefore be less important to detect with DRS. On the other hand, these tumors are originating from fat tissue, which may complicate the discrimination from healthy fat tissue. While similar studies22,23 have excluded WDLS during data analysis for this reason, in this study, we performed multiple experiments in which WDLS was considered as a separate tumor class (in a multiclass model), included in the sarcoma class (in binary model) and excluded (in both a multiclass and binary model). This approach allowed us to evaluate the performance of DRS in differentiating between WDLS and healthy fat tissue as well.

For all analyses, the results when using a binary classification model that distinguishes tumor tissue from all healthy tissues were compared to the results when using a multiclass classification model that differentiates between all different tissue types present in the resection specimens. A Chi-squared test with a significance level of 0.05 was used to assess differences in tumor classification accuracy between patients who received neoadjuvant therapy and those who did not.

3.

Results

3.1.

Patient and Tumor Characteristics

Freshly excised sarcoma specimens from 20 patients were included in this study. The patient and tumor characteristics are summarized in Table 1. The patient population consisted of 9 females (45%) and 11 males (55%), with a mean age of 58.4±15.4 years. A majority of the patients did not receive any neoadjuvant therapy (65%), whereas neoadjuvant chemotherapy or radiotherapy was received by 10% and 25% of the patients, respectively. Tumors were mainly located in the extremities (55%).

Table 1

Patient and tumor characteristics.

Number of patients (%)
Age (years)58.4±15.4
Gender
Female9 (45%)
Male11 (55%)
Neoadjuvant therapy
Chemotherapy2 (10%)
Radiotherapy5 (25%)
None13 (65%)
Tumor location
Extremities11 (55%)
Retroperitoneum6 (30%)
Trunk3 (15%)
Tumor type
Myxofibrosarcoma4 (20%)
Well differentiated liposarcoma3 (15%)
Leiomyosarcoma3 (15%)
Dedifferentiated liposarcoma2 (10%)
Myxoid liposarcoma2 (10%)
Undifferentiated pleiomorphic sarcoma2 (10%)
Pleiomorphic liposarcoma1 (5%)
Undifferentiated spindel cell sarcoma1 (5%)
Chondrosarcoma1 (5%)
Malignant peripheral nerve sheeth tumor1 (5%)
Maximum tumor size (cm)14.5±10.0
Resection margin (mm)0.73±1.48
Resection margin status
R013 (65%)
R0 (close)4 (20%)
R13 (15%)

DRS measurements were performed on 150 locations in total (seven to eight measurements per specimen). Locations for which the spectra showed poor tissue contact (n=19) and the US images showed inconclusive labels (n=17) were removed from the dataset, to ensure that only high-quality spectra representing the tissue of interest were used for the analysis. Finally, 114 locations remained for further analysis: 55 locations on healthy tissue and 59 locations on sarcoma tissue. A breakdown of the spectral labels by tissue type and number of patients is shown in Fig. 2. The average distance to the tumor over all measured tumor locations was equal to 1.08±0.85  mm.

Fig. 2

(a) Mean DRS spectra for different tissue types; fat, muscle, skin, tumor (sarcoma excluding WDLS), and WDLS. The shaded areas show the corresponding standard deviations; (b) t-SNE plot visualizing clusters in the spectral dataset after dimensionality reduction. The color of each point refers to the actual tissue type corresponding to the measured location (this information was not used by the t-SNE algorithm). WDLS, well-differentiated liposarcoma.

JBO_29_2_027001_f002.png

3.2.

Tissue Type Classification

The average DRS spectra of the different healthy and tumorous tissue types are shown in Fig. 2(a). The spectra of sarcoma tissue differ from the spectra of healthy tissue types in multiple wavelength regions. In the wavelength ranges of 670 to 675 nm and 1085 to 1135 nm, the standard deviation of sarcoma tissue does not overlap with any other tissue type. The spectra of fat tissue and WDLS show the most similarities, with both clearly showing the reflectance dip around 1200 nm, which is characteristic of adipose tissues.

To identify any patterns or clusters with similar characteristics in our dataset, t-distributed stochastic neighbor embedding (t-SNE) was used. This nonlinear dimensionality reduction method is able to visualize the high-dimensional dataset (1200 features per DRS spectrum for 114 measured samples) in a two-dimensional space. Figure 2(b) shows the resulting data points for all measurement locations. Three distinctive clusters can be seen: one corresponding to the spectra from tumor locations, one corresponding to the spectra from muscle and skin locations, and one corresponding to the spectra from fat and WDLS locations. WDLS, however, does seem to have its own cluster within the other fat locations. The fact that t-SNE is able to separate the data into natural clusters indicates that automatic tissue classification based on DRS spectra may be feasible (Table 2).

Table 2

Summary of measurement locations.

CharacteristicNumber of spectraNumber of patients
Fat3515
Muscle136
Skin73
Sarcoma (excluding WDLS)4715
WDLS123
The number of spectra and patients in this table is after excluding inconclusive US images. WDLS, well-differentiated liposarcoma.

3.2.1.

Performance including well-differentiated liposarcomas

The labeled DRS spectra were used to train a tissue classification model, both using a binary approach (sarcoma versus healthy) and a multiclass approach (different tissue types in five separate classes). It should be noted that in the binary approach, the healthy class consists of fat, muscle, and skin. The resulting classification performances are summarized in Table 3. Using the binary method, it was possible to discriminate sarcoma tissue from healthy tissue types with an accuracy of 0.85, sensitivity of 0.85, and specificity of 0.85. When training a multiclass kNN model, the classification performance increased to a tumor classification accuracy of 0.90, sensitivity of 0.88, and specificity of 0.93. Figure 3 shows the confusion matrices with all individual predictions, for both the binary and the multiclass classification approach.

Table 3

Tissue classification performance including WDLSs.

MethodAccuracy [95% CI]Sensitivity [95% CI]Specificity [95% CI]AUCMCC
Binary0.85 [0.77; 0.91]0.85 [0.73; 0.93]0.85 [0.73; 0.94]0.910.70
Multiclass0.90 [0.84; 0.95]0.88 [0.77; 0.95]0.93 [0.82; 0.98]0.960.81

Fig. 3

Confusion matrices showing the predicted tissue types versus the true tissue types with WDLS included in a separate class for (a) the multiclass approach and included in the tumor class for (b) the binary approach. The correctly classified measurement locations are found on the diagonal. The percentages in the last row and column specify the positive predicted values and sensitivities for the individual classes, respectively. The green color shade represents the number of locations, with darker green representing larger numbers.

JBO_29_2_027001_f003.png

The ROC curves corresponding to the trained models are shown in Fig. 4. The AUC for tumor tissue is equal to 0.91 in case of the binary model and 0.96 for the multiclass model. In addition, healthy tissue types can be distinguished better using a multiclass approach. AUC values of 0.97, 0.95, and 1 were achieved when using separate classes of fat, muscle, and skin, compared to an AUC of 0.91 when using one general healthy class. The circular markers in the figure indicate the selected operating points of the trained model for each tissue type, for which the accuracies are reported in Table 3 and Fig. 3.

Fig. 4

ROC curves of the tissue classification models with WDLS included in a separate class for (a) the multiclass approach and included in the tumor class for (b) the binary approach. (a) The multiclass plot shows one-versus-all curves for each class. The circular markers indicate the selected operating points of the trained model for each tissue type, for which the accuracies are reported in Table 3 and Fig. 3.

JBO_29_2_027001_f004.png

3.2.2.

Performance excluding well-differentiated liposarcomas

As described in Sec. 2.5, we also trained and evaluated a tissue classification model when excluding WDLS locations from the dataset (n=12). In this case, sarcoma tissue could be discriminated from healthy tissue types with an accuracy of 0.92, sensitivity of 0.89, and specificity of 0.95 when using a binary classification approach (see Table 4). When training a multiclass model, the performance increased to an accuracy of 0.93, sensitivity of 0.94, and specificity of 0.93. Figure 5 shows the corresponding confusion matrices with all individual predictions, for both classification approaches.

Table 4

Tissue classification performance excluding WDLSs.

MethodAccuracy [95% CI]Sensitivity [95% CI]Specificity [95% CI]AUCMCC
Binary0.92 [0.85; 0.97]0.89 [0.77; 0.96]0.95 [0.85; 0.99]0.940.84
Multiclass0.93 [0.86; 0.97]0.94 [0.83; 0.99]0.93 [0.82; 0.98]0.970.86

Fig. 5

Confusion matrices showing the predicted tissue types versus the true tissue types when excluding WDLSs, for both (a) a multiclass approach and (b) binary approach. The correctly classified measurement locations are found on the diagonal. The percentages in the last row and column specify the positive predicted values and sensitivities for the individual classes, respectively. The green color shade represents the number of locations, with darker green representing larger numbers.

JBO_29_2_027001_f005.png

Figure 6 shows the ROC curves for every tissue class. The AUC for tumor tissue is equal to 0.97 in the case of a multiclass approach, compared to an AUC of 0.94 in the case of a binary approach. The AUC values for healthy tissue types are higher using the multiclass approach as well. The circular markers in the figure indicate the selected operating points of the trained model for each tissue type, for which the accuracies are reported in Table 4 and Fig. 5.

Fig. 6

ROC curves of the tissue classification models excluding WDLSs, for both (a) a multiclass approach and (b) binary approach. (a) The multiclass plot shows one-versus-all curves for each class. The circular markers in the figure indicate the selected operating points of the trained model for each tissue type, for which the accuracies are reported in Table 4 and Fig. 5.

JBO_29_2_027001_f006.png

3.3.

Classification Performance Over Different Tumor Locations and Subtypes

To investigate the influence of different sarcoma locations (extremities, retroperitoneum, and trunk) and histological subtypes on the model performance, the tissue classification accuracy was calculated for each subgroup separately using the best performing classifier (multiclass) including WDLS. Figure 7 shows the number of correctly and incorrectly classified measurement locations for each subgroup, with corresponding accuracies. Good accuracy values were obtained for almost all sarcoma locations and subtypes. When looking at the locations, the highest accuracy was achieved for retroperitoneal tumors (96%) and the lowest with sarcomas located in the trunk (79%). Looking at different histological subtypes, we were able to correctly detect (accuracy: 100%) all chondrosarcoma, leiomyosarcoma, malignant peripheral nerve sheath tumors, myxoid liposarcoma, and pleiomorphic liposarcoma. Classifying dedifferentiated liposarcoma and undifferentiated spindle cell sarcoma were the most challenging ones.

Fig. 7

Number of correctly and incorrectly classified measurement locations for patients with (a) a specific tumor location and (b) histological subtype, with corresponding classification accuracies. These results are based on the multiclass classifier, including WDLSs. MPNST, malignant peripheral nerve sheath tumor.

JBO_29_2_027001_f007.png

3.4.

Classification Performance by Neoadjuvant Therapy Status

To evaluate the impact of neoadjuvant therapy on the model performance, we investigated whether there are significant differences in tissue classification accuracy between patients who received neoadjuvant therapy and those who did not, using the best performing classifier (multiclass) and considering both the inclusion and exclusion of WDLS. Figure 8 shows the classification accuracy for both patient groups. The results show comparable accuracies between the two groups. When including WDLS in the analysis, the classification accuracies of 91% and 89% were achieved for patients who did and did not receive neoadjuvant therapy, respectively. Statistical analyses revealed no significant difference between the two groups (p=0.8204). When excluding WDLS, classification accuracies of 91% and 94% were achieved for patients who did and did not receive neoadjuvant therapy, respectively. Once again, statistical analyses revealed no significant difference between the two groups (p=0.4974). These findings further emphasize the robustness of the classification method across patient groups with different treatment histories.

Fig. 8

Classification accuracy for patients who received neoadjuvant therapy and patients who did not receive any neoadjuvant therapy. These results are based on the multiclass classifier, including (a) WDLSs and (b) excluding WDLSs.

JBO_29_2_027001_f008.png

4.

Discussion

In this study, we investigated the potential of DRS for surgical margin assessment during STS surgery. To this end, we have collected a DRS dataset by performing measurements on both malignant and healthy locations on sarcoma specimens. Subsequently, we have developed a machine-learning model for tissue discrimination. To the best of our knowledge, this is the first study in which the DRS technique has been used for STSs. Our results demonstrated that DRS can distinguish between healthy and sarcoma tissues with good sensitivity and specificity, including different subtypes of sarcomas.

When looking at the mean DRS spectrum for different healthy and sarcoma tissues [Fig. 2(a)], it can be observed that the spectrum of sarcoma locations (excluding WDLS) differs from the spectra of healthy tissue locations in multiple wavelength regions. On the other hand, WDLS shares stronger spectral similarities to healthy fat tissue than to other sarcoma subtypes (including dedifferentiated liposarcoma). This is comparable to previous research in the literature into the correlation between lipid content and histological grade in liposarcomas.37 Nevertheless, t-SNE analysis seemed to reveal a subcluster of WDLS within the cluster of fat locations [Fig. 2(b)], indicating that it could still be possible to distinguish WDLS from fat tissue.

After training a tissue classification model on our acquired DRS dataset, tumor tissue could be classified with an accuracy of 0.93, sensitivity of 0.94, specificity of 0.93, and MCC of 0.86 when excluding WDLS. Including the WDLS class during classification training affects the classification performance, with a lower sensitivity of 0.88. However, lower DRS performance in the differentiation between WDLS and fat tissue is less critical from a clinical point of view since these sarcomas have a low recurrence rate and an excellent prognosis when located in the extremities.38 However, when present in the retroperitoneum local recurrence is seen more often and can eventually lead to mortality in these patients. Therefore, it might still be of added value to provide guidance to the surgeon for these cases as well. When taking WDLS into account as a separate class, the model was able to discriminate between fat and WDLS with reasonable accuracy. The confusion matrix in Fig. 3 shows that 75% of the WDLS locations were classified correctly, whereas 25% of the locations were misclassified as fat. This accuracy might even increase with a larger sample size and a better balance between these classes.

For all analyses, the results of a binary classification approach that distinguishes tumor tissue from all healthy tissues were compared to the results of a multiclass approach that differentiates between all different tissue types. Tables 3 and 4 showed that the highest tumor classification performance was achieved when a multiclass approach was used, both including and excluding WDLS. Combining fat, muscle, and skin tissue together in one healthy class resulted in slightly more false negative tumor predictions than when these healthy classes were treated separately, as can be seen in the confusion matrices in Figs. 3 and 5. However, it is worth noting that the binary classification results benefit from a more balanced class distribution compared to the multiclass approach, enhancing the reliability and robustness of these findings. Another observation in these confusion matrices is that muscle, skin, and WDLS are the tissue types that are most difficult to discriminate, with slightly lower sensitivity and specificity values. However, these groups are also the tissue types with the smallest number of samples per class, compared to fat and tumor classes. For these groups, it may be beneficial to acquire more data samples and examine if the classification performance for these tissue types can be improved. When specifically focusing on all misclassified locations, it was observed that it mainly concerned the same locations across all classification approaches. In addition, these locations also appeared to be in the wrong cluster in the t-SNE analysis [Fig. 2(b)]. There might be a chance that these locations are actually mislabeled in the original dataset, for example, due to a small displacement between the locations of the DRS measurement and the corresponding US image interpreted by the expert.

The developed tumor classification model showed a robust and consistent performance over all different histological subtypes and tumor locations that were included in this study, see Fig. 7. For retroperitoneal sarcomas, in which positive resection margin rates of around 45% are reported,6 an excellent classification accuracy of 96% was achieved. For myxofibrosarcomas, which are known for their high local recurrence rate of 20% to 30%,1720 a promising accuracy of 86% was achieved as well. Detecting dedifferentiated liposarcomas, which originated from fatty cells but differentiated into another type, are challenging. Undifferentiated spindle cell sarcoma showed the lowest performance, but with only four examples of this class, these results should be taken with caution. For some histological subtypes and locations, classification performance may benefit from an increased number of samples. Furthermore, the developed tumor classification model showed no significant differences in performance between patients who received neoadjuvant therapy and those who did not, confirming previous conclusions in the literature39 and further emphasizing the robustness of the classification method across different patient groups.

A few studies can be found in the literature in which alternative optical techniques have been explored for tissue discrimination in STS specimens; see Table 5. Nguyen et al. used near-infrared autofluorescence for tissue classification, achieving an accuracy of 56% for sarcoma tissue.22 When excluding WDLS, the accuracy increased to 88%. In a subsequent study, Nguyen et al. explored Raman spectroscopy for sarcoma differentiation, which resulted initially in a sensitivity of 60% and specificity of 92%.23 When excluding WDLS, these metrics increased to 89% and 96%, respectively. Li et al. also classified STS specimens with Raman spectroscopy using both a quantitative method and a machine learning method, which resulted in a classification accuracy of 85%.24 Larson et al. used mid-infrared spectroscopy to show differences in absorption between sarcoma tissue and healthy tissue for laser ablation of sarcoma, but no classification was performed.25 Finally, Gong et al. implemented intraoperative margin assessment based on ICG in patients with STSs.26 The ICG margins matched the permanent pathology margins in 56% of the patients. Larger sample sizes are needed to better quantify optimal dosage, timing, and the effect of histological subtypes. In conclusion, with DRS we achieved a better performance than previously achieved with other optical techniques, especially when the WDLS subtype is included. Previous studies have repeatedly excluded WDLS during part of the analysis, resulting in increased sarcoma classification accuracies, similarly as showed in this study. However, in this study, we also examined the distinctiveness between WDLSs and healthy fat tissue specifically and showed promising results using DRS despite the small number of examples. Furthermore, from a data analysis point of view, skin has also been taken into account as an additional healthy class in this study, which has not been done before.

Table 5

Overview of previous studies in literature in which optical techniques were used for tissue discrimination in STSs.

StudyOptical techniqueWDLSAccuracySensitivitySpecificity
Nguyen et al.22NIR autofluorescenceIncl.56%
Excl.88%
Nguyen et al.23Raman spectroscopyIncl.60%92%
Excl.89%96%
Li et al.24Raman spectroscopy85%
Larson et al.25MIR spectroscopyN/A (only qualitative absorption properties)
Gong et al.26ICG56%22%89%
Current studyDRSIncl.90%88%93%
Excl.93%94%93%

Some advantages and limitations of DRS compared to these alternative optical techniques are worth discussing. Raman spectroscopy presents the advantage of being a label-free technique without the need for contrast agents and provides detailed cellular-level information. However, it does require relatively complex instrumentation and data analysis and is limited by its shallow measurement volume and penetration depth, restricting its application to surface analysis. Fluorescence and ICG offer real-time imaging of larger surface areas, but require the administration of contrast agents and may experience limited penetration depths. On the other hand, DRS does not require contrast agents, is relatively easy to use, and provides a larger measurement volume compared to Raman, making it well-suited for examining a resection margin in real-time and identifying subsurface tumors. Moreover, the tissue discrimination performance of DRS for STS that has been achieved in this study surpasses the previously reported performances of alternative techniques in the literature. However, it is essential to acknowledge that DRS is a point-measurement technique, with limited resolution compared to Raman, and requires direct contact with the tissue. In addition, the visible part of DRS spectra can be affected by blood in in-vivo settings. For this reason, we have used a broad wavelength range extending up to 1600 nm, as the near-infrared region is not affected. Despite the aforementioned limitations, the practical advantages and superior tissue discrimination performance of DRS make it a promising solution for oncological surgical applications.

DRS has shown to be a promising technique for intraoperative guidance and resection margin assessment during sarcoma surgery. However, there are some points that can be addressed in future research before translation into clinical practice. First, as discussed before, the dataset used in this study had a relatively small number of samples, especially for some tissue types and histological subtypes. Future studies should aim to increase the dataset with a balanced number of samples per subtype and class, which may improve the classification performance for tissue types that were not well-represented in this study, such as muscle, skin, and WDLS. While our study employed a cross-validation approach due to the limited sample size per class, in which we took measures to reduce potential patient-specific biases through spatial separation of measurements within each patient, it is important to acknowledge the inherent limitation of this method. Future studies incorporating patient-wise cross-validation will be essential to enhance the robustness and applicability of our methodology in new clinical scenarios. Second, it would be interesting to investigate the discrimination of sarcoma from additional tissue types that surgeons would benefit from, such as blood vessels or other critical structures where sarcoma can grow close to. Third, future research should focus on the validation with histopathology results. Although the histopathology process is known to cause significant tissue deformation40 and the annotated ultrasound images were acquired under exactly the same circumstances as the DRS spectra, this is currently the gold standard for resection margin assessment. This will also enhance our understanding of the capabilities and limitations of DRS in critical margin or border areas, which often pose the greatest clinical challenge. The final step would be to perform an in-vivo study to evaluate the practical feasibility and validate the performance of DRS in the surgical setting. This technique could then even be integrated directly into surgical tools.

5.

Conclusion

Accurately distinguishing tumor tissue from normal tissue is crucial to achieve complete resections during STS surgery while preserving critical structures. Incomplete excisions can result in local recurrences and decreased survival rates, thereby requiring subsequent surgeries or additional therapies for the patient. Noninvasive DRS technology could be used in the operating room to evaluate tissue locations in real-time. Our results showed that tumor tissue can be distinguished from different types of healthy tissue with high sensitivity and specificity using DRS, even when including WDLSs. This may improve the accuracy of tumor border detection and help surgeons achieve a better surgical outcome and improve patient prognosis.

Disclosures

The authors declare no conflicts of interest.

Code and Data Availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Acknowledgments

The authors thank all surgeons and nurses from the Department of Surgery at the Netherlands Cancer Institute for their assistance in collecting the specimens. We would also like to thank Fleur Mook and Jikkie Smilde for their assistance with the data acquisition. Research at the Netherlands Cancer Institute is supported by institutional grants of the Dutch Cancer Society and of the Dutch Ministry of Health, Welfare and Sport.

References

1. 

G. Gatta et al., “Rare cancers are not so rare: the rare cancer burden in Europe,” Eur. J. Cancer, 47 2493 –2511 https://doi.org/10.1016/j.ejca.2011.08.008 (2011). Google Scholar

2. 

G. D. Demetri et al., “Soft tissue sarcoma,” J. Natl. Comprehensive Cancer Network, 8 630 –674 https://doi.org/10.6004/jnccn.2010.0049 (2010). Google Scholar

3. 

S. Byerly et al., “The role of margins in extremity soft tissue sarcoma,” J. Surg. Oncol., 113 333 –338 https://doi.org/10.1002/jso.24112 JSONAU 0022-4790 (2016). Google Scholar

4. 

A. Sambri et al., “Margin assessment in soft tissue sarcomas: review of the literature,” Cancers, 13 (7), 1687 https://doi.org/10.3390/cancers13071687 (2021). Google Scholar

5. 

E. A. Huis in ’t Veld et al., “Adequate surgical margins for dermatofibrosarcoma protuberans - a multi-centre analysis,” Eur. J. Surg. Oncol., 47 (2), 436 –442 https://doi.org/10.1016/j.ejso.2020.06.022 (2021). Google Scholar

6. 

A. Stojadinovic et al., “Analysis of the prognostic significance of microscopic margins in 2,084 localized primary adult soft tissue sarcomas,” Ann. Surg., 235 (3), 424 –434 https://doi.org/10.1097/00000658-200203000-00015 ANSUA5 0003-4932 (2002). Google Scholar

7. 

M. T. Ballo et al., “Retroperitoneal soft tissue sarcoma: an analysis of radiation and surgical treatment,” Int. J. Radiat. Oncol. Biol. Phys., 67 158 –163 https://doi.org/10.1016/j.ijrobp.2006.08.025 IOBPD3 0360-3016 (2007). Google Scholar

8. 

C. S. Trovik et al., “Surgical margins, local recurrence and metastasis in soft tissue sarcomas: 559 surgically-treated patients from the Scandinavian Sarcoma Group Register,” Eur. J. Cancer, 36 (6), 710 –716 https://doi.org/10.1016/S0959-8049(99)00287-7 (2000). Google Scholar

9. 

T. Fujiwara et al., “The role of surgical margin quality in myxofibrosarcoma and undifferentiated pleomorphic sarcoma,” Eur. J. Surg. Oncol., 47 (7), 1756 –1762 https://doi.org/10.1016/j.ejso.2020.11.144 (2021). Google Scholar

10. 

M. M. Cates and J. M. Cates, “Surgical resection margin classifications for high-grade pleomorphic soft tissue sarcomas of the extremity or trunk: definitions of adequate resection margins and recommendations for sampling margins from primary resection specimens,” Mod. Pathol., 32 (10), 1421 –1433 https://doi.org/10.1038/s41379-019-0278-9 MODPEO 0893-3952 (2019). Google Scholar

11. 

P. W. O’Donnell et al., “The effect of the setting of a positive surgical margin in soft tissue sarcoma,” Cancer, 120 (18), 2866 –2875 https://doi.org/10.1002/cncr.28793 CANCAR 0008-543X (2014). Google Scholar

12. 

F. Lintz et al., “Critical study of resection margins in adult soft-tissue sarcoma surgery,” Orthopaed. Traumatol.: Surg. Res., 98 S9 –S18 https://doi.org/10.1016/j.otsr.2012.04.006 (2012). Google Scholar

13. 

P. W. Pisters et al., “Analysis of prognostic factors in 1,041 patients with localized soft tissue sarcomas of the extremities,” J. Clin. Oncol., 14 1679 –1689 https://doi.org/10.1200/JCO.1996.14.5.1679 JCONDN 0732-183X (1996). Google Scholar

14. 

G. K. Zagars et al., “Surgical margins and reresection in the management of patients with soft tissue sarcoma using conservative surgery and radiation therapy,” Cancer, 97 (10), 2544 –2553 https://doi.org/10.1002/cncr.11367 CANCAR 0008-543X (2003). Google Scholar

15. 

E. Stoeckle et al., “Prospective evaluation of quality of surgery in soft tissue sarcoma,” Eur. J. Surg. Oncol., 32 (10), 1242 –1248 https://doi.org/10.1016/j.ejso.2006.05.005 (2006). Google Scholar

16. 

G. K. Zagars et al., “Prognostic factors for patients with localized soft-tissue sarcoma treated with conservation surgery and radiation therapy: an analysis of 1225 patients,” Cancer, 97 2530 –2543 https://doi.org/10.1002/cncr.11365 CANCAR 0008-543X (2003). Google Scholar

17. 

R. Sanfilippo et al., “Myxofibrosarcoma: prognostic factors and survival in a series of patients treated at a single institution,” Ann. Surg. Oncol., 18 720 –725 https://doi.org/10.1245/s10434-010-1341-4 (2011). Google Scholar

18. 

B. Odei et al., “Predictors of local recurrence in patients with myxofibrosarcoma,” Am. J. Clin. Oncol.: Cancer Clin. Trials, 41 (9), 827 –831 https://doi.org/10.1097/COC.0000000000000382 (2018). Google Scholar

19. 

N. J. Look Hong et al., “Prognostic factors and outcomes of patients with myxofibrosarcoma,” Ann. Surg. Oncol., 20 80 –86 https://doi.org/10.1245/s10434-012-2572-3 (2013). Google Scholar

20. 

A. Gronchi et al., “Extremity soft tissue sarcoma in a series of patients treated at a single institution: local control directly impacts survival,” Ann. Surg., 251 (3), 506 –511 https://doi.org/10.1097/SLA.0b013e3181cf87fa ANSUA5 0003-4932 (2010). Google Scholar

21. 

S. Miwa et al., “Accuracy of histological grades from intraoperative frozen-section diagnoses of soft-tissue tumors,” Int. J. Clin. Oncol., 25 2158 –2165 https://doi.org/10.1007/s10147-020-01762-2 (2020). Google Scholar

22. 

J. Q. Nguyen et al., “Near-infrared autofluorescence spectroscopy of in vivo soft tissue sarcomas,” Opt. Lett., 40 (23), 5498 –5501 https://doi.org/10.1364/OL.40.005498 OPLEDP 0146-9592 (2015). Google Scholar

23. 

J. Q. Nguyen et al., “Intraoperative Raman spectroscopy of soft tissue sarcomas,” Lasers Surg. Med., 48 774 –781 https://doi.org/10.1002/lsm.22564 LSMEDI 0196-8092 (2016). Google Scholar

24. 

L. Li et al., “Classification of soft tissue sarcoma specimens with raman spectroscopy as smart sensing technology,” Cyborg Bionic Syst., 2021 9816913 https://doi.org/10.34133/2021/9816913 (2021). Google Scholar

25. 

E. Larson et al., “Mid-infrared absorption by soft tissue sarcoma and cell ablation utilizing a mid-infrared interband cascade laser,” J. Biomed. Opt., 26 043012 https://doi.org/10.1117/1.JBO.26.4.043012 JBOPFO 1083-3668 (2021). Google Scholar

26. 

M. F. Gong et al., “Intraoperative evaluation of soft tissue sarcoma surgical margins with indocyanine green fluorescence imaging,” Cancers, 15 582 https://doi.org/10.3390/cancers15030582 (2023). Google Scholar

27. 

Z. Volynskaya et al., “Diagnosing breast cancer using diffuse reflectance spectroscopy and intrinsic fluorescence spectroscopy,” J. Biomed. Opt., 13 024012 https://doi.org/10.1117/1.2909672 JBOPFO 1083-3668 (2008). Google Scholar

28. 

M. D. Keller et al., “Autofluorescence and diffuse reflectance spectroscopy and spectral imaging for breast surgical margin analysis,” Lasers Surg. Med., 42 15 –23 https://doi.org/10.1002/lsm.20865 LSMEDI 0196-8092 (2010). Google Scholar

29. 

L. L. de Boer et al., “Fat/water ratios measured with diffuse reflectance spectroscopy to detect breast tumor boundaries,” Breast Cancer Res. Treat., 152 (3), 509 –518 https://doi.org/10.1007/s10549-015-3487-z BCTRD6 (2015). Google Scholar

30. 

E. J. M. Baltussen et al., “Tissue diagnosis during colorectal cancer surgery using optical sensing: an in vivo study,” J. Transl. Med., 17 333 https://doi.org/10.1186/s12967-019-2083-0 (2019). Google Scholar

31. 

E. J. M. Baltussen et al., “Optimizing algorithm development for tissue classification in colorectal cancer based on diffuse reflectance spectra,” Biomed. Opt. Express, 10 (12), 6096 https://doi.org/10.1364/BOE.10.006096 BOEICL 2156-7085 (2019). Google Scholar

32. 

F. Geldof et al., “Diffuse reflection spectroscopy at the fingertip: design and performance of a compact side-firing probe for tissue discrimination during colorectal cancer surgery,” Biomed. Opt. Express, 14 128 –147 https://doi.org/10.1364/BOE.476242 BOEICL 2156-7085 (2023). Google Scholar

33. 

S. Brouwer De Koning et al., “Toward complete oral cavity cancer resection using a handheld diffuse reflectance spectroscopy probe,” J. Biomed. Opt., 23 121611 https://doi.org/10.1117/1.JBO.23.12.121611 JBOPFO 1083-3668 (2018). Google Scholar

34. 

J. W. Spliethoff et al., “Improved identification of peripheral lung tumors by using diffuse reflectance and fluorescence spectroscopy,” Lung Cancer, 80 165 –171 https://doi.org/10.1016/j.lungcan.2013.01.016 (2013). Google Scholar

35. 

F. Geldof et al., “Layer thickness prediction and tissue classification in two-layered tissue structures using diffuse reflectance spectroscopy,” Sci. Rep., 12 1698 https://doi.org/10.1038/s41598-022-05751-5 (2022). Google Scholar

36. 

M. Witteveen et al., “Comparison of preprocessing techniques to reduce nontissue-related variations in hyperspectral reflectance imaging,” J. Biomed. Opt., 27 106003 https://doi.org/10.1117/1.JBO.27.10.106003 JBOPFO 1083-3668 (2022). Google Scholar

37. 

S. Singer et al., “Correlation of lipid content and composition with liposarcoma histology and grade,” Ann. Surg. Oncol., 4 (7), 557 –563 https://doi.org/10.1007/BF02305536 (1997). Google Scholar

38. 

K. M. Dalal et al., “Subtype specific prognostic nomogram for patients with primary liposarcoma of the retroperitoneum, extremity, or trunk,” Ann. Surg., 244 381 –391 https://doi.org/10.1097/01.sla.0000234795.98607.00 ANSUA5 0003-4932 (2006). Google Scholar

39. 

L. L. de Boer et al., “Influence of neoadjuvant chemotherapy on diffuse reflectance spectra of tissue in breast surgery specimens,” J. Biomed. Opt., 24 115004 https://doi.org/10.1117/1.JBO.24.11.115004 JBOPFO 1083-3668 (2019). Google Scholar

40. 

N. S. Goldstein, A. Soman and J. Sacksner, “Disparate surgical margin lengths of colorectal resection specimens between in vivo and in vitro measurements: the effects of surgical resection and formalin fixation on organ shrinkage,” Am. J. Clin. Pathol., 111 (3), 349 –351 https://doi.org/10.1093/ajcp/111.3.349 AJCPAI 0002-9173 (1999). Google Scholar

Biographies of the authors are not available.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
Freija Geldof, Yvonne M. Schrage, Winan J. van Houdt, Henricus J. C. M. Sterenborg, Behdad Dashtbozorg, and Theo J. M. Ruers "Toward the use of diffuse reflection spectroscopy for intra-operative tissue discrimination during sarcoma surgery," Journal of Biomedical Optics 29(2), 027001 (15 February 2024). https://doi.org/10.1117/1.JBO.29.2.027001
Received: 6 July 2023; Accepted: 24 January 2024; Published: 15 February 2024
Advertisement
Advertisement
KEYWORDS
Tissues

Tumors

Reflectance spectroscopy

Resection

Surgery

Diffuse reflectance spectroscopy

Binary data

Back to Top