Deep learning-based two-step organs at risk 
auto-segmentation model for brachytherapy planning in parotid gland carcinoma

Zhen-Yu Li; Jing-hua Yue; Wei Wang; Wen-Jie Wu; Fu-gen Zhou; Jie Zhang; Bo Liu

doi:10.5114/jcb.2022.123972

eISSN: 2081-2841
ISSN: 1689-832X

Journal of Contemporary Brachytherapy

Current Issue Archive Supplements Articles in Press Journal Information Aims and Scope Editorial Office Editorial Board Register as Author Register as Reviewer Instructions for Authors Abstracting and indexing Subscription Advertising Information Links

Editorial System

Submit your Manuscript

6/2022
vol. 14

Send email

Copy url:

Original paper

Deep learning-based two-step organs at risk auto-segmentation model for brachytherapy planning in parotid gland carcinoma

Zhen-Yu Li

¹

,

Jing-hua Yue

²

,

Wei Wang

¹

,

Wen-Jie Wu

¹

,

Fu-gen Zhou

^{2, 3}

,

Jie Zhang

¹

,

Bo Liu

^{2, 3}

Department of Oral and Maxillofacial Surgery, Peking University School of Stomatology, Haidian District, Beijing, P.R. China
Image Processing Center, Beihang University, Beijing, P.R. China
Beijing Advanced Innovation Center for Biomedical Engineering, Beihang University, Beijing, P.R. China

J Contemp Brachytherapy 2022; 14, 6: 527–535

DOI: https://doi.org/10.5114/jcb.2022.123972

Online publish date: 2022/12/30

Article file

- Deep learning-based.pdf [0.53 MB]

Get citation

Purpose

Accurate delineation of both target organs and organs at risk (OARs) is crucial for the planning of modern radiotherapy [1]. As a form of precision treatment, low-dose-rate (LDR) brachytherapy delivers high radiation doses to the anatomy of each target volume, while sparing surrounding normal tissues. Interstitial brachytherapy after parotid cancer surgical resection, can be used to deliver high-conformity radiation doses to target volumes, and it provides a high local control rate with few side effects [2]. OARs segmentation is currently manually performed, and involves screening of hundreds of computed tomo-graphy (CT) imaging slices. As such, this process is not only tedious and time-consuming, but is also prone to subjectivity and inter-operator variability.

Automated segmentation has been developed to address these challenges. Various deep learning-based systems methods have been derived from previous statistical models or atlas-based models [3]; however, the presence of post-surgical adhesions, which can compromise soft tissue boundaries on imaging, remains a challenge in adjuvant radiotherapy. Currently available algorithm-derived volumetric segmentation techniques tend to require significant manual editing, and fail to provide significant improvement in clinical workflow [4]. Advancements in deep learning technologies in medical image classification and segmentation have gained substantial attraction and success in recent years, particularly convolutional neural networks (CNN) [5]. Multiple CNN-based models have been proposed in the past decade, such as fully convolution networks (FCNs) [6], DeepLab system [7], and U-net [8].

With the adoption of symmetrical expansion paths and skip connections, U-net fuses low- and high-level feature maps, and effectively overcomes imaging noise and blurred boundaries. U-net and its’ variants have thus been widely used for segmentation during medical imaging, especially in the field of radiotherapy. Recently, the nnU-Net method, which builds on the architecture of U-net, has been considered the best baseline deep learning-based model for medical segmentation, owing to its’ automaticity in all stages, from pre- to post-processing of each given image dataset, removing the need for manual tuning [9]. Due to the scarcity of studies on the use of automated segmentation models in parotid cancer, we thereby proposed a two-step 3D nnU-Net-based automated OARs segmentation model for brachytherapy in parotid gland cancer, and evaluated its accuracy by comparison with gold standard expert manual segmentation.

Material and methods

Clinical imaging data

A total of 200 patients with parotid gland carcinoma, who underwent surgical resection followed by adjuvant iodine-125 (¹²⁵I) seed with surface radioactivity of 18.5 MBq and air-kerma strength of 0.635 μGy• m²• h⁻¹ per seed (model 6711; Beijing Atom and High Technique Industries Inc., Beijing, China; t_1/2, 59.6 days; energy level, 27.4-31.4 keV; tissue penetration capacity, 1.7 cm) with activity of 22.2 to 29.6 MBq (range, 0.6-0.8 mCi) implantation brachytherapy [10] at the Peking University Stomatology Hospital between 2017 and 2021 were included in the study. For each patient, treatment planning was performed using a brachytherapy planning system (BTPS; Beijing Astro Technology Ltd. Co., Beijing, China), based on CT images taken within 4 weeks post-surgery.

Images were acquired in 512 × 512 matrix size and 2 mm slice thickness. All images were cut in the sagittal position, and only data on the side of cancer were retained for the study. Six OARs were defined, including the auricula, condyle process, skin, mastoid process, external auditory canal, and mandibular ramus. It should be noted that some OARs in parotid brachytherapy have a large range, such as the skin and mandible. Nevertheless, collateral damage to normal tissues in brachytherapy is likely to be confined to the vicinity of clinical target volume (CTV), it was not possible to outline the whole organ. Regions of interest (ROIs) were limited to 17 mm from the residual parotid tissues, in accordance with inherent irradiation range of ¹²⁵I seeds. Collected datasets were randomly divided into a training set of 70% of cases (n = 140) to establish and train the proposed model, a validation set of 15% cases (n = 30) for held-out validation, and a test set of 15% of cases (n = 30) to evaluate the performance of the final prediction models.

Two-stage diagnostic labeling was performed by two experienced radiation oncologists (reader 1 and reader 4, with 5 years’ and 10 years’ experience in parotid cancer treatment planning, respectively) in consensus. Prior to this, training was performed on five cases unrelated to the study to allow for discussion of segmentation procedures and development of consensus criteria.

In the first stage, the radiologist with 5 years of experience manually labeled 6 organs using an open-source software package (ITK-SNAP version 3.4.0; http://www.itksnap.org) [11]. All 200 patients’ OARs were re-delineated for the purpose of this research. In the second stage, contour sets were reviewed by a senior radiologist and modified as necessary, to maintain consistency. Manual delineation were used as gold standard for training and testing.

The present study was approved by the Institutional Review Board. All image data were obtained retrospectively, and were anonymized and de-identified prior to analysis.

Deep learning two-step segmentation model

The architecture of 3D nnU-Net model is shown in Figure 1. As an out-of-the-box tool, pre-processing strategies and training hyperparameters were automatically adapted. The model was comprised of an encoder and decoder path, with skip connections in between. Both paths consisted of repeated blocks of two convolutional layers, each followed by an adaptive layer-instance normalization layer and a leaky rectified linear units. In the encoder, down-sampling was done by strode convolution, while in the decoder path, up-sampling was performed via transposed convolution. In the decoder, feature maps after each convolutional block were created by convolution with a 1 × 1 × 1 kernel, and SoftMax function was applied to the output for deep supervision. Initial feature maps were set to 32, and were doubled during down-sampling to a maximum of 320.

Fig. 1

Architecture of 3D nnU-Net-based model

/f/fulltexts/JCB/49842/JCB-14-49842-g001_min.jpg

A two-step approach was designed to allow for coarse-to-fine segmentation, as shown in Figure 2. The first model was trained to segment the parotid gland only, while the second was trained to segment the pre-defined ROIs. As mentioned, the input of the second model was created by expanding 17 mm from the position of the segmented parotid gland identified in the first model. The test dataset was used as the input of each model.

Fig. 2

Two-step model based on nnU-Net framework

/f/fulltexts/JCB/49842/JCB-14-49842-g002_min.jpg

A total of 170 patients’ CT scans were used for training and validation. Based on the self-configurable strategy of nnU-Net, all images were automatically pre-processed by re-sampling, normalization, data augmentation (mirroring, rotation, and scaling), and cropping prior to network input. Input volumes of each model were resized to 56 × 256 × 128 and 32 × 256 × 160 pixels, respectively.

Loss and training details

The model was trained with deep supervision [12]. For each output, a corresponding down-sampled ground-truth segmentation mask was used for loss computation. Loss (L) was defined as follows (Equation 1):

L=∑d=1512d−1⋅Ld

Ld=αd⋅Ldice+βd⋅Lce (1)

where α_d and β_d are the weight coefficients, and were assigned as 1 and 1; and where L_d represents the loss of d-th layer in deep supervision, d ∈ (1,2,3,4,5).

Loss function (L_d) was defined as the sum of dice loss (L_dice) [13] and cross entropy loss (L_ce) [14] as follows (Equation 2):

Ldice=−2N∑n∈N∑i∈Iyinpin∑i∈Iyin+∑pini∈I

Lce=1I∑n∈I−[pi ⋅ln(yi)+(1-pi)ln(1-yi)] (2)

where yⁿ_i and pⁿ_i represent the ground truth segmentation and the predicted probability of i^th pixel of n^th class, respectively. The pixel and class numbers were denoted as I and N, respectively.

Training of the model was performed using stochastic gradient descent (SGD) optimizer with Nesterov momentum. Initial learning rate was set at 1e-3, and batch size was set to 2, with each epoch defined as 250 training iterations. Maximum epoch was set to 1,000. The proposed model was implemented on a single GeForce RTX 3090 (NVIDIA) graphics processing unit of 24 GB. Codes were developed with Python version 3.7.9, while the model was developed using PyTorch version 1.8.0.

Quantitative evaluation metrics

The accuracy of the model was qualitatively evaluated against gold standard manual contours in terms of dice similarity coefficient (DSC), Jaccard index, 95^th-percentile Hausdorff distance (95HD), and precision and recall.

DSC describes the degree of volumetric overlap between ground truth (A) and deep learning-based segmentation (B) (Equation 3). Values 0 and 1 corresponded to no and complete overlap, respectively [15].

DSC(A,B)=2A∩BA+B (3)

The Jaccard index is an algorithm performance metric for measuring similarity between datasets. It was defined as the cardinality of intersection divided by the cardinality of union of the A and B sets (Equation 4) [16].

J(A,B)=A∩BA∪B (4)

Hausdorff distance was a distance-based metric, and measures dissimilarity between two datasets [17]. It was defined as the largest value of all distances from the point in one set to the closest point in the other. Lower HD values imply higher segmentation accuracy. 95HD was applied in our study because of high susceptibility of HD to noise and local outliers.

The precision and recall were based on true positives (TP), false negatives (FN), and false positives (FP) [18].

The precision was defined as the ratio between true positives and the total number of possible positives (true positives + false negatives). The precision rate was presented in Equation (5).

Precision = TPTP + FP (5)

The recall was defined as the ratio of true positives in the data to true positives and false negatives. The recall rate was represented in Equation (6).

Recall = TPTP + FN (6)

Data were verified with the Kolmogorov-Smirnov test to determine whether they were approximately normally distributed. For normally distributed data, the paired Student’s t-test was used to compare the two-step and segmentation-only models. For non-normally distributed data, the Wilcoxon test was used. Statistical significance was set at two-tailed p < 0.05.

Subjective validation

For qualitative analysis, all tested datasets were blindly reviewed by two senior oncologists with > 20 years of experience. Segmentation quality was graded using a 4-point scoring system, with 0 points meaning severe defect, presence of large and obvious errors; 1 point meaning moderate defect, presence of minor correctable errors; 2 points meaning mild defect, presence of clinically insignificant errors; and 3 points meaning precise, no editing required.

Dosimetric evaluation

To assess dosimetric impact of our proposed model, brachytherapy plans on all test cases were made. Then, dose-volume histogram (DVH) of OARs between two-step model and manual expert segmentation was calculated. Difference of Dmean was determined to access dosimetric effects of each segmentation method. OARs dosimetric metrics were analyzed using Spearman’s rank tests (p = 0.05).

Results

Accuracy of residual parotid tissue segmentation

The accuracy was demonstrated in the first segmentation stage of our proposed model for residual parotid tissue, with DSC of the validation dataset reaching 0.87.

Accuracy of OARs’ segmentation

Figure 3 show the best and challenging scenarios obtained with manual expert segmentation (B and F), the segmentation-only model (C and G), and the proposed two-step segmentation model (D and H) in two representative patients. Successful delineation of the six OARs was observed, with good visual agreement across all 3 modalities.

Fig. 3

The best and challenging scenarios in the model. Figures (A-D) show the best scenario. A) A slice obtained from a 3D parotid CT image; B) Result from manual expert segmentation; C) Segmentation-only model; D) The two-step segmentation model. Figures (E, F) show the challenging scenario. E) A slice obtained from another 3D parotid CT image; F) Result from manual expert segmentation; G) Segmentation-only model; H) Two-step segmentation model

/f/fulltexts/JCB/49842/JCB-14-49842-g003_min.jpg

Statistical results of the two-step segmentation and segmentation-only models are presented in Table 1. Lower 95 HD and higher DSC values were observed in our proposed model.

Table 1

Statistical evaluation results of four different kinds of autosegmentation models

OARs	DSC				p	95HD				p	Jaccard				p	Precision				p	Recall				p
OARs	3D Unet	3D Vnet	SM	TSM	p	3D Unet	3D Vnet	SM	TSM	p	3D Unet	3D Vnet	SM	TSM	p	3D Unet	3D Vnet	SM	TSM	p	3D Unet	3D Vnet	SM	TSM	p
Auricula	0.8468	0.7285	0.8731	0.8836	0.075	15.0672	7.1394	1.6362	1.3712	0.046*	0.7377	0.5781	0.7771	0.7925	0.125	0.8527	0.6266	0.8898	0.8703	0.143	0.8454	0.8929	0.8719	0.8992	0.039*
Condyle process	0.9067	0.9081	0.9139	0.9148	0.034*	1.7019	1.9366	1.3791	1.3626	0.161	0.8307	0.8341	0.8460	0.8475	0.041*	0.9220	0.9052	0.9330	0.9332	0.023*	0.8950	0.9141	0.8996	0.9011	0.008*
Skin	0.6787	0.6859	0.7249	0.7484	< 0.001*	12.4923	5.2427	2.8091	1.8215	< 0.001*	0.5164	0.5239	0.5702	0.5994	< 0.001*	0.7370	0.6782	0.7475	0.7689	< 0.001*	0.6384	0.7062	0.7174	0.7381	< 0.001*
Mastoid process	0.8523	0.8772	0.8835	0.8863	0.047*	1.8509	1.4978	1.2364	1.1629	0.152	0.7515	0.7860	0.8009	0.8056	0.029*	0.9080	0.9029	0.9317	0.9328	0.077	0.8218	0.8645	0.8545	0.8586	0.022*
External auditory canal	0.7164	0.7169	0.7379	0.7400	0.004*	2.4721	2.2832	1.8318	1.8010	0.091	0.5640	0.5646	0.5874	0.5901	0.004*	0.7070	0.7119	0.7238	0.7237	0.129	0.7459	0.7434	0.7646	0.7686	0.003*
Mandibular ramus	0.8936	0.6470	0.9162	0.9319	< 0.001*	3.7431	14.2690	2.5544	1.9688	< 0.001*	0.8110	0.4845	0.8492	0.8759	< 0.001*	0.9347	0.5131	0.9468	0.9535	< 0.001*	0.8633	0.9052	0.8924	0.9156	< 0.001*

[i] OARs – organs at risk, TSM – two-step segmentation model, SM – segmentation-only model, *p < 0.05 indicates a statistically significant difference in structure volume

Mean DSC values of > 0.80 were observed in all OARs, except for the skin and external auditory canal, suggesting the reliability and applicability of our proposed model for the automated segmentation of parotid cancers. Among all structures, the best segmentation results were obtained for the mandibular ramus and condyle processes, with mean DSC values of 93.19% and 91.48%, respectively, and relatively high contrast and clear boundaries were observed in post-operative CT images.

Quantitative data between the two-step and segmentation-only models were compared using the paired Student’s t-test or Wilcoxon test. Statistical significance was set at two-tailed p < 0.05. It can be seen from the data in Table 1 that there were differences in the statistical data between the two-step and segmentation-only models, and most of the data comparisons were statistically significant (p < 0.05).

Oncologist evaluation

The results of the qualitative analysis are demonstrated in Table 2. Automated delineation using the two-step model was deemed clinically acceptable by the two senior oncologists.

Table 2

The mean results of the clinical acceptability analysis of the organs at risk (OARs)

OARs	Auricula	Condyle process	Skin	Mastoid process	External auditory canal	Mandibular ramus
Oncologist 1	2.63	2.74	2.55	2.84	2.47	2.93
Oncologist 2	2.69	2.78	2.41	2.86	2.41	2.88

[i] A score of ≥ 2 was defined as clinically acceptable.

Segmentation time

The average time for OARs segmentation with our two-step model was 50.90 s, of which 4.47 s were spent on slice classification. In contrast, the expert manual segmentation required over 20 minutes.

Dosimetric impact

We performed experiments to examine the impact of contours obtained with the proposed automatic segmentation method. Figure 4 shows DVHs for all six OARs of an exemplary patient. We calculated the difference of Dmean on OARs segmented from manual segmentation by expert oncologists and two-step automated segmentation approach for all 30 plans. Group differences were assessed by calculating the dose, with a mean, standard deviation, and corresponding p-value. The mean and standard deviation of dose variables computed over these two groups are presented in Table 3, along with the corresponding p-values. Dosimetric metrics showed p-values larger than 0.05, indicating for all OARs that there were no obvious differences between the dosimetric metrics of these two segmentation methods.

Table 3

The mean and standard deviation (SD) of D_min differences and p-value calculated with organs at risk segmented by manual expert and two-step segmentation model for all 30 plans

	Mean (Gy)	SD (Gy)	P-value
Auricula	5.52	12.14	0.266
Condyle process	2.69	4.31	0.547
Skin	9.71	17.55	0.062
Mastoid process	3.37	3.81	0.054
External auditory canal	8.92	17.10	0.120
Mandibular ramus	2.53	3.34	0.619

Fig. 4

Dose distribution in one patient with (A) two-step segmentation model contours and (B) corresponding DVHs. Dose distribution in the same patient with (C) manual expert segmentation contours and (D) corresponding DVHs

/f/fulltexts/JCB/49842/JCB-14-49842-g004_min.jpg

Comparison with other methods

We compared our proposed method with several deep learning-based 3D segmentation methods, including 3D U-Net [19], V-Net [20], and nnU-Net (segmentation-only model). To ensure objectivity, we used the same training framework for the appeal method as our proposed method, including the same data augmentation, pre-processing, post-processing, etc. The data were analyzed in terms of DSC, Jaccard index, 95 HD, and precision and recall.

From Table 1, it can be seen that the two-step approach we proposed was better than the others method in each indicator. Among them, for the skin with a large range, our method was significantly superior to the others, 7% higher than 3D Unet and 2% higher than nnU-Net of the one-step method with mean DSC values.

Discussion

The parotid gland is considered an important organ at risk in head and neck cancer and whole brain radiotherapy [21]; however, studies on OARs in brachytherapy for parotid gland cancer are currently lacking. OARs for parotid cancer patients with external beam radiation therapy have included the contralateral parotid, eyes, lenses, optic nerve, and spinal cord [22]. Owing to the steep gradient in brachytherapy, small size of OARs and similar density of surrounding tissues render segmentation difficult. Moreover, tissue adhesions and subsequent blurring of soft tissue boundaries following surgical resection can further challenge the segmentation process for adjuvant brachytherapy in parotid gland cancer.

The present work proposed a deep learning-based two-step automated organs at risk segmentation technique, and evaluated its performance for brachytherapy planning in parotid gland cancer. Six OARs were identified, including the auricula, condyle process, skin, mastoid process, external auditory canal, and mandibular ramus. Accurate automated segmentation results, with close agreement to those of gold standard manual segmentation, were observed, indicating mean DSC values of 0.88, 0.91, 0.74, 0.89, 0.74, and 0.93, respectively. The entire segmentation process with our model took approximately 50.90 s, while manual segmentation performed by experienced oncologists required over 20 minutes. The potential of our model in improving the efficiency of the segmentation process was thus shown. While DSC is a widely adopted metric in assessing segmentation quality, it is highly sensitive to the size of evaluated item [23]. For adequate statistical evaluation, qualitative validation by experts in this field was considered, who subsequently deemed the two-step automated segmentation approach clinically acceptable. Therefore, our overall results demonstrated the potential of our proposed model as an accurate and efficient segmentation method in clinical practice.

The ranking in descending order of the segmentation results of the six OARs include the mandibular ramus, condyle process, mastoid process, auricula, external auditory canal, and skin, which seemed to be related to the degree of complexity and variation. The mandibular ramus and condyle process have relatively simple shapes, and clean boundaries and strong contrast allow an easy segmentation from image background. By contrast, automatic segmentation is challenging in external auditory canal and skin segmentation due to the complex background texture and large variation in size, shape, and intensity. Occasionally, it is difficult to distinguish the boundaries between residual gland and OARs, depending on initial tumor extent and gross residual diseases. Blurred and low-contrast surgical areas may lead to fuzzy segmentation from automatic segmentation models. In these cases, OARs segmented by the model had relatively poor consistency with those of the radiation oncologists; however, as intra- and inter-observer variability in the quantification of the same structure occur, it was necessary to conduct subjective validation in addition to quantitative evaluation. Numerous examples, in which cases with lower quantitative agreement were still judged as clinically acceptable.

The high variability in architecture and morphology of the parotid gland post-operation often results in heterogeneity in the range of ROIs among different individuals, which can be a major challenge in manual segmentation. As such, automated segmentation methods have received great attraction to enable personalized ROI contouring. We achieved a two-step approach using deep learning, which focused on ROI, and obtained greater accuracy even though the salient region was small when compared with the method that extracted global features of the entire image for estimating the overall quality. As the irradiation only affects a very localized area around the radiation sources in brachytherapy [24], delineation was not necessary for tissues outside the region of interest, which increased noise and artifacts, and potentially contributed to the most uncertainty in radiotherapy planning [25]. Although the region of interest was delimited at 1.7 cm edge of the post-operative residual parotid gland, radiation oncologists who delineate OARs tend to expand the scope to avoid potential omission. If the ROI is labeled accurately on the network method, the accuracy will be reflected in the segmentation. Moreover, as the original CT images contains a large background that carries irrelevant, noisy, and redundant features, this two-step segmentation step method not only improves the segmentation performance by removing the irrelevant (or less relevant) features, but also reduces the computational complexity by decreasing the spatial size of input volume. In this study, ROIs were limited to 17 mm from the residual parotid tissues, in accordance with the penetration depth of selected ¹²⁵I seed of 17 mm. Higher consistency can be achieved by selecting smaller ROIs.

Due to its influence on tumor control and the risk of radiation-induced toxicity, OARs’ contouring is crucial for radiotherapy planning. Current gold standard manual segmentation is, however, laborious and time-consuming [26]. Increasing attention has been placed on the use of deep learning-assisted models for automated segmentation in brachytherapy [27, 28]; however, to our knowledge, such models have not been utilized in the head and neck region. Precise delineation and prevention of severe radiotherapy-related complications are particularly crucial in head and neck cancers, given the complexity of anatomical structures and clinical emphasis on maintaining aesthetics [29]. At present, the majority of research on automated delineation techniques in the head and neck region has revolved around external radiation therapy. In a study by van Dijk et al., a 2D U-Net network model using CT images of 589 patients was utilized for the segmentation of the parotid gland, which obtained DSC values of 0.81 ±0.08 [30]. Dai et al. proposed a deep learning-based head and neck multi-organ segmentation method using magnetic resonance images of 60 patients, and obtained DSC values of 0.85 ±0.06 and 0.86 ±0.05 in the left and the right parotid glands, respectively [31]. Our results were in line with those of the aforementioned studies. In the current study, surrounding tissues, such as the mandible and skin, were delineated as OARs, but there is a scarcity of similar research data entries for these structures for comparison.

There were several limitations in our study. The six OARs included were selected based on our institutional protocol for parotid brachytherapy, which overlooked other organs, including the masseter and sternocleidomastoid muscle. Therefore, future research with inclusion of these organs is warranted. Due to limited computing power, images with 2 mm slice thickness were acquired. Given that images with a lower slice thickness would provide more and better information, it is possible to acquire better segmentation results using thinner CT slices; however, this will lengthen the segmentation time. We will consider this point in a follow-up research. Thirdly, another potential limitation is that this was a single-center study, which makes the accuracy of the cases out of our patients’ data open to question.

Conclusions

The high variability in architecture and morphology of the parotid gland post-operation often results in heterogeneity in the range of ROIs across different individuals, which can be a major challenge in manual segmentation. With deep learning-based segmentation techniques, the ROI was the first extracted and fed into the system, allowing for a focus on anatomically relevant regions to achieve segmentation of greater accuracy in our proposed model. This approach thereby carries the potential in expediting the treatment planning process of brachytherapy for parotid gland cancers.

Funding

This research was funded in part by the National Key Research and Development Program of China, with grant number of 2019YFB1311304, the National Key R&D Program of China, with grant numbers of 2018YFA0704100 and 2018YFA0704101.

Disclosure

All authors report no conflict of interest.

References

Tian YM, Huang WZ, Yuan X et al. The challenge in treating locally recurrent T3-4 nasopharyngeal carcinoma: the survival benefit and severe late toxicities of re-irradiation with intensity-modulated radiotherapy. Oncotarget 2017; 8: 43450-43457.

Takácsi-Nagy Z, Martínez-Mongue R, Mazeron JJ et al. American Brachytherapy Society Task Group Report: Combined external beam irradiation and interstitial brachytherapy for base of tongue tumors and other head and neck sites in the era of new technologies. Brachytherapy 2017; 16: 44-58.

Raudaschl PF, Zaffino P, Sharp GC et al. Evaluation of segmentation methods on head and neck CT: Auto-segmentation challenge 2015. Med Phys 2017; 44: 2020-2036.

Nikolov S, Blackwell S, Zverovitch A et al. Clinically applicable segmentation of head and neck anatomy for radiotherapy: deep learning algorithm development and validation study. J Med Internet Res 2021; 23: e26151.

Rezvantalab A, Safigholi H, Karimijeshni S. Dermatologist level dermoscopy skin cancer classification using different deep learning convolutional neural networks algorithms. ArXiv 2018; 181010348.

Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 3431-3440.

Chen L C, Papandreou G, Schroff F et al. Rethinking atrous convolution for semantic image segmentation. ArXiv preprint arXiv 2017; 1706.05587.

Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, Cham, 2015; 234-241.

Isensee F, Jaeger PF, Kohl SAA et al. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods 2021; 18: 203-211.

Safigholi H, Chamberland MJP, Taylor REP et al. Update of the CLRP TG-43 parameter database for low-energy brachytherapy sources. Med Phys 2020; 47: 4656-4669.

Yushkevich PA, Piven J, Hazlett HC et al. User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage 2006; 31: 1116-1128.

Lee CY, Xie S, Gallagher P et al. Deeply-supervised nets. In Artificial intelligence and statistics. PMLR 2015; 562-570.

Sudre CH, Li W, Vercauteren T et al. Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. Deep Learn Med Image Anal Multimodal Learn Clin Decis Support 2017; 2017: 240-248.

Sukhbaatar S, Fergus R. Learning from noisy labels with deep neural networks. ArXiv preprint arXiv 2014; 1406.2080, 2: 4.

Zou KH, Warfield SK, Bharatha A et al. Statistical validation of image segmentation quality based on a spatial overlap index. Acad Radiol 2004; 11: 178-189.

Jaccard P. Nouvelles recherches sur la distribution florale. Bull Soc Vaud Sci Nat 1908; 44: 223-270.

Huttenlocher DP, Klanderman GA, Rucklidge WJ. Comparing images using the Hausdorff distance. IEEE Trans Pattern Anal Mach Intell 1993; 15: 850-863.

Hosny KM, Kassem MA, Foaud MM. Skin melanoma classification using ROI and data augmentation with deep convolutional neural networks. Multimed Tool Appl 2020; 79: 24029-24055.

Çiçek Ö, Abdulkadir A, Lienkamp SS et al. 3D U-Net: learning dense volumetric segmentation from sparse annotation. In International conference on medical image computing and computer-assisted intervention. Springer, Cham, 2016; 424-432.

Milletari F, Navab N, Ahmadi SA. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 fourth international conference on 3D vision (3DV). IEEE 2016; 565-571.

Cho O, Chun M, Park SH et al. Parotid gland sparing effect by computed tomography-based modified lower field margin in whole brain radiotherapy. Radiat Oncol J 2013; 31: 12-17.

Blasi O, Fontenot JD, Fields RS et al. Preliminary comparison of helical tomotherapy and mixed beams of unmodulated electrons and intensity modulated radiation therapy for treating superficial cancers of the parotid gland and nasal cavity. Radiat Oncol 2011; 6: 178.

Schmidt P, Gaser C, Arsic M et al. An automated tool for detection of FLAIR-hyperintense white-matter lesions in multiple sclerosis. Neuroimage 2012; 59: 3774-3783.

Chargari C, Deutsch E, Blanchard P et al. Brachytherapy: an overview for clinicians. CA Cancer J Clin 2019; 69: 386-401.

Walker GV, Awan M, Tao R et al. Prospective randomized double-blind study of atlas-based organ-at-risk autosegmentation-assisted radiation planning in head and neck cancer. Radiother Oncol 2014; 112: 321-325.

Liu Y, Lei Y, Wang Y et al. MRI-based treatment planning for proton radiotherapy: dosimetric validation of a deep learning-based liver synthetic CT generation method. Phys Med Biol 2019; 64: 145015.

Zabihollahy F, Viswanathan AN, Schmidt EJ et al. Fully automated multiorgan segmentation of female pelvic magnetic resonance images with coarse-to-fine convolutional neural network. Med Phys 2021; 48: 7028-7042.

Lei Y, Wang T, Roper J et al. Male pelvic multi-organ segmentation on transrectal ultrasound using anchor-free mask CNN. Med Phys 2021; 48: 3055-3064.

Murakami N, Yoshimoto S, Nakamura S et al. Per-oral interstitial brachytherapy catheter insertion for boost in case of recurrent tonsillar carcinoma: dosimetry and clinical outcome. BJR Case Rep 2020; 6: 20190059.

van Dijk LV, Van den Bosch L, Aljabar P et al. Improving automatic delineation for head and neck organs at risk by deep learning contouring. Radiother Oncol 2020; 142: 115-123.

Dai X, Lei Y, Wang T et al. Multi-organ auto-delineation in head-and-neck MRI for radiation therapy using regional convolutional neural network. Phys Med Biol 2022; 67: 10.1088/1361-6560/ac3b34.

Copyright: © 2022 Termedia Sp. z o. o. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License (http://creativecommons.org/licenses/by-nc-sa/4.0/), allowing third parties to copy and redistribute the material in any medium or format and to remix, transform, and build upon the material, provided the original work is properly cited and states its license.