Fig. 2: Comparison of target contouring performance based on varying training dataset sizes.
From: LLM-driven multimodal target volume contouring in radiation oncology
a Quantitative comparison for all the validation sets. The Dice metric for each trial is presented as mean values (center lines) with 95th percentile of confidence intervals calculated with the non-parametric bootstrap method (shaded areas). n denotes the number of patients. b Visual comparison for external validation #1. Source data are provided as a Source Data file.