Publication Cover
Expert Review of Precision Medicine and Drug Development
Personalized medicine in drug development and clinical practice
Volume 9, 2024 - Issue 1
312
Views
0
CrossRef citations to date
0
Altmetric
Review

Advances in the field of developing biomarkers for re-irradiation: a how-to guide to small, powerful data sets and artificial intelligence

, , , , &
Pages 3-16 | Received 20 Jul 2023, Accepted 28 Feb 2024, Published online: 11 Mar 2024

ABSTRACT

Introduction

Patient selection remains challenging as the clinical use of re-irradiation (re-RT) increases. Re-RT data are limited to retrospective studies and small prospective single-institution reports, resulting in small, heterogenous data sets. Validated prognostic and predictive biomarkers are derived from large-volume studies with long-term follow-up. This review aims to examine existing re-RT publications and available data sets and discuss strategies using artificial intelligence (AI) to approach small data sets to optimize the use of re-RT data.

Methods

Re-RT publications were identified where associated public data were present. The existing literature on small data sets to identify biomarkers was also explored.

Results

Publications with associated public data were identified, with glioma and nasopharyngeal cancers emerging as the most common tumor sites where the use of re-RT was the primary management approach. Existing and emerging AI strategies have been used to approach small data sets including data generation, augmentation, discovery, and transfer learning.

Conclusions

Further data is needed to generate adaptive frameworks, improve the collection of specimens for molecular analysis, and improve the interpretability of results in re-RT data.

1. Introduction

Publications on re-irradiation (re-RT) have been increasing in oncology literature; however, data sets for use in analysis and validation and biomarkers that can guide the decision-making, administration, and prognosticate or predict response to re-RT have been slow to identify and none are currently used in the clinic [Citation1–3]. Reasons for this include data heterogeneity and inherent biases concerning re-RT. There is also inclusion bias of patients with superior performance status, lower disease burden, and, thus, potentially superior biological tumor behavior. Inclusion bias may also be characterized by the ability to travel or seek second opinions and undergo management at centers with inherent re-RT expertise and trials. As a result, data on re-RT that emerges from these clinical scenarios is limited by significant bias and is defined as ‘small’ by data standards. Data in re-RT suffers from a lack of robust acquisition and curation outside of clinical trials, with small data defining this clinical space and limited computational analysis due to data volume, format type, and data siloes [Citation4]. However, the definition of small data carries specific definitions depending on the context, i.e. it can be defined in terms of storage size or the number of instances, as will be discussed here for using small data in artificial intelligence (AI) approaches. Small data sets are necessary to study rare histologies or malignancies that disproportionally affect specific demographics or gender. In addition to being small in volume, small data may also refer to data that is narrow in variety and has faster data velocity than big data [Citation5]. In the era of personalized medicine, small data complements big data in advancing healthcare as it examines individual data to identify causation and develop predictive models [Citation6]. Notably, missing data or values are likely to affect small data sets more significantly than big data sets. A 2021 study examining data originating from registries [Citation7] identified the percentage of missing data for variables of interest as 71% in lung cancer patients and 55% in breast cancer patients, with a discrepancy in survival between patients with missing data and those with complete data. Evolving treatment paradigms with heterogeneity in previous management and the overlapping administration of novel agents in conjunction with re-RT limit the number of instances with homogenous management. It is unlikely that the data landscape in re-RT will undergo significant alteration to improve data capture or homogeneity. The ability to harness existing imperfect data and mitigate shortcomings with computational analysis may help advance re-RT data toward identifying actionable biomarkers. In this review, we will examine existing re-RT publications and available data sets, and discuss AI strategies to approach small data sets that can be employed to maximize the use of re-RT data.

2. Publicly available data in re-RT

2.1. Trends in re-RT publications

Re-RT has grown in importance as some malignancies increasingly benefit from more prolonged survival, and the ability to administer re-RT safely is more feasible given technological improvements. Publications on re-RT have been steadily increasing, significantly growing in quantity between 2000 and the present (). However, there is limited to no published literature on publicly available datasets in studies utilizing re-RT. According to clinicaltrials.gov as of June 2023, 29 clinical trials assessing re-RT in various disease sites and applications are recruiting. Approximately one-third (34.4%) are of Phase II design, with no Phase III studies currently recruiting. Of the 29 trials, they are predominately localized (78%) to the United States and Europe, with 13.7% recruiting in East Asia.

Figure 1. Publications on re-RT based on the Web of Science search for the terms “re-irradiation” and “reirradiation” [Citation1].

Figure 1. Publications on re-RT based on the Web of Science search for the terms “re-irradiation” and “reirradiation” [Citation1].

Some tumor sites, such as CNS and head and neck, benefit from the bulk of the data in this space. Other sites, such as bladder, have relatively little data, and in many malignancies, much of the data is limited to reviews. In the last 5 years, data linkages are emerging in tumor sites and radiation techniques ().

Figure 2. Sankey Chart evaluating the publication landscape in re-RT [Citation1]. Head and neck and glioma feature most prominently in re-RT publications, while other cancer sites benefit from fewer articles and more reviews addressing re-RT. Linkages between technical approaches and their implementation in various oncologic sites are increasing but are concentrated in some oncologic sites and are relatively scarce in others, such as bladder cancer.

Figure 2. Sankey Chart evaluating the publication landscape in re-RT [Citation1]. Head and neck and glioma feature most prominently in re-RT publications, while other cancer sites benefit from fewer articles and more reviews addressing re-RT. Linkages between technical approaches and their implementation in various oncologic sites are increasing but are concentrated in some oncologic sites and are relatively scarce in others, such as bladder cancer.

2.2. Available re-RT data with public data

To conduct a more in-depth evaluation of the landscape of existing re-RT data, we conducted a literature search using the National Institute of Health (NIH) institutionally available repositories: Web of Science, and Google Dataset (). Search terms included: ‘re-irradiation,’ ‘reirradiation,’ and ‘reirradi*.’ Inclusion criteria was publications with associated data, involving human subjects and available in the English language, published until June 2023.

Figure 3. Diagram of the literature review process.

Figure 3. Diagram of the literature review process.

As of June 2023, 17 published articles on re-RT with associated data were identified using these repositories (). Two of the four prospective studies are of Phase II design [Citation2,Citation3]. The remainder are retrospective studies. The most common disease sites include CNS (6) and head and neck (5) consistent with re-RT data trends. The associated data were usually found within the published manuscript or supplemental material. Two datasets were conditionally available only upon request. Raw data were not readily available in many situations. As expected, the data were highly heterogenous, with a variety of re-RT doses and techniques. One study utilized brachytherapy [Citation4], and four utilized proton or carbon ion therapy [Citation5–7,Citation13]. The use of these techniques in the re-RT setting is important but explored mostly in head and neck cancer and glioma, which are the sites with the most re-RT data overall (). The included studies ranged from 22 to 1399 patients, the latter being a systematic review [Citation11]. Most of the studies have endpoints that include local control (LC) and survival, with multivariate analyses to identify predictors, many of which include re-RT tumor volume, re-RT dose, patient age, and Karnofsky Performance Score (KPS). We note that most tumor sites carry little representation in re-RT, including common cancers such as prostate, breast, or brain metastases, as >50% of re-RT data is in head and neck and glioma (). Overall, the studies identified a variety of variables associated with LC, progression-free survival (PFS), and overall survival (OS). In the studies on head and neck, in particular nasopharyngeal cancer, predictors of LC included tumor volume and re-RT dose. A study examining carbon ions showed that ≥81.9 GyE was associated with improved LC and age <60 years old had superior OS [Citation13]. Huang et al. examined a comorbidity-based nomogram to predict survival in patients with recurrent nasopharyngeal carcinoma to aid in re-RT selection [Citation20]. The nomogram included eight prognostic factors: comorbidity, recurrent tumor stage, patient age, prior radiation-induced toxicity, tumor volume, disease-free interval, and KPS. The authors suggest comprehensive nomograms may aid treatment selection between ideal candidates for re-RT vs. those better suited for palliative systemic therapy. This approach balances presumed survival outcomes with treatment tolerability. In the studies on re-RT in glioma patients, survival predictors included patient factors such as age and KPS, wherein younger age and higher KPS corresponded to increased survival. Tumor variables impacted survival, such as tumor volume, histology, grade, and IDH mutation status. Treatment parameters such as re-RT dose and longer time interval between first radiation to re-RT (>12–24 months) also impacted survival [Citation3,Citation8]. BED10 >43 Gy correlated with increased survival following re-RT in glioma patients [Citation8]. Additionally, concomitant receipt of chemotherapy or bevacizumab suggested increased survival [Citation11].

Table 1. Existing re-irradiation datasets based on literature search. Glioma and nasopharyngeal cancer are the most common tumor sites represented.

As stated previously, the most common disease sites where re-RT was examined in the repositories identified were glioma and nasopharyngeal carcinoma (NPC) (, ). These cancers have the potential to be aggressive and highly invasive depending on molecular features, grade, and stage. Both NPC and high-grade gliomas are believed to have intratumoral heterogeneity, potentially impacting prognosis, and therapy response [Citation21–23]. With increasing molecular features guiding further subclassifications of glioma and NPC, there is a growing need and interest in identifying appropriate and tailored therapy [Citation24,Citation25]. Given that re-RT has an established role in select cases of recurrent glioma and NPC, molecular classifications may need to be considered in predicting response to re-RT [Citation26,Citation27]. Current published re-RT literature that looks at re-RT in gliomas is limited to single-institution retrospective or prospective phase II trials. To identify biomarkers predicting response to re-RT, larger datasets powered for such endpoints that are publicly available are needed for analysis.

3. Computational learning to harness small data

3.1. Strategies employed to approach small data sets

Given the limitations of small data sets and the lack of availability of larger detailed datasets for re-RT patients, computational methods must be harnessed to enhance the use of existing data () to arrive at clinically actionable biomarkers that can be validated as data sets emerge. Clinically actionable biomarkers of early detection, treatment selection, and prognosis currently in clinical use are based on larger data sets, including HER2 in breast cancer and PSA in prostate cancer [Citation28–30]. Evidence-based clinical practice guidelines in oncology are derived from multiple trials and, when feasible, metanalyses with data acquired over several decades [Citation31]. Data originating over the course of longer time periods may thus give the illusion of large data sets, though they may harbor heterogeneity and missing data. The reality is that there are limited published clinical trials investigating re-RT, and much re-RT is performed in clinical settings without data being robustly acquired or curated for later analysis. Furthermore, molecular data that could generate biomarkers is unavailable in most data sets involving re-RT. However, given examples in systemic management that include the addition of immunotherapy or radiopharmaceutical agents to the standard of care management where small data sets have been practice changing [Citation32], there is reason to remain hopeful that small data sets in re-RT can also be practice altering if they are robust and generate validated markers. There is a realistic possibility that promising biomarkers can be identified via small data sets. For example, CA 19–9, a potential serum maker of early breast, and pancreatic cancer detection, was first established in a relatively small sample size (n = 37) [Citation33,Citation34]. Given this clinical need, approaches to small data sets () have increased in prevalence, particularly in tumors such as bladder cancer, where the administration of RT with curative intent is less common and re-RT extremely rare. Markers of radiation response such as PD-L1, HiF-1a, and TP53 May 2001potentially be used in re-RT but data are limited, and inclusion of these in clinical trials would require further validation which is challenging [Citation44].

Table 2. Strategies employed to approach small data sets.

Clinical learning and decision-making in medicine is a byproduct of pattern recognition with added evidence, both existing and emerging, and can be based on small data sets. The dataset size deemed small in the context of AI is subject to the question posed. The number of patients (‘instances’), and observations available for each instance (‘features’), has been described as subject to the rule of 10 [Citation45], wherein a minimum of 10 events per predictor variable (EPV) should be used to optimize model convergence. Thus, when AI methods that aim to foster improvement in oncologic outcomes are considered, the need to examine large data sets or a greater number of instances to achieve stable results becomes a barrier to solution-driven discussion. However, small data sets in oncology are common, with major conclusions driven by data originating from data sets comprising less than 1000 instances. The typical number of patients in a clinical trial may be 10 to 30 patients (Phase I), 20–50 patients (Phase II), and 100 to 1000 patients (Phase III) [Citation46]. Small data sets, defined as less than 10 observations per predictor variable [Citation47], are thus common in oncology and the norm in re-RT (). Strategies employed to approach small data sets () are important to consider and evaluate in re-RT datasets as they can be used as templates to create a dynamic framework for harnessing small data types in oncology to improve patient outcomes using AI.

3.2. Strategies to handle limited data scenarios for machine learning problems

There are various approaches to training machine learning models with small data sets. Some important strategies to tackle limited data problems for machine learning tasks are illustrated in . These strategies are categorized into four groups that can be applied separately or as hybrids of several methods. Broadly, the strategies consist of data acquisition, cross-validation, feature selection, and learning methods [Citation48–52]. Data acquisition aims to gather or generate data that can be used to train computational learning models by various means or combinations depending on the data type and clinical context [Citation51]. Data acquisition includes discovery, augmentation, and generation [Citation51]. Data discovery is concerned with searching for new related datasets to address a specific problem; this can include the curation of data sets by mining electronic health records and various data streams that may be siloed, such as radiation dosimetry data and quality of life or toxicity data. Data augmentation and generation strategies serve to maximize or to increase the amount of training data by generating virtual samples or augmenting data that already exists. Data generation was used, for example, by Tsai et al., who overcame insufficient protein data by analyzing the available dataset with modified extreme value theory (MEVT) and using the population domain to generate virtual samples [Citation35]. The original and virtual samples were then used to train an artificial neural network. Another method used to generate virtual samples is distance-based mega-trend-diffusion (DB-MTD), which generates virtual samples while taking extreme values into consideration. In small data sets, the interpretation of extreme values is significant; thus, Lin et al. proposed the DB-MTD method to minimize the influence of extreme values while extending the range of possible virtual values [Citation53]. The data generation process is generally employed when no available dataset is generated by synthetic data [Citation51]. Data augmentation utilizes small sets by exploiting specific features of available data sets to maximize the amount of training data. For example, quantitative features can be extracted from medical images through radiomics and be used for tumor detection and classification. Ubaldi et al. used this technique and extracted radiomic features of computed tomography (CT) scans of non-small cell lung cancer patients to create a larger data set. These features were then used to train a machine-learning model to predict tumor histology and stage [Citation38].

Figure 4. Strategies to handle limited data scenarios for machine learning problems.

Figure 4. Strategies to handle limited data scenarios for machine learning problems.

Another approach, cross-validation (), is a commonly used technique in machine learning and model evaluation for limited data scenarios [Citation49]. It provides a robust and unbiased estimate of a model’s performance by partitioning the available data into multiple subsets to train and test machine learning models. Cross-validation helps mitigate issues such as overfitting and data bias [Citation54,Citation55]. K-fold, leave-one-out, or nested cross-validation can be used for limited dataset size-related machine learning problems [Citation52]. Feature selection is crucial in machine learning, pattern recognition, and data analysis to identify the most relevant features or variables from a given dataset. The goal is to eliminate irrelevant or redundant features, which can improve model performance, reduce overfitting, and enhance interpretability. Several feature selection approaches, including filter, wrapper, and embedded methods, have been developed and applied to accomplish this task [Citation48,Citation56,Citation57] (). Filter feature selection methods aim to evaluate the relevance of features independently of any specific machine learning algorithm [Citation58]. These methods assess the characteristics of individual features and their relationship to the target variable. Mutual information, chi-square, and minimum redundancy-maximum relevance (mRMR) are examples of filter feature selection methods. Wrapper feature selection methods aim to find the optimal feature subset that maximizes the performance (e.g. accuracy rate) of the specific model being used [Citation58,Citation59]. Recursive feature elimination (RFE) and genetic algorithms (GA) are examples of this method. Embedded feature selection methods integrate the feature selection process into the model training. These methods, such as Least Absolute Shrinkage and Selection Operator (LASSO) [Citation60], simultaneously optimize both the feature selection and model parameters, resulting in a more efficient and effective feature subset.

Another strategy for tackling limited data problems is to use various learning strategies such as ensemble learning, transfer learning, semi-supervised learning, few-shot learning, and federated learning (). Ensemble learning methods are a type of machine learning algorithms that improve the prediction performance by combining multiple model predictions [Citation61]. They provide robustness, reducing bias, and better results than a single model. Transfer learning (e.g. fine-tuning) is a machine learning technique that leverages knowledge gained from one task/domain to improve learning and performance on a different and related task/domain [Citation49,Citation62]. Transfer learning is especially applied for medical images and deep learning-based classification/recognition tasks by employing pre-trained models [Citation49]. Transfer learning offers several advantages, including reduced training time and improved generalization [Citation63]. Semi-supervised learning is a machine learning paradigm that leverages labeled and unlabeled data to build predictive models and various learning tasks [Citation64]. It aims to overcome the limitations of supervised learning, where labeled data may be difficult or expensive to obtain. Few-shot learning is a subfield of machine learning that deals with the challenge of learning new concepts or tasks with very limited labeled data [Citation65]. In traditional machine learning, models require a substantial amount of labeled data for training, but in few-shot learning, the goal is to enable models to learn from only a few examples of each new class or task. Federated learning is a decentralized machine learning approach where models are trained collaboratively across multiple edge devices or servers without sharing the raw data [Citation66]. This allows privacy-preserving and efficient model training by keeping the data local and distributed. The updates of local models are aggregated to create a global model, enabling learning from diverse strategies employed to approach small data sets and geographically dispersed data sources. Federated learning improves the generalizability of the model [Citation49].

Learning methods such as transfer learning (), in which machine learning models that have already been trained are finetuned for a different task, are crucial in working with re-RT data. This technique is demonstrated by Rahul et al. and Usman et al., who used machine learning models trained on ImageNet images to analyze medical images [Citation42,Citation43]. The medical image data sets are small; thus, they cannot be used to fully train machine learning models. However, millions of ImageNet images can be used to train the models. Although ImageNet only contains natural images that are not necessarily medical, the models serve as a basis that can be further improved to analyze medical images. Another technique in training machine learning models with small data sets is refining a specific portion of the training process. For example, Tappeiner et al. improved the quality of segmentation by using a stacked convolutional autoencoder as a shape prior to regularizing the network training. The shape prior guided the segmentation process based on the shapes of available medical images [Citation67].

Finally, another approach to small data sets is to relax the rule of 10, wherein a minimum of 10 events per predictor variable (EPV) are necessary to lead to meaningful results [Citation68]. This concept was tested using data from a cardiac trial of 673 patients in which 252 deaths had occurred, and seven variables were predictors of mortality [Citation69], resulting in an EPV of (252/7 =) 36. This study evaluated EPV values of 2, 5, 10, 15, 20, and 25 and found no major problems for EPV of greater than 10. However, several method and simulation papers have examined minimum sample size, determining that the minimum number of events depends on the study and prediction model to be investigated with the number of patients (n) and the number of predictor parameters (p) subject to several criteria [Citation70]. Per Riley et al., these criteria are comprised of predictor effect estimates as defined by a global shrinkage factor of ≥0.9, absolute difference of ≤0.05 in the apparent and adjusted R2, precise estimation of a margin of error ≤10% of the true value of the model’s residual standard deviation, and an estimation of the mean predicted outcome value (model intercept) [Citation70]. Riley et al., describe an example model for lung function prediction in African American women; 25 predictor parameters would require 918 patients for 36.7 subjects per parameter. With respect to analyzing specific outcomes (time-to-event), Riley et al. explain that certain clinical scenarios require an EPV ranging from 4.8 to 23 depending on the clinical context [Citation71]. This is particularly relevant in re-RT scenarios, given that the ability to leverage small data sets ultimately determines data separation and implementing penalties based on the strength of association [Citation72].

3.3. Dimensionality of data

Re-RT data is defined by wide dimensionality, which is described in the context of the EPV, and this can pose significant difficulties. In many re-RT sets, not many patients are available for analysis, but the features known about each patient are many and, given emerging molecular data, growing rapidly. As dimensionality, or number of variables, increases, the number of samples necessary to accurately estimate an output also needs to increase [Citation73,Citation74]. However, as dimensionality increases, the differences between variables may become negligible, which is considered a ‘blessing of dimensionality’ [Citation75,Citation76] (). The potential blessing of dimensionality needs to be maximized in oncology, particularly in the re-RT scenario, as data of increasingly larger dimensionality may continue to emerge. At the same time, the number of instances or events from where it will originate will remain small. The dimensionality that can be mined in re-RT scenarios is particularly relevant considering that re-RT is often implemented in superior biological tumor behaviors, i.e. the patient must be alive and clinically well enough to be offered and to be able to receive re-RT. This clinical scenario thus allows more aspects to be ‘known’ about the patient, the natural history of the disease, and biological alteration over time in response to management, which can mitigate the curse of dimensionality encountered in AI [Citation80] since it is inevitably the other side of the coin of the blessing of dimensionality (). This is the case since novel interventions in re-RT will result in large-scale omic panels and dosimetry parameters that will be based on very small patient cohorts, as is the case with digital medicine in general [Citation81].

Figure 5. The blessing of dimensionality in small data sets [Citation77–79]. In small data sets, the wide dimensionality of data, when available, can be leveraged using machine learning and deep learning to classify patients based on clinical, imaging, and molecular characteristics to arrive at features that can be employed for prediction and subsequently retraining. Conclusions can be employed to optimize data acquisition (e.g. on study protocols and in clinic on the standard of care, e.g. post-re-RT imaging fusion and analysis for response), as well as data curation (e.g. dose to organs at risk in the RT field).

Figure 5. The blessing of dimensionality in small data sets [Citation77–79]. In small data sets, the wide dimensionality of data, when available, can be leveraged using machine learning and deep learning to classify patients based on clinical, imaging, and molecular characteristics to arrive at features that can be employed for prediction and subsequently retraining. Conclusions can be employed to optimize data acquisition (e.g. on study protocols and in clinic on the standard of care, e.g. post-re-RT imaging fusion and analysis for response), as well as data curation (e.g. dose to organs at risk in the RT field).

3.4. Radiomics in re-RT

Imaging and biological data and dosimetric variables remain critical tools in the work-up and treatment of patients undergoing re-RT. There is limited published literature on radiomics in re-RT ([Citation82,Citation83]).

Magnetic resonance imaging linear accelerators (MRI-linac) may offer potential data acquisition for radiomic investigation particularly in re-RT [Citation84,Citation85]. Re-RT using SBRT may be employed for recurrent prostate cancer, wherein it is crucial to deliver ablative radiation doses to the target while limiting the dose to OARs, with the goals of tumor control and minimizing toxicity [Citation84]. MRI-linac may improve disease detection, better visualization of pelvic anatomy and adaptive planning accounting for target size and shape, as well as organs at risk (OARs) [Citation85]. MRI as an imaging modality may provide biologic data to predict tumor response [Citation86].

It is anticipated that these features will increase in utilization with the aim of becoming routine in clinical practice and clinical trials [Citation87]. To take advantage of emerging re-RT data, feature extraction across data sets must be possible to allow for predictive power [Citation88].

3.5. Limitations for the acquisition and validation of biospecimens

Patients who receive re-RT often do so in the context of palliative care after many interventions, therefore tissue and biospecimens are rarely available for molecular analysis [Citation89–95]. The ability to measure markers in serum as opposed to tissue in such scenarios would allow for realistic data acquisition leading to more robust computational studies and should become the norm in re-RT trials. As re-RT becomes more common and more data emerges, we anticipate that potential biomarkers identified in small data sets can attempt to be validated via computational analyses of open-access, publicly available platforms such as The Cancer Genome Atlas Program (TCGA),Chinese Glioma Genome Atlas (CGGA) and The International Cancer Genome Consortium (ICGC) for more meaningful results [Citation96–98]. For such attempts to be successful, biomarkers found in re-RT small data sets can be contextualized with and compared to data from other radiation datasets and the administration of radiation in general, including in upfront management. Gene Vector for Each Sample (GVES) has been shown to be a reliable machine learning model to identify prognostic genes in small data sets by limiting the effect of sample size, especially for pancreatic cancer [Citation99]. Computational methods such as GVES are being developed to limit the effects of small sample sizes for the development of biomarkers, but many more approaches noted in need to be explored and compared in individual re-RT sets. An ongoing challenge with AI systems in clinical care relates to limitations of its interpretability, potentially improved with robust peer-reviewed metrics [Citation100]. Given that novel treatments in systemic and radiation therapy are likely to be based on small data sets, computational learning needs to be leveraged and vetted for quality and interpretability in this setting.

4. Conclusions

Our study demonstrates a paucity of re-RT datasets overall. Given that there is no anticipated improvement in re-RT clinical scenarios with respect to robustly collected data on larger numbers of patients, we understand that data sets will continue to reflect heterogeneity, data missingness, and bias. However, we also anticipate a rapid growth in the number of features or variables that will become available in re-RT datasets, including with respect to omic data and radiation therapy dosimetry data complemented by imaging. As a result, we identified three areas of need: 1) It is critical to generate adaptive frameworks for working with imperfect data and small data sets in re-RT, and these need to be available for analysis in public repositories; 2) Biospecimen, such as serum, needs to be collected systematically on patients undergoing re-RT via clinical trials, and when possible, natural history protocols that gather, annotate, and collect specimens on patients receiving re-RT need wide implementation across all tumor sites; 3) The ability to aggregate data to analyze predictors of tumor response to re-RT will help to better identify the ideal patient populations suitable for re-RT in varied disease sites and AI methods need to be continuously explored, compared and vetted for interpretability in re-RT. Addressing these will further help oncologists and patients to make informed decisions about treatment approaches that balance potential toxicity from treatment with improvements in outcome and will lead to superior understanding of radiation biology and signaling pathways that can be targeted to improve outcomes.

5. Expert opinion

Despite increasing indications for re-RT, available data in this space remain small and heterogeneous rendering patient selection challenging. Identifying predictive and prognostic biomarkers for treatment response may aid in ideal patient selection and computational methods may be leveraged to maximally harness small data sets. The implementation of computational tools in this space is realistic given evolving methods for the organizational structure of RT data and an onus on secure data sharing and federated learning.

Key challenges in this area continue to be identifying optimal methods for data collection and curation, a lack of widespread tissue and biospecimen acquisition, limited phase III clinical trials, access to publicly available data, and innovative computational approaches that allow the generation of transferable clinical conclusions. Small data sets will continue to play an important role, particularly in rare diseases or conditions that disproportionately impact-specific patient populations based on demographics and thus both the collection of robust data sets and machine learning methods will need to evolve to ensure breakthroughs in evaluating small data sets.

Data can be augmented with enhanced use of objective treatment responses to re-RT, patient-reported outcomes (PRO) and health-related quality of life (HRQOL) which are essential components of oncologic care, particularly in the re-RT setting. Methods will need to be introduced to capture and mine data from patient reports, ideally digitizing paper questionnaires, and moving away from siloed data. The use of imaging, biological, and dosimetric data remains a critical tool in the work-up and treatment of patients undergoing re-RT. Methods to improve serum, as opposed to tissue markers, may provide more accurate information in the setting of re-RT. Ideally, serum collection will become routine as re-RT clinical trials increase. Health inequities exist within clinical trial enrollment, with certain demographics less represented than others. It is imperative to improve participant recruitment and enrollment in clinical trials. In the setting of re-RT, avenues to enhance data sharing may mitigate limitations present with retrospective reviews and single-institution experiences.

Advances have been made in the most common re-RT disease entities: head and neck and CNS, and in particular, glioma. Validated biomarkers include HPV and PD-L1 for select head and neck squamous cell carcinomas. Several efforts have been made to identify various biomarkers, radiologic features, and dosimetric predictors with research ongoing.

Re-RT data is defined by wide dimensionality which can be limiting. However, this aspect can also be favorably harnessed using computational approaches that aim at leveraging data acquisition, cross-validation, feature selection, and learning methods.

As the use of re-RT continues to grow, enhanced use of computational aids by collecting and structuring data for analysis is at the forefront of progress. It is anticipated that federated learning and innovation in this space as the field advances may provide a template for transferable advancement in other areas of medicine. There should be continued encouragement for multidisciplinary review to homogenize management and identify optimal patient selection with the byproduct of robust structured data.

Article highlights

  • Given prolonged survival for certain malignancies and technological improvements of radiation delivery, the indications for re-RT have been increasing.

  • Most re-RT data are limited to retrospective studies or single-institution experiences.

  • Given the limitations of small data sets for re-RT patients, computational methods may be used to enhance existing data to arrive at clinically actionable biomarkers.

  • Limitations to clinically use biomarkers include validation and identifying the optimal method to collect specimens for molecular analysis.

Abbreviations

CNS=

Central Nervous System

CGGA=

Chinese Glioma Genome Atlas

CV=

Cross-Validation

DB-MTD=

Distance-Based Mega-Trend-Diffusion

EPV=

Events per Predictor Variable

GA=

Genetic Algorithms

GVES=

Gene Vector for Each Sample

GyE=

Gray equivalent

HSRT=

Hypofractionated stereotactic radiation therapy

IMIT=

Intensity Modulated Ion Therapy

IMPT=

Intensity Modulated Proton Therapy

IMRT=

Intensity Modulated Radiation Therapy

KPS=

Karnofsky Performance Score

LASSO=

Least Absolute Shrinkage and Selection Operator

MEVT=

- Modified Extreme Value Theory

mRMR=

Maximum Relevance Minimum Redundancy

OS=

Overall Survival

P=

- Prospective

PFS=

Progression-Free Survival

R=

Retrospective

RFE=

Recursive Feature Elimination

RBE=

Relative Biological Effectiveness

Re-RT=

Reirradiation

S=

Systematic review

SRS=

Stereotactic radiosurgery

TCGA=

The Cancer Genome Atlas

Declaration of interests

The authors have no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

Reviewer disclosures

Peer reviewers on this manuscript have no relevant financial or other relationships to disclose.

Author contributions

All authors have substantially contributed to the conception and design of the review article and interpreting the relevant literature, and have been involved in writing the review article or revised it for intellectual content. All authors have read and agreed to the published version of the manuscript.

Additional information

Funding

This paper was funded in part by the NCI NIH intramural program (ZID BC 010990).

References

  • K AV. Web of science search. 2023 [cited 2023 Jul 10]; search engine. Available from: https://www.webofscience.com/
  • Jiang H, Yu K, Cui Y, et al. Combination of immunotherapy and radiotherapy for recurrent malignant gliomas: results from a prospective study. Front Immunol. 2021;12. doi: 10.3389/fimmu.2021.632547
  • Navarria P, Pessina F, Clerici E, et al. Re-irradiation for recurrent high grade glioma (HGG) patients: results of a single arm prospective phase 2 study. Radiother Oncol. 2022;167:89–96. doi: 10.1016/j.radonc.2021.12.019
  • Ren X, Fu Y, Liu Z, et al. Image-guided interstitial brachytherapy for recurrent cervical cancer after radiotherapy: a single institution experience. Front Oncol. 2022;12:943703. doi: 10.3389/fonc.2022.943703
  • Eekers DBP, Roelofs E, Jelen U, et al. Benefit of particle therapy in re-irradiation of head and neck patients. Results of a multicentric in silico ROCOCO trial. Radiother Oncol. 2016;121(3):387–394. doi: 10.1016/j.radonc.2016.08.020
  • Shirai K, Ohno T, Saitoh J-I, et al. Prospective study of Isolated recurrent tumor re-irradiation with carbon-Ion beams. Front Oncol. 2019;9:9. doi: 10.3389/fonc.2019.00181
  • Held T, Tessonnier T, Franke H, et al. Ways to unravel the clinical potential of carbon ions for head and neck cancer reirradiation: dosimetric comparison and local failure pattern analysis as part of the prospective randomized CARE trial. Radiat Oncol. 2022;17(1):121. doi: 10.1186/s13014-022-02093-4
  • Navarria P, Minniti G, Clerici E, et al. Re-irradiation for recurrent glioma: outcome evaluation, toxicity and prognostic factors assessment. A multicenter study of the Radiation Oncology Italian Association (AIRO). J Neurooncol. 2019;142(1):59–67. doi: 10.1007/s11060-018-03059-x
  • Kessel KA, Hesse J, Straube C, et al. Validation of an established prognostic score after re-irradiation of recurrent glioma. Acta Oncol. 2017;56(3):422–426. doi: 10.1080/0284186X.2016.1276621
  • Kessel KA, Hesse J, Straube C, et al. Modification and optimization of an established prognostic score after re-irradiation of recurrent glioma. PLoS One. 2017;12(7):e0180457. doi: 10.1371/journal.pone.0180457
  • Kulinich DP, Sheppard JP, Nguyen T, et al. Radiotherapy versus combination radiotherapy-bevacizumab for the treatment of recurrent high-grade glioma: a systematic review. Acta Neurochir (Wien). 2021;163(7):1921–1934. doi: 10.1007/s00701-021-04794-3
  • Huang RD, Sun Z, Wang X-H, et al. Development of a Comorbidity-Based Nomogram to predict survival after salvage reirradiation of locally recurrent nasopharyngeal carcinoma in the Intensity-Modulated Radiotherapy Era. Front Oncol. 2020;10:625184. doi: 10.3389/fonc.2020.625184
  • Hu J, Huang Q, Gao J, et al. Clinical outcomes of carbon-ion radiotherapy for patients with locoregionally recurrent nasopharyngeal carcinoma. Cancer. 2020 Dec 1;126(23):5173–5183. doi: 10.1002/cncr.33197. Epub 2020 Sep 15. PMID: 32931035; PMCID: PMC7693227.
  • Liu L-T, Chen Q-Y, Tang L-Q, et al. With or without reirradiation in advanced local recurrent nasopharyngeal carcinoma: a case–control study. BMC Cancer. 2016;16(1):774. doi: 10.1186/s12885-016-2803-2
  • D’Agostino GR, Di Brina L, Mancosu P, et al. Reirradiation of locally recurrent prostate cancer with volumetric modulated arc therapy. Int J Radiat Oncol Biol Phys. 2019;104(3):614–621. doi: 10.1016/j.ijrobp.2019.02.041
  • Taillez A, Bimbai A-M, Lacornerie T, et al. Studies of intra-fraction prostate motion during stereotactic irradiation in first irradiation and Re-irradiation. Front Oncol. 2021;11:690422. doi: 10.3389/fonc.2021.690422
  • Chung SY, Koom WS, Keum KC, et al. Treatment outcomes of Re-irradiation in locoregionally recurrent rectal cancer and clinical significance of proper patient selection. Front Oncol. 2019;9:9. doi: 10.3389/fonc.2019.00529
  • Shirai K, Ohno T, Saitoh J-I, et al. Prospective study of Isolated Recurrent tumor Re-irradiation with Carbon-Ion Beams. Front Oncol. 2019;9:181. doi: 10.3389/fonc.2019.00181
  • Sutera P, Bernard ME, Wang H, et al. Stereotactic body radiation therapy for locally progressive and recurrent pancreatic cancer after prior radiation. Front Oncol. 2018;8:52. doi: 10.3389/fonc.2018.00052
  • Huang R-D, Sun Z, Wang X-H, et al. Development of a Comorbidity-Based Nomogram to predict survival after salvage reirradiation of locally recurrent nasopharyngeal carcinoma in the Intensity-Modulated Radiotherapy Era. Front Oncol. 2021;10:10. doi: 10.3389/fonc.2020.625184
  • Zhao L, Fong AHW, Liu N, et al. Molecular subtyping of nasopharyngeal carcinoma (NPC) and a microRNA-based prognostic model for distant metastasis. J Biomed Sci. 2018;25(1):16. doi: 10.1186/s12929-018-0417-5
  • Chen Y-P, Chan ATC, Le Q-T, et al. Nasopharyngeal carcinoma. Lancet. 2019;394(10192):64–80. doi: 10.1016/S0140-6736(19)30956-0
  • Lauko A, Lo A, Ahluwalia MS, et al. Cancer cell heterogeneity & plasticity in glioblastoma and brain tumors. Semin Cancer Biol. 2022;82:162–175. doi: 10.1016/j.semcancer.2021.02.014
  • Ding R-B, Chen P, Rajendran BK, et al. Molecular landscape and subtype-specific therapeutic response of nasopharyngeal carcinoma revealed by integrative pharmacogenomics. Nat Commun. 2021;12(1):3046. doi: 10.1038/s41467-021-23379-3
  • Whitfield BT, Huse JT. Classification of adult-type diffuse gliomas: impact of the World Health Organization 2021 update. Brain Pathol. 2022;32(4):e13062. doi: 10.1111/bpa.13062
  • Minniti G, Niyazi M, Alongi F, et al. Current status and recent advances in reirradiation of glioblastoma. Radiat Oncol. 2021;16(1):36. doi: 10.1186/s13014-021-01767-9
  • Ng WT, Soong YL, Ahn YC, et al. International recommendations on reirradiation by intensity modulated radiation therapy for locally recurrent nasopharyngeal carcinoma. Int J Radiat Oncol Biol Phys. 2021;110(3):682–695. doi: 10.1016/j.ijrobp.2021.01.041
  • Chia S, Norris B, Speers C, et al. Human epidermal growth factor receptor 2 overexpression as a prognostic factor in a large tissue microarray series of node-negative breast cancers. J Clin Oncol. 2008;26(35):5697–704. doi: 10.1200/JCO.2007.15.8659
  • Patani N, Martin LA, Dowsett M. Biomarkers for the clinical management of breast cancer: international perspective. Int J Cancer. 2013;133(1):1–13. doi: 10.1002/ijc.27997
  • Schröder FH, Hugosson J, Roobol MJ, et al. Screening and prostate-cancer mortality in a randomized European study. N Engl J Med. 2009;360(13):1320–8. doi: 10.1056/NEJMoa0810084
  • Wayant C, Page MJ, Vassar M. Evaluation of Reproducible Research Practices in oncology systematic reviews with meta-analyses referenced by national comprehensive cancer network guidelines. JAMA Oncol. 2019;5(11):1550–1555.
  • Chen EY, Raghunathan V, Prasad V. An overview of cancer drugs approved by the US Food and Drug Administration Based on the surrogate end point of response rate. JAMA Intern Med. 2019;179(7):915–921. doi: 10.1001/jamainternmed.2019.0583
  • Bhasin MK, Ndebele K, Bucur O, et al. Meta-analysis of transcriptome data identifies a novel 5-gene pancreatic adenocarcinoma classifier. Oncotarget. 2016;7(17):23263–23281. doi: 10.18632/oncotarget.8139
  • Huang B, Huang H, Zhang S, et al. Artificial intelligence in pancreatic cancer. Theranostics. 2022;12(16):6931–6954. doi: 10.7150/thno.77949
  • Tsai TI, Zhang Y, Zhang Z, et al. Considering relationship of proteins for radiotherapy prognosis of bladder cancer cells in small data set. Methods Inf Med. 2018;57(4):220–229. doi: 10.3414/ME17-02-0003
  • Chao GY, Tsai T-I, Lu T-J, et al. A new approach to prediction of radiotherapy of bladder cancer cells in small dataset analysis. Expert Syst Appl. 2011;38(7):7963–7969. doi: 10.1016/j.eswa.2010.12.035
  • Krause J, Grabsch HI, Kloor M, et al. Deep learning detects genetic alterations in cancer histology generated by adversarial networks. J Pathol. 2021;254(1):70–79. doi: 10.1002/path.5638
  • Ubaldi L, Valenti V, Borgese RF, et al. Strategies to develop radiomics and machine learning models for lung cancer stage and histology prediction using small data samples. Phys Med. 2021;90:13–22. doi: 10.1016/j.ejmp.2021.08.015
  • Wodzinski M, Banzato T, Atzori M, et al. Training deep neural networks for small and highly heterogeneous MRI datasets for cancer grading. Annu Int Conf IEEE Eng Med Biol Soc. 2020;2020:1758–1761.
  • Zhu XL, Shen H-B, Sun H, et al. Improving segmentation and classification of renal tumors in small sample 3D CT images using transfer learning with convolutional neural networks. Int J Comput Assist Radiol Surg. 2022;17(7):1303–1311. doi: 10.1007/s11548-022-02587-2
  • Li DC, Liu CW, Hu SC. A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets. Artif Intell Med. 2011;52(1):45–52. doi: 10.1016/j.artmed.2011.02.001
  • Paul R, Hawkins S, Balagurunathan Y, et al. Deep feature transfer learning in combination with traditional features predicts survival among patients with lung adenocarcinoma. Tomography. 2016;2(4):388–395. doi: 10.18383/j.tom.2016.00211
  • Usman M, Zia T, Tariq A. Analyzing transfer learning of vision transformers for interpreting chest radiography. J Digit Imaging. 2022;35(6):1445–1462. doi: 10.1007/s10278-022-00666-z
  • Huang R-X, Zhou P-K. DNA damage response signaling pathways and targets for radiotherapy sensitization in cancer. Signal Transduct Ther. 2020;5(1):60. doi: 10.1038/s41392-020-0150-x
  • Concato J, Peduzzi P, Holford TR, et al. Importance of events per independent variable in proportional hazards analysis. I. Background, goals, and general strategy. J Clin Epidemiol. 1995;48(12):1495–1501. doi: 10.1016/0895-4356(95)00510-2
  • Stanley K. Design of randomized controlled trials. Circulation. 2007;115(9):1164–1169. doi: 10.1161/CIRCULATIONAHA.105.594945
  • Shaikhina T, Khovanova NA. Handling limited datasets with neural networks in medical applications: a small-data approach. Artif Intell Med. 2017;75:51–63. doi: 10.1016/j.artmed.2016.12.003
  • Tasci E, Zhuge Y, Camphausen K, et al. Bias and class imbalance in oncologic data—towards inclusive and transferrable AI in large scale oncology data sets. Cancers (Basel). 2022;14(12):2897.
  • Candemir S, Nguyen XV, Folio LR, et al. Training strategies for radiology deep learning models in data-limited scenarios. Radiol. 2021;3(6):e210014. doi: 10.1148/ryai.2021210014
  • Lu J, Gong P, Ye J, et al. A survey on machine learning from few samples. Pattern Recognition. 2023;139:109480. doi: 10.1016/j.patcog.2023.109480
  • Roh Y, Heo G, Whang SE. A survey on data collection for machine learning: a big data-ai integration perspective. IEEE Trans Knowledge Data Eng. 2019;33(4):1328–1347. doi: 10.1109/TKDE.2019.2946162
  • Vabalas A, Gowen E, Poliakoff E, et al. Machine learning algorithm validation with a limited sample size. PLoS One. 2019;14(11):e0224365. doi: 10.1371/journal.pone.0224365
  • Lin LS, Hu SC, Lin YS, et al. A new approach to generating virtual samples to enhance classification accuracy with small data-a case of bladder cancer. Math Biosci Eng. 2022;19(6):6204–6233. doi: 10.3934/mbe.2022290
  • Hastie T, Tibshirani R, Friedman JH, et al. The elements of statistical learning: data mining, inference, and prediction. Vol. 2. Stanford, California: Springer; 2009.
  • Arlot S, Celisse A. A survey of cross-validation procedures for model selection. Statistics Surveys. 2010;4(none). doi: 10.1214/09-SS054
  • Tasci E, Jagasia S, Zhuge Y, et al. RadWise: a rank-based hybrid feature weighting and selection method for proteomic categorization of chemoirradiation in patients with Glioblastoma. Cancers (Basel). 2023;15(10):2672. doi: 10.3390/cancers15102672
  • Tasci E, Zhuge Y, Kaur H, et al. Hierarchical voting-based feature selection and ensemble learning model scheme for glioma grading with clinical and molecular characteristics. Int J Mol Sci. 2022;23(22):14155. doi: 10.3390/ijms232214155
  • Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3(Mar):1157–1182.
  • Gokalp O, Tasci E, Ugur A. A novel wrapper feature selection algorithm based on iterated greedy metaheuristic for sentiment classification. Expert Syst Appl. 2020;146:113176. doi: 10.1016/j.eswa.2020.113176
  • Muthukrishnan R, Rohini R. LASSO: a feature selection technique in predictive modeling for machine learning. In 2016 IEEE international conference on advances in computer applications (ICACA); Coimbatore, India. 2016. IEEE.
  • Sagi O, Rokach L. Ensemble learning: a survey. Wiley Interdisciplinary Reviews: data mining and knowledge discovery. 2018;8(4):e1249.
  • Tasci E, Uluturk C, Ugur A. A voting-based ensemble deep learning method focusing on image augmentation and preprocessing variations for tuberculosis detection. Neural Comput Appl. 2021;33(22):15541–15555. doi: 10.1007/s00521-021-06177-2
  • Pan SJ, Yang Q. A survey on transfer learning. IEEE Trans Knowledge Data Eng. 2009;22(10):1345–1359. doi: 10.1109/TKDE.2009.191
  • Van Engelen JE, Hoos HH. A survey on semi-supervised learning. Mach Learn. 2020;109(2):373–440. doi: 10.1007/s10994-019-05855-6
  • Wang Y, Yao Q, Kwok JT, et al. Generalizing from a few examples: a survey on few-shot learning. ACM Comput Surveys. 2020;53(3):1–34. doi: 10.1145/3386252
  • Kairouz P, McMahan HB, Avent B, et al. Advances and open problems in federated learning. Found Trends Mach Learn. 2021;14(1–2):1–210. doi: 10.1561/2200000083
  • Ozaki S, Kaji S, Nawa K, et al. Training of deep cross-modality conversion models with a small data set, and their application in megavoltage CT to kilovoltage CT conversion. Med Phys. 2022;49(6):3769–3782. doi: 10.1002/mp.15626
  • Vittinghoff E, McCulloch CE. Relaxing the Rule of Ten Events per variable in logistic and cox regression. Am J Epidemiol. 2006;165(6):710–718. doi: 10.1093/aje/kwk052
  • Peduzzi P, Concato J, Kemper E, et al. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49(12):1373–1379. doi: 10.1016/S0895-4356(96)00236-3
  • Riley RD, Snell KIE, Ensor J, et al. Minimum sample size for developing a multivariable prediction model: part I – Continuous outcomes. Stat Med. 2019;38(7):1262–1275. doi: 10.1002/sim.7993
  • Riley RD, Snell KI, Ensor J, et al. Minimum sample size for developing a multivariable prediction model: part II - binary and time-to-event outcomes. Stat Med. 2019;38(7):1276–1296. doi: 10.1002/sim.7992
  • van Smeden M, de Groot JAH, Moons KGM, et al. No rationale for 1 variable per 10 events criterion for binary logistic regression analysis. BMC Med Res Methodol. 2016;16(1):163. doi: 10.1186/s12874-016-0267-3
  • Bellman RE. Adaptive control processes. Princeton: Princeton University Press; 1961.
  • Chen L. Curse of dimensionality. In: Liu L ÖZsu MT, editors. Encyclopedia of Database Systems. Boston (MA): Springer US; 2009. p. 545–546.
  • Gorban AN, Tyukin IY. Blessing of dimensionality: mathematical foundations of the statistical physics of data. Philos Trans A Math Phys Eng Sci. 2018;376(2118):20170237. doi: 10.1098/rsta.2017.0237
  • Anderson J, Goyal N, Rademacher L, et al. The more, the merrier: the blessing of dimensionality for learning large gaussian mixtures. J Mach Learn Res. 2013;35.
  • Mariano D. Classificação de indivíduos com base em duas características. 2022 Jun 3 [cited 2023 Jul 11]; Feature learning for classification of groups. Available from: https://commons.wikimedia.org/wiki/File:Aprendizagem_n%C3%A3o_supervisionada.png
  • Kathuria A. Intro to optimization in deep learning: gradient descent. 2023. 2018 [cited 2023 Jul 11]; Blessing of dimentionality. Goal of neural networks to reach global minima. Available from: https://blog.paperspace.com/intro-to-optimization-in-deep-learning-gradient-descent/
  • Fgpacini. Diagram Of The Feature Learning Paradigm In Machine Learning For Application To Downstream Tasks. 2022 Oct 8 [cited 2023 Jul 11]; Diagram which explains the motivation and use of feature learning. Available from: https://commons.wikimedia.org/wiki/File:Feature_Learning_Diagram.png
  • Pestov V. Is the k-NN classifier in high dimensions affected by the curse of dimensionality? Comput Math Appl. 2013;65(10):1427–1437. doi: 10.1016/j.camwa.2012.09.011
  • Berisha V, Krantsevich C, Hahn PR, et al. Digital medicine and the curse of dimensionality. NPJ Digit Med. 2021;4(1):153. doi: 10.1038/s41746-021-00521-5
  • Pirrone G, Matrone F, Chiovati P, et al. Predicting local failure after partial prostate Re-irradiation using a dosiomic-based machine learning Model. JPM. 2022;12(9):1491. doi: 10.3390/jpm12091491
  • Carles M, Popp I, Starke MM, et al. FET-PET radiomics in recurrent glioblastoma: prognostic value for outcome after re-irradiation? Radiat Oncol. 2021;16(1):46. doi: 10.1186/s13014-020-01744-8
  • Boldrini L, Romano A, Chiloiro G, et al. Magnetic resonance guided SBRT reirradiation in locally recurrent prostate cancer: a multicentric retrospective analysis. (1748-717X (electronic)). 2023.
  • Cuccia F, Rigo M, Figlia V, et al. 1.5T MR-Guided Daily Adaptive Stereotactic Body Radiotherapy for Prostate Re-Irradiation: A Preliminary Report of Toxicity and Clinical Outcomes. Front Oncol. 2022 Apr 13;12:858740.
  • Ng J, Gregucci F, Pennell RT, et al. MRI-LINAC: A transformative technology in radiation oncology. Front Oncol. 2023 Jan 27;13:1117874
  • Lohmann P, Franceschi E, Vollmuth P, et al. Radiomics in neuro-oncological clinical trials. Lancet Digital Health. 2022;4(11):e841–e849. doi: 10.1016/S2589-7500(22)00144-3
  • Tran T, Luo W, Phung D, et al. A framework for feature extraction from hospital medical data with applications in risk prediction. BMC Bioinf. 2014;15(1):425. doi: 10.1186/s12859-014-0425-8
  • Lok BH, Jiang G, Gutiontov S, et al. Palliative head and neck radiotherapy with the RTOG 8502 regimen for incurable primary or metastatic cancers. Oral Oncol. 2015;51(10):957–62. doi: 10.1016/j.oraloncology.2015.07.011
  • Corry J, Peters LJ, Costa ID, et al. The ‘QUAD SHOT’—a phase II study of palliative radiotherapy for incurable head and neck cancer. Radiother Oncol. 2005;77(2):137–142. doi: 10.1016/j.radonc.2005.10.008
  • Gamez ME, AGARWAL M, HU KS, et al. Hypofractionated palliative radiotherapy with concurrent radiosensitizing chemotherapy for advanced head and neck cancer using the “QUAD-SHOT regimen”. Anticancer Res. 2017;37(2):685–691. doi: 10.21873/anticanres.11364
  • Toya R, Saito T, Yamaguchi K, et al. Hypofractionated palliative volumetric modulated arc radiotherapy with the Radiation Oncology study group 8502 “QUAD shot” regimen for incurable head and neck cancer. Radiat Oncol. 2020;15(1):123. doi: 10.1186/s13014-020-01548-w
  • Kamran SC, Harshman LC, Bhagwat MS, et al. Characterization of efficacy and toxicity after high-dose pelvic reirradiation with palliative intent for genitourinary second malignant neoplasms or local recurrences after full-dose radiation therapy in the pelvis: a high-volume cancer center experience. Adv Radiat Oncol. 2017;2(2):140–147. doi: 10.1016/j.adro.2017.01.001
  • Guren MG, Undseth C, Rekstad BL, et al. Reirradiation of locally recurrent rectal cancer: a systematic review. Radiother Oncol. 2014;113(2):151–157. doi: 10.1016/j.radonc.2014.11.021
  • Drodge CS, Ghosh S, Fairchild A. Thoracic reirradiation for lung cancer: a literature review and practical guide. Ann Palliat Med. 2014;3(2):75–91. doi: 10.3978/j.issn.2224-5820.2014.03.04
  • Zhao Z, Zhang K-N, Wang Q, et al. Chinese glioma genome atlas (CGGA): a comprehensive resource with functional genomic data from Chinese glioma patients. Int J Genomics Proteomics. 2021;19(1):1–12. doi: 10.1016/j.gpb.2020.10.005
  • Hudson TJ, Anderson W, Aretz A, et al. International network of cancer genome projects. Nature. 2010;464(7291):993–998.
  • Hutter C, Zenklusen JC. The cancer genome atlas: creating lasting value beyond its data. Cell. 2018;173(2):283–285. doi: 10.1016/j.cell.2018.03.042
  • Ko S, Choi J, Ahn J. GVES: machine learning model for identification of prognostic genes with a small dataset. Sci Rep. 2021;11(1):439. doi: 10.1038/s41598-020-79889-5
  • Kelly CJ, Karthikesalingam A, Suleyman M, et al. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195. doi: 10.1186/s12916-019-1426-2