270
Views
0
CrossRef citations to date
0
Altmetric
Clinical Research Article

Sudden gains in routine clinical care: application of a permutation test for trauma-focused cognitive behavioural therapy

Progresos súbitos en la atención clínica de rutina: la aplicación de una prueba de permutación para la terapia cognitiva conductual enfocada en trauma

ORCID Icon, , , , , , & show all
Article: 2335796 | Received 20 Oct 2023, Accepted 16 Mar 2024, Published online: 17 Apr 2024

ABSTRACT

Background: Sudden gains, defined as large and stable improvements of psychopathological symptoms, are a ubiquitous phenomenon in psychotherapy. They have been shown to occur across several clinical contexts and to be associated with better short-term and long-term treatment outcome. However, the approach of sudden gains has been criticized for its tautological character: sudden gains are included in the computation of treatment outcomes, ultimately resulting in a circular conclusion. Furthermore, some authors criticize sudden gains as merely being random fluctuations.

Objective: Use of efficient methods to evaluate whether the amount of sudden gains in a given sample lies above chance level.

Method: We used permutation tests in a sample of 85 patients with posttraumatic stress disorder (PTSD) treated with trauma-focused cognitive behaviour therapy in routine clinical care. Scores of self-reported PTSD symptom severity were permuted 10.000 times within sessions and between participants to receive a random distribution.

Results: Altogether, 18 participants showed a total of 24 sudden gains within the first 20 sessions. The permutation test yielded that the frequency of sudden gains was not beyond chance level. No significant predictors of sudden gains were identified and sudden gains in general were not predictive of treatment outcome. However, subjects with early sudden gains had a significantly lower symptom severity after treatment.

Conclusions: Our data suggest that a significant proportion of sudden gains are due to chance. Further research is needed on the differential effects of early and late sudden gains.

HIGHLIGHTS

  • Treatment-related sudden gains exhibit clinical significance when their manifestation is above chance level.

  • We used permutation tests to examine their occurrence in trauma-focused cognitive behaviour therapy as applied in a naturalistic treatment setting.

  • The occurrence of sudden gains in general was not significantly higher than chance, yet early sudden gains were associated with improved treatment outcome.

Antecedentes: Los progresos súbitos, definidos como mejorías amplias y estables en los síntomas psicopatológicos, son un fenómeno ubicuo en las psicoterapias. Se ha demostrado que suceden en diferentes contextos clínicos y que están asociados a mejores respuestas al tratamiento al corto y al largo plazo. Sin embargo, el abordaje de los progresos súbitos ha sido criticado por su carácter tautológico: Los progresos súbitos están incluidos en el cálculo de los resultados de los tratamientos, lo que resulta en una conclusión circular. Además, algunos autores realizan la crítica que los progresos súbitos no son sino meras fluctuaciones debidas al azar.

Objetivo: Usar métodos eficientes para evaluar si la cantidad de progresión súbita en una muestra determinada está por encima del punto de corte atribuido al azar.

Método: Empleamos pruebas de permutación en una muestra de 85 pacientes con el diagnóstico del trastorno de estrés postraumático (TEPT) en tratamiento con terapia cognitiva conductual centrada en el trauma en la atención clínica de rutina. Los puntajes de severidad de las escalas clínicas autorreportadas para TEPT fueron permutadas 10.000 veces entre las sesiones y entre los participantes para recibir una distribución aleatoria.

Resultados: En conjunto, 18 participantes mostraron un total de 24 progresos súbitos dentro de las primeras 20 sesiones. Las pruebas de permutación mostraron que la frecuencia de los progresos súbitos no se encontraba más allá del punto de corte atribuido al azar. Se identificaron predictores no significativos de progreso súbito, y el progreso súbito en general no fue predictor del resultado del tratamiento. Sin embargo, los sujetos con progresos súbitos tempranos obtuvieron significativamente una menor severidad en los síntomas luego del tratamiento.

Conclusiones: La información sugiere que una proporción significativa de progresos súbitos son debidas al azar. Se necesita más investigación en la diferenciación de los efectos de los progresos súbitos tempranos y tardíos.

1. Introduction

A wealth of clinical research has focused on sudden gains (SGs) that have been defined as large and stable reductions of psychopathological symptoms (Shalom & Aderka, Citation2020). Initially, SGs were defined by Tang and DeRubeis (Citation1999) as large reductions in symptoms, which fulfil the criteria of being: 1) large in absolute magnitude, 2) large in relation to the previous symptom score (reduction of at least 25%) and 3) stable in relation to symptom fluctuation (scores in the three sessions after the gain should be lower than the three before the gain). Over several years, clinical research has investigated SGs in several treatment contexts. They have been reported to be a ubiquitous phenomenon in psychotherapy that is significantly associated with treatment outcome (Shalom & Aderka, Citation2020). However, the mechanisms of its occurrence still remain poorly understood. A better understanding of the nature of this phenomenon and its relationship to change during treatment may help us improve psychological interventions.

Initially, and following Tang and DeRubeis (Citation1999), SGs in cognitive behavioural therapy (CBT) were assumed to be the result of cognitive change. The authors found significant cognitive changes preceding SGs and postulated that cognitive changes triggered SGs (Tang & DeRubeis, Citation1999) This finding was replicated by Tang et al. (Citation2005) and complemented by a more recent study showing decreases of negative appraisals shortly before SGs in patients with posttraumatic stress disorder(PTSD) (Wiedemann et al., Citation2020). However, several studies seem to contradict the cognitive change hypothesis of sudden gains (Bohn et al., Citation2013; Hofmann et al., Citation2006; Hunnicutt-Ferguson et al., Citation2012; Vincent & Norton, Citation2019; Vittengl et al., Citation2005). Importantly, if cognitive changes were the main mechanism leading to SGs, one would expect treatments like CBT to produce more SGs relative to treatments not directly focusing on modifying cognitions. Although an earlier meta-analysis had indicated higher rates of SGs in CBT compared to other treatments (Aderka et al., Citation2012), a more recent meta-analysis (Shalom & Aderka, Citation2020) showed that the type of treatment had no effect on the occurrence of SGs. Furthermore, SGs do not seem to be easily explainable by extratherapeutic factors (e.g. positive and negative life events) or as a result of regression to the mean (see Shalom et al., Citation2018, for an overview).

Importantly, some authors have argued that SGs are the result of random processes rather than significant mechanisms of change and therefore do not mark relevant points in the therapeutic process. With the use of Monte Carlo data simulation techniques, Thomas and Persons (Citation2013) and Vittengl et al. (Citation2015) concluded that the amount of SGs reported in the literature was comparable to the one in simulated data sets in which random errors were combined with either gradual linear or gradual curvilinear decreases of symptoms. Similarly, Persons (Citation2022) argued as a result of simulation studies that SGs also occur under gradual patterns of change. Other approaches have proposed explanations for SGs in terms of extratherapeutic factors or regression to the mean, neither of which seems promising (see Shalom et al., Citation2018, for an overview).

Furthermore, SGs appear to have significant effects on other relevant factors including increases in therapeutic alliance (Lutz et al., Citation2013; Wucherpfennig et al., Citation2017), changes in coping skills (e.g. patients’ experiences of clarification of meaning and problem solving; Wucherpfennig et al., Citation2017), as well as changes in negative emotional states associated with PTSD (Kuck et al., Citation2023).

Recently, Aderka and Shalom (Citation2021) proposed a revised theory of sudden gains based on the notion that SGs can be predicted by a measure of intra-individual variability (IV; Shalom et al., Citation2018; Shalom et al., Citation2020). The authors defined IV as the variability of a subject’s weekly symptom score, computed an index of this fluctuation and found it to correlate with the occurrence of SGs, independently of the change during treatment. However, this finding has not been replicated in another study and the described findings on intra-individual variability do not appear to generalize easily across treatments and disorders (Kuck et al., Citation2023). Explanations may be that more chronic psychopathology (as found in the study by Kuck et al., Citation2023) leads to less spontaneous variability or that there are other moderators to consider that exert an influence on variability.

In summary, there is no strong empirically supported explanation for the phenomenon of SGs or what predicts them (see Aderka & Shalom, Citation2021; Persons, Citation2022; Thomas & Persons, Citation2013; Vittengl et al., Citation2015). One possible way to advance the current understanding is to consider that a certain amount of SGs marks meaningful points of change during treatment while others might appear at random. This probabilistic focus promises to resolve some inconsistencies observed to date. In this context, reliable methods need to be tested and established to distinguish true and meaningful effects from randomly occurring SGs. In a recent approach by Lorenzo-Luaces et al. (Citation2020), permutation tests were used to address this issue. By re-analysing previous data from cognitive and interpersonal therapy for depression, the authors showed that SGs occurred above and beyond chance level and that the relationship between SGs and better treatment outcomes was unlikely to occur by chance (Lemmens et al., Citation2016). The conducted permutation test is a robust non-parametric approach to evaluate the significance of SGs: Observed scores are shuffled (or ‘permuted’) within treatment sessions across 10,000 iterations while preserving original means and standard deviations. By comparing SG counts in the simulated datasets to those in the observed data, a p-value can be calculated that indicates the likelihood of the observed amount of SGs under the null hypothesis. For instance, in a hypothetical sample with 40% of subjects having an SG, if in 6,000 random re-samples among 10,000 permutations 40% of subjects show a sudden gain, the resulting p-value is .60 (6,000/10,000). This would suggest that SGs occurred at a chance level in the observed data. In sum, the permutation test provides us with a concise and statistically informed evaluation of an observed frequency of SGs in a treatment sample.

The findings by Lorenzo-Luaces et al. (Citation2020) need to be replicated in populations with different disorders. As described, we still lack a coherent explanation for SGs and it remains to be examined whether the mechanisms of SGs are similar and the proposed method is applicable to individuals with other disorders or in a more naturalistic context. To the best of our knowledge, the work at hand is the first application of this form of permutation test in trauma-focused CBT and the first replication of the work by Lorenzo-Luaces et al. (Citation2020). By demonstrating what proportion of SGs is to be expected by chance in the given data set, findings can guide further research on the use of adequate tests for the frequency of SGs. Importantly, this can shed light on the meaning of SGs for psychological treatments.

More recently, emphasis has also been placed on the distinction between ‘early’ and ‘late’ SGs in terms of when they occur in the therapeutic process. Sudden gains occurring earlier in treatment (mostly defined as occurring before the first third to half of a treatment) appear to be more closely connected to treatment outcomes than late gains (Busch et al., Citation2006) and subjects with early gains seem to have larger improvements and shorter treatment lengths (Lutz et al., Citation2013; Stiles et al., Citation2003). Interestingly, this parallels findings of early treatment responders having better treatment outcomes than late responders in general (Haas et al., Citation2002; Santor & Segal, Citation2001). This focus on the timing of SGs also appears promising for a more advanced understanding of the phenomenon and therefore constitutes a second aim of our study. Specifically, we expect that patients with early SGs show a lower symptom severity after the treatment. In addition, in an exploratory analysis, we will test whether SGs occurring in the trauma-focused phase of the treatment (which is expected to contain the specific mechanism of the treatment) are associated with a larger treatment effect.

2. Method

2.1. Participants

Participants were recruited from February 2014 to April 2016 at the outpatient centres of the University of Münster, Germany, and the Otto Selz Institute at the University of Mannheim, Germany. The data was collected for an effectiveness-study for trauma focused CBT in routine clinical care (for details see Krüger-Gottschalk et al., Citationin preparation). A screening was conducted for all patients referred to the outpatient clinics and eligibility for the study was assessed via the Structured Clinical Interview for DSM-IV [SCID-IV, Wittchen et al., Citation1997]. Inclusion criteria were a primary diagnosis of PTSD (assessed with the CAPS-5; Weathers et al., Citation2018) and an age above 17 years. Exclusion criteria comprised acute suicidality, psychotic disorders, current substance dependence and a BMI < 17.5. Eligible patients provided written informed consent and the study was approved by the local ethics committees. In total, N = 85 patients (68 female; mean age = 35.84, SD = 12.85) were included, see for demographic and clinical characteristics.

Table 1. Demographic and clinical characteristics of the sample at baseline.

2.2. Treatment

The applied treatment was a modularized trauma-focused cognitive behavioural therapy, consisting of a preparation phase, a trauma-focused phase (comprising imaginal exposure, discrimination training, changing dysfunctional appraisals) and a phase of reclaiming-your-life assignments and relapse prevention. Prior to treatment, screening sessions were held in which the structured clinical interviews were conducted (see Krüger-Gottschalk et al., Citationin preparation, for more information). Treatment was conducted by registered CBT therapists or therapists in advanced postgraduate training under regular supervision. To reflect representative routine care conditions, treatment length was not determined a priori and therapists were allowed to apply the manual in a flexible and personalized way. On average, patients received M = 37.22 (SD = 20.15; range: 1–80) sessions (duration per session: 50 min) over a mean number of weeks of M = 56.04 (SD = 26.92), which is representative for the German health care system (granting up to 80 sessions for CBT treatment). A frequency of one session per week was aspired (mean length of between-session interval = 8.3 days, SD = 27.72) with longer intervals at the end of treatments to grant successful application of strategies for relapse prevention. Shortly before each treatment session, the process measures were completed. The main assessments were conducted before treatment, after completion and in follow-up measurements of 3, 6 and 12, and 24 months.

2.3. Measures

The main assessments were conducted prior to and following treatment and at follow-up assessments 3, 6, 12, and 24 months after the intervention. Before each weekly treatment session and at the main assessments, patients completed the self-report questionnaires (i.e. ‘process measures’).

The severity of PTSD symptoms in the last four weeks was measured with the German version (Müller-Engelmann et al., Citation2020) of the Clinician-Administered PTSD scale for DSM-5 (CAPS-5) at every main assessment. This structured clinical interview shows excellent reliability and validity and is widely used in PTSD-research (Weathers et al., Citation2018). Assessments were conducted by the therapists at pre, post- and the follow-up assessments.

At each treatment session and at all main assessments, the severity of post-traumatic symptoms was measured with the self-report PTSD Checklist for DSM-5 (PCL-5; Weathers et al., Citation2013; German version: Krüger-Gottschalk et al., Citation2017). In this 20-item self-report-instrument patients rate the distress caused by PTSD symptoms (from 0 – ‘not at all’ to 4 – ‘extremely’) in the past month (adapted here for the last seven days for the session-by-session assessments). The PCL-5 shows good psychometric properties and is a widely used measure in clinical studies of PTSD (Morrison et al., Citation2021).

For the assessment of depressive symptoms, the German version of the Beck Depression-Inventory-II (Beck et al., Citation1996; Hautzinger et al., Citation2006) was conducted, which has shown acceptable to strong psychometric properties (Kühner et al., Citation2007).

2.4. Statistical analysis

All data analysis was performed in RStudio (RStudio Team, Citation2020) using R (Version 4.1.1; R Core Team, Citation2020). Detection and processing of SGs was done with the R package ‘sudden gains’ (Wiedemann et al., Citation2019). Linear mixed models (LMM) were computed with the R Package lme (Pinheiro et al., Citation2017) employing the maximum-likelihood estimator. The significance criterion was set at α = .05 for all analyses.

Identification of sudden gains and sudden losses. For the detection of SGs and sudden losses in the weekly PCL-5-scores, the three criteria proposed by Tang and DeRubeis (Citation1999) were considered and applied in analogy to Krüger et al. (Citation2014) and Wiedemann et al. (Citation2020). Firstly, a change between session N (pre-gain session) and N + 1 (post-gain session) had to be large in absolute terms. This was defined as exceeding the standard error of the difference SED=2SEM2 from the reliable change index (RCI; Jacobson & Truax, Citation1991), which is based on the standard error of measurement SEM=SD1rxx. Values for the standard deviation (SD = 19.99) and the re-test reliability (rxx = .91) were drawn from a German clinical sample of trauma-exposed adults (Krüger-Gottschalk et al., Citation2017). This resulted in an SED of 8.5 points on the PCL-5, meaning a session-to-session improvement had to exceed this value to be considered as a sudden gain/ sudden loss. Secondly, the between session change had to be large relative to the symptom severity before the gain, defined as reaching at least 25% of the pregain symptom severity. Third, changes had to be stable, meaning being large relative to symptom fluctuation before and after the gain. This was fulfilled when the mean of the three PCL-5-scores before the gain was significantly smaller/greater than the mean of the three sessions following the gain/loss. Specifically, the difference between the means had to exceed the two-tailed t statistic (which used the pooled standard deviation of the three sessions before and after the gain). The critical values were adjusted for missing values. If only two sessions were available, these were taken for the computation.

As is common practice in the field, only the first sudden gain/loss per subject was included in the further analyses (on sg-prediction and the outcome-sg interaction). This ensures statistical independence of the SGs and allows the focus of analyses to be on the first major change in symptoms. The occurrence of SGs was exploratively examined in the entire sample and the frequency of SGs after session 20 will also be reported. Between-session intervals of more than 13 days were excluded for this analysis and only the first 20 sessions were analysed. This provided an adequate data quality for the permutation test and grants comparability to the method applied by Lorenzo-Luaces et al. (Citation2020) who used data from a study with a protocol of 16–20 therapy sessions. Although analysing only the first 20 sessions in our sample (total range: 1–80) can be considered as a limitation, the mean number of conducted sessions was considerably lower than the possible maximum of sessions (M = 37.22; SD = 20.15), meaning that a considerable amount of therapeutic changes can be assumed to have taken place within the analysed sessions. The outcome was defined as the score at the end of treatment in the naturalistic setting.

Prediction of sudden gains, sudden losses and relationship with treatment outcome. For the prediction of sudden gains status (yes vs. no) in the weekly PCL-5-scores, univariate logistic regression analyses were conducted with demographic variables (age, gender) and pre-treatment clinical variables (total number of axis I diagnoses, comorbid anxiety disorder, comorbid mood disorder, months since main trauma, type of trauma, BDI-II score, comorbid personality disorder, history of substance dependence, suicidal ideation, past suicide attempt, childhood abuse, pre-treatment CAPS-5 score) as predictors.

To test whether subjects with SGs showed a lower symptom severity after the treatment, a linear mixed model [lmm] for the CAPS-5 sum scores was fitted with random intercepts for patients and time (pretreatment/ post-treatment), sudden gain status (yes/ no) and their interaction as fixed effects (model 1). To control for pre-treatment symptom severity, this was added to the model as a covariate. To gain insights on the timing of SGs with relation to the treatment phase, the first model was repeated considering only SGs occurring in the trauma focused phase (model 2). Another modified version of the first lmm was conducted to test whether early SGs differ from all SGs with respect to their effect on the treatment outcome (model 3). The model contained random intercepts for patients and fixed effects for an adopted sudden gain status variable (1 = early gain/ 0 = no early gain), the effect of time (pretreatment/ post-treatment) and their interaction on the CAPS-5-sum-scores. The median session of the occurrence of all SGs was computed and SGs were classified as early SGs if they occurred before this median session. The effect sizes for the effect of SGs on the treatment outcome were computed using the Cohen's d statistic (Cohen, Citation1988).

The described logistic regression models for the prediction of SGs and the lmm for the analysis of the relationship with the treatment outcome (model 1) were repeated in the same way for sudden losses.

Permutation tests. To assess whether the observed frequency of SGs in the weekly PCL-5-sum-scores in the current sample was higher than expected by chance, permutation tests were performed as suggested in Lorenzo-Luaces et al. (Citation2020). The R-script used by the authors is accessible online (https://osf.io/d7rg2/). First, 10.000 data sets were created by randomly shuffling (or ‘randomizing’) the order of the original PCL-5 scores from the first 22 weeks of the treatment within each session (for exemplary symptom trajectories see figure B2). Thus, the data input for the permutation test yielded two additional sessions (21 and 22) to allow the detection of SGs within the first 20 sessions, as the third SG criterion (stability) always considers the two previous sessions before a potential SG. In this way, the SG detection in the original data (based on 20 sessions) was paralleled. With this permutation procedure, the means and standard deviations per session (across all participants) and the total symptom improvement across all participants was kept the same as in the original data. No autocorrelation between symptoms in the permuted data sets was expected, meaning that the session to session pattern of change was random for these scores.

Second, for each of the resulting 10.000 data sets, a sudden gains analysis was performed as described above for the original data. This gives a frequency distribution of SGs (under randomness) against which the found frequency of the real data set can be tested.

Third, the permutation test was conducted by comparing the SGs frequency of the original data against the obtained distribution from the permuted values. The achieved p-value is the proportion of the 10.000 permuted data sets, which contain equal or less SGs than the original data set.

3. Results

Information on the efficacy of the treatment and descriptive statistics will be reported in detail elsewhere (see Krüger-Gottschalk et al., Citationin preparation).

Characteristics and timing of sudden gains and sudden losses. After having applied the criteria for sudden gains and losses, the analysis was based on 1616 between-session intervals for the whole treatment and 772 intervals within the first 20 treatment sessions (for complete data see ). Twenty-four SGs were detected in total within the first 20 sessions.Footnote1 Additional explorative analyses revealed five SGs and two sudden losses (both reversed later) during the screening sessions and five SGs after session twenty. There was no significant difference between study completers and dropouts concerning the number of experienced SGs, χ2(1) = 0.42, p = .52. The average symptom change around SGs can be found in (and see Figure B3 for example trajectories).

Figure 1. Average symptom change around sudden gains on the PCL-5. Note. The sudden gain takes place between the pregain session N and the postgain session N + 1.

Figure 1. Average symptom change around sudden gains on the PCL-5. Note. The sudden gain takes place between the pregain session N and the postgain session N + 1.

Table 2. Characteristics of sudden gains and sudden losses.

Prediction of sudden gains, sudden losses and relationship with treatment outcome. None of the investigated predictors of sudden gain or sudden loss status (yes vs. no) reached significance in the univariate logistic regression models (all ps > .05; see ). Subjects showed large effect sizes (according to Cohen, Citation1988) on the change of CAPS-5 scores from pre- to post-treatment (d = 2.23; see and Krüger-Gottschalk et al., Citationin preparation, for details on the treatment’s effectiveness).

In all three linear mixed models investigating differences between subjects with and without SGs on the CAPS-5 scores, there was a significant main effect of time, with lower scores after the treatment (all ps < .05, for complete values see ). In the first lmm (model 1), the interaction of time*sg-status was non-significant, F(1,53) = 0.36, d = 0.6, p = .55. This interaction remained non-significant in model 2, when considering only SGs occurring in the trauma-focused phase, F(1,53) = 0.03, d = 0.19, p = .86. However, when considering only early SGs (model 3), the time*sg-status-interaction was significant, F(1,53) = 4.97, d = 2.23, p = .03, showing that subjects with early SGs (M = 4.33, SD = 2.16) showed lower CAPS-5-scores at post-treatment than all other subjects (M = 12.37, SD = 11.76), t(45.43) = −4.3034, p < .001. To test for possible effects of SGs at later time points, model 1 was repeated with CAPS-5 scores at pre and follow-up 1. Again, the time*sg-status-interaction was non-significant, F(1,8) = 0.13, d = 0.36, p = .73. Similarly, the occurrence of sudden losses was not associated with treatment outcome, as the time*sl-status-interaction was non-significant, F(1,53) = 0.65, d = 0.81, p = .42. In means and standard deviations before and after treatment are reported separately for subjects with and without SGs .

Figure 2. Means of PTSD symptom scores in the ITT sample at pre, post, follow-up 1, and follow-up 2. Note. N = 85. Cohen’s d: dCAPS-5 = 2.23, dPCL-5 = 1.86.

Figure 2. Means of PTSD symptom scores in the ITT sample at pre, post, follow-up 1, and follow-up 2. Note. N = 85. Cohen’s d: dCAPS-5 = 2.23, dPCL-5 = 1.86.

Table 3. Fixed effects of linear mixed models with time, sudden gains, and time*sudden gain interaction as predictors of CAPS-5 score.

Table 4. Mean (SD) of PTSD symptom severity of the intent-to-treat-sample on the CAPS-5 before and after treatment for patients with and without sudden gains.

Permutation test of sudden gains. The application of the SG criteria on the 10.000 permuted data sets revealed an average of 17.75 (SD = 3.93; range: 4–35) SGs across an average of 16.37 (SD = 3.49; range: 4–31) patients. In a considerable amount of the permuted data sets (36.60%) equal or more patients than in the original data set experienced SGs (see ), suggesting that the number of patients with SGs in the original data is not greater than expected by chance (p = .37). Noteworthy, the total amount of SGs (across patients) in the original data showed a trend to be higher than chance level, as only 7.54% of the permuted data sets contain the same or a higher number of SGs (p = .08; see ). Three example trajectories each of the original and permuted data sets can be found in figure B2. Replicating the pattern in the original data, subjects with SGs (M = 11.55, SD = 3.04) did not differ from subjects without SGs (M = 11.52, SD = 0.79) with respect to the treatment outcome on the CAPS-5 (t(11348) = −0.91, p = .36).

Figure 3. Distribution of number of patients with sudden gains in 10,000 permuted datasets with randomized PCL-5 scores within sessions. Note. Black line: Mode of the distribution. Red dotted line: Number of patients with sudden gains in the original dataset.

Figure 3. Distribution of number of patients with sudden gains in 10,000 permuted datasets with randomized PCL-5 scores within sessions. Note. Black line: Mode of the distribution. Red dotted line: Number of patients with sudden gains in the original dataset.

Figure 4. Distribution of total numbers of sudden gains in 10,000 permuted datasets with randomized PCL-5 scores within session. Note. Black line: Mode of the distribution. Red dotted line: Number of sudden gains in the original dataset.

Figure 4. Distribution of total numbers of sudden gains in 10,000 permuted datasets with randomized PCL-5 scores within session. Note. Black line: Mode of the distribution. Red dotted line: Number of sudden gains in the original dataset.

4. Discussion

We examined SGs in routine clinical care in a PTSD sample and provided an example of a successful application of a permutation test to evaluate the frequency of SGs. Less than one fourth of the subjects reported a sudden gain in the examined treatment period. As judged by the permutation test, this frequency was not higher than would be expected by chance.

A total of 29 SGs were reported by 18 participants (out of n = 85), with five SGs after session 20. The majority of SGs occurred during the phase of active trauma processing. Together with other evidence on the timing of SGs in relation to treatment manuals (Kuck et al., Citation2023), this suggests that SGs are connected to specific treatment mechanisms. In addition, no predictors of SGs or sudden losses were identified. This finding is in line with the literature on SGs (see Shalom & Aderka, Citation2020) and suggests that SGs are not connected in a simple fashion to pre-treatment patient characteristics.

Importantly, as judged by the permutation test the number of patients with SGs was not higher than expected by chance, while we found a trend for significance for the sum of SGs observed across participants. This result has to be integrated with the finding of an increased incidence of SGs during the trauma-focused treatment phase (vs. in the phases of preparation and reclaiming-your-life assignments) and findings from other studies (e.g. Kuck et al., Citation2023; Wiedemann et al., Citation2020), which connect SGs to specific behaviour change processes. We posit that during behaviour change processes, a certain amount of symptom reduction is categorized as SGs, with a portion of these categorizations occurring randomly.

Other possible explanations for the results of the permutation test are that the data or the methods used in our study differed significantly from other trials (resulting in a lower frequency of SGs or different mechanisms leading to them). Several aspects might have contributed to the relatively low frequency of SGs in our sample: The frequency of sudden changes in routine care settings has been found to be typically lower (around 23%) than in more controlled RCTs (around 40%, see Shalom & Aderka, Citation2020). Thus, although the number of SGs found in the current study deviates from that in earlier RCTs, it is within the range of what is typically found in routine clinical care. Differences between the study designs (RCT vs. more pragmatic studies) include less standardization, longer treatments and more individualized therapies as well as higher attention to comorbid disorders in routine care. Further, the intervals between sessions also show a higher variability, which might have affected the sg frequency. Notably, the found frequency in the current study still lies within the range of 14.30–62.20% found across studies by Shalom and Aderka (Citation2020). Interestingly, the data from our sample resemble the findings on interpersonal therapy (IPT) in Lorenzo-Luaces et al. (Citation2020), where the frequency was also not higher than expected by chance. Another possible explanation is that the patterns of change or mechanisms in the occurrence of SGs differ between disorders or treatments. Of note, the permutation test represents a very conservative test for the detection of SGs and it still remains open, whether it is easily applicable to naturalististic samples. We tested this method for the first time on trauma-focused CBT for patients with PTSD (vs. CT and IPT for depression in Lorenzo-Luaces et al., Citation2020), which means that replication with other clinical samples or more controlled study designs (e.g. in RCTs) is warranted.

Contrary to a larger body of research (Shalom & Aderka, Citation2020), we found no interaction of sudden gain status with the treatment outcome, which may be due to the specific study design: The treatment manual allowed therapists, in consultation with the supervisor, to switch between therapy phases in order to adapt to the patient's current symptoms. This means that both sudden gainers and non-sudden gainers received a well-matched and individualized treatment (also reflected in the large effect size) which might have reduced the variability of usual symptom trajectories and hereby influenced the association which is usually found. Additionally, the treatment length was not determined a priori, leading to good outcomes also for the non-sg-subjects. Together with the above-mentioned low frequency of SGs might be responsible for a missing association. Note, however, that some other studies also have found no relationship between SGs and treatment outcomes (Aderka et al., Citation2012). Furthermore, lack of statistical power may explain the lack of association between SG and outcome and the failure to predict SGs in our data.

Finally, a significant association between SGs and treatment outcome was found for early SGs in our sample. To date, differences between late and early gains are not well understood yet, and hence further research to understand their implications is required. Importantly, a more consistent operationalization of ‘early’ and ‘late gains’ is needed. As for the timing of SGs, in our sample, the majority of SGs occurred relatively late in the therapeutic process (median session: 10, vs. around 5 in most studies, see Aderka et al., Citation2012). We attribute this discrepancy to the fact that RCTs often implement treatment specific interventions earlier in treatment than in our study. In contrast, in our study the most potent (trauma-focused) intervention occured on average in later sessions, starting around session 11.

The primary strength of our study is that it was the first study to date to provide an example of a permutation test on SGs in routine clinical care. Findings inform the field on characteristics of SGs in this setting and provide an example for evaluating whether an observed frequency of SGs is higher than expected by chance. The naturalistic design results in a high external validity of the findings and allows conclusions to be drawn for applied psychological treatments. Yet, the naturalistic design also represents a potential limitation of the work at hand. Particularly, the flexibility of the therapists to being able to apply important therapeutic tools at different times during treatment makes the investigation of sg more difficult. Additionally, the generalization of our findings to other countries could also be questioned, as in the German health care system a larger number of therapy sessions is granted and the length of therapies was not determined a priori. Accordingly, the number of sessions might have been higher than in many other countries. The permutation test also has important limitations. The analysis is time-consuming and requires more computing power than other methods currently used in the field (see Lorenzo-Luaces et al., Citation2020). Further, the method does not explicitly account for temporal dependencies or autocorrelation in investigated time-series data. On the other hand, it involves fewer assumptions than other methods (see Berry et al., Citation2011).

The results of this study should be interpreted with caution when drawing clinical implications. However, our findings add to the body of literature indicating that SGs are a ubiquitous phenomenon and they might be connected to the effective mechanisms of treatments when happening early in treatment. Further, the absence of a correlation between sudden losses and therapy outcomes indicates that therapists should not be concerned by a temporary worsening of symptoms during treatment. Finally, the low number of sudden losses (around 5%) and the fact that they are mostly reversed during therapy encourages an optimistic approach to the treatment manual used.

Altogether, the approach we applied in this study seems promising for enhancing our understanding of the phenomenon of SGs. Future studies should examine whether the permutation test yields different results when applied in other forms of treatment and for other psychological disorders. Our results provide evidence for the practicality of permutation tests in the field and show that caution should be exercised in interpreting SGs, as a significant proportion of findings could be due to chance. Future research on the differential effects of early and late SGs in different clinical contexts is needed.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

We are unable to share the data publicly due to restrictions in the informed consent and ethical approval obtained for this study, which did not include the option of making the anonymised data publicly available.

Notes

1 A separate analysis of the SG-criteria showed that when only applying criterion 1 (and not two and three), 48 subjects showed a SG and when only applying criterion 1 together with criterion two or criterion three separately, 22 subjects showed a SG respectively.

References

  • Aderka, I. M., Nickerson, A., Bøe, H. J., & Hofmann, S. G. (2012). Sudden gains during psychological treatments of anxiety and depression: A meta-analysis. Journal of Consulting and Clinical Psychology, 80(1), 93–101. https://doi.org/10.1037/a0026455
  • Aderka, I. M., & Shalom, J. G. (2021). A revised theory of sudden gains in psychological treatments. Behaviour Research and Therapy, 139, 103830. https://doi.org/10.1016/j.brat.2021.103830
  • Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Beck depression inventory: BDI-II (2nd ed.). Pearson. https://doi.org/10.1037/t00742-000.
  • Berry, K. J., Johnston, J. E., & Mielke, P. W. (2011). Permutation methods. Wiley Interdisciplinary Reviews: Computational Statistics, 3(6), 527–542. https://doi.org/10.1002/wics.177
  • Bohn, C., Aderka, I. M., Schreiber, F., Stangier, U., & Hofmann, S. G. (2013). Sudden gains in cognitive therapy and interpersonal therapy for social anxiety disorder. Journal of Consulting and Clinical Psychology, 81(1), 177–182. https://doi.org/10.1037/a0031198
  • Busch, A. M., Kanter, J. W., Landes, S. J., & Kohlenberg, R. J. (2006). Sudden gains and outcome: A broader temporal analysis of cognitive therapy for depression. Behavior Therapy, 37(1), 61–68. https://doi.org/10.1016/j.beth.2005.04.002
  • Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Hillsdale, NJ: Erlbaum.
  • Haas, E., Hill, R. D., Lambert, M. J., & Morrell, B. (2002). Do early responders to psychotherapy maintain treatment gains? Journal of Clinical Psychology, 58(9), 1157–1172. https://doi.org/10.1002/jclp.10044
  • Hautzinger, M., Keller, F., & Kühner, C. (2006). Beck depressions-inventar (BDI-II) (Harcourt Test Services.).
  • Hofmann, S. G., Schulz, S. M., Meuret, A. E., Moscovitch, D. A., & Suvak, M. (2006). Sudden gains during therapy of social phobia. Journal of Consulting and Clinical Psychology, 74(4), 687–697. https://doi.org/10.1037/0022-006X.74.4.687
  • Hunnicutt-Ferguson, K., Hoxha, D., & Gollan, J. (2012). Exploring sudden gains in behavioral activation therapy for major depressive disorder. Behaviour Research and Therapy, 50(3), 223–230. https://doi.org/10.1016/j.brat.2012.01.005
  • Jacobson, N. S., & Truax, P. (1991). Clinical significance: A statistical approach to defining meaningful change in psychotherapy research. Journal of Consulting and Clinical Psychology, 59(1), 12–19. https://doi.org/10.1037//0022-006x.59.1.12
  • Krüger, A., Ehring, T., Priebe, K., Dyer, A. S., Steil, R., & Bohus, M. (2014). Sudden losses and sudden gains during a DBT-PTSD treatment for posttraumatic stress disorder following childhood sexual abuse. European Journal of Psychotraumatology, 5(1). https://doi.org/10.3402/ejpt.v5.24470
  • Krüger-Gottschalk, A., Knaevelsrud, C., Rau, H., Dyer, A., Schäfer, I., Schellong, J., & Ehring, T. (2017). The German version of the posttraumatic stress disorder checklist for DSM-5 (PCL-5): psychometric properties and diagnostic utility. BMC Psychiatry, 17(1), 379. https://doi.org/10.1186/s12888-017-1541-6
  • Krüger-Gottschalk, A., Kuck, S., Alpers, G., Pittig, A., Dyer, A., Morina, N., & Ehring, T. (in preparation). Effectiveness of phase-based trauma-focused cognitive behavioural therapy in a routine clinical setting [Unpublished manuscript].
  • Kuck, S., Arntz, A., Rameckers, S. A., Lee, C. W., Boterhoven de Haan, K. L., Fassbinder, E., & Morina, N. (2023). Intraindividual variability and emotional change as predictors of sudden gains in imagery rescripting and EMDR for PTSD in adult survivors of childhood abuse. Clinical Psychology & Psychotherapy. 30(5), 1029–1046. https://doi.org/10.1002/cpp.2855
  • Kühner, C., Bürger, C., Keller, F., & Hautzinger, M. (2007). Reliabilität und Validität des revidierten Beck-Depressionsinventars (BDI-II). Der Nervenarzt, 78(6), 651–656. https://doi.org/10.1007/s00115-006-2098-7
  • Lemmens, L. H., DeRubeis, R. J., Arntz, A., Peeters, F. P., & Huibers, M. J. H. (2016). Sudden gains in cognitive therapy and interpersonal psychotherapy for adult depression. Behaviour Research and Therapy, 77, 170–176. https://doi.org/10.1016/j.brat.2015.12.014
  • Lorenzo-Luaces, L., Wiedemann, M., Huibers, M. J. H., & Lemmens, L. H. (2020). A permutation test to probe the statistical significance of sudden gain frequency: An application to patterns of change in cognitive and interpersonal therapy for depression [Manuscript submitted for publication]: Department of Psychological and Brain Sciences, Indiana University Bloomington. https://doi.org/10.31234/osf.io/jbzw3.
  • Lutz, W., Ehrlich, T., Rubel, J., Hallwachs, N., Röttger, M-A, Jorasz, C., Mocanu, S., Vocks, S., Schulte, D., & Tschitsaz-Stucki, A. (2013). The ups and downs of psychotherapy: Sudden gains and sudden losses identified with session reports. Psychotherapy Research, 23(1), 14–24. https://doi.org/10.1080/10503307.2012.693837
  • Morrison, K., Su, S., Keck, M., & Beidel, D. C. (2021). Psychometric properties of the PCL-5 in a sample of first responders. Journal of Anxiety Disorders, 77, 102339. https://doi.org/10.1016/j.janxdis.2020.102339
  • Müller-Engelmann, M., Schnyder, U., Dittmann, C., Priebe, K., Bohus, M., Thome, J., Fydrich, T., Pfaltz, M. C., & Steil, R. (2020). Psychometric properties and factor structure of the German version of the clinician-administered PTSD Scale for DSM-5. Assessment, 27(6), 1128–1138. https://doi.org/10.1177/1073191118774840
  • Persons, J. (2022). Sudden Gains Can Occur in Psychotherapy Even When the Pattern of Change Is Gradual. https://doi.org/10.31234/osf.io/gchf7.
  • Pinheiro, J., Bates, D., DebRoy, S., Sarkar, D., Heisterkamp, S., van Willigen, B., & Maintainer, R. (2017). Package ‘nlme’. Linear and nonlinear mixed effects models, version, 3(1). http://cran.rapporter.net/web/packages/nlme/nlme.pdf.
  • R Core Team. (2020). R: A language and environment for statistical computing [Manual]. https://www.R-project.org/.
  • RStudio Team. (2020). RStudio: Integrated development environment for R [Manual]. http://www.rstudio.com/.
  • Santor, D. A., & Segal, Z. V. (2001). Predicting symptom return from rate of symptom reduction in cognitive–behavior therapy for depression. Cognitive Therapy and Research, 25(2), 117–135. https://doi.org/10.1023/A:1026435001271
  • Shalom, J. G., & Aderka, I. M. (2020). A meta-analysis of sudden gains in psychotherapy: Outcome and moderators. Clinical Psychology Review, 76, 101827. https://doi.org/10.1016/j.cpr.2020.101827
  • Shalom, J. G., Gilboa-Schechtman, E., Atzil-Slonim, D., Bar-Kalifa, E., Hasson-Ohayon, I., van Oppen, P., van Balkom, A. J. L. M., & Aderka, I. M. (2018). Intraindividual variability in symptoms consistently predicts sudden gains: An examination of three independent datasets. Journal of Consulting and Clinical Psychology, 86(11), 892–902. https://doi.org/10.1037/ccp0000344
  • Shalom, J. G., Strauss, A. Y., Huppert, J. D., Andersson, G., Agassi, O. D., & Aderka, I. M. (2020). Predicting sudden gains before treatment begins: An examination of pretreatment intraindividual variability in symptoms. Journal of Consulting and Clinical Psychology, 88(9), 809–817. https://doi.org/10.1037/ccp0000587
  • Stiles, W. B., Leach, C., Barkham, M., Lucock, M., Iveson, S., Shapiro, D. A., Iveson, M., & Hardy, G. E. (2003). Early sudden gains in psychotherapy under routine clinic conditions: Practice-based evidence. Journal of Consulting and Clinical Psychology, 71(1), 14–21. doi:10.1037/0022-006X.71.1.14
  • Tang, T. Z., & DeRubeis, R. J. (1999). Sudden gains and critical sessions in cognitive-behavioral therapy for depression. Journal of Consulting and Clinical Psychology, 67(6), 894–904. https://doi.org/10.1037/0022-006X.67.6.894
  • Tang, T. Z., DeRubeis, R. J., Beberman, R., & Pham, T. (2005). Cognitive changes, critical sessions, and sudden gains in cognitive-behavioral therapy for depression. Journal of Consulting and Clinical Psychology, 73(1), 168–172. https://doi.org/10.1037/0022-006X.73.1.168
  • Thomas, C., & Persons, J. B. (2013). Sudden gains can occur in psychotherapy even when the pattern of change is gradual. Clinical Psychology: Science and Practice, 20(2), 127–142. https://doi.org/10.1111/cpsp.12029
  • Vincent, L., & Norton, P. J. (2019). Predictors and consequences of sudden gains in transdiagnostic cognitive-behavioural therapy for anxiety disorders. Cognitive Behaviour Therapy, 48(4), 265–284. https://doi.org/10.1080/16506073.2018.1513557
  • Vittengl, J. R., Clark, L. A., & Jarrett, R. B. (2005). Validity of sudden gains in acute phase treatment of depression. Journal of Consulting and Clinical Psychology, 73(1), 173–182. https://doi.org/10.1037/0022-006x.73.1.173
  • Vittengl, J. R., Clark, L. A., Thase, M. E., & Jarrett, R. B. (2015). Detecting sudden gains during treatment of major depressive disorder: Cautions from a Monte Carlo Analysis. Current Psychiatry Reviews, 11(1), 19–31. https://doi.org/10.2174/1573400510666140929195441
  • Weathers, F. W., Bovin, M. J., Lee, D. J., Sloan, D. M., Schnurr, P. P., Kaloupek, D. G., Keane, T. M., & Marx, B. P. (2018). The Clinician-Administered PTSD Scale for DSM–5 (CAPS-5): Development and initial psychometric evaluation in military veterans. Psychological Assessment, 30(3), 383–395. https://doi.org/10.1037/pas0000486
  • Weathers, F. W., Litz, B. T., Keane, T. M., Palmieri, P. A., Marx, B. P., & Schnurr, P. P. (2013). The PTSD Checklist for DSM-5 (PCL-5) - LEC-5 and extended criterion A. (https://www.ptsd.va.gov/professional/assessment/documents/PCL-5_LEC_criterionA.pdf).
  • Wiedemann, M., Stott, R., Nickless, A., Beierl, E. T., Wild, J., Warnock-Parkes, E., Grey, N., Clark, D. M., & Ehlers, A. (2020). Cognitive processes associated with sudden gains in cognitive therapy for posttraumatic stress disorder in routine care. Journal of Consulting and Clinical Psychology, 88(5), 455–469. https://doi.org/10.1037/ccp0000488
  • Wiedemann, M., Thew, G. R., Stott, R., & Ehlers, A. (2019). Suddengains: An R package to identify sudden gains in longitudinal data. PLoS One, 15(3), e0230276. https://doi.org/10.1371/journal.pone.0230276
  • Wittchen, H-U, Zaudig, M., & Fydrich, T. (1997). Skid. Strukturiertes klinisches interview für DSM-IV. Achse I und II. Handanweisung. Hogrefe.
  • Wucherpfennig, F., Rubel, J. A., Hofmann, S. G., & Lutz, W. (2017). Processes of change after a sudden gain and relation to treatment outcome-evidence for an upward spiral. Journal of Consulting and Clinical Psychology, 85(12), 1199–1210. https://doi.org/10.1037/ccp0000263

Appendix

Appendix A

Table A1. Logistic regression analysis with demographic and baseline variables as predictors for sudden gain status (yes vs. no).

Appendix B

Figure B1. Timing of sudden gains: distribution of pregain sessions.

Figure B1. Timing of sudden gains: distribution of pregain sessions.

Figure B2. PTSD symptom trajectories (PCL-5) of three example participants in the original and permuted data. Note. Pregain sessions are accentuated as squares.

Figure B2. PTSD symptom trajectories (PCL-5) of three example participants in the original and permuted data. Note. Pregain sessions are accentuated as squares.

Figure B3. Distribution of missing PCL-5 values within the first 20 therapy sessions for six example PTSD symptom trajectories. Note. Red bars represent missing PCL-5 values. The upper row shows examples for patients without sudden gains whereas in the lower row, examples for patients with sudden gains are presented.

Figure B3. Distribution of missing PCL-5 values within the first 20 therapy sessions for six example PTSD symptom trajectories. Note. Red bars represent missing PCL-5 values. The upper row shows examples for patients without sudden gains whereas in the lower row, examples for patients with sudden gains are presented.