506
Views
0
CrossRef citations to date
0
Altmetric
Clinical Research Article

The German PCL-5: evaluating structural validity in a large-scale sample of the general German population

El PCL-5 alemán: evaluación de la validez estructural en una muestra a gran escala de la población general alemana

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Article: 2317055 | Received 13 Jul 2023, Accepted 19 Jan 2024, Published online: 21 Feb 2024

ABSTRACT

Background: In attempts to elucidate PTSD, recent factor analytic studies resulted in complex models with a proliferating number of factors that lack psychometrical and clinical utility. Recently, suggestions have been made to optimize factor analytic practices to meet a refined set of statistical and psychometric criteria.

Objective: This study aims to assess the factorial structure of the German version of the PCL-5, implementing recent methodological advancements to address the risk of overfitting models. In doing so we diverge from traditional factor analytical research on PTSD.

Method: On a large-scale sample of the German general population (n = 1625), exploratory factor analyses were run to investigate the dimensionality found within the data. Subsequently, we validated and compared all model suggestions from our preliminary analyses plus all standard and common alternative PTSD factor models (including the ICD-11 model) from previous literature with confirmatory factor analyses. We not only consider model fit indices based on WLSMV estimation but also deploy criteria such as favouring less complex models with a parsimonious number of factors, sufficient items per factor, low inter-factor correlations and number of model misspecifications.

Results: All tested models showed adequate to excellent fit in respect to traditional model fit indices; however, models with two or more factors increasingly failed to meet other statistical and psychometric criteria.

Conclusion: Based on the results we favour a two-factor bifactor model with a strong general PTSD factor and two less dominant specific factors – one factor with trauma-related symptoms (re-experiencing and avoidance) and one factor with global psychological symptoms (describing the trauma’s higher-order impact on mood, cognition, behaviour and arousal).

From the perspective of clinical utility, we recommend the cut-off scoring method for the German version of the PCL-5. Basic psychometric properties and scale characteristics are provided.

HIGHLIGHTS

  • We contribute new insights to the debate on the factor structure of the PTSD Checklist (PCL-5) based on a large German general population sample deploying the newest methodological developments in a revised factor-analytical approach.

  • Combining theoretical, statistical and practical considerations, we favour a two-factor bifactor model with a strong general PTSD factor and two less dominant specific factors – one factor with trauma-related symptoms and one factor with global psychological symptoms.

  • For clinical practitioners, we recommend using the cut-off scoring method.

Antecedentes: En un intento por dilucidar el Trastorno de Estrés Postraumático (TEPT), estudios analíticos factoriales recientes dieron como resultado modelos complejos con un número creciente de factores carentes de utilidad psicométrica y clínica. Recientemente, se han hecho sugerencias para optimizar las prácticas de análisis factorial para cumplir con un conjunto depurado de criterios estadísticos y psicométricos.

Objetivo: Este estudio apunta a evaluar la estructura factorial de la versión alemana del PCL-5, implementando avances metodológicos recientes para abordar el riesgo de sobreajuste de modelos. Al hacerlo, nos apartamos de la investigación analítica factorial tradicional sobre el TEPT.

Método: En una muestra a gran escala de la población general alemana (n = 1.625), se realizaron análisis factoriales exploratorios para investigar la dimensionalidad encontrada en los datos. Posteriormente, validamos y comparamos todas las sugerencias de modelos de nuestros análisis preliminares más todos los modelos factoriales de TEPT alternativos estándares y comunes (incluido el modelo CIE-11) de la literatura previa con análisis factoriales confirmatorios. No sólo consideramos índices de ajuste de modelos basados en la estimación WLSMV (mínimos cuadrados ponderados robustos con media y varianza ajustada), sino que también implementamos criterios como favorecer modelos menos complejos con un número parsimonioso de factores, suficientes elementos por factor, bajas correlaciones entre factores y un número de especificaciones erróneas del modelo.

Resultados: Todos los modelos probados mostraron un ajuste de adecuado a excelente con respecto a los índices de ajuste de modelos tradicionales; sin embargo, los modelos con dos o más factores fallaron progresivamente en cumplir con otros criterios estadísticos y psicométricos.

Conclusión: En base a los resultados, favorecemos un modelo bifactorial de dos factores con un fuerte factor de TEPT general y dos factores específicos menos dominantes: un factor con síntomas relacionados con el trauma (reexperimentación y evitación) y un factor con síntomas psicológicos globales (que describen el mayor impacto del trauma en el estado de ánimo, la cognición, el comportamiento y en el estado de alerta). Desde la perspectiva de la utilidad clínica, recomendamos el método de puntuación de corte para la versión alemana del PCL-5. Se proporcionan propiedades psicométricas básicas y características de la escala.

1. Introduction

Post-traumatic stress disorder (PTSD) is one of the most common consequences of trauma exposure, with prevalence rates in the German general population of 1–3% (Glaesmer et al., Citation2015; Hauffa et al., Citation2011; Maercker et al., Citation2008). The long-term social, physical health and economic implications of PTSD are considerable. Therefore, appropriate treatment should be administered to those with PTSD to mitigate this individual and societal cost, beginning with an accurate diagnosis. In this paper, we evaluate the psychometric properties of the German version of the PTSD Checklist (PCL) developed by Weathers et al. (Citation2013) to facilitate the diagnosis of PTSD in German-speaking areas. Specifically, the instrument's factorial structure is evaluated using the factor analytical framework by Schmitt and colleagues (Citation2018). This approach entails a combination of common confirmatory factor analytical practice (CFA) (testing all relevant PTSD models proposed in previous literature) and additional exploratory factor analyses (EFA) paired with refined strict statistical evaluation criteria that move beyond traditional model fit indices.

The PCL (Weathers et al., Citation2013) is one of the most widely used self-report questionnaires for assessing symptoms of PTSD. It was initially based on the DSM-IV criteria and has since been revised to reflect the DSM-5 criteria (PCL-5), given the existence of traumatic experience (Criterion A). Now, the 20 Items of the PCL-5 correspond to the four DSM-5 symptom clusters (American Psychiatric Association, Citation2013): re-experiencing (R; Criterion B), avoidance of trauma reminders (AV; Criterion C), negative alterations in cognitions and mood (NACM; Criterion D), and alterations in arousal and reactivity (AAR; Criterion E). The revision to the PCL-5 includes the addition of three items to capture newly added PTSD symptoms, changes to the language of certain items, a change in the response scale from 1–5 to 0–4 and providing only one general version instead of three specific ones (military members, civilians and specific events). The PCL-5 can be used in multiple ways. First, PTSD can be provisionally diagnosed based on an adequate cut-off score (cut-off scoring method). Second, given that at least one B item, one C item, two D items, and two E items at a rating of 2 (moderately) or above are endorsed, provisional PTSD can be diagnosed following the DSM-5 diagnostic rules (symptom scoring method). Third, the PCL-5 can measure symptom severity (sum score) to monitor symptom change across time and in response to interventions (Weathers et al., Citation2013).

1.1. Psychometric properties and diagnostic utilities of the PCL-5 scale

The revised PCL-5 has undergone comprehensive psychometric evaluation across different languages. A meta-analysis of Forkus et al. (Citation2022) showed satisfactory reliability across all reviewed studies (n = 64). Internal consistencies for the overall scale were high (r = .83–.97) and acceptable to good for the subscales (R: r = .57–.93; AV: r = .69–.91; NACM: r = .74–.94; AAR: r = .71–.90). Likewise retest-reliability ranged from acceptable to high (r = .58–.91). Regarding the convergent validity they found moderate to strong correlations of the PCL-5 total score with other measures of PTSD (r = .44–.89). Correlations of the total score with measures of associated symptoms and comorbid disorders were moderate: traumatic stressors (r = .12–.46), depression (r = .54–.81), anxiety (r = .56–.74), substance use (r = .12–.26), somatic symptoms (r = .50–.61), sleep (r = .51–.62), negative cognitions (r = .47–.61), and dissociation (r = .53–.72). In addition, Forkus et al. found that recommended cut-off scores in most cases ranged between 31 and 33, with a minimum of 22 and a maximum of 49. As for other diagnostic utility estimates, they reported sensitivity (.50 to 1.00), specificity (.35 to .97), positive predictive power (.38–.97), and negative predictive power (.63–1.00), also known as positive/negative predictive value.

Specifically for the German PCL-5, Krüger-Gottschalk et al. (Citation2017) conducted a first psychometric evaluation in a German clinical sample (n = 352). They found high internal consistency (α = .95), high re-test reliability (r = .91) and a high correlation with the total severity score of the Clinical-Administered PTSD Scale for DSM-5 (CAPS-5) of r = .77 – satisfying both evaluations of reliability and validity. They recommend a cut-off score between 31–33 for clinical populations in Germany. Both evaluation methods resulted in acceptable diagnostic utility estimates, with the cut-off scoring method outperforming the symptom score method (sensitivity = .84–.88, specificity = .66–.68, positive predictive power = .81, negative predictive power = .73–.77, overall efficiency = .78–.79).

1.2. Structural modelling of PTSD and the PCL-5

The factor structure of PTSD has substantial implications on the diagnostic algorithms (e.g. the symptom scoring method). Ultimately, diagnoses derived from these algorithms directly inform the treatment pathways of patients in both clinical and research settings. Subsequently, it is essential to have a good understanding of the structural validity of the PCL-5 (Forkus et al., Citation2022).

Over the years, several studies on military and civilian samples could not support the initially intended four-factor DSM-5 model (Armour et al., Citation2015; Blevins et al., Citation2015; Wortmann et al., Citation2016). Instead, various alternate models have evolved (for an overview, see ). The 4-factor Dysphoria Model (Simms et al., Citation2002) kept the re-experiencing and avoidance factors and restructured the remaining AAR and NACM cluster items to favour a more prominent dysphoria factor. This conceptualization might explain the comorbidity with distress-based disorders (Contractor et al., Citation2018). The 5-factor Dysphoric Arousal Model (Elhai et al., Citation2011) kept the first three factors but differentiated between dysphoric and anxious arousal symptoms in the AAR cluster. The 6-factor Anhedonia Model (Liu et al., Citation2014) additionally splits the NACM cluster into anhedonia and negative affect. In contrast, the Externalizing Behavior Model (Tsai et al., Citation2015) focuses on emotion dysregulation difficulties by suggesting an additional externalizing behaviours factor based on the AAR cluster. The Hybrid Model (Armour et al., Citation2015) integrates previously postulated factors within the DSM-5 conceptualization in a 7-factor model. In the aforementioned recent review (Forkus et al., Citation2022) of existing factor analytical literature, the Hybrid Model found solid empirical support. It has been shown that of 34 studies that tested the Hybrid Model, 82.3% found it to be optimal compared to alternate models. The Anhedonia and Hybrid Models performed equally well in six studies.

Table 1. Overview of Standard and Common Alternative DSM-5 Plus ICD-11 Derived PTSD Models With Symptom Mapping.

However, empirical support for the Hybrid Model remains debated due to substantial statistical and practical limitations of contemporary factor analytical approaches to PTSD. The theory-testing and post-hoc adjustment approach tends to favour increasingly complex models, raising concerns about statistical model robustness because of (1) diminished suitability of model fit indices for performance evaluation, (2) a greater prevalence of under-identified factors with fewer than three items per factor, (3) increased inter-factor correlation leading to decreased construct validity (Schmitt et al., Citation2018). Practically, the heightened complexity of the model’s structure and diagnostic algorithm diminishes its clinical utility (Silverstein et al., Citation2018). Furthermore, previous work emphasized the relatively nonspecific PTSD factors (‘negative alterations in cognitions and mood’ and ‘hyperarousal’), neglecting unique factors such as ‘re-experiencing’ and ‘avoidance’ (Krüger-Gottschalk et al., Citation2022; Rasmussen et al., Citation2019). As a solution, Schmitt et al. (Citation2018) suggest a combination of EFAs and CFAs on the same data set as a viable solution to create a model that is both theory-informed and reflective of the underlying factorial structure. They also provide detailed statistical methods and a framework to evaluate factor structures and address any potential limitations (see Method section).

Efforts to simplify the PTSD construct include reducing the number of factors, eliminating items and adopting the streamlined ICD-10 conceptualization. For instance, Ferrie et al. (Citation2022) identified a three-factor model (AN, R, NACM) as the best-fitting model through exploratory factor analysis on a general British population sample, excluding items with poor factor loadings. Brewin et al. (Citation2009) and Schellong et al. (Citation2019) suggest a three factor solution (re-experiencing, avoidance and perceived sense of threat) based on the ICD-11 through post-hoc item allocation (see ). This approach, applied to various instruments, including the Harvard Trauma Questionnaire (Hansen et al., Citation2015), the Impact of Event Scale-Revised (Hyland et al., Citation2017) and the Clinician-Administered PTSD Scale (CAPS-5) (Krüger-Gottschalk et al., Citation2022) has shown some success. Additionally, evidence of the specifically developed International Trauma Questionnaire (Ben-Ezra et al., Citation2018; Cloitre et al., Citation2013; Ho et al., Citation2020; Redican et al., Citation2021) supports the factorial validity of the ICD-11 model.

American veteran samples exhibit evidence for both a unidimensional structure (Jenkins-Guarnieri et al., Citation2023) and two- and four-bifactor models with one decisive general and several weaker specific factors (Schmitt et al., Citation2018).

In a German clinical sample (Krüger-Gottschalk et al., Citation2017) evidence for the latent factor structure of the PCL-5 remained inconclusive. Furthermore, the methodological suggestions for DSM models and the application of the ICD-11 model, as put forth by Schmitt and colleagues, have yet to be tested.

In summary, critique on contemporary factor analytical procedure questions the clarity of traditionally identified PCL-5 factorial structures. Newer attempts, exhibiting heterogeneity, alter the questionnaire or adopt different theoretical frameworks without consistent use of updated statistical methods. Previous studies often involved small, homogeneous samples—primarily white, female college students, veterans, or treatment-seeking individuals—limiting generalizability.

1.3. Research objective

To establish robust and generalizable psychometric evidence, we propose evaluating the optimal PCL-5 factor model by (1) deploying statistically rigorous methods, (2) using a large-scale, general-population sample, and (3) testing and comparing theory and empirically-derived models within a clinical utility framework. Our study aims to execute this by implementing Schmitt et al. (Citation2018) statistical algorithm in a substantial, non-clinical, population-based sample in Germany.

2. Method

2.1. Subjects

The present cross-sectional study is part of a representative survey of the general German population conducted in 2016 with the help of an independent institute for market and social research (USUMA, Berlin). Germany was separated into 258 sample areas representing the country's different regions. Within these regions, 4902 households were selected using the ‘random-route-procedure.’ A participating member of each household fulfilling the inclusion criteria (age at or above 14, able to read and understand the German language) was selected randomly via a Kish selection grid. A maximum of three attempts were made to contact the selected person in their private homes. Subjects were visited by a trained study assistant of USUMA and informed about the investigation. When they provided written informed consent the study assistant then administered self-rating paper-pencil questionnaires which were filled out privately and offered help where needed. A total of 2510 people between the ages of 14 and 91 agreed to participate, completing the self-rating questionnaires between September and November 2016 (participation rate: 52.0%, refusal to be interviewed: 30.0%, target person repeatedly unavailable: 18.0%). The final sample consisted of 1625 participants (excluding 86 underage participants, 20 subjects with missing data on all items of the PCL-5 and 779 participants without trauma exposure). Characteristics of the study sample are presented in . In our pursuit to derive scientific inferences concerning the factorial structure of the PCL-5, we used the unweighted sample, acknowledging the trade-off in sample representativeness and the consequent limitations in making statistical inferences, such as the determination of psychometric norms and prevalences. Yet the analysis sample was approximately representative regarding age (M = 48.36; SD = 18.2; Range: 18–94) and sex for the Federal Republic of Germany when compared to the microcensus of the Federal Office of Statistics. The Ethics Review Committee of the Medical Faculty of the University of Leipzig determined the proposed study to be outside their scope but expressed no ethical concerns for the survey.

Table 2. Distribution of Sociodemographic Characteristics in the Sample.

2.2. Instruments

Life-Events-Checklist 5 (LEC-5). The LEC-5 (Krüger-Gottschalk et al., Citation2017; Weathers et al., Citation2013) is a self-report measure assessing the lifetime prevalence of trauma. The German version of the LEC-5 and PCL-5 (Krüger-Gottschalk et al., Citation2017) has been computed by a professional translation and back-translation process from and to the English version. Sixteen predefined traumatic events and the option to describe any other very stressful event or experience not listed before are presented. The A-criterion of the PTSD diagnosis is endorsed when at least one of the events is answered by ‘Happened to me’, ‘Witnessed it’, ‘Learned about it’, and ‘Part of my job’. The answer options ‘Not sure’ and ‘Doesn't apply’ are not considered affirmative. Based on this metric, prevalence rates of traumatic events in the sample (including individuals without traumatic exposure, n = 2,404) are reported in the lower part of .

Post-Traumatic Disorder Checklist-5 (PCL-5). The Posttraumatic-Disorder-Checklist 5 (PCL-5; Weathers et al., Citation2013) consists of 20 items covering the DSM-5 PTSD symptoms. Symptom severity in the past month can be rated on a 5-point Likert scale from 0 (‘Not at all’), 1 (‘A little bit’), 2 (‘Moderately’), 3 (‘Quite a bit’) to 4 (‘Extremely’). Following the DSM-5 diagnostic criteria, a preliminary diagnosis can be given.

2.3. Statistical procedure and considerations

For our analyses we applied the detailed recommendations of Schmitt et al. (Citation2018) regarding the selection of appropriate estimators, the factor analysis framework, and the use of a comprehensive set of indicators to assess model fit (including model fit indices, misspecification indices, interfactor correlations, factor identification, and the number of factors). In their work, they advocate selecting the best model based on a combination of theoretical and empirical considerations and have defined an improved standard for factor structure validation.

1.

Factor Analysis Framework

We chose a combined approach of EFA, CFA and Bifactor Modelling for our factor analysis framework. We ran EFAs to gain a comprehensive understanding of the empirical data in order to determine the appropriate number of factors to extract. Supplementing CFAs with EFA helps to detect and correct misspecifications in CFAs, thereby avoiding overly complex and overfit models that may result from relying solely on confirmatory models and assuming a priori theory. We ran CFAs to test and compare the most common theoretically postulated models, the ICD-11 model and the models derived from EFAs. We then ran a bifactor model analysis based on the outcomes to test whether a unidimensional construct with facets (interpretation of the sum score with merely descriptive facets / cut-off scoring method) or a multidimensional construct (reliable and interpretable subscales / symptom scoring method) is more appropriate.

2.

Model Estimation

We used the Weighted Least Square Mean and Variance Adjusted estimation (WLSMV) and a polychoric correlation matrix if not stated otherwise in the result section. The WLSMV estimator has been proven to be the most efficient and robust estimation method for coarse-ordered categorical scaled and skewed data. Since all of the PCL-5′s items are based on an ordinal Likert scale and were highly skewed (skew statistics range between 1.57 (Item 1) and 3.02 (Item 14)), we deemed the WLSMV the most appropriate. In CFAs, we fixed all variances of latent variables to one. In EFAs, the mean and the variance were fixed at 0 and 1, respectively. For the bifactor model, we set the variances of the latent variables to 1 and the covariances between latent variables to 0 (i.e. standard bifactor model practice, Reise et al., Citation2007). Preliminary analyses to determine the appropriate number of factors to extract for the EFAs contained an evaluation of eigenvalue magnitudes, parallel analysis and model fit statistics. We used an oblique Geomin rotation to obtain a simple approximate structure with the EFA. The programme used for all statistical analyses was R (R Core Team, Citation2021). The CFAs, EFAs and the bifactor model were conducted with the lavaan package (Rosseel, Citation2012). Parallel analyses were run with the psych package (Revelle, Citation2022). Missing data have been imputed by the mice package (Buuren & Groothuis-Oudshoorn, Citation2011).

3.

Model Fit and Factor Model Selection

We conducted a comprehensive, balanced evaluation of the models that incorporated multiple model fit indices and other criteria to assess the amount of variance explained (good model fit from a statistical perspective) and the validity of the factorial structure (good model fit from a practical standpoint).

Since the χ2 statistic is known to produce statistically significant values for good fitting models when the factor structure is complex and the sample size is large, we additionally evaluated the following Approximate Fit Indices (AFI). According to Hu and Bentler (Citation1999), the Comparative Fit Index (CFI) and Tucker-Lewis Index (TLI) statistics greater than 0.90 are deemed adequate, and values greater than 0.95 are good. Root Mean Square Error of Approximation (RMSEA) values less than 0.10 and 0.06 are considered mediocre and good, respectively. Both fit indices are commonly used for maximum likelihood estimated models and must be interpreted carefully with WLSMV estimation (Sass et al., Citation2014). Robust to the estimation method is the Standardized Root Mean square Residual (SRMR), whose values are deemed adequate if less than 0.08 (Savalei, Citation2021). We employed maximum likelihood estimation to calculate the Akaike Information Criterion (AIC; Akaike, Citation1974) and Bayesian Information Criterion (BIC; Schwarz, Citation1978) to compare the relative fit of nested and non-nested models to the same data. Several indices assess the reliability, stability and dimensionality of bifactor models. Although there is no universal agreement on the standards for these bifactor indices, we provide an overview of the most common indices (Reise et al., Citation2013; Rodriguez et al., Citation2016; Schmitt et al., Citation2018). Omega (ω), as a measurement for reliability, provides coefficients for the general/hierarchical factor (ω), specific factors (ωS) and after accounting and isolating the specific factors, only the general factor (ωH) and vice versa only the specific factors (ωHS). When the ωH is large (e.g. > .80) and the ωHS are small (e.g. < .50), this indicates that the general factor is more reliable than specific factors. Explained Common Variance (ECV) values greater than .85 on the general factor (and smaller specific factor ECVs) suggest the measure is sufficiently unidimensional. Values of the Item-level Explained Common Variance (IECV) measuring between 0.5 and 1 indicate that the item reflects more of the general dimension and less of the specific dimension. If most items have IECV values above .80, this commonly implies that a unidimensional measure is preferable. Further evidence of unidimensionality is if the Percentage of Uncontaminated Correlations (PUC) is greater than .70 and the Average Relative Bias (ARB) between the general and unidimensional factor loadings is small. The Factor Determinacy index (FD) and the construct replicability (H, Hancock & Mueller, Citation2001) should be closely monitored to ensure stability; values of FD below 0.90 and H above 0.80 are ideal.

We sought to identify the most economical model that best fits the data by comparing the number of factors extracted from the models to those suggested by the combined results of parallel analysis and eigenvalues. Additionally, we monitored inter-factor correlation magnitudes to avoid collinearity, taking factor correlations greater than .85 (Carragher et al., Citation2016; Rasmussen et al., Citation2019) as a reference point for high collinearity and low discriminant validity of factors. Furthermore, we ensured that each factor contained sufficient items, with a minimum of three items per factor, to not only ensure model identification (Little et al., Citation1999) but also avoid construct underrepresentation (Kaplan & Saccuzzo, Citation2009).

4.

Missing data

After removing 20 cases in which the PCL-5 was skipped entirely, the sample consisted of n = 2404. In the remaining data set, in 4.83% of the subjects, the answers were incomplete (with a total of 0.6% missing data points throughout the whole data set). Little's missing completely at random test indicated that data was missing systematically (χ2(874) = 1263.7, p < .001). Therefore, the missing data treatment used the predictive mean matching method. After, we excluded non-trauma-exposed individuals arriving at our final sample size of n = 1625.

3. Results

3.1. Exploratory factor analysis

Preliminary analyses regarding the number of factors to extract resulted in multiple solutions. The analysis of eigenvalues based on an Oblimin rotation revealed the first three eigenvalues to be 12.278, 1.367 and 0.804  – hinting at one strong factor and potentially a second weak one. Parallel analysis based on the less robust WLS estimation suggests four factors based on the estimated (and simulated) eigenvalues of 11.89 (0.35), 0.97 (0.24), 0.38 (0.21), 0.30 (0.18), 0.17 (0.16). We interpreted the latter cautiously since parallel analysis based on WLSMV estimation is not available in R as there is a debate on its appropriateness with categorical data (yet Garrido et al., Citation2016; Weng & Cheng, Citation2017 make a case for applicability).

Hence, we ran and compared EFAs from a one- up to a four-factor solution. Comparing the resulting four different models (see ), a four-factor solution seems to result in the best fit indices. When examining the four-factor solution closely, it is evident that items loading high on factors one and three are in fact more strongly associated with factors two and four. Similarly, in the three-factor model, several items associated with factor three have high cross-loadings (> 0.30) with factors one and two. Furthermore, all multiple-factor models show high covariances (> 0.7), suggesting more than 50% of shared variance between those factors, indicating poor discriminant validity. From a practical point of view, a two-factor model with one factor reflecting trauma-related symptoms (re-experiencing and avoidance, items 1–7) and one factor reflecting the global psychological symptoms (the trauma's higher order impact on mood, cognition, behaviour and arousal, items 8–20) seems most feasible, economical, and in line with outcomes of the Eigenvalue analyses. An agreeable fit index (CFI > .90) further supports a two-dimensional solution. Yet, high inter-factor correlations and a few high cross-loadings between the two factors impose limitations.

Table 3. Weighted Least Squares Mean- and Variance Adjusted EFA and Standardized CFA Factor Loadings and Fit Indices for Multiple Factor Solutions.

It is not yet possible to rule out the possibility of a unidimensional construct. Subsequently, a CFA was conducted on the one-factor and two-factor models to examine the above findings.

3.2. Confirmatory factor analysis

A summary of model fit parameters across the five traditional models based on literature, the ICD-11 model and the two models derived from our exploratory factor analyses is displayed in . Overall, the ICD-11 model (χ2(6) = 26.79, p < .05, RMSEA = .046, CFI = .998, TLI = .995, SRMR = .015) has the best fit and has a number of factors that is both small and is closest to the indicated two to four recommended factors. Nonetheless, note that it is only composed of a subset of the PCL-5 items having less than three items per factor. The model using all of the items with the best model fit parameters is the Hybrid Model (χ2(149) = 839.21, p < .05, RMSEA = .053, CFI = .982, TLI = .977, SRMR = .032). Yet, we would like to note, the incremental improvement of the AFI in contrast to more parsimonious models (e.g. the classic DSM four-factor model) is relatively small. Instead, we find high factor covariances (>.85 for NA  – EB, AN  – EB, AN  – DA, NA  – AN, EB  – AA, EB  – DA, AA  – DA) which suggests collinearity and low discriminant validity between the factors. Additionally, four out of the seven factors are under-identified (< 3 items per factor), and the construct is quite complex and exceeds the recommended number of factors to extract by far. Based on fit indices, the two-factor model already shows a perfect fit (χ2(169) = 1676.23, p < .05, RMSEA = .074, CFI = .961, TLI = .956, SRMR = .048) while also satisfying the criteria of sufficient items per factor and a lower, more economic number of factors.

Table 4. CFA Summary  – Overview of Model Fit Indices for Each of the Previously Theorized Models and Models Derived from EFAs.

3.3. Bifactor analysis

We added a bi-factor model approach to our analysis to further explore the appropriateness of a unidimensional construct in light of (1) weak evidence for a second factor in the eigenvalue analysis, (2) high inter-factor correlations in the two-factor model and (3) consistent high item loadings of the one factor EFA solution.

The two-factor bifactor model fit the data well (χ2(150) = 1206.21, p < .001, CFI = .973, TLI = .965, RMSEA = .066, SRMR = .037) with the standardized factor loadings on the specific factors being smaller than the general factor, particularly for specific factor 1 (see ). These factor-loading results suggest that while the items relate consistently to the general model, this is not necessarily the case for the specific factors. To further evaluate the bifactor model, various bifactor indices were examined based on the standards outlined earlier (see ). The ECV (.79) met the minimum criteria (ECV > .70; Stucky & Edelen, Citation2014), with the specific factor ECVs being small. The ARB (.111) indicated a relatively small difference between the unidimensional factor solution and the general factor, yet slightly above the suggested cut-off (ARB < .10; Stucky & Edelen, Citation2014). The average IECV was .79 (SD = 0.15, minimum = 0.50, maximum = 1.00), with 65% of the IECV greater than .80, providing further support for the bifactor model. For the general and the second factor, the FD statistic exceeded .90 and the H statistic .70 (Rodriguez et al., Citation2016), which indicates trustworthiness and acceptable to perfect construct replicability. The PUC for this model was .51. The ωH and ωHS revealed high reliability for the general factor (ωH = .84) but low reliability for the specific factors ωHS. In summary, a bifactor model is more statistically appropriate than a model with two specific factors; the general factor is dominant over the specific ones.

Table 5. Bifactor Models and their Indices.

3.4. Model misspecification

An alternative to the goodness-of-fit test is to investigate whether misspecifications are present in the model using the method suggested by Saris et al. (Citation2009, p. 571). To implement the Saris-Satorra-van der Veld procedure, Jrule was used to examine the power and significance of potential cross-loadings by setting the misspecification cut-off, λ, to .20 and .40, alpha level rate to .05, and power to .80. We tested the PCL-5 models from for model misspecification at the factor loading and residual covariance levels. Results from show that an increase in the number of factors leads to a rise in model misspecifications. Our findings indicate that the one-factor and two-bifactor models are probably the most suitable. Only zero or two modifications may be necessary depending on the magnitude of λ to optimize the models. As Schmitt et al. (Citation2018) found, probably, the second dimension (related to the second eigenvalue) is likely a consequence of an item pair (items 6 and 7) exhibiting shared variance that is not associated with the factor, which is essentially a manifestation of dimensionality.

Table 6. Number of Model Misspecifications at the Factor Loading, Correlated Residual and Overall (Factor Loading plus Correlated Residuals) Levels.

Based on the results of confirmatory factor analyses and misspecification indices ( and ), a two-factor bifactor model seems statistically and practically most appropriate. Therefore, research should consider using the total PCL-5 score .

Figure 1. The Factor Model and Standardized Factor Loadings for the Two-Factor Bifactor Model.

Figure 1. The Factor Model and Standardized Factor Loadings for the Two-Factor Bifactor Model.

3.5. General psychometric properties of the PCL-5

Participants reported an average sum score of 8.19 (SD = 11.37) on the PCL-5. According to the cut-off scoring method, 115 individuals, constituting 4.78% of the total sample and 7.07% of the trauma-exposed individuals, met the DSM-5 criteria for PTSD. Further, they reported an average of 1.56 traumatic events experienced (SD = 0.49). We evaluated the psychometric properties of the PCL-5 item and scale characteristics of the uni-, two- and four-dimensional PCL-5-conception based on the total trauma-exposed sample (n = 1625). As shown in , item-total correlations are in the upper range. Internal consistency as a measure of the reliability of the scale can be considered to be very good (Cronbach's α = .94 for the total scale, α = .91 for both two-factor model subscales, α = .9 for the re-experiencing cluster, α = .82 for the avoidance cluster, α = .87 for the NACM cluster and α = .83 for the AAR cluster). Results indicate better internal consistencies for a one- or two-dimensional model.

Table 7. Psychometric Properties of the German Adaptation of the PCL-5.

4. Discussion

Our study investigates the dimensionality, psychometric properties, and conceptual determination of the PCL-5 in a large-scale population-based sample, addressing the ongoing debate about the PCL-5′s dimensionality. Regarding the general factor analytical procedure, all tested models (standard and common alternative DSM-5 models, the ICD-11 model, a unidimensional model, a two-dimensional model and the 2-bifactorial model) showed at least satisfactory to good fit indices. In particular, we found the ICD-11 and Hybrid models to fit excellently with the data. However, only the one-, two-factor models and the two-factor bifactor model included all items and additionally (1) had a number of factors supported by the combined outcomes of parallel analysis and eigenvalues, (2) showed sufficient discriminant validity between factors, (3) had sufficient items per factor and (4) a low number of model misspecifications. Based on our data, we find a 2-bifactor model to have the best overall fit. It consists of a solid general PTSD factor and two weaker rather descriptive specific factors – one containing traumatic stimuli-related behaviour (re-experiencing and avoidance) and one describing the trauma's higher-order impact on mood, cognition and arousal. It provides a model that fits well, has easily interpretable parameters and approximates reality parsimoniously (as recommended by Preacher & Merkle, Citation2012). This result advocates for the use of the cut-off scoring method. Not least, the internal consistencies of the total scale and two-factorial subscales were higher than those of the DSM model's subscales, providing further evidence to support the reduced dimensional structure of the PCL-5.

Our findings align with Schmitt et al. (Citation2018), endorsing a bifactorial model characterized by a robust general factor and two subordinate specific factors. Despite the excellent model fit exhibited by the traditional models (Hybrid (Armour et al., Citation2015), DSM Dysphoria (Simms et al., Citation2002), Dysphoric Arousal (Elhai et al., Citation2011), Anhedonia (Liu et al., Citation2014), Externalizing Behaviours (Tsai et al., Citation2015)) on our PCL-5 data, the increase in the number of factors leads to higher inter-factor correlations, increased model-misspecifications, and more underidentified factors. This undermines the models’ statistical robustness and diminishes their clinical utility through overly intricate diagnostic algorithms and less distinguishable factors. In summary, our research challenges the prevalent trend of factor over-extraction, demonstrating that simpler models provide good fit, statistical robustness, and enhanced clinical utility. In the conceptualization of the two facets, we find a resemblance to the differentiation between PTSD (core symptoms) and complex PTSD (general dysphoria and anxiety symptoms) of the ICD-11.

Even though our study is based on a large-scale population-based sample, and we have implemented a thorough and precise statistical and psychometric methodology, some shortcomings and future challenges must be mentioned. Noteably, this study refrains from reporting prevalences. We acknowledge minor deviations in the analysis sample from expected sociodemographic group norms, such as the underrepresentation of individuals older than 70 and a gender bias favouring more females in the working-age sample. However, exact representativeness is not essential for structural modelling (Rothman et al., Citation2013). Additionally, individuals residing in institutions, including older individuals with World War-related traumas and those with severe trauma leading to mental and physical impairment, are not represented in our study since data collection took place in private households. The broad definition of the A-criterion, encompassing indirect exposure (‘learned about it’), was chosen to construct a sizable sample for factor structure validation analyses, potentially inflating the base rate of trauma. The observed low rate of PTSD in the analysis sample poses a limitation to the interpretation of our findings on the factorial structure of PTSD. Yet this corresponds to the rarity of this mental disorder in the general population (Hauffa et al., Citation2011; Maercker et al., Citation2008).

The presentation of questions in questionnaire form by non-medical/psychological interviewers from a survey institute could lead to uncertain compliance from participants in providing information about their psychological experiences. Likewise, it is imperative to consider memory bias when attempting to capture events from the distant past. In standard statistical procedures, caution is recommended when conducting both exploratory and confirmatory factor analyses on a single dataset, as this practice may introduce bias by inflating fit statistics through chance discoveries in the exploratory phase. However, adhering to Schmitt et al. (Citation2018) structured approach, we advocate a judicious combination of CFAs with EFAs to pinpoint model misspecifications and gain a comprehensive understanding of empirical data. We recommend implementing the proposed two-bifactor model on independent samples in subsequent studies. Due to the lack of access to other instruments and a parallel clinical interview, general psychometric properties in this study are limited to internal consistencies. We cannot provide a validated cut-off score for the general German population. However, Krüger-Gottschalk et al. (Citation2017) describe the psychometric properties in a German clinical sample. Additionally, no statements can be made about the impact of trauma type, frequency of exposure and time passed since exposure on the factorial structure, as they have not been included in the present analysis. The factorial structure of the measure is contingent on the sample population; our findings can be generalized to the German general population and the German version of the PCL-5. However, further research is required to determine the stability of this factorial structure in independent samples, in clinical samples with confirmed PTSD diagnosis, across diverse cultural backgrounds and other language versions of the PCL-5.

In conclusion, our research demonstrates that the PCL-5 is a reliable and effective screening tool for assessing PTSD in clinical and research settings. Its economic structure with a general factor and two specific facets (trauma related symptoms and global psychological symptoms) renders the diagnostical algorithm unnecessary. Given these findings we suggest using the cut-off scoring method.

Disclosure statement

During the revision of this work, the author used ChatGPT-3.5 in order to refine language. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication. The authors reported no potential conflict of interest.

Data availability statement

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to the informed consent given by the participants.

Additional information

Funding

This work is funded by the Open Access Publishing Fund of Leipzig University supported by the German Research Foundation within the program Open Access Publication Funding.

References

  • Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705
  • American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.). American Psychiatric Publishing.
  • Armour, C., Tsai, J., Durham, T. A., Charak, R., Biehn, T. L., Elhai, J. D., & Pietrzak, R. H. (2015). Dimensional structure of DSM-5 posttraumatic stress symptoms: Support for a hybrid Anhedonia and Externalizing Behaviors model. Journal of Psychiatric Research, 61, 106–113. https://doi.org/10.1016/j.jpsychires.2014.10.012
  • Ben-Ezra, M., Karatzias, T., Hyland, P., Brewin, C. R., Cloitre, M., Bisson, J. I., Roberts, N. P., Lueger-Schuster, B., & Shevlin, M. (2018). Posttraumatic stress disorder (PTSD) and complex PTSD (CPTSD) as per ICD-11 proposals: A population study in Israel. Depression and Anxiety, 35(3), 264–274. https://doi.org/10.1002/da.22723
  • Blevins, C. A., Weathers, F. W., Davis, M. T., Witte, T. K., & Domino, J. L. (2015). The Posttraumatic Stress Disorder Checklist for DSM-5 (PCL-5): Development and initial psychometric evaluation. Journal of Traumatic Stress, 28(6), 489–498. https://doi.org/10.1002/jts.22059
  • Brewin, C. R., Lanius, R. A., Novac, A., Schnyder, U., & Galea, S. (2009). Reformulating PTSD for DSM-V: Life after Criterion A. Journal of Traumatic Stress, 22(5), 366–373. https://doi.org/10.1002/jts.20443
  • Buuren, S. van, & Groothuis-Oudshoorn, K. (2011). Mice: Multivariate imputation by chained equations in R. Journal of Statistical Software, 45(3), https://doi.org/10.18637/jss.v045.i03
  • Carragher, N., Sunderland, M., Batterham, P. J., Calear, A. L., Elhai, J. D., Chapman, C., & Mills, K. (2016). Discriminant validity and gender differences in DSM-5 posttraumatic stress disorder symptoms. Journal of Affective Disorders, 190, 56–67. https://doi.org/10.1016/j.jad.2015.09.071
  • Cloitre, M., Garvert, D. W., Brewin, C. R., Bryant, R. A., & Maercker, A. (2013). Evidence for proposed ICD-11 PTSD and complex PTSD: A latent profile analysis. European Journal of Psychotraumatology, 4(1), https://doi.org/10.3402/ejpt.v4i0.20706
  • Contractor, A. A., Greene, T., Dolan, M., & Elhai, J. D. (2018). Relations between PTSD and depression symptom clusters in samples differentiated by PTSD diagnostic status. Journal of Anxiety Disorders, 59, 17–26. https://doi.org/10.1016/j.janxdis.2018.08.004
  • Elhai, J. D., Biehn, T. L., Armour, C., Klopper, J. J., Frueh, B. C., & Palmieri, P. A. (2011). Evidence for a unique PTSD construct represented by PTSD’s D1-D3 symptoms. Journal of Anxiety Disorders, 25(3), 340–345. https://doi.org/10.1016/j.janxdis.2010.10.007
  • Ferrie, O., Richardson, T., Smart, T., & Ellis-Nee, C. (2022). A validation of the PCL-5 questionnaire for PTSD in primary and secondary care. Psychological Trauma: Theory, Research, Practice, and Policy, https://doi.org/10.1037/tra0001354
  • Forkus, S. R., Raudales, A. M., Rafiuddin, H. S., Weiss, N. H., Messman, B. A., & Contractor, A. A. (2022). The Posttraumatic Stress Disorder (PTSD) Checklist for DSM-5: A systematic review of existing psychometric evidence. Clinical Psychology: Science and Practice, https://doi.org/10.1037/cps0000111
  • Garrido, L. E., Abad, F. J., & Ponsoda, V. (2016). Are fit indices really fit to estimate the number of factors with categorical variables? Some cautionary findings via Monte Carlo simulation. Psychological Methods, 21(1), 93–111. https://doi.org/10.1037/met0000064
  • Glaesmer, H., Matern, B., Rief, W., Kuwert, P., & Braehler, E. (2015). Traumatisierung und posttraumatische Belastungsstörungen: Auswirkung von Art und Anzahl traumatischer Erfahrung. Nervenarzt, 86(7), 800–806. https://doi.org/10.1007/s00115-014-4235-z
  • Hancock, G. R., & Mueller, R. O. (2001). Rethinking construct reliability within latent variable systems. In R. Cudeck, S. du Toit, & D. Sörbom (Eds.), Structural equation modeling: Present and future - A Festschrift in honor of Karl Jöreskog (pp. 195–216). Scientific Software International.
  • Hansen, M., Hyland, P., Armour, C., Shevlin, M., & Elklit, A. (2015). Less is more? Assessing the validity of the ICD-11 model of PTSD across multiple trauma samples. European Journal of Psychotraumatology, 6(1), https://doi.org/10.3402/ejpt.v6.28766
  • Hauffa, R., Rief, W., Brähler, E., Martin, A., Mewes, R., & Glaesmer, H. (2011). Lifetime traumatic experiences and posttraumatic stress disorder in the German population: Results of a representative population survey. Journal of Nervous & Mental Disease, 199(12), https://doi.org/10.1097/NMD.0b013e3182392c0d
  • Ho, G. W. K., Hyland, P., Shevlin, M., Chien, W. T., Inoue, S., Yang, P. J., Chen, F. H., Chan, A. C. Y., & Karatzias, T. (2020). The validity of ICD-11 PTSD and complex PTSD in East Asian cultures: Findings with young adults from China, Hong Kong, Japan, and Taiwan. European Journal of Psychotraumatology, 11(1), https://doi.org/10.1080/20008198.2020.1717826
  • Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal, 6(1), 1–55. https://doi.org/10.1080/10705519909540118
  • Hyland, P., Brewin, C. R., & Maercker, A. (2017). Predictive validity of ICD-11 PTSD as measured by the Impact of Event Scale-Revised: A 15-year prospective study of political prisoners. Journal of Traumatic Stress, 30(2), 125–132. https://doi.org/10.1002/jts.22171
  • Jenkins-Guarnieri, M., McEuin, C., Smolenski, D., Hosey, R., Macobin, B., & Prins, A. (2023). Factor Structure and psychometric performance of the PCL-5 in a clinical sample of veterans seeking treatment in a VA PTSD outpatient clinic. Psychological Assessment, 35(4), 325–338. https://doi.org/10.1037/pas0001208
  • Kaplan, R. M., & Saccuzzo, D. P. (2009). Psychological testing: Principles, applications, and issues (7th ed.). Wadsworth Cengage Learning.
  • Krüger-Gottschalk, A., Ehring, T., Knaevelsrud, C., Dyer, A., Schäfer, I., Schellong, J., Rau, H., & Köhler, K. (2022). Confirmatory factor analysis of the Clinician-Administered PTSD Scale (CAPS-5) based on DSM-5 vs. ICD-11 criteria. European Journal of Psychotraumatology, 13(1), https://doi.org/10.1080/20008198.2021.2010995
  • Krüger-Gottschalk, A., Knaevelsrud, C., Rau, H., Dyer, A., Schäfer, I., Schellong, J., & Ehring, T. (2017). The German version of the posttraumatic stress disorder checklist for DSM-5 (PCL-5): Psychometric properties and diagnostic utility. BMC Psychiatry, 17(1), 379. https://doi.org/10.1186/s12888-017-1541-6
  • Little, T. D., Lindenberger, U., & Nesselroade, J. R. (1999). On selecting indicators for multivariate measurement and modeling with latent variables: When “good” indicators are bad and “bad” indicators are good. Psychological Methods, 4(2), 192–211. https://doi.org/10.1037/1082-989X.4.2.192
  • Liu, P., Wang, L., Cao, C., Wang, R., Zhang, J., Zhang, B., Wu, Q., Zhang, H., Zhao, Z., Fan, G., & Elhai, J. D. (2014). The underlying dimensions of DSM-5 posttraumatic stress disorder symptoms in an epidemiological sample of Chinese earthquake survivors. Journal of Anxiety Disorders, 28(4), 345–351. https://doi.org/10.1016/j.janxdis.2014.03.008
  • Maercker, A., Forstmeier, S., Wagner, B., Glaesmer, H., & Brähler, E. (2008). Posttraumatische Belastungsstörungen in Deutschland. Ergebnisse einer gesamtdeutschen epidemiologischen Untersuchung. Nervenarzt, 79(5), 577–586. https://doi.org/10.1007/s00115-008-2467-5
  • Preacher, K. J., & Merkle, E. C. (2012). The problem of model selection uncertainty in structural equation modeling. Psychological Methods, 17(1), 1–14. https://doi.org/10.1037/a0026804
  • R Core Team. (2021). R: A language and environment for statistical computing (R version 4.1.1 (2021-08-10)). R Foundation for Statistical Computing. https://www.R-project.org/.
  • Rasmussen, A., Verkuilen, J., Jayawickreme, N., Wu, Z., & McCluskey, S. T. (2019). When did posttraumatic stress disorder get so many factors? Confirmatory factor models since DSM-5. Clinical Psychological Science, 7(2), 234–248. https://doi.org/10.1177/2167702618809370
  • Redican, E., Nolan, E., Hyland, P., Cloitre, M., McBride, O., Karatzias, T., Murphy, J., & Shevlin, M. (2021). A systematic literature review of factor analytic and mixture models of ICD-11 PTSD and CPTSD using the International Trauma Questionnaire. Journal of Anxiety Disorders, 79, 102381. https://doi.org/10.1016/j.janxdis.2021.102381
  • Reise, S. P., Morizot, J., & Hays, R. D. (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research, 16(S1), 19–31. https://doi.org/10.1007/s11136-007-9183-7
  • Reise, S. P., Scheines, R., Widaman, K. F., & Haviland, M. G. (2013). Multidimensionality and structural coefficient bias in structural equation modeling. Educational and Psychological Measurement, 73(1), 5–26. https://doi.org/10.1177/0013164412449831
  • Revelle, W. (2022). Psych: Procedures for psychological, psychometric, and personality research (R package version 2.2.9). Northwestern University. https://CRAN.R-project.org/package = psych.
  • Rodriguez, A., Reise, S. P., & Haviland, M. G. (2016). Applying bifactor statistical indices in the evaluation of psychological measures. Journal of Personality Assessment, 98(3), 223–237. https://doi.org/10.1080/00223891.2015.1089249
  • Rosseel, Y. (2012). Lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), https://doi.org/10.18637/jss.v048.i02
  • Rothman, K. J., Gallacher, J. E., & Hatch, E. E. (2013). Why representativeness should be avoided. International Journal of Epidemiology, 42(4), 1012–1014. https://doi.org/10.1093/ije/dys223
  • Saris, W. E., Satorra, A., & van der Veld, W. M. (2009). Testing structural equation models or detection of misspecifications? Structural Equation Modeling: A Multidisciplinary Journal, 16(4), 561–582. https://doi.org/10.1080/10705510903203433
  • Sass, D. A., Schmitt, T. A., & Marsh, H. W. (2014). Evaluating model fit with ordered categorical data within a measurement invariance framework: A comparison of estimators. Structural Equation Modeling: A Multidisciplinary Journal, 21(2), 167–180. https://doi.org/10.1080/10705511.2014.882658
  • Savalei, V. (2021). Improving fit indices in structural equation modeling with categorical data. Multivariate Behavioral Research, 56(3), 390–407. https://doi.org/10.1080/00273171.2020.1717922
  • Schellong, J., Hanschmidt, F., Ehring, T., Knaevelsrud, C., Schäfer, I., Rau, H., Dyer, A., & Krüger-Gottschalk, A. (2019). Diagnostik der PTBS im Spannungsfeld von DSM-5 und ICD-11. Der Nervenarzt, 90(7), 733–739. https://doi.org/10.1007/s00115-018-0668-0
  • Schmitt, T. A., Sass, D. A., Chappelle, W., & Thompson, W. (2018). Selecting the “best” factor structure and moving measurement validation forward: An illustration. Journal of Personality Assessment, 100(4), 345–362. https://doi.org/10.1080/00223891.2018.1449116
  • Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), https://doi.org/10.1214/aos/1176344136
  • Silverstein, M. W., Dieujuste, N., Kramer, L. B., Lee, D. J., & Weathers, F. W. (2018). Construct validation of the hybrid model of posttraumatic stress disorder: Distinctiveness of the new symptom clusters. Journal of Anxiety Disorders, 54, 17–23. https://doi.org/10.1016/j.janxdis.2017.12.003
  • Simms, L. J., Watson, D., & Doebbeling, B. N. (2002). Confirmatory factor analyses of posttraumatic stress symptoms in deployed and nondeployed veterans of the Gulf war. Journal of Abnormal Psychology, 111(4), 637–647. https://doi.org/10.1037/0021-843X.111.4.637
  • Stucky, B. D., & Edelen, M. O. (2014). Using hierarchical IRT models to create unidimensional measures from multidimensional data. In S. P. Reise & D. A. Revicki (Eds.), Handbook of item response theory modeling: Applications to typical performance assessment (pp. 183–206). Taylor & Francis.
  • Tsai, J., Harpaz-Rotem, I., Armour, C., Southwick, S. M., Krystal, J. H., & Pietrzak, R. H. (2015). Dimensional structure of DSM-5 posttraumatic stress disorder symptoms: Results from the national health and resilience in veterans study. The Journal of Clinical Psychiatry, 76(5), 546–553. https://doi.org/10.4088/JCP.14m09091
  • Weathers, F. W., Litz, B. T., Keane, T. M., Palmieri, P. A., Marx, B. P., & Schnurr, P. P. (2013). The PTSD Checklist for DSM-5 (PCL-5). https://www.ptsd.va.gov/professional/assessment/adult-sr/ptsd-checklist.asp.
  • Weng, L.-J., & Cheng, C.-P. (2017). Is categorization of random data necessary for parallel analysis on Likert-type data? Communications in Statistics - Simulation and Computation, 46(7), 5367–5377. https://doi.org/10.1080/03610918.2016.1154154
  • Wortmann, J. H., Jordan, A. H., Weathers, F. W., Resick, P. A., Dondanville, K. A., Hall-Clark, B., Foa, E. B., Young-McCaughan, S., Yarvis, J. S., Hembree, E. A., Mintz, J., Peterson, A. L., & Litz, B. T. (2016). Psychometric analysis of the PTSD checklist-5 (PCL-5) among treatment-seeking military service members. Psychological Assessment, 28(11), 1392–1403. https://doi.org/10.1037/pas0000260