key: cord-0966862-dcbe0xsi authors: Zhang, Chi; Wang, Tingting; Zeng, Ping; Zhao, Minghao; Zhang, Guifang; Zhai, Shuo; Meng, Lingbing; Wang, Yuanyuan; Liu, Deping title: Reliability, Validity, and Measurement Invariance of the General Anxiety Disorder Scale Among Chinese Medical University Students date: 2021-05-19 journal: Front Psychiatry DOI: 10.3389/fpsyt.2021.648755 sha: 81927e026dcaadf2028d988c788554158daeeb94 doc_id: 966862 cord_uid: dcbe0xsi Background: Medical students are affected by high levels of general anxiety disorder. However, few studies have specifically focused on the applicability of universal anxiety screening tools in this sample. This study was aimed to evaluate the psychometric property of the 7-item Generalized Anxiety Disorder Scale (GAD-7) among Chinese medical university students. Methods: A questionnaire survey was conducted among 1,021 medical postgraduates from six polyclinic hospitals. Internal consistency and convergent validity of the GAD-7 were evaluated. Factor analyses were used to test the construct validity of the scale. An item response theory (IRT) framework was used to estimate the parameters of each item. Multi-group confirmatory analyses and differential item function analyses were used to evaluate the measurement equivalence of the GAD-7 across age, gender, educational status, and residence. Results: Cronbach's α coefficient was 0.93 and the intraclass correlation coefficients ranged from 0.71 to 0.87. The GAD-7 summed score was significantly correlated with measures of depression symptoms, perceived stress, sleep disorders, and life satisfaction. Parallel analysis and confirmatory factor analysis supported the one-factor structure of the GAD-7. Seven items showed appropriate discrimination and difficulty parameters. The GAD-7 showed good measurement equivalence across demographic characteristics. The total test information of the scale was 22.85, but the test information within the range of mild symptoms was relatively low. Conclusions: The GAD-7 has good reliability, validity, and measurement invariance among Chinese medical postgraduate students, but its measurement precision for mild anxiety symptoms is insufficient. The prevalence of mental health disorders has increased considerably among medical students including postgraduates (1) . These students are affected by higher levels of anxiety than students who major in other disciplines (2) (3) (4) (5) as well as the general population (6, 7) . Anxiety has garnered little attention and is often undetected or undertreated in the general population. In particular, only a small number of college students undergo timely screening (8) . Generalized anxiety disorder (GAD) is the most common form of anxiety, which is characterized by excessive and persistent worry (9, 10) . Studies have shown that GAD was correlated with academic performance (11) , depression symptoms (12, 13) , sleep problems (14) , and adverse events (15) . Several systematic reviews have described high levels of general anxiety disorder among medical students in the US (16) , Canada (3), Brazil (17) , and China (18) . Anxiety is most prevalent among medical students from the Middle East and Asian countries (19) . A recent review including 10 investigation studies showed that the prevalence of anxiety among Chinese medical students is 21%, which is higher than that of students majoring in other subjects, as well as medical students from other Asian countries (20) . A cross sectional study showed that 11% of postgraduate medical residents in Bangladesh had anxiety disorders (21) . Medical university students are affected by various sources of stress, such as academia, employment, family, tutors, and a harsh health service environment. Although researchers are concerned about the prevalence of anxiety disorder among medical students, more attention should be paid to the early screening and a valid tool for GAD screening needs to be generally accepted in this sample. However, the literature regarding this specific population has been relatively insufficient. The 7-item Generalized Anxiety Disorder Scale (GAD-7) (9), recommended by the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) (22) , is a common instrument used in the screening of generalized anxiety disorders because of its simplicity and operability. The GAD-7 has been translated into different languages, including Chinese, over the last two decades (23) (24) (25) (26) . The reliability, validity, and diagnostic capability of the GAD-7 have been confirmed, but the majority of previous psychometric studies focused on clinical settings rather than general populations (12, 23, (27) (28) (29) . Although the GAD-7 has been widely used for anxiety screening among medical students (30) (31) (32) , few studies have systematically evaluated its measurement properties in this sample. Besides, previous studies have focused on the psychometric performance of the overall scale, but little attention has been paid to the characteristics or measurement invariance of individual items. The measurement equivalence is an important attribute of a screening instrument, as it ensures the comparability of measurement values across different subsamples. Therefore, it is necessary to evaluate the GAD-7 comprehensively with methodologies that combine classical test theory (CTT) and item response theory (IRT). The IRT framework test the probability of subjects' response according to particular models and then evaluates parameters of the measurement tool. These methods were originally designed to evaluate examination tools and are recently widely used to assess the suitability of health-related scales (33, 34) . This study was designed to evaluate the reliability, validity, and measurement invariance of the GAD-7 using a sample of medical university students. We also aimed to provide reasonable suggestions of its application in practice. The study participants were 1,021 full-time medical postgraduates from six polyclinic hospitals of Peking University Medical College or Peking Union Medical College. These hospitals were Beijing Hospital, the First Hospital of Peking University, the People's Hospital of Peking University, the Third Hospital of Peking University, Peking Union Medical College Hospital, and the Cancer Hospital of The Chinese Academy of Medical Sciences. In each hospital, more than 50 percent of the total postgraduate students were selected during the survey. We estimated sample size on the basis of factor analysis and item sample ratio method. As some researchers recommend, a sample of 300-1,000 in factor analysis is excellent (35) and a sample item ratio between 10 and 20 indicates sufficient (36) . Respondents in the current study included 630 (61.71%) master's and 391 (38.29%) doctoral medical students. A cross-sectional questionnaire survey was conducted from April to June, 2020, and the management staff of each hospital collected the data. The ethics committee of Beijing Hospital approved this study (2020BJYYEC-231-01). All respondents gave informed consent and volunteered to participate in the study. The background and purpose of the survey, as well as informed consent, were explained on the first page of the questionnaire. In order to ensure an effective recovery rate, each respondent could receive a feedback report by email after submission. A total of 1,108 questionnaires were collected and 87 invalid questionnaires (either incomplete within the allotted time, had unidentifiable information, or too many repetitive responses) were excluded. Thus, 1,021 valid questionnaires were obtained and included in the analysis. General anxiety disorders were measured with a Chinese GAD-7 version. The Chinese GAD-7 version was firstly translated by He and colleagues using standard translation methods in 2010 (37) . In the current study, we quoted the Chinese GAD-7 version from He's study and further translated it back into English by a doctor of psychiatry, two specialists in medical education, and one English native overseas postgraduate. These cross-cultural adaptation procedures ensured the semantic equivalence between the translated version and the original. Besides, as the statements of questions in the GAD-7 are relatively concise, university students could understand the meaning easily, thus the face validity and content validity of the Chinese GAD-7 version were supported. Previous studies have demonstrated the GAD-7's appropriate screening utility in clinical samples and the general population (23, 27, 38, 39) . Its unidimensional structure has been demonstrated in many published studies employing different methodologies (27, 29) , with a few exceptions (12, 40) . The GAD-7 is a 7-item self-report measurement designed to screen the presence of general anxiety disorders over the previous 2 weeks (9) . Items consist of seven statements about worry or somatic tension, which are rated on a four-point Likert scale as follows: 0 (not at all); 1 (several days); 2 (more than half the number of days); and 3 (nearly every day), indicating frequency levels of GAD symptoms. The GAD-7 summed score ranges from 0 to 21, with cutoff points of 5, 10, and 15 allowing researchers to classify the anxiety as none/normal (0-4), mild (5-9), moderate (10) (11) (12) (13) (14) , and severe (15) (16) (17) (18) (19) (20) (21) (41) . However, the cutoff score for the prevalence of general anxiety disorders has not been consistent among multiple samples. The original validation study of the GAD-7 in the primary care setting, adopted 9/10 as the cutoff score (9) , while the recommended cutoff scores range from 7 to 13 for different versions (23, 26, 27, (41) (42) (43) (44) . Furthermore, a small number of studies have used a 4/5 score as an optimal cutoff (45) . The Patient Health Questionnaire 9-item depression scale (PHQ-9) (46), a valid self-administered depression screening and diagnostic tool, was used to measure depression symptoms. The Cronbach's α coefficient of PHQ-9 in this study was 0.89. The 10item Perceived Stress Scale (PSS-10) (47) was used to measure perceived stress. The PSS-10 is one of the most frequently used self-report psychological questionnaires, which is widely used across various cultures and populations (48) . It showed appropriate consistency reliability with an α coefficient of 0.91. The Athens Insomnia Scale (AIS) (49) was used to quantify the presence of insomnia among study participants. The AIS is widely used in the general population and included eight threepoint Likert items. It showed appropriate consistency reliability with an α coefficient of 0.87. Continuous variables were described as the mean ± standard deviation (mean ± SD), and categorical variables were described as numbers with percentages [n (%)]. The Student's t test or Wilcoxon rank sum test was used to compare the differences of GAD-7 scores among different groups. Spearman correlation coefficient was used to analyze the correlation between GAD-7 score and other measured outcomes. Statistical significance was accepted at the two-sided 0.05. Internal consistency of the scale was evaluated using the Cronbach's α coefficient and Guttmann's coefficient, with an α coefficient >0.7 indicating good internal reliability (50) . An exploratory factor analysis (EFA) using the principal component method was performed to explore the factor structure. Parallel analyses (PA) (51) were used to retain factors with 500 random data matrices. The retained eigenvalues should meet the K1 criterion (≥1) and greater than the average or the 95th percentile of the random samples. Factor loading >0.6 in exploratory factor analysis (EFA) is considered acceptable. Confirmatory factor analyses (CFA) with robust weighted least squares estimation were conducted in Mplus (version 7.4) in cases of violation of the multivariate normality assumption. χ 2 /df, root mean square error of approximation (RMSEA), comparative fit index (CFI), and normed fit index (NFI) were used to evaluate the fitness. The model is considered to have a good fit with a χ 2 /df of 5 or less, a RMSEA of 0.1, a CFI and NFI >0.90 (52) . The IRT analysis with a fitted Semejima graded response model was implemented to estimate the discrimination (a) and difficulty (b) parameters of seven items using the IRTPRO version 4.2 software. Prior to the implementation of the IRT, the unidimensionality assumption was tested using factor analyses. The local independence was confirmed using the χ 2 LD statistic (53) and residuals covariance (54) . A χ 2 LD statistic <10 and a standardized residual covariance <0.2 between two items indicated an acceptable level of local independence (54) . The item characteristic curve (ICC) was used to establish the relationship between subjects' potential trait and their responses, and the item information curve (IIC) was used to evaluate the measurement precision through the test information function (TIF). The measurement precision of a scale is sufficient when the total test information is above 16 (55) . In addition, by applying the posterior estimation method, the transformation relationship between the original sum score and IRT characteristic score was established (56) . Factorial invariance of the GAD-7 across age, gender, education, and residence was tested by a multigroup confirmatory factor analysis approach, which consisted of a series of nested confirmatory steps (57) . Configural invariance (free parameters), metric invariance (constraints of equivalent factor loadings), scalar invariance (further constraints of the intercepts), and strict invariance (further constraints of residual variances) models were tested across subgroups. A no-significant χ 2 (P > 0.05); a CFI value <0.01; and a RMSEA value <0.15 were used to compare the fit of nested models (58) . We examined measurement invariance of item parameters using differential item functioning (DIF) methods (59) . The DIF occurred when the relationship between the latent variable and item responses differed on item parameters across subgroups. The existence of the DIF suggests that the differences between groups may not be Frontiers in Psychiatry | www.frontiersin.org due to actual differences between groups in the survey variables, but to other factors, such as the measurement tool itself or unknown external factors (60) . A no-significant χ 2 (P > 0.05) at specific degrees of freedom indicated acceptable parameter invariance (60) . A total of 1,021 postgraduate students (26.01 ± 2.46 years) completed the survey, including 61.71% master and 38.29% doctoral students. The majority were female (65.36%) and clinical medical students (76.64%). The average GAD-7 score was 6.29 ± 3.58 and the distribution of points for each item was described in Table 1 . Among the participants, 34.28% (350/1,021) had no anxiety; 49.07% (501/1,021) had mild anxiety; 12.34% (126/1,021) had moderate anxiety; and 4.31% (44/1,021) had severe anxiety. The socio-demographic characteristics according to the GAD-7 scores of the postgraduate students were presented in Table 2 . Reliability, Validity, and Factor Structure of the GAD-7 The overall α coefficient of the GAD-7 was 0.93 and the Guttmann's coefficient was 0.89. The summed GAD-7 score was statistically significant correlated with scores of the PHQ-9 (r = 0.78, P < 0.001), PSS-10 (r = 0.71, P < 0.001), AIS-8 (r = 0.67, P < 0.001), and SWLS-5 (r = −0.38, P < 0.001). As Table 2 showed, the α coefficient of the scale was reduced when a specific item was removed. The intraclass correlation coefficients between scores of seven items and the summed GAD-7 score ranged from 0.71-0.87 (P < 0.001). The KMO statistic was 0.92 and the significance of Bartlett's test of sphericity (χ 2 = 4,997.63, df = 21, P < 0.001) indicated that the data was suitable for factor extraction. A parallel analysis employing the principal component method was used to determine the number of factors, and one common factor was extracted. The eigenvalue of this factor was 4.82 accounting for 66.02% of the variation and the scree plot was showed in Figure 1 . As Table 3 showed, the loadings of seven items on this factor were >0.7. A CFA with weighted least square estimation was used to test the onefactor structure of the GAD-7. The modification index between item 3 and item 4 was 83.52, and the CFA model was modified by establishing the residual covariation correlation between the two items. The adaptability of the modified model was then significantly improved (χ 2 /df = 3.48, CFI = 0.97, NFI = 0.96, RMSEA = 0.05) and the factor loading of each item in the CFA model was >0.6. This indicated that the unidimensional structure showed excellent suitability to the data. The CFA model of the GAD-7 is shown in Figure 2 . We then tested the factorial invariance using the multigroup confirmatory factor analysis (MGCFA) framework. The configural invariance model was used as a basic model and three restrictive models were tested step by step. As summarized in Table 4 , the metric invariance model and scalar invariance model showed excellent fitness across age, age, gender, education, and residence (P > 0.05, CFI < 0.01). The strict invariance model only showed acceptable fitness across residence (P = 0.236, CFI = 0.004). As the results of EFA and CFA supported the unidimensional structure of the GAD-7, we further conducted the χ 2 LD statistic matrix (Supplementary Table 1 ) and residual covariance matrix (Supplementary Table 2 ) between any two items to test its local independence. The χ 2 LD statistics were all <10 (0.42 to 9.30) and the residual covariances were <0.2 (−0.007 to 0.053), which indicated an appropriate local independence feature of the GAD-7. Among the two matrices, items 3 and 4 showed the highest χ 2 LD statistic (9.3) and highest residual covariance (0.053). An item response analysis with a Semejima graded model was used to estimate the parameters of seven items. The discrimination parameter of seven items ranged from 1.90 to 4.79, and the difficulty parameter ranged from −1.22 to 3.19 with a monotonically increasing trend. All seven items had sufficient test information with a corresponding local trait (θ ). We summarized the ICC and IIC of seven items in Figure 3 and parameter values are shown in Table 3 . We listed the conversion between the original GAD summed scores and the IRT trait scores (Supplementary Table 3) , and divided the horizontal coordinate of the test information curve of the GAD-7 into four anxiety category levels. As Figure 4 shows, the total test information of the GAD-7 among medical postgraduate students was 22.85, and the corresponding latent trait level located at 1.38. However, the test information within the range of mild anxiety symptoms (5 ≤ GAD score <10) was relatively low. We further analyzed the differential item function of each item across four subgroups. The results of parameter invariance are summarized in Table 5 . No statistically significant differences were found in either discrimination or difficulty parameters (P > 0.05) according to the χ 2 , which indicated excellent equivalence for the seven items. Furthermore, the test information function curves of the different subgroups were close to the curve of the total sample (Figure 4) . These curvilinear paths further supported the measurement invariance of the GAD-7. As far as the authors know, this is the first study to evaluate the psychometric properties of the GAD-7 among medical university students combining CTT and IRT. We observed a higher prevalence of general anxiety disorders (65.72%) than previous reports in China (18) . This indicated that psychological impairment was a common problem among Chinese medical students. In addition, the high incidence could also be attributed to the influence of COVID-19, as teaching tasks in universities of Beijing had not fully recovered during the survey period. Notably, the differences observed in the anxiety detection rate was related to the selection of the GAD-7's cutoff (16) . When the threshold of 9/10 was applied, the detection rate of general anxiety disorder dropped to 16.65%. The results of IRT analysis showed that the GAD-7 had considerably lower test information for subjects with mild anxiety symptoms. This innovative finding supported the importance of careful selection of cutoff values in clinical practice, and the necessity of clinical diagnosis in subjects with mild symptoms (GAD scores ranging from 5 to 10). No significant differences were found in the subjects' GAD scores across different age, gender, education status, or residence subgroups. This indicated weak associations between general anxiety and demographic characteristic among the medical university students. The GAD scores were closely related to family income and satisfaction (with college, major, or tutor), and were consistent with the results of previous studies of medical students (61). The above results show that negative emotions among medical students are an important potential risk factor of general anxiety disorders. In the current study, we implemented standardized backtranslation and cross-cultural adaptation procedures to ensure the content validity of the Chinese GAD-7 version (36). The GAD-7 had a good internal consistency reliability coefficient of 0.93, which is consistent with that reported in previous studies ranging from 0.74 (42) to 0.94 (25) . The strong correlation coefficient between the GAD-7 and PHQ-9 (r = 0.78) has also been observed among other samples (12, 13, 62) . These findings suggest that anxiety disorders frequently occur alongside depression symptoms. Several previous studies have also confirmed the association between the GAD-7 and factors such as stress (12) , sleep disorders (28) , and life satisfaction (28) . Significant correlations between the GAD-7 and theoretically related measurements support the scale's convergent validity and discrimination ability for subjects with differences in psychological status. These results are consistent with those of previous studies of multiple populations (13, 27, 41, 63) . Although the one-dimensional structure of the GAD-7 proposed by its developer is not consistent across all studies, the construct validity of its unidimensional structure was confirmed in the current study. This finding was consistent with the majority of published studies conducted in both the primary care setting and the general population (9, 25, 63) . Nevertheless, a two-factor structure was reported by Satomi (40) among Japanese adult populations, as well as Kertz's (12) study in an acute psychiatric sample. Heterogeneity of the sample and differences in methodology may explain these conflicting results. We modified the CFA model by establishing a residual correlation between item 3 ("Feeling afraid as if something awful might happen") and 4 ("Worrying too much about different things"). The residual pair between specific items is a common method used to improve the scale's fitness, and has been applied in previous studies among Portuguese college students (13) , American outpatients (64), and heterogeneous psychiatric populations (65) . There was some overlap between content of item 3 and 4. Furthermore, the LDχ 2 statistic between the two items (9.3) was higher than that of other pairs. This indicated that these two questions reflected an ambiguous trait, other than general anxiety (e.g., fear). When we removed either of the two items, the total test information of the GAD-7 was significantly reduced. Thus, we recommend retaining all items for specific applications. The metric and scalar invariance models of the unidimensional GAD-7 showed excellent equivalence across subgroups, which has also been confirmed in various clinical and general population studies (12, 13, 39, 64) . However, the strict invariance model was not equivalent across demographic characteristics. This might be related to the heterogeneity of residual covariance between different items among subgroups. The GAD-7 has good local independence among medical university students. This is an important characteristic of an ideal scale and one of the preconditions for IRT analysis that is often ignored by researchers (27) . The difficulty and discrimination parameters of seven items were within an appropriate range ( Table 3 ) and the total test information of the scale was relatively high (22.85) . These findings are consistent with those of Zhong's study in pregnant women (27) . Although some psychological experts suggested collapsing the response categories, "more than half the days" and "nearly every day" in a graded response scale, owing to potentially disordered thresholds (66, 67) , the difficulty parameters of the four response categories increased monotonically in the present results. This indicated that response categories were used in a reasonable and ordered manner. According to the summarized ICC (Figure 3) , the curves of the four response categories 0-4 were significantly spaced, which is inconsistent with the findings of another study of antepartum women in two low-income countries (68) . The seven items showed appropriate measurements in DIF analyses ( Table 5) . This indicated that the GAD-7 was fair among the subsamples, which is consistent with the findings of Pascal's study among primary care patients (69) . Besides, we used the test information curves (Figure 4) to describe the measurement precision of the GAD-7, according to different screening outcomes, which is helpful in choosing optimal cutoff scores. One novelty of the present study is the fact that the GAD-7 had a lower precision for persons with mild anxiety symptoms (with scores ranging from 5-10). Barthel also confirmed that the GAD-7 items measured well at higher anxiety levels, but not as well at lower levels (68) . These findings strengthen the necessity for the clinical diagnosis of persons with mild anxiety symptoms and rigorous exposition of cutoff scores in practical applications. The GAD-7 had relatively sufficient test information within the range of non-anxiety symptoms, as well as moderate and major symptoms, indicating that it is a valid screening tool for the sample. We further constructed the IIC (Figure 4 ) across different demographic characteristics, and the basic shapes of all curves were relatively similar. Moreover, the test information curves among subgroups showed very little fluctuation around the curve of the total sample, which also supported its measurement equivalence. This study had some limitations. Firstly, we did not confirm the inter-intra rater reliability and concurrent validity of the Chinese GAD-7 version. Secondly, the optimal cutoff was not identified owing to the lack of clinical diagnoses. Thirdly, the sample used in this study originated from only one city in China, and extrapolation to other populations needs to be further verified. In the future study, we will test the screening ability of specific anxiety scales in conjunction with clinical diagnosis, as well as expand the scope of random sampling nationwide. The 7-item General Anxiety Disorder Scale showed acceptable reliability, validity, and measurement invariance among Chinese medical postgraduates. The optimal cutoff score of the GAD-7 should be considered with caution, because of its insufficient measurement precision for symptoms of mild anxiety. The original contributions generated for this study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s. The studies involving human participants were reviewed and approved by the ethics committee of Beijing Hospital approved this study (2020BJYYEC-231-01). The patients/participants provided their written informed consent to participate in this study. Prevalence of depression, depressive symptoms, and suicidal ideation among medical students: a systematic review and meta-analysis Changes in counseling center client problems across 13 years Systematic review of depression, anxiety, and other indicators of psychological distress among US, and Canadian medical students The health of students in institutes of higher education: an important and neglected public health problem? Prevalence of anxiety and depressive disorders in medical students. Transversal study in medical students in the Saint The effects of two methods of pelvic examination instruction on student performance and anxiety Some norms and reliability data for the State-Trait Anxiety Inventory and the Zung Self-Rating Depression scale Mental disorders among college students in the World Health Organization World Mental Health Surveys -CORRIGENDUM A brief measure for assessing generalized anxiety disorder: the GAD-7 American Psychiatric Association Diagnostic Statistical Manual of Mental Disorders (DSM-IV) Selective influence of test anxiety on reading processes Validity of the Generalized Anxiety Disorder-7 scale in an acute psychiatric sample Factor structure and construct validity of the Generalized Anxiety Disorder 7-item (GAD-7) among Portuguese college students Depression, anxiety, and tobacco use: Overlapping impediments to sleep in a national sample of college students Prevalence and co-occurrence of substance use disorders and independent mood and anxiety disorders: results from the National Epidemiologic Survey on Alcohol and Related Conditions Medical student depression, anxiety and distress outside North America: a systematic review Mental health problems among medical students in Brazil: a systematic review and meta-analysis Prevalence of mental health problems among medical students in China: a meta-analysis The global prevalence of anxiety among medical students: a meta-analysis Mental health issues amongst medical students in Asia: a systematic review Depression, anxiety, stress among postgraduate medical residents: a cross sectional observation in Bangladesh Diagnostic and statistical manual of mental disorders Validation of the Generalized Anxiety Disorder-7 (GAD-7) among Chinese people with epilepsy Crippa José AdS. Factor structure, reliability, and item parameters of the Brazilian-Portuguese Version of the GAD-7 Questionnaire Cultural adaptation into Spanish of the generalized anxiety disorder-7 (GAD-7) scale as a screening tool Quick and easy selfrating of Generalized Anxiety Disorder: validity of the Dutch webbased GAD-7, GAD-2 and GAD-SI Diagnostic Validity of the Generalized Anxiety Disorder -7 (GAD-7) among pregnant women Psychometric evaluation of the Generalized Anxiety Disorder Screener GAD-7, based on a large German general population sample The psychometric properties of the Generalized Anxiety Disorder scale (GAD-7) among Korean university students High levels of depression and anxiety among Croatian medical and nursing students and the correlation between subjective happiness and personality traits A peer-support and mindfulness program to improve the mental health of medical students Anxiety and its association with preparation for future specialty: a cross-sectional study among medical students, Saudi Arabia Improving patient reported outcomes using item response theory and computerized adaptive testing Item response theory and health outcomes measurement in the 21st century Sample size used to validate a scale: a review of publications on newly-developed patient reported outcomes measures Cross cultural adaptation and psychometric validation of research instruments: a methodological review Reliability and validity of a generalized anxiety disorder scale in general hospital outpatients Prevalence of anxiety disorders among Finnish primary care high utilizers and validation of Finnish translation of GAD-7 and GAD-2 screening tools Measurement invariance of assessments of depression (PHQ-9) and anxiety (GAD-7) across sex, strata and linguistic backgrounds in a European-wide sample of patients after Traumatic Brain Injury Factorial validity and invariance of the 7-Item Generalized Anxiety Disorder Scale (GAD-7) among populations with and without self-reported psychiatric diagnostic status Anxiety disorders in primary care: prevalence, impairment, comorbidity, and detection Validation of the GAD-7 (Malay version) among women attending a primary care clinic in Malaysia Brief case finding tools for anxiety disorders: validation of GAD-7 and GAD-2 in addictions treatment Comparative efficacy of the generalized anxiety disorder 7-item scale and the Edinburgh Postnatal Depression Scale as screening tools for generalized anxiety disorder in pregnancy and the postpartum period Validation of the Generalized Anxiety Disorder-7 (GAD-7) and GAD-2 in patients with migraine The PHQ-9: validity of a brief depression severity measure A global measure of perceived stress The German Version of the Perceived Stress Scale (PSS-10): evaluation of dimensionality, validity, and measurement invariance with exploratory and confirmatory bifactor modeling Athens Insomnia Scale: validation of an instrument based on ICD-10 criteria Individual-patient monitoring in clinical practice: are available health status surveys adequate? A first course in factor analysis Structural Equations With Latent Variable Likelihood-based item-fit indices for dichotomous item response theory models Psychometric evaluation and calibration of health-related quality of life item banks Item banking to improve, shorten and computerize self-reported fatigue: an illustration of steps to create a core item bank from the FACIT-Fatigue Scale The value of item response theory in clinical assessment: a review Factorial invariance of the Five-Factor Model Rating Form across gender Evaluating goodness-of-fit indexes for testing measurement invariance Measurement invariance conventions and reporting: the state of the art and future directions for psychological research Effect of purification procedures on DIF analysis in IRTPRO Anxiety and depression in medical students related to desire for and expectations from a medical career Reliability and validity of the Portuguese version of the Generalized Anxiety Disorder (GAD-7) scale. Health Qual Life Outcomes Validation and standardization of the Generalized Anxiety Disorder Screener (GAD-7) in the general population Psychometric properties of the Generalized Anxiety Disorder Scale-7 (GAD-7) in outpatients with anxiety and mood disorders Beyond generalized anxiety disorder: psychometric properties of the GAD-7 in a heterogeneous psychiatric sample The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Dimensional assessment of depressive severity in the elderly general population: psychometric evaluation of the PHQ-9 using Rasch Analysis Psychometric properties of the 7-item Generalized Anxiety Disorder scale in antepartum women from Ghana and Côte d'Ivoire Psychometric analysis of the Generalized Anxiety Disorder scale (GAD-7) in primary care using modern item response theory