key: cord-0707467-efr3qgcp
authors: Johns, G.; Samuel, V.; Freemantle, L.; Lewis, J.; Waddington, L.
title: The global prevalence of depression and anxiety among doctors during the covid-19 pandemic: Systematic review and meta-analysis
date: 2022-02-01
journal: J Affect Disord
DOI: 10.1016/j.jad.2021.11.026
sha: 34b0c59e3253dc7200d29e6a0e29fa301baea839
doc_id: 707467
cord_uid: efr3qgcp

BACKGROUND: This review provides an estimate of the global prevalence of depression and anxiety symptoms among doctors, based on analysis of evidence from the first year of the COVID-19 pandemic. METHODS: A systematic review was conducted to identify suitable studies. Final searches were conducted on 3rd March 2021. Papers were initially screened by title and abstract, based on pre-agreed inclusion criteria, followed by full-text review of eligible studies. Risk of bias was assessed using the Joanna Briggs Checklist for Prevalence Studies. Data from studies rated as low or medium risk of bias were pooled using a random-effects meta-analysis. Sensitivity and subgroup analyses were conducted to explore heterogeneity. RESULTS: Fifty-five studies were included after full-text review. Of these, thirty studies were assessed as low or medium risk of bias and were included in primary analyses. These comprised twenty-six studies of depression (31,447 participants) and thirty studies of anxiety (33,281 participants). Pooled prevalence of depression and anxiety was 20.5% (95% CI 16.0%-25.3%) and 25.8% (95% CI 20.4%-31.5%) respectively. INTERPRETATION: Evidence from the first year of the pandemic suggests that a significant proportion of doctors are experiencing high levels of symptoms of depression and anxiety, although not conclusively more so than pre-pandemic levels. Differences in study methodology and variation in job demands may account for some of the observed heterogeneity. LIMITATIONS: Findings must be interpreted with caution due to the high heterogeneity and moderate risk of bias evident in the majority of included studies.

On the 30th January 2020 the world health organisation (WHO) declared the coronavirus disease 2019 (COVID-19) outbreak a Public Health Emergency of International Concern, its highest level of alarm. An unparalleled global response followed, with local and national 'lockdowns', quarantines, travel restrictions, and physical distancing measures introduced in attempts to curb transmission rates. At the time of writing, there have been over 114 million confirmed cases and more than 2.5 million reported COVID-associated deaths (WHO, 2021) .

In response to the unprecedented pressure on global health systems, there has been enhanced focus on the mental wellbeing of healthcare staff. In April 2020, The Lancet published a position paper outlining their suggested research priorities for the pandemic:

"The immediate research priorities are to monitor and report rates of anxiety, depression, self-harm, suicide, and other mental health issues both to understand mechanisms and crucially to inform interventions. This should be adopted across the general population and vulnerable groups, including front-line workers." (Holmes et al., 2020, p5) crisis.

The Job Demand-Resources (JD-R) model of occupational stress (Demerouti et al., 2001) offers a framework to understand these problems. The model hypothesises that as job demands increase so too does emotional strain, which negatively affects performance. Whereas greater access to job resources is associated with enhanced engagement and performance. Job demands are conceptualised as the physical, psychological, social, and organisational features of a job that require sustained physical and/or psychological effort. Examples of job demands are high workload or emotionally demanding interactions with patients. Job resources are defined as the physical, psychological, social, or organizational aspects of a job that facilitate achievement of work-based goals, reduce job demands, and stimulate personal growth, learning, and development. Examples of job resources are performance feedback, autonomy, and skill variety. The theory suggests that job demands are associated with health-impairments (e.g., poor mental or physical health), whereas job resources are associated with engagement and motivational processes (Bakker and Demerouti, 2017) . The current pandemic can be considered a universal job demand on health care systems across the world. However, there will also be additional localised variability in job demands and resources. For example, insufficient staffing levels and underfunded services may create additional strain for healthcare workers.

Medics form an essential part of the global frontline pandemic response. Studies conducted outside of global crises have highlighted that medical students and doctors are already at increased risk of psychological distress, depression, anxiety, burnout, and suicidality, compared with the general population (De Sio et al., 2020; Dong et al., 2020; Tian-Ci Quek, 2019; Hayes et al., 2017; Dai et al., 2015; Dyrbye et al., 2006) . As a result, there have been calls to improve the conceptual definition and measurement of wellbeing in medics (Brady et al., 2018; Wallace et al., 2009) .

Studies conducted during the 2003 outbreak of severe acute respiratory syndrome (SARS) indicated significant psychological distress in 18% to 57% of health care workers (Tam et al., 2004; Chan and Huak, 2004; Phua et al., 2005; Nickell et al., 2004; Maunder et al., 2004) . A study conducted one to two years post-SARS outbreak found high levels of burnout, psychological distress, and posttraumatic stress in healthcare workers (Maunder et al., 2006) . However, a similar study by Lancee et al. (2008) found incidence of new episodes of psychiatric disorders in community populations were similar to, or higher than, those observed in health care workers two years post-outbreak.

Although a number of studies have focused on the prevalence of mental health outcomes in doctors during the current COVID-19 pandemic, to the author's knowledge, there have been no systematic reviews conducted to analyze and synthesize data relating exclusively to doctors. Some meta-analyses of healthcare workers of multiple professions have included doctors (Santabárbara et al., 2021; Pappa et al., 2020; Salari et al., 2020; Luo et al., 2020) , and sub-group analyses provide some evidence of high levels of psychological distress among medics. However, outcomes from these analyses are limited by review design (e.g., rapid reviews), and underpowered sub-group meta-analyses for doctors. In addition, given the rate of publications during the pandemic, an up-to-date review is needed.

The current review will focus on the prevalence of symptoms of depression and anxiety during the COVID-19 pandemic. Previous metaanalyses have estimated the global prevalence of major depressive disorder and anxiety disorders to be 4.7% (4.4-5.0%) (Ferrari et al., 2013) and 7.3% (4.8-10.9%) respectively. The core features of depression are persistent depressed mood and anhedonia; other symptoms included psychomotor agitation or retardation, appetite changes, sleep problems, fatigue, feelings of low self-worth, poor concentration, and suicidal ideation (American Psychiatric Association 2013) Anxiety is characterized by psychological and somatic symptoms, including autonomic arousal (e.g., palpitations, sweating, trembling, dry mouth, difficulty breathing, chest pain, nausea), restlessness, fatigue, difficulty concentrating, irritability, and sleep problems (American Psychiatric Association 2013). Depression and anxiety are associated with impairments in cognitive functioning, including poorer performance on tests of memory, attention, executive function and motor function (Rock et al., 2014; Hallion et al., 2017; Moran, 2016; Eysenck et al., 2007; Runswick et al., 2018; Wilson, 2012) . These cognitive, physiological, and behavioural consequences may be of particular concern among medical doctors, given the potential implications for professional competence and patient safety, as well as personal wellbeing.

The aim of this systematic review and meta-analysis is to analyze the evidence emerging from the first year of the COVID-19 pandemic to answer the following research questions:

• What is the global prevalence of depression and anxiety symptoms among doctors during the COVID-19 pandemic? • What factors might explain differences in the prevalence of depression and anxiety symptoms among doctors during the COVID-19 pandemic?

This systematic review and meta-analysis was conducted in accordance with PRISMA (Page et al., 2021) and MOOSE (Meta-analyses of Observational Studies in Epidemiology) guidelines (Stroup et al., 2000) . The review protocol was registered with PROSPERO and is available online (CRD42021228667).

The CoCoPop framework (Condition, Context, Population), for prevalence and incidence reviews, was used to develop the following inclusion criteria: (i) assessment of depression and/ or general anxiety symptoms using a standardised and validated measure; (ii) conducted during the COVID-19 pandemic; (iii) practicing medical doctors working in any speciality, across the world. Studies were excluded studies based on the following criteria: (i) studies conducted outside of the pandemic timeframe; (ii) studies using non-standardised or unvalidated measures; (iii) studies that do not report prevalence for the target population or do not provide sufficient information to calculate prevalence; (iv) studies that have not separated professions in the data; (v) studies relating exclusively to medical students, non-practicing doctors, or non-medical doctors; (vi) pre-prints, or studies not published in a peer reviewed journal; (vii) studies with a sample size <139 (calculated according to minimum expected prevalence from previous literature (Vaughan and Morrow, 1989) ).; (viii) qualitative studies; (ix) articles inaccessible for full review or not published in English; (x) studies not reporting original research (e.g., literature review, article, commentary); (xi) studies focussing on mental health outcomes other than depression and/ or general anxiety (e.g., stress, burnout, specific anxiety disorders).

A search strategy was developed following consultation with an expert librarian. Search terms were selected to identify records reporting on prevalence data for depression and anxiety in doctors during the COVID-19 pandemic. Full text searches were conducted using the following key search terms: (covid OR covid-19 OR "sars cov 2 ′′ OR "sarscov2" OR "corona virus") AND (doctor* OR physician* OR medic OR medics) AND (anxiety OR "anxiety symptoms" OR "anxiety disorder" OR anxious OR "generali?ed anxiety" OR panic OR worry OR depress* OR "mental health" OR "mental illness" OR "mental disorder*"). Four electronic databases (PubMed, CINAHL, Embase, PsychInfo) and one preprint database (MedRxiv) were searched. Final searches were conducted on 3rd March 2021. Search strategies were adapted for each database, where necessary. No restrictions were applied. An example of the search terms used is included in Supplementary Information 1 (SI1) . Identified records were extracted to Zotero and then uploaded to (Covidence systematic review software 2021).

Two independent reviewers (G.J. and L.F.) screened titles and abstracts, followed by all eligible full text papers, based on the pre-agreed inclusion criteria. Inter-rater reliability was substantial (K = 0.66/ 0.68). Two research supervisors (L.W. and V.S.) were available to resolve any disagreements.

Data extraction was conducted independently by G.J. and a third reviewer (J.L.), and cross-checked for reliability. Where essential data was missing, the corresponding authors were contacted to request information. The following data items were extracted: author, publication year, study design, recruitment method, data collection timeframe, geographical location, measures used, cut-off and severity thresholds. The following data were extracted for the target population only (i.e., doctors): sample size, sex, age, number of positive cases of depression and anxiety, response rate. In cases where prevalence information was missing, relevant calculations were made, where possible.

The primary outcome was the total number of positive cases of depression and/ or anxiety among doctors during the pandemic, determined by the number of participants scoring above a pre-defined threshold on a validated depression or anxiety measure. Frequency data were collected for total sample (N), anxiety and/or depression cases (n), and resulting proportions with 95% confidence intervals (CI).

Risk of bias was independently assessed by G.J. and J.L. for all included studies using the Joanna Briggs Inventory (JBI) Checklist for Prevalence Studies tool (Munn et al., 2015) . The tool was developed for the purpose of increasing consistency in systematic reviews of prevalence data and has been recommended as the most appropriate tool for studies of this kind (Migliavaca et al., 2020) . Study risk of bias was evaluated based on the following nine criteria: 1) Was the sample frame appropriate to address the target population? 2) Were study participants recruited in an appropriate way? 3) Was the sample size adequate? 4) Were the study subjects and setting described in detail? 5) Was data analysis conducted with sufficient coverage of the identified sample? 6) Were valid methods used for the identification of the condition? 7) Was the condition measured in a standard, reliable way for all participants? 8) Was there appropriate statistical analysis? 9) Was the response rate adequate, and if not, was the low response rate managed appropriately? Within the existing literature (Islam et al., 2020; Sarria-Santamera et al., 2021) , level of bias is assessed by calculating the total number of criteria with a yes response and converting this score into a percentage (n/9). Studies scoring <50% are considered high risk of bias, 50-69% medium risk of bias, and ≥70% low risk of bias. The quality assessment tool was first piloted on a small number of studies. L.W. and V.S. were available for consultation and to resolve any disagreements.

Studies assessed as high risk of bias were excluded from the primary analysis. Following consultation with expert statisticians, a metaanalysis for proportional data was conducted using the Metaprop (Nyaga et al., 2014) command of the software package STATA version 16.1 (StataCorp, 2019). To address potential weighting issues that can occur when including studies with proportions close to one or zero, which can disproportionately skew the outcome of meta-analysis, proportions were transformed using the Freeman-Tukey double arcsine method (Freeman & Tukey, 1950) , and back-transformed for ease of interpretation (Barendregt et al., 2013) . A DerSimonian & Laird (1986) random effects model was used to extract pooled prevalence, given the assumed differences in regional demographics and study design. The I 2 statistic was used to assess the statistical heterogeneity (Higgins et al., 2003) . I 2 values < 50% are considered low, 50-75% moderate, and >75% high. Subgroup analyses were conducted to explore sources of heterogeneity, as expected in meta-analyses of cross-sectional studies.

Sensitivity analysis was conducted to explore the impact of individual studies (leave one out and cumulative analyses), and the impact of study quality and design (risk of bias, measure, severity threshold, and survey timeframe). Survey timeframe was split into first three months of the pandemic (January to March 2020), and April 2020 onwards. In line with the JD-R model, subgroup analysis was conducted to explore the potential for variability in job demands and resources to explain heterogeneity of outcomes during the pandemic. gross domestic product (GDP) per capita and doctors per 10,000 population were used as potential indicators of job demands and resources for each study. GDP per capita was split into three groups <$10,000 per capita, $10-15,000 per capita, and >$25,000 per capita. Doctors per 10,000 population was split into four groups <15.5, 15.5-19, 20-29, and >30. Geographical region was also explored as a potential source of heterogeneity, with studies grouped by continent; two studies were omitted from this subgroup analysis due to their global coverage. Sub-group analysis was only conducted for categories with a minimum of four studies. Reported outcomes are proportion (p), confidence interval (CI) and percentage prevalence (p × 100%). All statistical analyses were two-tailed and p=<0.05 was considered statistically significant.

Publication bias was assessed via visual inspection of funnel plots (SI3 and SI4) and Egger's test (Egger et al., 1997) , with p = <0.05 indicating publication bias.

The grading of recommendations assessment, development, and evaluation (GRADE) system was used to assess the quality of the overall body of evidence and the level of confidence in the conclusions drawn (Guyatt et al., 2008) . GRADE assessment considers factors over and above individual study risk of bias, such as imprecision, inconsistency, indirectness, study limitations and publication bias. Overall quality of evidence may be rated as high, moderate, low, or very low. All observational research begins as low quality and can be (less commonly) upgraded or (more commonly) downgraded, based on the five criteria outlined above (Balshem et al., 2011) .

A total of 2359 records were identified following systematic review of four databases and one pre-print server. After removal of duplicates, 1680 records were screened by title and abstract. Full text review was conducted on 161 papers, of which a further 106 studies were excluded. Fifty-five studies (see SI2 for references) were included in the quality assessment process. A further 22 studies were excluded from the primary analysis due to high risk of bias, leaving 33 studies assessed as medium or low risk of bias. Twenty-three studies reported data for depression and anxiety, seven reported data exclusively for anxiety, and three reported data exclusively for depression. Study characteristics and prevalence data for high risk of bias studies are presented in Supplementary Tables 1 and 2 (ST1 and ST2). A PRISMA diagram detailing the flow of information is presented in Fig. 1 

Risk of bias ratings for all 55 studies, assessed using the JBI Checklist for Prevalence Studies tool, are presented in ST3. Five studies were assessed as low, 28 as medium, and 22 as high risk of bias. Most studies used appropriate methods to identify and measure the condition(s) and reported appropriate statistical analysis. Setting and characteristics were also largely well described, although a small number of studies reporting on a wide range of health care workers were downgraded on this item, due to the lack of sufficient detail pertaining specifically to the target population of interest for this review (i.e., doctors). The predominant use of non-probability sampling methods reduced scores for many studies. This methodology typically indicates the absence of a sampling frame and random sampling approach, an inability to calculate a response rate, and introduces coverage bias. Some studies lost additional points due to inadequate reporting of data (e.g., absence of numerator and/or denominator), and some did not report sample size calculation, or provide sufficient information to calculate retrospectively.

The sample size of the studies ranged from 149 to 10,178. All studies employed a cross-sectional design. Full study characteristics are summarised in ST4.

A total of 31,447 participants from 26 studies were included; ten studies were based in Asia, seven in Europe, four in North America, two in South America, two in Africa, and one multi-national. Participants' mean (SD) age ranged from 28.0 (3) to 45.2 (13.3). The proportion of female participants ranged from 3.4% to 80.1%. The median number of participants per study was 467.5. Male vs female split was 45.9% vs 54.0% respectively (NB. sex data not reported for some studies).

A total of 33,281 participants from 30 studies were included. Ten studies were based in Asia, nine in Europe, five in North America, three in South America, two in Africa, and two were multi-national. The mean (SD) age of sample size ranged 28.0 (3) to 52.0 (11). The proportion of female participants ranged from 8.3% to 80.1%. The median number of participants per study was 502.5. Male vs female split was 46.6% vs 53.8%.

Seventeen studies used the Generalised Anxiety Disorder Scale-7 Item (GAD-7; Spitzer et al., 2006) , thirteen used the Patient Health Questionnaire-9 Item (PHQ-9; Kroenke et al., 2001) , seven used the Hospital Anxiety and Depression Scale (HADS; Zigmond and Snaith, 1983) , three used the Depression Anxiety Stress Scale-21 item (DASS-21, short version of the DASS; Lovibond and Lovibond, 1995) , three used the Patient Health Questionnaire-2 Item (PHQ-2; Löwe et al., 2005) , one used the Generalised Anxiety Disorder Scale-2 Item (GAD-2; Kroenke Beck et al., 1988) , and one used the Patient-Reported Outcomes Measurement Information System-Anxiety (PROMIS; Cella et al., 2010) .

Point prevalence of depression ranged from 6.1% (95% CI 5.5-6.8%) to 73.4% (95% CI 65.9-79.7%) (Elhadi et al., 2020) . Point prevalence of anxiety ranged from 5.9% (95% CI 4.1-8.3%) (Skoda et al., 2020) to 74.2% (95% CI 70.3-77.8%) (Jain et al., 2020) , although only two out of the 26 depression studies and two out of the 30 anxiety studies reported prevalence of <10%. Point prevalence and confidence intervals for all individual studies are presented in ST5.

The pooled prevalence of depression for the 26 included studies was 20.5% (95% CI 16.0-25.3%), with a high degree of heterogeneity (I 2 = 98.931%), as presented in Fig 2. The pooled prevalence of anxiety for the 30 included studies was 25.8% (95% CI 20.4-31.5%), with a similarly high degree of heterogeneity (I 2 = 99.190%), presented in Fig. 3 

One study affected the pooled prevalence of depression by ≥1%. The study in question (Elhadi et al., 2020) changed pooled prevalence by 1.7%. After running the analysis without this study, pooled prevalence was 18.8% (95% CI 14.6-23.3%). Cumulative analysis revealed heterogeneity only reached acceptability for a subset of thirteen studies (Chatzittofis et al., 2021; Civantos et al., 2020a,b; Fauzi et al., 2020; Florin et al., 2020; Hilmi et al., 2020; Khanna et al., 2020; Lai et al., 2020; Que et al., 2020; Vallée et al., 2020; Elhadi and Msherghi, 2021) all with proportions falling within a 7% range (95% CI 10.6-17.4%). For these studies, heterogeneity was reduced to moderate (I 2 =65.063) and pooled prevalence was 13.5% (95% CI 12.2-14.8%).

As presented in Table 1 , between-group heterogeneity was not significant when analysed by measure (p = 0.062), severity threshold (p = 0.330), survey timeframe (p = 0.681), or risk of bias (p = 0.600).

Three studies affected the pooled prevalence of anxiety by ≥1% (Jain et al., 2020; Elhadi et al., 2020; Thomaier et al., 2020) , the largest impact was a 1.5% change (Jain et al., 2020) . After removing the three largest influencing studies, pooled prevalence was 21.8% (95% CI 17.3-26.7%). Cumulative analysis revealed that heterogeneity only reached acceptability for a subset of ten studies (Civantos et al., 2020a,b; Fauzi et al., 2020; Imran et al., 2020; Malgor et al., 2021; Shalhub et al., 2021; Elhadi and Msherghi, 2021; Kannampallil et al., 2020) , all with proportions falling within an 8.5% range (95% CI 15.2-23.6%). For these studies, heterogeneity was reduced to moderate (I 2 =58.054) and pooled prevalence was 20.9% (95% CI 19.5-22.4%).

As presented in Table 2 , between-group heterogeneity was statistically significant when analysed by measure (p = 0.034), severity threshold (p = 0.013), and survey timeframe (p = 0.038), but not by risk of bias (p = 0.089). 

Secondary analysis was performed with all studies (i.e., including those assessed as high risk of bias). The prevalence of depression symptoms for the 16 studies assessed as high risk of bias was 34.6% (95% CI 23.8-46.1%, I 2 =98.467). When compared with the 26 primary studies assessed as medium or low risk of bias, between-group heterogeneity was statistically significant (p = 0.018) (see SI5). By contrast, the prevalence of anxiety symptoms for the twenty-two studies assessed as high risk of bias (27.0%, 95% CI 20.5-34.0%, I 2 =98.918) was not significantly different from the 30 studies assessed as medium or low risk of bias (p = 0.787) (see SI6).

Subgroup categorical information for each study is provided in ST6.

As presented in Table 3 , between-group heterogeneity was statistically significant for studies of depression when analysed by GDP per capita (p = 0.014). Further analysis revealed significant heterogeneity between the <$10,000 and $10-15,000 groups (p = 0.005) but differences were not significant between other groups. Differences were not explained by geographical region (p = 0.282), or by doctors per 10,000 population (p = 0.198). 

As presented in Table 4 , between-group heterogeneity was statistically significant among anxiety studies when analysed by doctors per 10,000 population (p = 0.003). As expected, the highest pooled prevalence of anxiety was calculated for the group of studies with the lowest number of doctors per 10,000 population (<15.5) at 37.9% (95% CI 20.6-56.9%). However, the lowest rates of anxiety were not observed in either of the categories with the highest numbers of doctors per 10,000 population (20-29, >30) but rather for the group of studies within the 15.5-19 doctors per 10,000 population range, with a prevalence of 14.7% (95% CI 9.0-21.5%). Further analysis revealed significant heterogeneity between the 15.5-19 group, when compared with the <15.5 group (p = 0.013), and when compared with the 20-29 group (p = 0.001). GDP per capita was on the threshold of significance (p = 0.054). Differences were not explained by geographical region (p = 0.145).

Egger's test revealed that publication bias was not statistically significant for studies reporting prevalence of depression symptoms (p = 0.6765), nor for studies reporting anxiety symptoms (p = 0.8973) (see SI3 and SI4 for visual funnel plots).

The objective of this systematic review and meta-analysis was to provide an estimate of the global prevalence of depression and anxiety symptoms among doctors during the COVID-19 pandemic. The overall pooled prevalence of depression, calculated from 26 studies and 31,447 participants, was 20.5% (95% CI 16.0-25.3%). The overall pooled prevalence of anxiety, calculated from 30 studies and 33,281 participants, was 25.8% (95% CI 20.4-31.5%).

Findings are broadly comparable to earlier estimates for doctors, conducted within the first three to six months of the pandemic. Pappa et al. (2020) conducted a meta-analysis of health care workers up until mid-April 2020. Their subgroup analysis of six studies reporting anxiety data specifically for doctors revealed a pooled prevalence of 21.7% (95% CI 15.3-29.0%); while five studies reported depression data with a pooled prevalence of 25.4% (95% CI 16•6-35.2%). In Santabárbara et al. (2021) meta-analysis of anxiety in health care workers, conducted up until mid-September 2020, a sub-group analysis of 13 studies of doctors reported a more modest pooled prevalence of 17% (95% CI 12.0-22.0%) for anxiety. This figure is comparable to the proportion calculated from the eight studies conducted in the first three months in the current study, but somewhat lower than the overall pooled estimate. However, direct comparisons are difficult due to the wide and overlapping confidence intervals and significant heterogeneity found across reviews.

The prevalence of depression and anxiety symptoms among doctors also falls within the range reported in research conducted during the SARS epidemic ranging from 18% to 57% (Tam et al., 2004; Chan and Huak, 2004; Phua et al., 2005; Nickell et al., 2004; Maunder et al., 2004; Koh et al., 2005) . However, these studies reported data on the prevalence of psychological distress rather than symptoms of depression and anxiety. Furthermore, many of these studies focussed on the broader population of healthcare workers, rather than doctors, so a direct comparison is not possible

The results of the current study are also broadly consistent with previous studies conducted prior to the pandemic, indicating very high prevalence of depression and anxiety among doctors. However, evidence of a clear increase compared with pre-pandemic estimates is lacking. As above, direct comparisons are difficult to make as much of the pre-pandemic literature reports the prevalence of psychological distress and/ or burnout, rather than depression and anxiety, for this population. To the author's knowledge, there has only been one systematic review of depression and anxiety in qualified doctors prior to the pandemic (Beyond Blue, 2010); however, pooled prevalence was not calculated due to the wide variation in point prevalence. The narrative summary reported depression as ranging from 14% to 60%, and anxiety ranging from 18% to 55%. Subsequently, a cross-sectional study based in the Netherlands reported prevalence of depression and anxiety among doctors to be 29% and 24% respectively (Ruitenburg et al., 2012) . In 2017, a study conducted in Ireland reported 16.6% and 14.4% of doctors with symptoms of depression and anxiety of moderate severity or above (Hayes et al., 2017) ; although these figures are more modest (particularly in relation to anxiety symptoms) than those reported in the current study, they remain considerably higher than rates in the general population. Previous research has also found higher levels of job demands are associated with reduced wellbeing in doctors Lee et al., 2013; Teoh et al., 2021) . A tentative hypothesis is that the absence of a clear increase in prevalence of depression and anxiety among doctors during the COVID-19 pandemic, compared with previous estimates, might suggest either a ceiling effect of job demands has been reached, or that greater job resources have been made available during the pandemic to offset the increased demands.

Interestingly, a meta-analysis conducted for the general population, up to June 2020, estimated the global prevalence as 28.0% (95% CI 25.0-31.2%) for depression and 26.9% (95% CI 24.0-30.0%) for anxiety (Nochaiwong et al., 2021) . These rates are significantly higher than pre-pandemic global estimates for the general population of 4.7% (4.4-5.0%) for depression (Ferrari et al., 2013) and 7.3% (4.8-10.9%) for anxiety . This suggests there may have been a large increase in depression and anxiety symptoms among the general population within the first few months of the pandemic, reaching the consistently high levels reported among doctors. Furthermore, while levels of anxiety in the Nochaiwong study appear similar to those reported for doctors in the current study (26.9% vs 25.8%), levels of depression appear significantly higher in the global general population compared to those observed in doctors in the current study (28.0% vs 20.5%). Given that reduced activity is associated with depression, this finding might be explained by the presumed greater levels of inactivity within the general population, due to lockdown restrictions. Whereas doctors, as essential workers, may have experienced a less severe loss of routine. It is also of note that the pre-pandemic Ferrari and Baxter meta-analyses used studies that estimated prevalence based on 'gold standard' diagnostic interview procedures rather than self-report, which may account for some of the difference in outcomes. The data from this study suggests that doctors continue to be a population at high risk of depression and anxiety, but the evidence does not support a clear increase in symptoms, compared with pre-pandemic data.

The subgroup analyses conducted in this review (geographical region, doctors per 10,000 population, GDP per capita) were able to explain some of the heterogeneity in depression and anxiety studies, but not consistently. When comparing prevalence based on GDP per capita, there was significant between-group heterogeneity for depression (p = 0.014), and threshold significance for anxiety (p = 0.054). As expected, the highest prevalence rates were recorded for the lowest GDP per capita (<$10,000 studies), with pooled prevalence of 28.8% (95% CI 19.1-39.6%) for depression and 32.7% (95% CI 22.3-44.1%) for anxiety. However, notably, for both sub-group analyses, the lowest levels of depression and anxiety were not reported for countries with the highest GDP per capita (>$25,000), but for studies in the $10-15,000 level, with prevalence of depression at 13.3% (95% CI 9.0-18.4%) and of anxiety at 16.4% (95% CI 9.4-24.9%). These findings are consistent with previous research that suggests that beyond a certain level of wealth and resource, additional benefit to emotional wellbeing is minimal (Kahneman and Deaton, 2010).

Findings are somewhat consistent with the JD-R model, which was used to select the subgroup comparisons of GDP per capita and doctors per 10,000 population as factors that may be expected to increase job demands and reduce job resources for doctors during the pandemic. Lowest GDP corresponded with highest rates of depression symptoms, and lowest numbers of doctors per 10,000 corresponded with highest rates of anxiety.

The methodological differences explored via sensitivity analyses (risk of bias, measure, severity threshold, survey timeframe) did not explain the heterogeneity for depression studies, apart from when comparing high risk of bias with low/ medium risk of bias studies (p = 0.018). High risk of bias studies produced a prevalence of 34.6% (23.8-46.1%) whereas low/medium risk of bias studies produced a prevalence of 20.5% (16.0-25.3%). Conversely, all of the methodological differences were relevant in explaining the heterogeneity in anxiety studies, apart from risk of bias (high vs low/medium p = 0.787).

The type of measure used in depression studies did not produce statistically significant differences in estimates (p = 0.062). Pooled prevalence was 16.1% (95% CI 10.4-22.8%) for the PHQ9 and 27.5% (95% CI 17.6-38.6%) for the HADS-D. However, for anxiety, there was a significant difference between studies using the GAD7 vs those using the HADS-A (p = 0.034). Pooled prevalence was 20.3% (95% CI 14.3-27.2%) for the GAD7 and 35.5% (95% CI 23.2-49.1%) for the HADS-A. This may be explained by potential differences in the underlying factor being measured. For example, a meta confirmatory factor analysis of the HADS identified a strong general factor. The authors suggested that it does not provide good separation between symptoms of anxiety and depression and recommended it may be best used as a measure of general distress (Norton et al., 2013) .

Reporting of mild vs moderate and above symptoms did not produce statistically different prevalence estimates for depression (p = 0.330) but did for anxiety (p = 0.013). Studies reporting mild and above symptoms of anxiety produced a pooled prevalence of 37.2% (95% CI 25.0-50.4%) whereas studies reporting moderate and above symptoms produced a more modest estimate of 20.5% (95% CI 15.9-25.6%). The lack of consensus and consistency across studies regarding what constitutes clinically significant levels of anxiety symptoms, and the poor equivalence when comparing severity levels across different measures, presents a challenge when attempting to estimate an overall prevalence (Clover et al., 2020) .

The timeframe of data collection was not significant for depression studies (p = 0.681) but was for anxiety studies (p = 0.038). Interestingly, the pooled prevalence of anxiety symptoms was significantly lower in studies conducted within the first three months of the pandemic (17.2%, 95% CI 9.7-26.3%) compared with studies reporting data from April onwards (29.2%, 95% CI 22.5-36.4%). Although this was based on a small subgroup of eight studies. This finding is in contrast to research in the UK general population between 23rd March and 9th August 2020 that suggest symptoms of anxiety were higher in the first few months before gradually declining (Fancourt et al., 2021) . This finding might be understood as the consequence of chronic stress on the medical workforce as the pandemic progressed. However, it is also of note that findings from the UK-based study (Fancourt et al., 2021) are not consistent with the pooled prevalence reported in a similar timeframe from the global meta-analysis (Nochaiwong et al., 2021) . This inconsistency is reflective of the overall high variability in the evidence.

This review has several limitations. Firstly, there are a number of limitations associated with the methodology of the studies of interest. As with all observational research, causation cannot be inferred. The predominant use of non-probability sampling methods introduced the highest levels of bias. This methodology means that a sampling frame and stratified random sampling approach is typically absent, which has implications for coverage bias and the ability to calculate a response rate. In addition, the widespread use of online-only survey, although appropriate given the global context, may have introduced further coverage bias by excluding people who were too busy or overwhelmed to access their emails or social media. Other potential sources of bias include self-selection bias, which may be introduced by disproportionately attracting doctors with a past history or particular interest in mental health. Conversely, social desirability bias can also be introduced by the use of self-report measures. All of which can influence study results. Another significant limitation is the high heterogeneity observed across studies. Heterogeneity is inherent in meta-analyses of this type of data, but limits confidence in the conclusions drawn. Given the betweenstudy variability in geographical location, settings, and specialities, generalisability may be limited. Lack of consistency in methodological approaches also limits confidence in conclusions, including the use of a wide variety of questionnaires, differences in cut-offs and severity thresholds, and absence of 'gold standard' diagnostic interviews.

There are also several limitations associated with the methodology of the overall review. High risk of bias studies were excluded, with the aim of reducing overall bias and increasing homogeneity (Higgins et al., 2011; Detweiler et al., 2016) . However, a drawback of analysis with a reduced sample is a reduction in overall precision. Sensitivity analysis incorporating high risk studies indicated that omitting these studies from the primary analyses of anxiety was not sufficient to explain heterogeneity. However, the significant difference in pooled prevalence in depression studies highlights the potential utility of this approach in avoiding overestimation of distress. Inter-rater reliability for risk of bias ratings was not an available as a function within the software used. Reporting bias may have been introduced by the exclusion of gray literature, non-English language papers, and inaccessible papers. While this study covered symptoms of depression and anxiety, specific anxiety disorders and other mental health conditions were excluded. It may also have been useful to consider the influence of additional variables, including indicators of more localised job demands, such as local infection rates during the timeframe for each study, and indicators of resources, such as organisational, social and psychological factors. Finally, although this review covers more than twelve months of research conducted during the pandemic, any studies published after the 3rd March 2021 will be absent from analyses. Given the rate at which new studies are being published, a more updated meta-analysis may soon be required.

The overall quality of evidence likely falls within the low to very low range, as per GRADE assessment guidelines. All observational research begins as low quality. Given the wide-ranging point prevalence observed across studies, the broad confidence intervals around pooled prevalence estimates, and the high level of heterogeneity observed, this assessment appears to be a fair reflection. This means that the estimate of effect is uncertain and future research may change this estimate. Recommendations for improving the quality of future research are outlined below.

Despite these limitations, this review has a number of strengths. Firstly, risk of bias assessment highlighted a number of strengths in the individual studies. The vast majority of studies used appropriate and valid methods to identify depression and/or anxiety and measured the condition(s) in a standard and reliable way for all participants. Most studies appropriately described and reported the statistical analyses conducted. Setting and characteristics were also largely well described.

In consideration of the overall review, to our knowledge, this is the first systematic review and meta-analysis of the global prevalence of symptoms of depression and anxiety among doctors during the pandemic. The number of studies returned in our searches was unexpectedly high; enabling us to be more selective in the quality of the studies included for full analysis. Although high risk of bias studies were excluded from the primary analyses, secondary analysis was also conducted to compare high vs medium/ low risk of bias studies. While between-group heterogeneity was not significant when comparing the risk of bias for anxiety studies, heterogeneity was significant for depression studies. The more modest pooled prevalence for depression, using just the lower risk studies, may therefore be considered a more accurate estimate. Data were extracted for cases above clinical cut-off thresholds; for the majority of studies, reported cut-offs were within the moderate severity range. In the few studies where a specific cut-off score was not reported, data were extracted for cases in the moderate and above categories. Studies reporting prevalence estimates based on predominantly mild symptoms are likely to provide an overinflated estimation of mental health conditions in this population; therefore, the pooling of predominantly moderate and above estimates may offer a more accurate reflection of the prevalence of clinically relevant symptoms in doctors than studies including data for all levels of symptom severity. Further strengths include the large number of overall participants from across the globe, spanning a wide range of clinical specialities and settings. Subgroup analyses, exploring the potential impact of job demands, provides some additional insight into factors that may be influencing prevalence.

Given the evidence for high levels of depression and anxiety symptoms among doctors across the world, health care services should consider multi-level approaches to support (Bakker and Demerouti, 2018) . Firstly, organisational and structural changes are needed to ensure doctors have access to the most fundamental resources, such as time to sleep, eat, exercise, and spend time with others (Unadkat and Farquhar, 2020) . Ongoing efforts should be made to destigmatise discussions around mental health (Galbraith et al., 2020) . Formal and informal peer support systems may help to facilitate these conversations and should be encouraged (Behrman et al., 2020) . Schwartz rounds are increasing in popularity, are well received by staff (Flanagan et al., 2020) , and can normalize conversations around the emotional impact of work and reduce stigma. Similarly, formal and informal psychology input should be embedded within health services. Services should consider incorporating evidence based and high-quality interventions, such as those based on mindfulness and cognitive-behavioural therapy, which have been found to be effective in reducing stress, anxiety, and depression for doctors and nurses (Melnyk et al., 2020; Murray et al., 2016) . Systems to monitor the wellbeing of doctors should be in place, and in cases where one-to-one psychological support is required there should be clear and discreet pathways to referral.

Further longitudinal research is needed to monitor long-term outcomes and to explore potential differences in trajectory of mental health outcomes for doctors compared with other populations. Future research may benefit from greater consideration of individual, social and organisational demands and resources. Improvements to research methodology would also increase the overall quality of the evidence base and enable greater confidence in conclusions. Specifically, the adoption of random probability sampling methods is needed. There also needs to be more consistency in measurement. Future studies would benefit from adopting 'gold standard' diagnostic interview methods, using only measures with the strongest psychometric properties, utilizing cut-offs that optimize sensitivity and specificity in identifying clinically relevant symptoms, and reporting on a broader range of cut-offs in order to facilitate better comparisons with studies using alternative measures (Clover et al., 2020; Cameron et al., 2008) .

This systematic review and meta-analysis provides a comprehensive analysis of the global prevalence of depression and anxiety symptoms among doctors during the first twelve months of the COVID-19 pandemic. Symptoms of depression and anxiety are elevated among doctors, compared with earlier research from the general population, but not conclusively more so than pre-pandemic levels among doctors. Differences in study design and variation in job demands may account for some of the observed heterogeneity. Findings may help to quantify the needs of this population and guide health care systems to plan support as we recover from the pandemic, and prepare for other times of national or global crisis.

This review was conducted as part of doctoral training and is funded by NHS Wales.

GJ designed the study and protocol, conducted literature searches, screening, data extraction, quality assessment, statistical analysis, and wrote the manuscript. VS contributed to the design and coordination of the study and provided input into the final drafts. LF conducted independent screening and full text review. JS conducted independent data extraction and quality assessment. LW contributed to the design and coordination of the study, provided input into the final drafts and was the primary supervisor. All authors have approved the final manuscript.

The protocol was registered on PROSPERO and can be accessed online (CRD42021228667).

Diagnostic and Statistical Manual of Mental Disorders (DSM-5®)

Job demands-resources theory: taking stock and looking forward

Multiple levels in job demands-resources theory: implications for employee well-being and performance

GRADE guidelines: 3. Rating the quality of evidence

Meta-analysis of prevalence

Global prevalence of anxiety disorders: a systematic review and meta-regression

An inventory for measuring clinical anxiety: psychometric properties

Peer support for junior doctors: a positive outcome of the COVID-19 pandemic?

What do we mean by physician wellness? A systematic review of its definition and measurement

Psychometric comparison of PHQ-9 and HADS for measuring depression severity in primary care

The Patient-Reported Outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks

Psychological impact of the 2003 severe acute respiratory syndrome outbreak on health care workers in a medium size regional general hospital in Singapore

Impact of the COVID-19 pandemic on the mental health of healthcare workers

Mental health among head and neck surgeons in Brazil during the COVID-19 pandemic: A national study

Mental health among otolaryngology resident and attending physicians during the COVID-19 pandemic: National study

Apples to apples? Comparison of the measurement properties of hospital anxiety and depression-anxiety subscale (HADS-A), depression, anxiety and stress scale-anxiety subscale (DASS-A), and generalised anxiety disorder (GAD-7) scale in an oncology setting using Rasch analysis and diagnostic accuracy statistics

Veritas Health Innovation

Prevalence and correlates of psychological symptoms in Chinese doctors as measured with the SCL-90-R: a metaanalysis

Work-related stress risk and preventive measures of mental disorders in the medical environment: an umbrella review

The job demandsresources model of burnout

Meta-analysis in clinical trials

Risk of bias and methodological appraisal practices in systematic reviews published in anaesthetic journals: a meta-epidemiological study

Prevalence of Suicide-Related Behaviors Among physicians: A systematic Review and Meta-Analysis, 50. Suicide and Life-Threatening Behavior

Systematic review of depression, anxiety, and other indicators of psychological distress among US and Canadian medical students

Bias in meta-analysis detected by a simple, graphical test

The Mental Well-Being of Frontline Physicians Working in Civil Wars Under Coronavirus Disease

Mental health of surgeons during the COVID-19 pandemic: An urgent need for intervention

Anxiety and cognitive performance: attentional control theory

Trajectories of anxiety and depressive symptoms during enforced isolation due to COVID-19 in England: a longitudinal observational study

Doctors' mental health in the midst of COVID-19 pandemic: The roles of work demands and recovery experiences

Global variation in the prevalence and incidence of major depressive disorder: a systematic review of the epidemiological literature

Reflection for all healthcare staff: a national evaluation of Schwartz rounds

Socio-economic and psychological impact of the COVID-19 outbreak on private practice and public hospital radiologists

Transformations related to the angular and the square root

The mental health of doctors during the COVID-19 pandemic

GRADE: an emerging consensus on rating quality of evidence and strength of recommendations

Cognitive control in generalized anxiety disorder: relation of inhibition impairments to worry and anxiety severity

What's up doc? A national cross-sectional study of psychological wellbeing of hospital doctors in Ireland

Measuring inconsistency in meta-analyses

The Cochrane Collaboration's tool for assessing risk of bias in randomised trials

Professional and Psychological Impacts of the COVID-19 Pandemic on Oncology Residents: A National Survey

Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science

Psychological impact of COVID-19 pandemic on postgraduate trainees: A cross-sectional survey

Prevalence of Headache in Patients with Coronavirus Disease 2019 (COVID-19): a Systematic Review and Meta-Analysis of 14,275 Patients

Psychosocial work characteristics, burnout, psychological morbidity symptoms and early retirement intentions: a crosssectional study of NHS consultants in the UK

COVID-19 pandemic: Psychological impact on anaesthesiologists

High income improves evaluation of life but not emotional well-being

Exposure to COVID-19 patients increases physician trainee stress and burnout

Psychological impact of COVID-19 on ophthalmologists-in-training and practising ophthalmologists in India

Risk perception and impact of severe acute respiratory syndrome (SARS) on work and personal lives of healthcare Workers in Singapore What can we Learn? Med. Care

The PHQ-9: validity of a brief depression severity measure

Anxiety disorders in primary care: prevalence, impairment, comorbidity, and detection

Factors Associated With Mental Health Outcomes Among Health Care Workers Exposed to Coronavirus Disease

Prevalence of psychiatric disorders among Toronto hospital workers one to two years after the SARS outbreak

Correlates of physician burnout across regions and specialties: a meta-analysis

Anxiety and Depression Among Imaging Doctors in Post-COVID-19 Period

The structure of negative emotional states: comparison of the Depression Anxiety Stress Scales (DASS) with the Beck Depression and Anxiety Inventories

Detecting and monitoring depression with a twoitem questionnaire (PHQ-2)

The psychological and mental impact of coronavirus disease 2019 (COVID-19) on medical staff and general public-A systematic review and meta-analysis

Factors associated with the psychological impact of severe acute respiratory syndrome on nurses and other hospital workers in Toronto

Brazilian vascular surgeons experience during the coronavirus (COVID-19) pandemic

Long-term psychological and occupational effects of providing hospital healthcare during SARS outbreak

Interventions to improve mental health, well-being, physical health, and lifestyle behaviors in physicians and nurses: a systematic review

Quality assessment of prevalence studies: a systematic review

Anxiety and working memory capacity: a meta-analysis and narrative review

Methodological guidance for systematic reviews of observational epidemiological studies reporting prevalence and incidence data

Systematic review of interventions to improve the psychological well-being of general practitioners

Psychosocial effects of SARS on hospital staff: survey of a large tertiary care institution

Global prevalence of mental health issues among the general population during the coronavirus disease-2019 pandemic: a systematic review and meta-analysis

The Hospital Anxiety and Depression Scale: a meta confirmatory factor analysis

Metaprop: a Stata command to Perform Meta-Analysis of Binomial Data, 72. Archives of Public Health

The PRISMA 2020 statement: an updated guideline for reporting systematic reviews

Prevalence of depression, anxiety, and insomnia among healthcare workers during the COVID-19 pandemic: a systematic review and meta-analysis

Coping responses of emergency physicians and nurses to the 2003 severe acute respiratory syndrome outbreak

Psychological impact of the COVID-19 pandemic on healthcare workers: A cross-sectional study in China

2029. Royal College of Physicians, 2015. Work and Wellbeing in the NHS: Why Staff Health Matters to Patient Care

The prevalence of common mental disorders among hospital physicians and their association with self-reported work ability: a cross-sectional study

The effects of anxiety and situation-specific context on perceptual-motor skill: a multi-level investigation

The prevalence of stress, anxiety and depression within front-line healthcare workers caring for COVID-19 patients: a systematic review and meta-regression

Prevalence of anxiety in health care professionals during the COVID-19 pandemic: a rapid systematic review (on published articles in Medline) with meta-analysis

Asúnsolodel-Barco, A, 2021. Systematic Review and Meta-Analysis of Incidence and Prevalence of Endometriosis

Global vascular surgeons' experience, stressors, and coping during the coronavirus disease 2019 pandemic

Psychological burden of healthcare professionals in Germany during the acute phase of the COVID-19 pandemic: Differences and similarities in the international context

A brief measure for assessing generalized anxiety disorder: the GAD-7

Stata Statistical Software: Release 16

Meta-analysis of observational studies in epidemiology: a proposal for reporting

Severe acute respiratory syndrome (SARS) in Hong Kong in 2003: stress and psychological impact among frontline healthcare workers

Doctors' working conditions, wellbeing and hospital quality of care: a multilevel analysis

The global prevalence of anxiety among medical students: a metaanalysis

Doctors' wellbeing: self-care during the covid-19 pandemic

Prospective and observational study of COVID-19's impact on mental health and training of young surgeons in France

World Health Organization, 1989. Manual of Epidemiology For District Health Management. World Health Organization

Physician wellness: a missing quality indicator

World Health Organisation. Coronavirus disease (COVID-19) pandemic

The psychological impact of covid-19 pandemic on medical staff in guangdong, china: A cross-sectional study

Acute psychological effects of Coronavirus Disease 2019 outbreak among healthcare workers in China: a cross-sectional study

UK NHS staff: stressed, exhausted, burnt out

Anxiety: attention, the brain, the body and performance. The Oxford handbook of sport and performance psychology

The hospital anxiety and depression scale

None.

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.jad.2021.11.026.

The authors declare no conflicts of interest.