key: cord-0308463-qs22ppg5 authors: Schmitz, L. L.; Goodwin, J.; Miao, J.; Lu, Q.; Conley, D. title: The Impact of Late-Career Job Loss and Genotype on Body Mass Index date: 2020-11-23 journal: nan DOI: 10.1101/2020.11.20.20235895 sha: e14c3f40de47c066b693b578a6016a92a3b1f059 doc_id: 308463 cord_uid: qs22ppg5 Unemployment shocks from the COVID-19 pandemic have reignited concerns over the long-term effects of job loss on population health. Past research has highlighted the corrosive effects of unemployment on health and health behaviors. This study examines whether the effects of job loss on changes in body mass index (BMI) are moderated by genetic predisposition using data from the U.S. Health and Retirement Study (HRS). To improve detection of gene-by-environment (G x E) interplay, we interacted layoffs from business closures (a plausibly exogenous environmental exposure) with whole-genome polygenic scores (PGSs) that capture genetic contributions to both the population mean (mPGS) and variance (vPGS) of BMI. Results show evidence of genetic moderation using a vPGS (as opposed to an mPGS) and indicate genome-wide summary measures of phenotypic plasticity may further our understanding of how environmental stimuli modify the distribution of complex traits in a population. Recent unemployment shocks from the COVID-19 pandemic have left millions of older workers unemployed. In the U.S. alone, the seasonally adjusted unemployment rate for adults aged 55 and over jumped from 2.6 in February 2020 to as high as 13.6 in April 2020 1 . Recent evidence indicates unemployment rates for workers 55 and older have exceeded those of mid-career workers since the pandemic began-the first time in nearly 50 years that older workers have faced higher unemployment than mid-career workers 2 . For older workers in particular, the scarring effects of unexpected job loss could be severe. Job loss at older ages has been associated with longer periods of unemployment than any other age group 3, 4 , higher rates of depression and anxiety [5] [6] [7] , and a sharp increase in the need for medical care due to heightened stress levels and gaps in health insurance coverage 8, 9 . Further, when reemployed, older workers suffer significant wage penalties and lower levels of employer-offered pension and health insurance [10] [11] [12] . All these factors could trigger chronic stress and adverse changes in health and health behaviors 13, 14 . This study expands on past work by examining the degree to which underlying genetic predisposition moderates changes in body mass index (BMI) after a job loss. Genotype-byenvironment (G x E) interaction is a fundamental component of population variance for complex traits like BMI, but there has been limited success in identifying G x E effects in human populations due to several challenges, including the complexity of measuring environmental exposures, a need for statistical methods that can screen for genetic effects on phenotypic variability, and inadequate power to detect small G x E effects at loci across the genome [15] [16] [17] . To overcome the third challenge of lower power to detect individual effects in the context of multiple hypothesis testing, researchers have used whole-genome polygenic scores (PGSs) constructed from well-powered genome-wide association studies (GWAS) that summarize the genomic contribution to a trait or disease across common variants in the genome 18 . That is, PGSs aggregate thousands of genome-wide genetic influences on a phenotype into a single index using results from GWAS that estimate the association between genetic variants and the conditional mean of a phenotype, which we refer to herein as mGWAS. However, PGSs constructed from mGWAS may not capture the impact of loci that contribute to within-individual variance in an outcome that are more responsive to environmental stimuli (i.e., variance quantitative loci or vQTLs). Since estimating genetic contributions to within-person variability is hindered by a lack of large datasets with genotype data and longitudinal phenotypic data on participants, researchers have developed methods that can detect population-level variance effects that are not driven by mean effects, referred to herein as vGWAS 16, [19] [20] [21] [22] [23] . In this study, we apply summary statistics from both mGWAS and vGWAS to construct whole genome PGSs for BMI that capture mean effects (mPGS) and variance effects (vPGS) 24 . Evaluating both measures in a G x E framework is necessary because environmental shifts may moderate individuals' propensity for higher or lower BMI, and/or their propensity towards changes in BMI or BMI plasticity 21,24 . Our data come from the U.S. Health and Retirement Study (HRS). The HRS is a nationally representative, longitudinal study with genotype data and over twenty years of sociodemographic data on respondents, including individual-level exposures to involuntary job losses from business closures. We focus specifically on business closures because they are typically the byproduct of external, firm level decisions to restructure or relocate businesses and are therefore considered more exogenous than layoffs or firings, which may be correlated with unobserved health or worker characteristics that could bias G x E estimates [25] [26] [27] . The majority of G x E interaction studies use endogenous measures of the environment that cannot address the non-random distribution of genes across environments. This is important because G x E interactions can, in that case, be proxying a different, unmeasured E that is interacting with G, or G x G interactions (i.e., epistasis) or even E x E (if the measured genes proxy other environments). Specifically, in the case of job loss being endogenous (e.g., for cause), such a measure could be intertwined with a host of unobserved genetic or environmental influences that are associated with health and changes in BMI 28-30 . To address this, our empirical strategy interacts business closures with, respectively, an mPGS and vPGS in a regression-adjusted semiparametric difference-in-differences (DiD) propensity score matching framework that compares the BMI of those before and after an involuntary job loss with a control group that was not laid off. Combining propensity score matching with DiD estimation makes the model more robust to selection on observables and unobservables with time invariant effects (e.g., ability or worker preferences) 31 . This is necessary because although business closures are plausibly more exogenous than layoffs or firings, it is still possible that workers with unhealthy behaviors or poor health, for example, could select into more vulnerable or volatile industries 32 . To date, we are aware of only one other study that has leveraged a vPGS and a quasi-natural experiment (education reform in the UK) to detect G x E interaction effects on BMI and educational attainment 24 . Results from this study found evidence of mPGS and vPGS interaction effects, indicating that both forms of moderation need to be tested in G x E interaction studies. In the context of older workers in the U.S., we focus on changes in BMI for two reasons. First, BMI is an inexpensive, non-invasive proxy measure of adiposity that is available for all HRS waves and is predictive of metabolic syndrome and other more difficult to measure anthropomorphic measures like abdominal adiposity that increase risk for cardiovascular disease and type 2 diabetes 33 . In older adults, unintentional weight loss or frailty can also be harmful and indicative of decreased resistance to stressors, resulting in greater vulnerability to disease and . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint drinking and BMI were concentrated among workers who were already pursuing unhealthy behaviors pre-job loss, indicating the effects of job loss may be especially problematic for highrisk individuals. However, because genetic data have only recently become available in large population studies, past research was not able to explore the possibility that the impact of job loss on intra-individual fluctuations in BMI may vary across the spectrum of genetic risk. Second, we focus on changes in BMI because BMI is currently the most well studied phenotypic trait in vGWAS. Previous meta-analyses of mGWAS have identified more than 100 genome-wide significant loci associated with BMI [44] [45] [46] [47] [48] . The largest cluster of highly significant genetic variants is located in the FTO (fat mass and obesity associated) gene region on chromosome 16. Studies suggest FTO polymorphisms increase obesity risk through subtle changes in food intake and preference and affect pathways in the central nervous system that regulate appetite 45, 49 . In particular, the SNP rs1421085 underlies the association between the FTO locus and obesity via activation of IRX3 and IRX5, which play a role in the differentiation of adipocyte subtypes 50 . Recent vGWAS have found evidence for loci with variance effects on BMI located in genes responsible for adipocyte differentiation (PPARG) and genes implicated in the . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint pathology of obesity, diabetes, atherosclerosis, and cancer (FTO, PPARG, CCNL1, TCF7L2, ZNF668, GIPR) 16 . Most BMI-associated loci have their largest impact early in life or during adolescence 51 , although a few loci, which have also been associated with type 2 diabetes or coronary artery disease, exhibit stronger effects in older adults 52 . Past studies have also found genetic moderation of social aspects of the environment that affect BMI, including lifetime socioeconomic status (SES), social norms, birth cohort, and institutional policies [53] [54] [55] . Results indicate that an unexpected job loss did not seem to affect differences in BMI by mPGS; individuals with higher mPGS had higher levels of BMI regardless of whether they were in the treatment or control group. However, we do see suggestive evidence of genetic moderation by vPGS such that individuals in the treatment group varied in their genetic propensity for weight gain or loss, whereas similarly matched individuals in the control group did not. Genetic moderation is particularly pronounced in the lower half of the vPGS distribution; less plastic individuals in the bottom 50% appear to adjust more slowly to environmental changes, resulting in minor weight loss compared to similarly matched individuals in the control group. Results from an event time study analysis show that changes in BMI were detectable up to two years post job loss, but did not persist in the subsequent HRS wave, or up to four years post job loss. Matching quality and summary statistics. Table 2 . After matching, covariates should be balanced with little to no significant differences remaining. We included both the standardized bias and two-sample t-. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint tests for equality of the means to check for significant differences between covariates for both groups (Methods) 56 . Before matching, individuals affected by a business closure have lower socioeconomic standing, were less likely to have health insurance, and report worse mental health and health behaviors than continuously employed individuals. Labor statistics show they were more likely to work part time, for smaller firms, in the agriculture/fishing/farming, construction/mining, manufacturing, or trade industries, were more likely to be blue collar, and have lower job tenure than workers in the control group. We do not see any significant differences in BMI between treatment and control groups. After matching, covariates are more balanced overall, and the standardized biases for the majority of variables are at or below 5%, which indicates that mean differences between the treatment and control group are small and the balancing procedure was effective 57 . Notable exceptions include mean differences in education, industry, household income, access to health insurance, and smoking behavior. To minimize any remaining differences between groups, we control for all covariates in our empirical model. Importantly, we do not see any significant difference in the mPGS or vPGS between treatment and control groups before or after matching, indicating the absence of gene-environment correlation (rGE), or evidence of selection into the treatment group by underlying genetic predisposition. Construction and predictive performance of vPGS. Because we are incorporating mPGS and vPGS into our G x E interaction model, it is important to use a vPGS that captures variance effects that are distinct from mean effects 24 . To decorrelate the mean and variance effects, Young et al. proposed a dispersion effects test that can identify differences in the variance of the GWAS sample . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint as a whole that are not driven by mean effects at the SNP level 16 . We used dispersion weights from Young et al. to construct vPGS in the HRS (Methods). The predictive performance of mPGS for BMI in the HRS and other population-based samples has been well studied [53] [54] [55] 58, 59 . To evaluate the predictive performance of the vPGS, we fit a Double Generalized Linear Model (DGLM) that allowed us to assess the association between the vPGS and the between-individual variance in BMI in a UK Biobank (UKB) test sample that is independent of our HRS testing sample (Methods). Table 2 reports associations from the DGLM with and without mPGS adjustment (Models 1 and 2, respectively). The dispersion vPGS is significantly associated with the population variance in BMI in the UKB test sample (p=1.01E-04), and this association holds after controlling for the mPGS (p=1.44E-04). We used a propensity score matched DiD model to evaluate the effect of job loss from a business closure (Methods). Table 3 shows separate propensity score adjusted DiD results from specifications with and without the mPGS and vPGS interactions. Columns 1-2 model changes in BMI without the mPGS and vPGS in the full HRS sample of workers aged 50-70 (1) and in the same-aged European ancestry analytic sample of workers with genotype data (2) . Columns 3-4 add the main effect (3) and mPGS interaction effect (4), and Column 5 displays results from the full model with the mPGS and vPGS main effects and interaction effects. All specifications adjust for the conditioning variables used in the propensity score matching that are reported in Table 1 . Results without PGSs in Columns 1 and 2 do not condition on genotype in the matching procedure or in the regression analysis. Coefficient pvalues that pass FDR correction are denoted with an asterisk. The results in Column 1 are similar in magnitude and direction to the HRS results reported by Deb et al, which find a positive, but insignificant main effect from business closures in the full . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint HRS sample on changes in BMI in all ancestry groups 39 . Among genotyped, European ancestry respondents between the ages of 50 and 70 who report being in the labor force, the main effect is still insignificant but negative (Column 2). Columns 3-5 show that the inclusion of both the mPGS and vPGS is necessary to uncover a genetic main effect and an interaction effect: the mPGS captures a significant main effect of genotype on BMI (p=0.002), while the vPGS captures a significant G x E effect (p=0.011). Graphically, this can be seen in Figure 1 , which used estimated parameters from the DiD regression model to predict BMI at different values of the mPGS and vPGS for treated individuals in the wave following a job loss and for corresponding control individuals that were matched to . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint treated individuals in the same HRS wave (Methods). In the mPGS figure, which plots predicted BMI based on the mPGS interaction model in Column 3, we see differences in the predicted level of BMI by mPGS, but no significant differences between groups. Conversely, in the vPGS figure, which plots predicted BMI based on the vPGS interaction model in Column 5, there are no differences in the predicted level of BMI by vPGS, but as indicated by the cross-over shape of the interaction, there is suggestive evidence of a G x E interaction, or evidence of environmental moderation by vPGS for the treatment group relative to the control group in the post-treatment wave. Significant differences between treatment and control groups can only be seen in the lower half of the vPGS distribution; individuals below the mean appear more likely to lose weight as a result of a business closure relative to workers with similar vPGS scores. These results seem to indicate that less plastic individuals may adapt more slowly to environmental shifts, at least of the type studied herein. We conducted an event time study (ETS) to assess the validity of our findings and to show the evolution in BMI by vPGS for the treatment and control groups up to four years post job loss (Methods). The assumption underlying the DiD research design is that in the absence of an involuntary job loss, BMI would have evolved similarly for the treatment and control groups (i.e., the "parallel trends" assumption). Figure 2 plots the coefficient estimates from the ETS model, which can be interpreted as the difference in BMI between treatment and control groups (Supplementary Table 3 ). The first panel of Figure 2 indicates the presence of parallel trends in BMI prior to a business closure for the full sample-i.e., the difference between treatment and control groups is close to zero and not statistically significant. We then estimated separate event time study regressions for respondents in the top and bottom 50% of the vPGS distribution to compare trajectories in BMI for treatment and control groups by vPGS. Similar to the results in . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint Figure 1 , we found suggestive evidence that individuals in the bottom 50% of the vPGS distribution have a lower BMI on average compared to the control group (p=0.035). Effects did not appear to persist in the next HRS interview wave at t + 2, or up to four years post job loss. The figure plots the coefficients from an event time study model for the full sample and by samples stratified at the vPGS median (Methods). Error bars represent 95% confidence intervals. The reference category is BMI in t-2, or BMI at baseline before the job loss occurred for treated individuals (represented here as year 0). The dotted line depicts the approximate time point that the job loss occurred-i.e., some point between the baseline year and the current wave. Gene-environment interplay is a fundamental biological process that influences the diversity of outcomes we observe in human populations 60 . However, because genetic differences are tightly interwoven with environmental differences, it is challenging to identify genomic and environmental factors underlying phenotypic plasticity. The search for interaction effects is . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint further complicated by the fact that the majority of GWAS methods are unable to separate genetic effects on phenotypic variability from effects on the mean or level of trait values 16 . As a result, most PGS x E interaction studies cannot detect interaction effects that are driven by loci that affect plasticity 24 . Research suggests that SNPs associated with the variance of BMI (vQTLs) are highly enriched for G x E interactions, and that these vQTLs play in an important role in cellular response to the external environment 16, 22 . In this study, we used a natural experiment to investigate whether the effect of job loss on BMI-a stressful and often debilitating lifetime event-is moderated by genetic predisposition. To incorporate genetic effects associated with the population variance in BMI, we followed Johnson et al. and used summary statistics from recently developed GWAS methods that can separate SNP mean and variance effects 16 to construct vPGS for BMI 24 . Using both an mPGS and a vPGS, we find stronger evidence for a vPGS x E interaction. A shift in the environment did not seem to affect differences in BMI by mPGS; individuals with higher mPGSs had higher levels of BMI regardless of whether they were in the treatment or control group. However, we do see differences between the treatment and control groups by vPGS such that less plastic or lower vPGS treated individuals showed signs of weight loss compared to similarly matched control individuals, perhaps because they adjust more slowly to an environmental shock compared to high vPGS individuals. ETS analysis suggests that treatment and control groups evolved similarly prior to treatment, and that findings are detectable up to two years post job loss. It is important to note that we deployed, by necessity, a vPGS that was constructed from weights that were trained in a discovery sample to predict variation between individuals net of mean effects. However, we are using this measure in analysis that examines within-subject variation. This is an important distinction that may inform the interpretation of our results. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint Namely, our theory is that highly plastic individuals are better able to adapt to changing environmental contexts and thus maintain a more stable weight in the face of job loss. We classify individuals as plastic or non-plastic based on their score from a genetic model that predicts whether an individual scores higher on a cross-individual model of dispersion (independent of mean effects). In a sense, this means that someone with a higher vPGS has more "noise" in their prediction than another individual-that is, s/he is less well-predicted from a levels (mean values) regression than someone with a lower score. Someone who is low on the plasticity score has a BMI that is predicted better by their levels effects than someone who is high on the score. This, in turn, we think is indicative of someone whose phenotype is more affected by non-additive genetic effects (i.e., epistasis) as well as by environmental effects. That is, imagine two groups: One with a low vPGS and one with a high vPGS. It is the lower-vPGS group that may have a narrow range around a population mean BMI of 25 (say, SD = 2 units), while those with a high vPGS may display the same mean BMI in their group (25) but have a wider dispersion (SD = 4 units). Thus, high vPGS individuals do a better job of buffering differences in environments they encounter and, as a result, their phenotypes vary more widely. In one sense of "plasticity" they are higher, as their phenotype varies more. Turning to our within-person analysis in the HRS, low vPGS-scoring individuals may be more "stable" in their weight from year to year, given the smaller "error" term from the levels regression. Indeed, when we simply compare the standard deviation of BMI within individuals in our HRS sample across waves by a quartile split in the vPGS, irrespective of treatment status, we find that individuals in the lowest quartile of the vPGS score range display a (non-significantly) lower within-person standard deviation in BMI than individuals in the highest quartile (Supplementary Table 4 ). Thus, in the absence of a specific, measured environmental shock, an . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint individual with a low vPGS score derived from the cross-person discovery analysis has more phenotypic stability in the within-person analysis (i.e., less plasticity). This may be one reason why we are able to detect significant differences between treated and control individuals up to two years post job loss in the lower half of the vPGS distribution-i.e., in the presence of an unexpected job loss, it takes less plastic individuals longer to recover to their pre-job loss weight. On the other hand, it could be that less plastic individuals display a more stable weight trajectory overallregardless of environmental forces, rendering our ability to see a "specific" environmental effect more clearly. Although our results point to the former explanation, we acknowledge that we cannot draw any definitive conclusions from this study and more analysis is needed to determine the extent to which a vPGS constructed from a cross-person GWAS captures within-person vQTL effects. Overall, with only 374 individuals in our treatment group, we acknowledge that we are likely underpowered to detect precise G x E effects. This could in part explain why we are unable to detect effects between treatment and control groups in the upper part of the vPGS distribution. Specifically, because vPGS individuals display a higher within-person standard deviation in BMI, it may be harder to detect differences between treatment and control groups in smaller samples because their weight oscillates more between waves, independent of any particular treatment effect, as mentioned above. Conversely, it's possible that the true shape of the interaction does not display a crossover effect at higher levels of the vPGS-perhaps because of the larger withinor between-person standard deviation in BMI. Either way, due to a lack of detailed job loss data in other population studies that also collect genetic data on participants, we were unable to pursue replication of our quasi-experimental approach in other samples. Thus, we caution that our results . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint are suggestive, and further analysis that can leverage quasi-experimental variation is needed to assess the validity of our approach. There are several limitations of the HRS data, all of which may bias our estimates downwards or reduce the precision of our estimates. First, we only observe BMI in the HRS every two years, which makes it difficult to assess stress-related changes in BMI that are more proximal to the timing of the event, or in the months immediately pre-and post-job loss. It is entirely possible that high vPGS individuals gained or lost more weight than low vPGS individuals in the months following a job loss but they bounced back quicker to their pre-job loss weight than low vPGS individuals, which would make it more difficult to detect differences between high vPGS treatment and control individuals in the subsequent HRS wave. Second, to obtain the largest sample of treated individuals, we were limited to using self-reports of BMI, which may induce measurement error in our estimates. In 2006, the HRS did start collecting in In addition, there is significant complexity surrounding obesity and aging such that differences in BMI may not indicate an actual change in body fat. Higher BMI at midlife is a risk factor for age-related disease and early mortality, however at older ages it might be somewhat protective of mortality because age-related diseases and aging itself are wasting conditions that stimulate significant weight loss. Therefore, while incrementally higher BMI in midlife is more likely a measure of risk for disease, later in life it may actually signal the absence of disease. In . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint addition, individuals generally lose muscle mass with increasing chronological age, meaning older individuals could maintain a constant BMI while simultaneously losing lean body mass and gaining a greater portion of adiposity 62 . Thus, any increases in BMI from a job loss may be offset by these other countervailing trends that are inherent to the aging process, which may in part be another explanation the null findings we report for more plastic individuals in the top half of the vPGS distribution. Furthermore, the relatively nominal findings we report may reflect a greater culmination of environmental and lifestyle factors on adiposity in older adults that overwhelm any genetic effects. The genomic influence on BMI has been shown to both weaken over the life course and increase in magnitude since the current obesity epidemic began in the mid- Thus, we caution that PGSs constructed from European ancestry GWAS cannot be generalized to other ancestral populations. On the environmental side, limiting our analysis to white HRS respondents restricts the scope of potential job loss effects that we can observe. Race powerfully . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint shapes structural-and institutionally-derived differences in occupational sorting and occupational opportunities across the life course 67, 68 . For example, white HRS respondents were more likely to work in higher status jobs with better working conditions than their Black counterparts 69 , and following a job loss, they were more likely to be reemployed or have additional economic resources to buffer stressful declines in income [70] [71] [72] , all of which may further bias effects from this study downwards. These limitations are counterbalanced by several strengths of our study. The use of a large, nationally representative cohort of individuals from the same ancestry group is an advantage in that it both increases our power to detect effects while also minimizing the presence of ascertainment bias and other selection issues. Having access to detailed, longitudinal job loss data in the HRS also allowed us to exploit a quasi-experimental research design that limited the treatment group to individuals who lost their job due to a business closure while also creating a control group that is matched on a rich suite of characteristics pre-job loss. Current G x E interaction studies that utilize population data are often unable to separate gene-environment correlation (rGE) from G x E effects, which limits our understanding of social-environmental effects on health 27 . Finally, to our knowledge, this is one of the first studies to integrate genetic measures that can separately capture phenotypic mean and variance effects into PGS x E interaction analysis. Genotype data on ~15,000 HRS participants was collected from a random subset of the ~26,000 total participants that were selected to participate in enhanced face-to-face interviews and . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. European ancestry sample included all respondents that had PC loadings within ± one standard deviations for eigenvectors one and two in the PC analysis of all unrelated study subjects and who self-identified as White on survey data. A second set of principal components was then calculated . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint within the European ancestry sample to further account for any population stratification within the sample. The genotype sample has been defined by the HRS and is available on dbGaP 80 . Mean polygenic score (mPGS) construction. We calculated a linear mPGS for the HRS sample based on a GWAS of 457,824 European ancestry individuals in the UK Biobank 48 . Imputed HRS genotype data were accessed through dbGap (phs000428). The mPGS BMI score was constructed in PRSice 81 by taking a weighted sum across the number of SNPs ( ) of the number of reference alleles (zero, one, or two) at that SNP multiplied by the effect size for that SNP ( ): GWAS summary statistics were pruned for linkage disequilibrium (LD) using the clumping procedure in PLINK (R 2 =0.1, range=1000kb) 82,83 . Since these GWAS summary statistics were pre-clumped, no LD-clumping or p-value threshold was implemented in PRSice. After LD clumping was applied, 90,326 SNPs were used to construct the BMI mPGS. The mPGS was standardized to have a mean of zero and a standard deviation of one for all analyses. participants of European ancestry. SNP weights in the vPGS were based on dispersion effects estimated in the UKB using the heteroskedastic linear mixed model (HLMM) approach 16 . Prepruned HLMM summary statistics were obtained from Young et al. 16 . We did not perform additional LD-clumping or p-value thresholding to filter variants. A total of 242,870 SNPs remained in the vPGS model after overlapping the HLMM summary statistics and HRS genotype data. The vPGS was constructed in PRSice 81 and standardized to have a mean of zero and a standard deviation of one for all analyses. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint using European ancestry UKB samples identified from genetic PCs (data field 22006). To avoid overfitting, HRS samples were not used for model validation. Quality control procedures for the UKB genetic data have been described elsewhere 84 . We excluded participants recommended by UKB (data field 22010), those with conflicting genetically-inferred (data field 22001) and selfreported sex (data field 31), and those who withdrew from the study. We randomly apportioned UKB participants (N=406,873) into training (N=325,498) and testing sets (N=81,375), with an 80-20 split. We applied the HLMM approach to estimate the dispersion effect of each SNP on BMI using samples in the training set, controlling for sex, age, age 2 , age 3 , age × sex, age 2 × sex, age 3 × sex, genotyping array, and the first 40 genetic PCs. Following Young et al., we analyzed related and unrelated samples in the training set separately and performed fixed-effect meta-analysis to combine the results 16 . Related samples were inferred from genetic kinship (third-degree relatives or higher; data field 22021). Random effects were included to account for genetic relatedness in the analysis of related samples. We then pruned SNPs following Young et al. and used dispersion effect estimates to generate vPGS for samples in the testing set. We then fitted a Double Generalized Linear Model (DGLM) to associate the vPGS with the between-individual BMI variance in testing samples 85 . The DGLM takes the form of where # denotes the inverse normal-transformed BMI of individual , # is the vPGS of individual , is the vector of covariates including sex, age, age 2 , age 3 , age × sex, age 2 × sex, age 3 × sex, genotyping array, and the first 40 genetic principal components. Here, ' quantifies the effect of vPGS on the variability of BMI and is the parameter of interest in this analysis. The . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint vPGS was standardized with a mean of 0 and a variance of 1. We fitted DGLM using the dglm package 86 in R. To assess the performance of vPGS after adjusting for the effect of mPGS, we performed a standard, mean-effect GWAS of BMI on the training set and used the effect estimates to generate mPGS for the testing samples. GWAS summary statistics were pruned for LD using the clumping procedure in PRSice (R2=0.1, range=250kb) when calculating the mPGS 81 . We then fitted the same DGLM model as above with the mPGS added to the vector of covariates. The mPGS was standardized with a mean of 0 and a variance of 1. For each observation, we used information from two wavesbefore and after treatment. Before treatment (t-2), all respondents were working for pay either full-or part-time. At the following HRS interview two years later (t), respondents in the treatment group report they were no longer working for their previous-wave employer. These respondents were asked why they left their employer. Possible answers included 'business closed', 'laid off/let go', 'poor health/disabled', 'quit' 'family care', 'better job', 'retired', 'family moved', 'strike', 'divorce/separation', 'transportation/distance to work', and 'early retirement incentive/offer'. Respondents could report up to three reasons. Our definition of exogenous job loss includes observations that reported being laid off due to a business closure. We excluded workers who also stated that they quit or left for health reasons but included workers who stated they were also laid off or let go 40 . For the control group, we used individuals who reported working for the same employer the entire time they were in the sample-i.e., we did not include individuals in the control group if they ever quit their job or were laid off for any reason. Treated individuals are only in the analytic sample for two waves, or pre-and post-job loss. Control individuals can be in the analytic sample for multiple HRS waves. In the whole observation period, there were 374 instances of . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. A traditional DiD setting assumes that after conditioning on a vector of observables , the BMI of individuals in the treatment group would have evolved similarly over time to the BMI of individuals in the control group if they had never been laid off: Where #* − #*+, refers to the change in BMI before and after the treatment, BC denotes the treatment group indicator (i.e., whether an individual lost their job due to a business closure), anddenotes an individual in the control group with the same characteristics as individual in the treatment group. While conditioning on genotype and a rich set of covariates minimizes the . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint possibility of violating this assumption, other systematic differences between the treated and control groups may remain even after conditioning on observables. To minimize potential confounding from unobservable characteristics, we used the weights from propensity score matching (W) to reduce unmeasured differences between the treatment and control groups that could bias estimates: Covariates used to estimate the propensity score, or the probability of treatment, were also included in the DiD regression model. Thus, coefficients from the regression-adjusted semiparametric DiD matching estimator are considered "doubly robust" because the estimator is consistent if the regression model or the propensity score model is correctly specified 88, 89 . As a result, the DiD matching estimator accounts for selection on observable and unobservable variables with time invariant effects, or the model allows for systematic differences between treatment and control groups even after conditioning on observables 90 . Difference-in-differences (DiD) empirical strategy. Our empirical strategy can be broken down into three parts. First, we estimated propensity scores using a probit regression that regresses business closures on the mPGS and vPGS, as well as a rich set of covariates that are both standard in the job loss literature and satisfy the conditional independence assumption-i.e. they influence job loss and/or changes in BMI 87, 91 . In addition, we only conditioned on observables that were unaffected by job loss (or the anticipation of it), or variables that were either fixed over time or measured in t-2 57 . A complete list of covariates can be found in Supplementary Table 1 . To avoid losing observations with missing information on a covariate, we set missing values equal to zero and included an additional dichotomous variable that is equal to one if the observation is . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint missing. As a result, matching is not only on observed values but also on the missing data pattern 31,92 . Throughout, we restricted our analysis to the region of common support, or the subset of individuals in the control group that were comparable to individuals in the treatment group 91 . Specifically, we dropped treatment observations whose propensity score was greater than the maximum or less than the minimum propensity score of the controls. We used the estimates from the probit regression to compute the weights for the control group with kernel matching, a nonparametric matching estimator that uses the weighted averages of all observations on common support to construct the counterfactual outcome 87, 90 . Specifically, the weight given to a non-treated individual was in proportion to the closeness of their observables to treated individual : Where is the propensity score for individual or in the treated or control group, respectively, [⋅] is the kernel function, and is the bandwidth parameter. We used the program psmatch2 93 in Stata 14 to compute ( , ) with the Epanechnikov kernel function and a bandwidth of 0.06 87 . In addition, when computing the weights, we performed exact matching on survey year and sex in t-2. This ensured 1) individuals who were laid off were matched with controls from the same time period, and 2) treated individuals were grouped with same-sex non-treated individuals. In the final step, we incorporated the weights from propensity score matching into the DiD regression model: . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint Where BC is an indicator for job loss due to a business closing in the years between HRS survey waves, or between t-2 and t for individual , is a vector of observable time invariant and variant covariates measured at t-2, including the first 10 principal components of the genetic data. We also include *+, to control for baseline BMI, or to estimate deviations in BMI between t-2 and t. All regressions were estimated with robust standard errors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. This model is similar to the DiD model outlined above except the business closure term is replaced by a series of event terms that are the product of indicators for each HRS survey year ( ) relative to the survey year the respondent reported a job loss # * , ( − # * = ), and their treatment status The omitted category is the survey year prior to treatment ( ≠ −2). We also present ETS results for the full sample that includes controls for the vPGS. Each estimate of 8 gives the difference in BMI for treated individuals compared to non-treated individuals relative to this excluded year. If outcomes were evolving similarly for treated and untreated individuals prior to a business closure, the coefficient estimates for < 0 should be close to zero and not statistically significant. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint Table 1 and defined in detail in Supplementary Table 1. The first column includes all workers, regardless of ancestral background or the availability of genetic data. Columns 2-5 report results for the European ancestry analytic sample with genetic data. Individuals in the control group can have multiple observations. In the full HRS sample, unique N(control)=6,217; in the analytic sample, unique N(control)=3,564. *FDR corrected p-value<0.05. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint Occupational status a Binary (0/1) variables for blue collar and service workers. The omitted category is white collar workers. Job tenure a Current job tenure in years. Health status 1=excellent or very good self-reported health; 0=good, fair, or poor self-reported health. Health insurance 1=covered by a federal or employer-sponsored health insurance program; 0=otherwise. Exercise 1=exercises vigorously three or more times per week; 0=otherwise. Ever smoke cigarettes 1=smoked 100 or more cigarettes in their lifetime; 0=otherwise. Cigarettes per day a Total number of cigarettes smoked per day, excluding pipes or cigars. Variable is set equal to zero if respondent does not smoke. Drinks per week Total number of alcoholic drinks per week Doctor diagnosed psychiatric issue a 1=reports doctor diagnosed emotional or psychiatric problems; 0=otherwise. Survey year Binary (0/1) variables for 1994-2012. The omitted year is 1992. a Variables with additional category for missing values. Analyses also controls for the first 10 principal components of the European ancestry genetic data. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. Note: Analytic sample consists of white, non-Hispanic workers aged 50-70 who were not self-employed. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint Each regression is run separately for the full analytic sample (Column 1), and then stratified by high versus low vPGS groups (Columns 2 and 3). vPGS groups were stratified at the median. Additional covariates are listed in Table 1 and defined in detail in Supplementary Table 1 . Individuals in the treated and control groups can have multiple observations. Unique N(treated)=374; unique N(control)=3,564. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint BMI statistics were calculated for control and treated individuals for all waves they were observed in the HRS between the ages of 50 and 70 (regardless of whether or not all waves were included as treated or control observations in the final analytic sample). Person-wave observations (N=14,235). Total unique observations (N=3,938). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 23, 2020. ; https://doi.org/10.1101/2020.11.20.20235895 doi: medRxiv preprint Unemployment rates by age, sex, and marital status, seasonally adjusted. Labor Force Statistics from the Current Population Survey A first in nearly 50 years, older workers face higher unemployment than mid-career workers. Status of Older Workers Report Series Can unemployed older workers find work Older workers, retirement, and the great recession What makes retirees happy? The persistence of depressive symptoms in older workers who experience involuntary job loss: Results from the health and retirement survey Retirement and subjective well-being Intermittent lack of health insurance coverage and use of preventive services Rough passage: affordable health coverage for near-elderly Americans Earnings losses of displaced workers The effects of job displacement on job quality: Findings from the Wisconsin Longitudinal Study Job loss in the United States Unemployment and change of tobacco habits: a study of young people from 16 to 21 years of age Stress-related eating and drinking behavior and body mass index and predictors of this behavior Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases Identifying loci affecting trait variability and detecting interactions in genome-wide association studies Interactions between polygenic scores and environments: Methodological and conceptual challenges Power and predictive accuracy of polygenic risk scores Recent developments in statistical methods for detecting genetic loci affecting phenotypic variability FTO genotype is associated with phenotypic variability of body mass index Quantile treatment effects of job loss on health Involuntary job loss as a risk factor for subsequent myocardial infarction and stroke: Findings from The Health and Retirement Survey Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index Genetic studies of body mass index yield new insights for obesity biology New genetic loci link adipose and insulin biology to body fat distribution Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry Mixed-model association for biobank-scale datasets The bigger picture of FTO-the first GWAS-identified obesity gene FTO obesity variant circuitry and adipocyte browning in humans Polygenic prediction of weight and obesity trajectories from birth to adulthood The influence of age and sex on genetic associations with adult body size and shape: a large-scale genome-wide interaction study Gene-environment interactions related to body mass: School policies and social context as environmental moderators Lifetime socioeconomic status, historical context, and genetic inheritance in shaping body mass in middle and late adulthood The genome-wide influence on human BMI depends on physical activity, life course, and historical period The bias due to incomplete matching Some practical guidance for the implementation of propensity score matching Changing polygenic penetrance on phenotypes in the 20th century among adults in the US population Association of a genetic risk score with body mass index across different birth cohorts Phenotypic plasticity and the origins of diversity Mortality selection in a genetic sample and implications for association studies Association of a genetic risk score with BMI along the life-cycle: evidence from several US cohorts Generalization and dilution of association results from european GWAS in populations of non-European ancestry: the PAGE study Human demographic history impacts genetic risk prediction across diverse populations Genome-wide association studies in diverse populations The Structure of disadvantage: individual and occupational determinants of the black-white wage gap The Extent of occupational segregation in the United States: differences by race, ethnicity, and gender. Ind. Relations A Structural racism in the workplace: does perception matter for health inequalities? Jobs lost, jobs regained: an analysis of black/white differences in job displacement in the 1980s After the rainy day: how private resources shape personal trajectories following job loss and amplify racial inequality Family adaptations to income and job loss in the U.S Biological robustness Phenotypic plasticity's impacts on diversification and speciation Biological robustness: paradigms, mechanisms, and systems principles Social determinants of health and survival in humans and other animals An overview of the health and retirement study Cohort profile: the health and retirement study (HRS) Quality control report for genotypic data Health and retirement study (HRS) dbGaP study accession: phs000428.v2 PRSice: polygenic risk score software Second-generation PLINK: rising to the challenge of larger and richer datasets The UK Biobank resource with deep phenotyping and genomic data Genetic heterogeneity of residual variance -Estimation of variance components using double hierarchical generalized linear models Matching as an econometric evaluation estimator: evidence from evaluating a job training programme Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models Nonparametric estimation of average treatment effects under exogeneity: a review Does matching overcome LaLonde's critique of nonexperimental estimators? Causal effects in nonexperimental studies: reevaluating the evaluation of training programs Matching methods for causal inference: a review and a look forward psmatch2: stata module to perform full Mahalanobis and propensity score matching, common support graphing, and covariate imbalance testing