key: cord-0944490-t8inkhvr authors: Raparelli, Valeria; Norris, Colleen M.; Bender, Uri; Herrero, Maria Trinidad; Kautzky-Willer, Alexandra; Kublickiene, Karolina; El Emam, Khaled; Pilote, Louise title: Identification and inclusion of gender factors in retrospective cohort studies: the GOING-FWD framework date: 2021-04-09 journal: BMJ Glob Health DOI: 10.1136/bmjgh-2021-005413 sha: 1ab9a46a90674ee9a0439bd119595882c7ae7f4e doc_id: 944490 cord_uid: t8inkhvr Gender refers to the socially constructed roles, behaviours, expressions and identities of girls, women, boys, men and gender diverse people. Gender-related factors are seldom assessed as determinants of health outcomes, despite their powerful contribution. The Gender Outcomes INternational Group: to Further Well-being Development (GOING-FWD) project developed a standard five-step methodology applicable to retrospectively identify gender-related factors and assess their relationship to outcomes across selected cohorts of non-communicable chronic diseases from Austria, Canada, Spain, Sweden. Step 1 (identification of gender-related variables): Based on the gender framework of the Women Health Research Network (ie, identity, role, relations and institutionalised gender), and available literature for a certain disease, an optimal ‘wish-list’ of gender-related variables was created and discussed by experts. Step 2 (definition of outcomes): Data dictionaries were screened for clinical and patient-relevant outcomes, using the International Consortium for Health Outcome Measurement framework. Step 3 (building of feasible final list): a cross-validation between variables per database and the ‘wish-list’ was performed. Step 4 (retrospective data harmonisation): The harmonisation potential of variables was evaluated. Step 5 (definition of data structure and analysis): The following analytic strategies were identified: (1) local analysis of data not transferable followed by a meta-analysis combining study-level estimates; (2) centrally performed federated analysis of data, with the individual-level participant data remaining on local servers; (3) synthesising the data locally and performing a pooled analysis on the synthetic data and (4) central analysis of pooled transferable data. The application of the GOING-FWD multistep approach can help guide investigators to analyse gender and its impact on outcomes in previously collected data. The distinction between sex and gender, which is clear and common in social sciences, has largely been neglected in health sciences. Indeed, sex and gender are often erroneously used and/or measured interchangeably. Given that sex and gender are not independent of each other, solely assessing one or the other cannot account for identified variations in health. 1 2 Furthermore, although the reasons explaining the increasing incidence of chronic diseases are incompletely understood, changing family, social, institutional roles and attitudes of men and women in the last decades ultimately play a role. Thus, a wide range of behavioural factors, psychosocial processes, personal, cultural and societal factors can create, suppress or amplify underlying biological health differences. 3 4 While differences in health status and outcomes have been attributed to biological sex, it is now increasingly recognised that both sex and gender influence the risk of developing certain diseases, presentation of symptoms, severity of illness, response to drugs or nonpharmacological interventions and seeking care behaviours. 5 More importantly, gender Well-being Development (GOING-FWD) framework is a feasible 5-step methodology to assess the impact of gender domains on clinical and patient-relevant outcomes in retrospective studies, allowing a crosscountries comparison. ► A multidisciplinary international team (ie, specialists in life science, social science and computer science; patient partners) built the methodology, guaranteeing that the interests of all stakeholders be (were?) represented. ► The lack of a standardised definition of gender and data accessibility/protection issues are expected obstacles to the applicability of GOING-FWD methodology. ► Strategies to minimise potential drawbacks of the methodology are provided such as the derived 'wish list' of gender-related factors or the application of privacy-enhancing technologies including tools for federated analysis. BMJ Global Health may also have a bearing on people's access to and uptake of health services and the resulting health outcomes experienced throughout the life-course. 6 Consequently, it is now understood that the intersectionality of gender with other social factors such as race, age, ethnicity, culture and sexual orientation, plays a central role in an individual's health. The integration of a gender-based framework in health research is a crucial and long-awaited development. 7 When considering gender aspects in the evaluation of clinical outcomes, the first challenge for scientists originates from the apparent lack of standardised method to measure the complexities that gender encompasses. Recently, through a Pan-Canadian collaboration of a multidisciplinary team of scientists, a comprehensive list of gender-related variables was established and collected in the setting of premature cardiovascular disease. Constructed with the aim of exploring the impact of gender on the clinical outcomes of young patients with acute coronary syndrome, the Gender and Sex Determinants of Cardiovascular Disease: From Bench to Beyond Gender Score (GPGS) was developed. 8 9 The GPGS measures a comprehensive group of genderrelated factors and offers a pragmatic means to prospectively explore the relationships between sex, gender and health outcomes. In patients with premature and established cardiovascular disease, gender factors, independent of biological sex, emerged as powerful predictors of the acquisition of risk factors as well as of 1-year adverse health outcomes. 9 10 Most significantly, regardless of sex, patients who exhibited gender factors most traditionally ascribed to women's identity and roles in society were more likely to have a recurrent cardiac event within the first year. While these results have important direct implications for expanding the measurement of gender determinants of health to other populations, they may also identify novel determinants of healthcare costs that could be averted. To facilitate the integration of sex and gender-based analyses, we developed a standard methodology that can be applied to retrospective studies for testing the associations of gender-related factors with clinical and patientrelevant outcomes. The Gender Outcomes INternational Group: to Further Well-being Development (GOING-FWD) is a personalised medicine project that explores the effect of sex and gender on outcomes across already available datasets using feasible approaches to perform both traditional and machine-learning-based analytics. It was recently cofunded by the Canadian Institutes of Health Research, and GENDER-NET plus, which is a part of the European EU H2020 initiative (http:// gender-net-plus. eu/ jointcall/ funded-projects/ going-fwd/). For the GOING-FWD project, around thirty accessible databases of observational studies and registries that include non-communicable chronic diseases (NCDs) among a four-country transatlantic network (ie, Austria, Canada, Spain and Sweden) were identified. The overarching aims of the GOING-FWD project were (1) to integrate sex and gender dimensions in applied health research and (2) to evaluate their impact on clinical cost-sensitive outcomes and patient-reported outcomes related to quality of life in NCDs including cardiovascular disease, metabolic disease, chronic kidney disease (CKD) and neurodegenerative disease. Each partner of the Consortium provided the data dictionary of the retrospective cohort studies conducted in their respective countries. The GOING-FWD Consortium is composed of investigators with multidisciplinary expertise in gender dimension, psychosocial science, computer science, epidemiology, endocrinology, internal medicine, renal and cardiovascular medicine, reproductive health, neuroscience, preclinical and clinical experimental research, health outcome research, nursing and biostatistics. The investigators were assigned to one of the three-work packages. The GOING-FWD methodology proposed therein is the result of the integrated activities carried out by the GOING-FWD investigators from March 2019 to December 2019. A five-step procedure was developed that can be applied to pre-existing observational cohorts for the integration of gender-related factors in assessing their association with selected health outcomes. GOING-FWD also has a patient partner advocate group. All interactions with patient partners are based on inclusiveness, support, mutual respect and cobuilding. For example, patient partners can assist in knowledge dissemination (eg, summer institutes, online educational materials, trainee journal club meetings, public forum presentations, may coauthor manuscripts and provide feedback on draft manuscripts during development and participate in teleconferences. A patient partner representative also attends monthly GOING-FWD steering meetings. A multistep methodology was developed as summarised in figure 1. Step 1: identification of gender-related variables Based on the data dictionaries provided by all participating centres, a preliminary list of the gender-related factors available in selected datasets was compiled by the coordinating centre. The template including the identified sex and gendered factors was presented and discussed at the first consortium meeting (Montréal, April 2019) by all investigators and stakeholders. Guided by the gender framework of the Women Health Research Network (ie, gender identity, gender role, gender relations and institutionalised gender) 11 (figure 2), and available literature in the four NCDs areas, the investigators created an optimal 'wish list' of gender-related variables/ factors: the definitions and validity of the proposed variables were discussed and expert consensus reached. Investigators considered variables that differ between men and women in terms of prevalence and/or identified (in the published literature) as exerting different effects on the outcomes of men and women as 'gender-related' variables. A revised draft template including additional gendered variables was created (table 1) . Step 2: definition of outcomes Each of the cohort data dictionaries was screened for outcomes of interest (including clinical/survival and patient-reported outcomes) by the coordinating centre. Similar to gendered variables, a second working group was tasked with developing a list of outcome variables, using the International Consortium for Health Outcome Measurement (ICHOM) framework 12 [The ICHOM Standard Sets are standardised outcomes, measurement tools, time points and risk adjustment factors for a given condition (eg, CKD, diabetes, etc). Developed by a consortium of experts and patients in the field of outcomes research, the ICHOM Standard Sets focus on clinical and patient-centred outcomes. By creating a standardised list of the outcomes based on the patient's priorities, the ICHOM framework ensures that the patient remains at the centre of care] and cost-sensitive variables and/or patient-reported outcome measures (PROMs) collected in all databases as the basic 'outcomes variable list'. A prespecified list of potential outcomes was created by all GOING-FWD investigators. Depending on the study population and type of dataset (eg, administrative, observational cohort study), we identified as relevant the following cost-sensitive outcomes: (1) inpatient outcomes including: in-hospital length of stay, in-hospital complications and/or death and readmission within 30 days of discharge; (2) outpatient outcomes including: access and/or numbers of visits and procedures, admissions, death, progression of disease and disability. We also looked for the availability of any PROMs, including symptoms (eg, pain), functioning, health related quality of life, depression and stress. The ICHOM specific-disease outcomes were considered for each of the four main clinical areas of interest. The revised draft template with outcomes, compiled by the investigators, was discussed and approved by all (table 2). Step 3: building of feasible final list The two lists were sent to each participating centre to rescreen their datasets for the presence of the identified sex-and gender-related as well as outcome variables. A cross-validation between gender-related and outcomes variables available per database was performed both locally and centrally. In case of disagreement or discordant definitions of variables among the wish-list and the actuallist, a discussion to reach consensus between coordinator centre and local principal investigators was performed. In principle, a more inclusive approach was pursued for both gender-related variables and outcomes definition. After the double check of wish-list and local actual-list, the final feasible list of variables (core dictionary) was built, and each country partner used the lists to apply to their respective research ethics boards according to the country regulations. Step 4: retrospective data harmonisation Once the final list was compiled, the harmonisation potential of gender-related and outcomes variables was assessed using the Maelström Research guidelines for rigorous retrospective data harmonisation and merging when possible 13 (Core Dataset). The harmonisation across the different databases is a premise for assessing the feasibility of big data analysis, as well as minimising deviations in data measurement across independently recruited databases. Data harmonisation methodology consists of assessing the presence and definitions of common variables across the different databases, followed by the creation of a harmonised dataset and subsequent extraction of information from study-specific datasets into the harmonised dataset. For example, while many datasets may record smoking status, the exact definition of this variable may differ between datasets: one may define this variable as dichotomous, others may quantify the number of cigarettes smoked by the participant. Through data harmonisation, a new variable definition is created to include the information from each of these datasets, which in this case would be reduced to smoking status as a dichotomous variable. Throughout this process, harmonised definitions that are created are scrutinised until a consensus is reached. Step 5: definition of data structure Beyond harmonisation, the structure and countryspecific management of health data was recognised as crucial to planning and conducting the final analysis addressing the relationships between the genderrelated factors. The analysis plans for each country will be based on the following options: (1) if data are not transferable even when anonymised-study-specific data analyses will be done locally followed by a meta-analysis combining study-level estimates; (2) for multiple cohorts in different countries, analyses will be done centrally, but the individual-level participant data will remain on local servers using a federated analysis approach; (3) the local data are synthesised and then a pooled analysis of the synthesised data is performed or (4) if the data are transferable: data will be pooled and analysed at a central location ( figure 3 ). The GOING-FWD methodology is a multistep process that provides insights on how to identify gender-related factors when variables have already been collected through the merging of several datasets and how to design the analysis plan on the core dataset. The multidimensionality of gender aspects might be effectively captured due to its complexity by non-traditional analytic approaches. The Big Data paradigm shift is significantly transforming healthcare and biomedical research. 14 Massive volumes of aggregated biomedical data often display different levels of granularity fostering the capability to explore, on large international scales, the effect of variables such as gender and/or sex. Big data allows researchers to overcome sample size issues and perform types of analysis such as interaction or mediation that would not be feasible and reliable in small cohorts/ studies. Nevertheless, there are some important issues related to data privacy and the merging of different databases when a cross-country comparison is planned, especially where issues on general data protection regulation need to be addressed in detail. Of note, the GOING-FWD challenges in ensuring data privacy and protection have foster our effort to develop techniques like synthetic data to make amalgamation of data possible. The GOING-FWD framework is a feasible methodology to foster the assessment of the gender impact on outcomes in retrospective studies. The screening of each dataset is a step that not only allows to identify the gender-related variables but also provides the rationale for selecting psycho-social factors that could be collected prospectively in the same cohort. The effort of investigating how sex and gender-related factors impact clinical and patient-related outcomes in NCDs is essential as it provides evidence for sex-tailored and gender-tailored interventions. We have learnt that a multidisciplinary team is a prerequisite for developing such methodology including gender experts and patient partners. Patients with lived experience can contribute to understanding what is really important for a specific disease which further strengthens the concept of patient empowerment in clinical practice. The international nature of GOING-FWD methodology highlights important considerations on the complexity of gender. Gender norms, identities and relations vary by culture, historical era, ethnicity, socioeconomic status, geographical location and other factors. We expect that the gender behaviours and attitudes captured by our variables may differ among women, men, and gender diverse individuals as well as between these groups. Gender norms also change overtime and across countries. Furthermore, everyone over time can be exposed to different degree of any gender-related factors. Impactful findings generated by retrospective analyses ca stimulate the scientific community to conduct a prospective collection on gender-factors in the design of future studies. Therefore, as researchers, we need to recognise the dynamic nature of integrating gender in clinical research questions and act accordingly. We also envisioned our multicountry analyses as an opportunity to capture institutional gender by including some country specific variables that are commonly available like the Gender Inequality Index (GII) developed by the United Nations Development Programme. 15 The GII is a composite measure to quantify gender inequality within a country and measures opportunity costs, reproductive health empowerment and labour market participation. Another similar measure of gender is the European Institute for Gender Equality's Gender Equality Index, which includes additional details about country specific domains of health, violence against women, work, money, knowledge time and power. 16 The idea is to relook and rethink on how we can gain the most from data on gender that are already available. Finally, the GOING-FWD approach is timely and might foster inclusion of gender in understanding the COVID-19 pandemic. In fact, the global COVID-19 economic and medical crisis could be the first outbreak where sex and gender differences are recorded and taken into account by researchers and policy makers. The GOING-FWD methodology will be instrumental in exploring the impact of various gender domains on outcomes across countries. In developing the GOING-FWD methodology, we have faced practical challenges. First, the lack of a standardised definition of gender-related factors is perceived as an obstacle to researchers even if they are interested in the topic. The low availability of gender-related factors in retrospective studies is not surprising but this should not preclude analyses. We strongly encourage clinical and even preclinical researchers to start from what they have even if only one gender-related factor is available. Merging more datasets allows us to perform analyses that incorporate interaction and mediation given large sample sizes. Second, in the current era, data accessibility and data protection issues in international networks can represent a deal breaker in pursuing this kind of research approach. Increasingly strict data protection regulations in many jurisdictions limit the ability to share sensitive health information. This requires the application of privacy enhancing technologies to enable the necessary analyses to be performed without the transfer of personal health information. Finally, harmonisation is a necessary step to allow big data analysis, but it is a time -consuming process and susceptible to pitfalls related to the quality of the process and difficulties of maintenance when several databases from different countries are merged. Personnel with explicit knowledge and skills are required to perform data harmonisation from both technical (ie, computing science, mathematics) and clinical (ie, life science) perspectives. We believe that our example of a derived 'wish list'" based on selected variables offers a standardised tool that can be widely used to explore the consistency of associations with health behaviours and outcomes. The GOING-FWD Consortium, a multidisciplinary network of Canadians and European researchers and patient partners, provides a framework that will support clinical researchers in integrating gender relevant factors in their research questions when using already collected databases hence providing solutions for the challenges that such approach poses. In conclusion, the application of a systematic multistep approach defining gender-related variables, the use of data harmonisation and country-specific data structure models, inform the identification and inclusion of gender factors in retrospective cohort studies. Gleaning important information on gender will not only strengthen current clinical practice but will also provide a stepping-stone for sex-tailored and gender-tailored interventions and care. Rubee Dev Angie Fullerton/MacCaul; Jeanette Smith Principal Investigators University of British Columbia, Canada; Monica Parry, Lawrence S. Bloomberg Faculty of Nursing, Canada Bloomberg Faculty of Nursing, Canada; Carola Deschinger British Columbia, Canada; Rachael Manion, Ontario, Canada. Contributors VR: conception, design, interpretation of data, drafting of the manuscript and revision; CMN: conception, design, interpretation of data AKW: design, interpretation of data, revision of the draft; KK: design, interpretation of data, revision of the draft; KEE: analysis and development of new IT solution, synthetic data; LP: conception, design, interpretation of data, drafting of the manuscript and revision; GOING-FWD Consortium: design contribution, acquisition of data and revision of the draft Number: GNP-78): The Canadian Institutes of Health Research (GNP-161904), La Caixa Foundation (LCF/PR/DE18/52010001), The Swedish Research Council (2018-00932) and The Austrian Science Fund (FWF, I 4209). VR was funded by the Scientific Independence of Young Researcher Program of the Italian Ministry of University LP have nothing to disclose; KEE is co-founder, director, and investor in Replica Analytics Ltd, a CHEO Research Institute / University of Ottawa spinoff company that develops data synthesis software Ethics approval Ethics approval for the project was obtained from the coordinator centre at McGill University Open access This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is REFERENCES 1 CIHR, Online Training Modules. Integrating Sex & Gender in Health Research Gender roles in persistent sex differences in health-related quality-of-life outcomes of patients with coronary artery disease Gender norms and health: insights from global survey data GBD 2013 Risk Factors Collaborators. Global, regional, and national comparative risk assessment of 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks in 188 countries, 1990-2013: a systematic analysis for the global burden of disease study 2013 Clinical advances in sex-and Gender-Informed medicine to improve the health of all: a review Gender differences in utilization of preventive care services in the United States Sex and gender analysis improves science and engineering A composite measure of gender and its association with risk factors in patients with premature acute coronary syndrome Sex versus gender-related characteristics: which predicts outcome after acute coronary syndrome in the young? The contribution of gender to apparent sex differences in health status among patients with coronary artery disease Better science with sex and gender: facilitating the use of a sex and gender-based analysis in health research International Consortium for health outcomes measurement Maelstrom research guidelines for rigorous retrospective data harmonization Predicting the Future -Big Data, Machine Learning, and Clinical Medicine Gender inequality index (GII), 2020. Available Gender equality index