key: cord-0704839-kqmt6twu authors: Cox, Lisa Ann; Hwang, Stephen; Haines, Jonathan; Ramos, Erin M.; McCarty, Catherine A.; Marazita, Mary L.; Engle, Michelle L.; Hendershot, Tabitha; Pan, Huaqin (Helen); Hamilton, Carol M. title: Using the PhenX Toolkit to Select Standard Measurement Protocols for Your Research Study date: 2021-05-26 journal: Curr Protoc DOI: 10.1002/cpz1.149 sha: e8e0638e1552976da83ccd6b1e71ff26dbffe356 doc_id: 704839 cord_uid: kqmt6twu The goals of PhenX (consensus measures for Phenotypes and eXposures) are to promote the use of standard measurement protocols and to help investigators identify opportunities for collaborative research and cross‐study analysis, thus increasing the impact of individual studies. The PhenX Toolkit (https://www.phenxtoolkit.org/) offers high‐quality, well‐established measurement protocols to assess phenotypes and exposures in studies with human participants. The Toolkit contains protocols representing 29 research domains and 6 specialty collections of protocols that add depth to the Toolkit in specific research areas (e.g., COVID‐19, Social Determinants of Health [SDoH], Blood Sciences Research [BSR], Mental Health Research [MHR], Tobacco Regulatory Research [TRR], and Substance Abuse and Addiction [SAA]). Protocols are recommended for inclusion in the PhenX Toolkit by Working Groups of domain experts using a consensus process that includes input from the scientific community. For each PhenX protocol, the Toolkit provides a detailed description, the rationale for inclusion, and supporting documentation. Users can browse protocols in the Toolkit, search the Toolkit using keywords, or use Browse Protocols Tree to identify protocols of interest. The PhenX Toolkit provides data dictionaries compatible with the database of Genotypes and Phenotypes (dbGaP), Research Electronic Data Capture (REDCap) data submission compatibility, and data collection worksheets to help investigators incorporate PhenX protocols into their study design. The PhenX Toolkit provides resources to help users identify published studies that used PhenX protocols. © 2021 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol: Using the PhenX Toolkit to support or extend study design The PhenX Toolkit (https:// www.phenxtoolkit.org/ ) is a catalog of recommended measurement protocols intended to help standardize data collection. In the PhenX Toolkit, a "protocol" is the methodology used to collect the data to address a specific concept or measure. PhenX protocols address a wide range of research domains and are suitable for use in genomic, biomedical, clinical, epidemiologic, and translational research studies (Stover, Harlan, Hammond, Hendershot, & Hamilton, 2010) . The Toolkit provides detailed protocols for data collection as well as tools to help investigators incorporate these protocols into their research studies. Use of PhenX protocols facilitates collaboration and cross-study analyses, potentially increasing the scientific impact of individual studies . The PhenX Toolkit is a freely available resource that is intentionally created through a consensus process. PhenX is driven by the scientific community, including oversight by a Steering Committee, recommendations of Working Group (WG) experts, and feedback from the broader scientific community (Maiese et al., 2013) . The PhenX Steering Committee established the criteria for WGs to consider as they decide which protocols to recommend for inclusion in the Toolkit (Table 1) . PhenX WGs use these guidelines as they review and select well-established, broadly validated protocols. The PhenX Toolkit currently includes 29 research domains (Table 2 ) and 6 specialty collections ( Table 3 ) that together include 874 protocols. Research domains add breadth to the Toolkit, whereas specialty (19) Anthropometrics (31) Cancer (12) Cancer outcomes and survivorship (15) Cardiovascular (14) Demographics (16) Diabetes (16) Environmental exposures (15) Gastrointestinal (15) Genomic medicine implementation (15) Geriatrics (16) Infectious diseases and immunity (17) Neurology (33) Nutrition and dietary supplements (15) Obesity (14) Ocular (18) Oral health (15) Pediatric development (18) Physical activity and physical fitness (19) Pregnancy (21) Psychiatric (31) Psychosocial (25) Rare genetic conditions (20) Reproductive health (24) Respiratory (21) Skin, bone, muscle, and joint (10) Smoking cessation, harm reduction, and biomarkers (11) Social environments (19) Speech, language, and hearing (23) a Note: The number in parentheses is the number of protocols in the domain. collections add depth to the Toolkit in specific areas of research (Figs. 1 and 2). Several National Institutes of Health (NIH) institutes have issued guide notices announcing the availability of PhenX protocols for specific areas of research. Social Determinants of Health (SDoH) and COVID-19 specialty collections have recently been added to the Toolkit. When designing a new study or expanding an existing study, investigators may visit the Toolkit to search, browse, review, and select PhenX protocols to include in their study. Using PhenX protocols can help investigators ensure that their study data will be compatible with data from other studies that implement these standard data collection methods (Phillips et al., 2017) . Knowing that PhenX protocols are recommended by experts, investigators can be confident that they are adding high-quality protocols to their studies even when those protocols are from research topics outside their realm of expertise. Cox et al. (12) • Ethnicity, race, and demographics (13) • History, treatment, and outcomes (8) • Information resources (4) • Psychosocial and mental health (14) • Socioeconomic (12) Mental health research • Suicide (21) • Post-traumatic stress psychopathology (including PTSD) (14) • Eating disorders (19) • Early psychosis clinical services (26) • Early psychosis translational research (10) Social determinants of health • Social determinants of health: core (16) • Individual social determinants of health (12) • Structural social determinants of health (10) Substance abuse and addiction • Assessment of substance use and substance use disorders (15) • Substance-specific intermediate phenotypes (21) • Substance use-related neurobehavioral and cognitive risk factors (9) • Substance use-related psychosocial risk factors (13) • Substance use-related community factors (8) • Substance use-related co-morbidities and health-related outcomes (10) Tobacco regulatory research • Tobacco regulatory research-host: social/cognitive (12) • Tobacco regulatory research-host: biobehavioral (12) • Tobacco regulatory research: agent (10) • Tobacco regulatory research: vector (24) • Tobacco regulatory research: environment (13) a Note: The number in parentheses is the number of protocols in the collection. The Basic Protocol provides step-by-step guidance on using the PhenX Toolkit to identify and incorporate standard phenotype and environmental exposure protocols into a study . The Basic Protocol walks through browsing and searching the Toolkit for measurement protocols, reviewing text of and additional information about identified protocols, and using Toolkit features and resources to download protocols for inclusion in a study. Current Protocols The PhenX Toolkit is a web-based resource that makes it easy for researchers to identify standard data collection measurement protocols to include in their studies. The value of standard measures is widely recognized (Bennett et al., 2011) because they facilitate cross-study analysis, allow studies to be combined to increase statistical power, and allow comparisons between studies to facilitate validation of results. Without standard protocols, investigators must harmonize data collected using different but conceptually related methodologies, which can be a time-consuming and difficult process. An up-to-date Web browser, such as Google Chrome, Microsoft Edge, Safari, and Firefox, is required for this protocol An investigator with expertise in diabetes is designing a new research study. She has several diabetes protocols in her study design and would like to consider expanding the scope of her study to include protocols related to cardiovascular health, environmental health, and SDoH. She is unsure of the recommended standard data to collect and collection methods to follow for a few of these measures and visits the PhenX Toolkit to identify recommended protocols for phenotypes and exposures that could be included in her new study. She navigates the Toolkit to consider protocols that extend beyond her area of expertise to add to the study design. Additionally, the investigator knows that data collected using PhenX protocols can be readily compared with or combined with data from other studies that have also used the same PhenX protocols. The investigator follows these steps to explore and identify protocols in the PhenX Toolkit: 1. Navigate to the PhenX Toolkit website at https:// www.phenxtoolkit.org/ . a. The investigator notes several ways to access information from the home page, including a search bar and buttons such as "Research Domains" and "Browse Protocol Tree" (Fig. 3 ). To start, the investigator decides to use the search bar on the home page. 2. Search the Toolkit for "diabetes" using the search bar. a. The search bar allows users to type in words, phrases, or topics of interest. For this study, the investigator is going to search the term "diabetes." b. This search returns 27 protocols. (Fig. 4 displays the search results.) c. The investigator already plans to include a few bioassays in her study and is curious to see what bioassays are included in the Toolkit. Using the filters on the left under "Refine by:", she selects "Bioassay" under "Data Collection Mode" (Fig. 5 ). d. The investigator sees "Oral Glucose Tolerance Test" listed among the search results. She has already included this test in her study design. Clicking that link displays a description of the protocol, details of specimen collection, specific instructions for administration, and related protocols. Essential Protocols collect information necessary for data interpretation, and are suggested in the left-hand panel. Related Protocols are other protocols in the Toolkit that may be of interest. Related Protocols are suggestions only and are not required for interpretation of a protocol. Tabs on this page can be accessed to obtain additional material, including information on administration, details, source, variables, and measure. After reviewing this information, the investigator notes that this protocol is the same one she has already included in her new study design. e. Upon further review of each tab, the investigator sees on the "Details" tab ( Fig. 6 ) that links are provided to other research standards such as caDSR Common Data Elements (CDE), Logical Observation Identifiers Names and Codes (LOINC), and Human Phenotype Ontology. These additional resources and links to other research standards will be nice to review because she regularly uses the Oral Glucose Tolerance Test protocol. She also notes the quick access to directly download a Data Collection Worksheet (DCW) or Data Dictionary (DD), which are available through the "Download" drop-down menu above the "Protocol" panel ( Fig. 7) . 3. Select protocols via "Add to My Toolkit." a. Because the investigator already uses the Oral Glucose Tolerance Test, she decides to add the protocol to My Toolkit, so she can review additional resources displayed in the "Details" panel ( Fig. 6 ). After she adds the protocol to My Toolkit, she can save it to access and download later (Fig. 7) . She will also have quick access to directly download a Data Collection Worksheet (DCW) or Data Dictionary (DD), which are available at the top of the "Details" panel. b. More information and guidance on My Toolkit can be found at https:// www. youtube.com/ watch?v=Gz_4LLaQmr0. 4. Use My Toolkit and review Essential Protocols. a. In My Toolkit, the investigator may see that there are some Essential Protocols associated with the protocols in My Toolkit. An alert indicates that Essential Protocols are needed to interpret the data. In My Toolkit, each of the Essential Protocols can be reviewed and added to My Toolkit. For example, the Current Age protocol is essential to interpreting the Oral Glucose Tolerance Test protocol (Fig. 8) . Cox et al. Current Protocols The "Details" panel on the "Protocol" page. Protocol resources can be downloaded from the "Protocol" panel. Alternatively, users can quickly add Essential Protocols by selecting "Add all Essential Protocols to your Toolkit" (Fig. 9 ). Clicking the Essential Protocol name will bring the user to the "Details" panel for that protocol. After adding the suggested Essential Protocols to My Toolkit, the investigator logs off to resume later. The protocols she has selected are saved in My Toolkit. b. Satisfied with identifying the Oral Glucose Tolerance Test, the investigator goes back to the PhenX Toolkit home page and this time decides to click "Research Domains" (Fig. 10) . She notices the Cardiovascular Domain (Fig. 11) and wonders if the PhenX Toolkit includes cardiovascular protocols that might be appropriate Cox et al. Current Protocols The Current Age protocol is essential to interpreting the Oral Glucose Tolerance Test protocol. to consider for her new study. When she clicks "Cardiovascular," the investigator now sees the 14 protocols that make up the Cardiovascular domain (Fig. 12) . c. After scanning the 14 protocols in the Cardiovascular domain, she notices protocols that she would like to review further and clicks the protocol names to view additional details for each. She adds "Blood Pressure (Adult/Primary)" and "Family History of Heart Attack" to My Toolkit (Fig. 13) . Cox et al. Current Protocols Figure 12 Browsing protocols in the Cardiovascular Domain. d. How to become a TK Registered User: To access the ability to save protocols and return to them at a later date, the user can register and create a profile. First, click the "Add to My Toolkit" button to save one or more protocols for convenience. (Fig. 14) . After registering, users can access features such as the ability to name and save multiple Toolkits and share them with collaborators. Registered users can also opt to receive the PhenX Toolkit Newsletter to learn about new content releases and access the Link Your Study feature. Exploring the Toolkit, the investigator finds additional standard protocols relevant to her study design. The following steps detail how to use Smart Search, Advanced Search, and the Protocol Tree Browser. 5. Use Smart Search to find all protocols from a specific source (e.g., National Health Interview Survey [NHIS]). a. The Smart Search query bar at the top of the page allows a user to search all protocols in the Toolkit for those with matching metadata, including protocol source, specific instructions for protocol administration, and descriptions. b. Because the investigator is interested in identifying protocols assessing social determinants of health and nutrition, she decides to search the Toolkit for a large national study, which she thinks might contain these types of protocols. She chooses NHIS. The investigator types the search term "NHIS" into the Smart Search bar at the top of the PhenX Toolkit webpage. The search returns 50 relevant results (Fig. 15 ). c. The investigator selects the "Sugar Intake (Added)" protocol, which looks like a good match to include in her new diabetes study. She adds this protocol to My Toolkit (Fig. 16 ). 6. Advanced Search under the "Search" drop-down menu on the PhenX Toolkit home page is where search operations can be fine-tuned (Fig. 17) . The investigator decides to search here for a protocol that measures healthcare access, and has a couple of options. Smart Search combs through protocol metadata and key fields, so it has high specificity in its function. Smart Search is also the default operation of the search bar at the top of each page. Another search option is Text Search only, found on the Advanced Search page. The advanced Text Search looks for word matches in the entire protocol text rather than just key metadata fields. A full Text Search of any relevant search term can be used to review more broadly relevant results and any Supplemental Information, which are listed at the end of the results. The investigator types "healthcare access" into Advanced Search and clicks the Text Search button (Fig. 18) . The results are displayed, and she sees "Access to Healthcare Services." After reviewing the protocol, she decides to add it to My Toolkit. Cox et al. Current Protocols Figure 15 Using the Smart Search query for National Health Interview Survey (NHIS) protocols. More information and guidance on searching the Toolkit can be found at https:// www. youtube.com/ watch?v=yAycTLYdFAM. 7. One protocol the investigator would still like to identify is a measure related to Environmental Health, which might affect diabetes. She goes back to the PhenX Toolkit home page and clicks the "Tree" button (Fig. 19) (Fig. 20) . b. She clicks to open the "Environmental Health" branch, followed by "Air Pollution," then clicks the Air Quality Index protocol. Information about the protocol is then displayed, so she can easily review the protocol there or go to the protocol page (Fig. 20) . She decides to add the Air Quality Index protocol to My Toolkit using the button on the top right of the screen. c. More information and guidance on using the Browse Protocol Tree can be found at https:// www.youtube.com/ watch?v=JSL7nqEF6AY. Following these steps, the investigator identified several standard protocols for inclusion in her study design. The investigator can download protocol documents in real time or save them in My Toolkit, where she can download them individually or as a set. If the investigator is logged in as a registered user, her selections are saved, and she can return later to review and download the protocol documents and even share the protocols with collaborators directly from within the Toolkit. The features and parameters we describe next detail Toolkit resources designed to as-sist users in implementing standard data collection using these protocols. We also discuss how to appropriately cite use of the Toolkit and of protocols provided by the Toolkit. The standard procedure outlined in each protocol in the PhenX Toolkit ensures consistent data collection across different studies. Data need to be collected using the same methodology in order for the data to be truly comparable (Fig. 21) . The Toolkit provides several resources to assist with these research implementation phases. Using standard protocols allows for comparisons among studies to validate results and allows results to be combined across studies to increase statistical power in downstream analyses. This ultimately increases the scientific impact of individual studies. Investigators should consider several factors when using the PhenX Toolkit. Users should understand the concept of Essential protocols and identify how to download and use Data Collection Worksheets and Data Dictionaries. Users who would like to see some realworld examples of studies citing the PhenX Toolkit or review funding opportunity announcements (FOAs) recommending use of PhenX measures can access this information from the drop-down menu under the "Resources" tab on the navigation bar. From the "Resources" tab, the "Publications & Presentations" page shows publications for research studies that have used PhenX Toolkit protocols in their study design. Toolkit users can download protocol details including Data Collection Worksheet (DCWs) directly from the "Details" panel on the protocol page or from My Toolkit. The DCW is provided as a Microsoft Word document to facilitate easy integration into existing study documentation. Investigators can generate basic or full reports for selected protocols and download a DCW (Fig. 22) . Here the user can see details related to data collection for saved protocols. The Toolkit offers a DCW (Fig. 23 ) that helps investigators integrate PhenX protocols into their existing research. The DCW captures each data item required and collected in the measurement protocol, which helps ensure that investigators collect all necessary information during data collection. PhenX Data Dictionaries (DDs) describe the variables associated with the selected protocols. These data dictionaries are available in three formats: (1) the RTF format captures skip patterns commonly used in epidemiologic studies (Fig. 24); (2) the CSV format is compatible with the database of Genotypes and Phenotypes (dbGaP) data submission packet (Fig. 25); and (3) the RED-Cap ZIP file format is suitable for upload to studies being implemented in REDCap (Fig. 26) . The REDCap data dictionary can be imported as a standalone REDCap project or Cox et al. Current Protocols The protocol preview from the Protocol Tree Browser page. Using standard measures allows datasets across different domains to be combined and/or compared, increasing the scientific impact of individual studies in genome-wide association studies. added to an existing project for data collection (Fig. 26 ). The LYS feature collects basic study information from registered users who are using PhenX protocols in their research. The LYS feature helps users find other investigators employing the same PhenX protocols in their studies to facilitate opportunities for crossstudy analysis. To view papers citing the PhenX Toolkit, navigate to the "Resources" menu from the home page and select "Publications & Presentations," then select "Publications Citing PhenX" (Fig. 27) . Information on citing the PhenX Toolkit can be found at https:// www. phenxtoolkit.org/ help/ citation. Instructions for citing PhenX in publications are available on the home page (Fig. 28) : https:// www.phenxtoolkit.org/ help/ citation. "Measures incorporated in this study were selected from the PhenX Toolkit version February 23, 2021, Ver 37.0." The date and version number should reflect the version from which the user downloaded the protocols for use. The PhenX Toolkit provides recommended standard data collection protocols which are suitable for a variety of study designs. Because not all measurement protocols are suitable for all study designs, it is the investigator's responsibility to select appropriate protocols for inclusion in their study. The Toolkit home page includes a button to the "Toolkit Guidance" page ( Fig. 29) , where investigators can find links to references for conducting research with human subjects including Certificate of Confidentiality, the NIH Genomic Cox et al. Current Protocols REDCap data dictionaries can be imported as new projects and can be used immediately for data collection. The "Publications Citing PhenX" page can be found under the "Resources" menu. Data Sharing Policy, A Code of Ethics for Public Health, the National Human Genome Research Institute (NHGRI) informed consent resource, and references pertaining to study design. The Toolkit Guidance guidelines are included in every downloaded PhenX report. Each protocol has varying needs for equipment and expertise for administration (Fig. 30) . By reviewing the Equipment Needs and Requirements (under the "Administration" tab), investigators can identify potential barriers or needs they will potentially encounter when administering a protocol. The "Administration" tab also describes personnel and training required for the protocol. Reviewing this information can help investigators determine their ability to follow the protocol and any needs they may encounter along the way. Knowing these needs up front can help investigators be best prepared for conducting a study. By including protocols from the PhenX Toolkit in new or existing studies, investigators will be able to combine or compare data from other studies incorporating the same PhenX protocols. This facilitates cross-study analysis and increases the statistical power to identify genetic associations with complex diseases and traits, gene-gene interactions, and gene-environment interactions. Promoting and incorporating standard measures ultimately increases the impact of individual studies. To date, 394 FOAs have been issued from NIH and the Department of Defense that encourage the use of PhenX protocols. Widespread adoption of PhenX protocols will augment the impact of biomedical research studies and ultimately improve the health and well-being of the population. Brevity of protocol administration is considered when protocols are selected for inclusion in the Toolkit. Protocols that take more than 15 minutes, on average, for an unaffected individual are noted in the Toolkit. Protocols that meet the other requirements for inclusion and require the least burden (time is considered, among other factors) on the investigator and participant are ideal choices for inclusion in the Toolkit (Fig. 30 ). The Toolkit Guidance is provided in basic and full reports. Current Protocols Ramos: writing review and editing. Catherine McCarty: writing review and editing. Mary Marazita: writing review and editing writing original draft. Tabitha Hendershot: writing original draft. Huaqin (Helen) Pan: writing review and editing. Carol Hamilton: writing review and editing Phenotype harmonization and cross-study collaboration in GWAS consortia: The GENEVA experience The PhenX Toolkit: Get the most from your measures PhenX-establishing a consensus process to select common measures for collaborative research PhenX measures for phenotyping rare genetic conditions PhenX: A toolkit for interdisciplinary genetics research Key References Using the PhenX Toolkit to add standard measures to a study Using the PhenX Toolkit to add standard measures to your study Research reported in this publication was supported by the National Human Genome Research Institute of the National Institutes of Health, Award Number U41HG007050, with co-funding from the National Institute on Drug Abuse (NIDA), the National Heart, Lung, and Blood Institute (NHLBI), the Office of the Director (OD), the Office of Behavioral and Social Sciences Research (OBSSR) of the National Institutes of Health (NIH), and Administrative Supplement Award Number 3U41HG007050-08S1 funded by the Office of the Director (OD), the Office of Behavioral and Social Sciences Research (OBSSR), the National Institute On Minority Health And Health Disparities (NIMHD), and the National Cancer Institute (NCI) of the NIH. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.The authors would like to acknowledge the guidance of the PhenX Steering Committee, contributions of all Working Groups, Expert Review Panels, and research panel members, and the efforts of the overall RTI/NHGRI/NIH PhenX project team. Project participants are acknowledged at https:// www. phenxtoolkit.org/ about/ teams. Current Protocols Authors have no financial or personal relationship between themselves and others that might bias their work. Data sharing is not applicable to this article as no new data were created or analyzed in this study.