key: cord-0881675-2vhiq4tr authors: Alahmar, Ayman; AlMousa, Mohannad; Benlamri, Rachid title: Automated clinical pathway standardization using SNOMED CT- based semantic relatedness date: 2022-03-31 journal: Digit Health DOI: 10.1177/20552076221089796 sha: 12df6ef1dbe1576a8400856c13bc1acbe5f87850 doc_id: 881675 cord_uid: 2vhiq4tr The increasing number of patients and heavy workload drive health care institutions to search for efficient and cost-effective methods to deliver optimal care. Clinical pathways are promising care plans that proved to be efficient in reducing costs and optimizing resource usage. However, most clinical pathways are circulated in paper-based formats. Clinical pathway computerization is an emerging research field that aims to integrate clinical pathways with health information systems. A key process in clinical pathway computerization is the standardization of clinical pathway terminology to comply with digital terminology systems. Since clinical pathways include sensitive medical terms, clinical pathway standardization is performed manually and is difficult to automate using machines. The objective of this research is to introduce automation to clinical pathway standardization. The proposed approach utilizes a semantic score-based algorithm that automates the search for SNOMED CT terms. The algorithm was implemented in a software system with a graphical user interface component that physicians can use to standardize clinical pathways by searching for and comparing relevant SNOMED CT retrieved automatically by the algorithm. The system has been tested and validated on SNOMED CT ontology. The experimental results show that the system reached a maximum search space reduction of 98.9% within any single iteration of the algorithm and an overall average of 71.3%. The system enables physicians to locate the proper terms precisely, quickly, and more efficiently. This is demonstrated using case studies, and the results show that human-guided automation is a promising methodology in the field of clinical pathway standardization and computerization. Clinical pathway (CP) is a multidisciplinary, structured health care plan in which therapeutic and diagnostic medical interventions performed by doctors, nurses, and other staff members for a specific procedure or diagnosis are sequenced on a timeline. [1] [2] [3] CPs can reduce physicians' mental effort and cognitive load to allow them to focus on thought-requiring, more complex health care activities. 3 Therefore, CPs have the potential to improve patient outcomes and satisfaction. CPs also contribute to reducing the length of stay (LOS) in hospitals and controlling overall public health care costs. 3, 4 Automation of modern health care systems covers many fields and necessitates the computerization of hospital processes to streamline health care services, reduce paperwork, collect digital data for data analytics, and control costs 5, 6 . The CP was not an exception in this regard, as many studies reported the development of computerized CPs [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] . By analyzing the literature, we found that the common theme in most studies is that the computerization process was directed mainly toward either connecting CPs with Electronic Medical Record (EMR) systems or computerizing CPs without standardizing their contents. This is viewed as a limitation because the focus of previous research was not on CP digitization; rather, the emphasis was mainly on EMRs or producing isolated CP systems that are difficult to integrate with all other digital health care systems. Table 1 is a comparison table that shows the limitations of CP computerization without standardization. Therefore, one crucial area that remains to be investigated in CP automation is CP digitization, which can be facilitated by standardizing the clinical terms used in CPs. However, there is a gap today between the clinical terms used to define CP interventions and the clinical terms used in health information systems (HISs). On one hand, CPs use institutions' local informal terms, but on the other hand, HISs use and refer to standardized terminology terms such as SNOMED Clinical Terms (Systematized Nomenclature of Medicine -Clinical Terms, abbreviated as SNOMED CT or SCT) 17 , which is the CP standardization terminology system adopted in this research. Non-standardized CPs are difficult to share across hospitals. In addition, non-standardized CP terms cannot be captured in HISs. Missing CP data due to this gap form a challenge to using data analytics in health care. The existing CP terminology gap can be traced to the following reasons: 1. CPs are developed internally in hospitals through internal meetings attended by local staff members. 2. CPs are commonly developed for internal use. Thus, the emphasis is on a language that is understandable by the institution's own health care members and administrators. 3. CPs are usually circulated in paper-based document forms, or tables and charts that are distributed to staff members and posted on poster boards, or on the doors of in-patients' hospital rooms. The situation in many hospitals is that once CPs are developed and circulated internally, there is a future demand to computerize them. 10, 18 At that point, the terminology gap already exists which complicates the computerization and integration process. The terminology gap can be addressed by correctly standardizing CP terms so that CPs can be digitized and easily integrated with other HISs. In general, local CP terms that need to be standardized can be divided into three categories: 1. Category 1: Local CP terms that exist in SNOMED CT but do not accurately match the correct SNOMED CT terms (i.e. are not approved by domain experts as the correct SNOMED CT terms). For example, the SNOMED CT term "Intracranial hemorrhage (SCTID: 1386000)" was used incorrectly for the approved term "Subarachnoid intracranial hemorrhage (SCTID: 21454007)." 2. Category 2: Local CP terms that exist in other terminology systems (e.g. LOINC [19] , ICD-10 [20], and ICD-10-PCS [21] ) and mapping these terms to the adopted system is required (e.g. SNOMED CT terms in this study). For example, mapping the ICD-10-PCS term "Resection of Appendix, Percu taneous Endoscopic Approach (0DTJ4ZZ)" to the SNOMED CT term "Laparoscopic appendectomy (SCTID: 6025007)." 3. Category 3: Local CP terms that are nonexistent in any terminology system. The objective of this research is to address the challenges in CP term standardization of Category 1 and Category 2 by exploiting semantic relations embedded in SNOMED CT. Standardization reduces missing CP data (i.e. CP data that are not captured in EMRs or other HISs), enhances the use of data analytics in health care, and reduces medical errors in hospitals. Addressing Category 3 local CP term standardization requires different AI techniques, which we will be considered in future work. Our domain experts in this research were physicians and nurses from the Regional Stroke Unit at the Thunder Bay Regional Health Sciences Centre in Thunder Bay, Ontario, Canada. In Category 1, domain experts perform a tedious manual search in SNOMED CT hierarchy to locate the correct standardized terms. In Category 2, finding an initial similar term from SNOMED CT is straightforward, but locating an accurate SNOMED CT term based on the initial term is performed through a tedious manual search. A solution to this standardization challenge would be to automate the process of searching for accurate SNOMED CT standardized terms based on local/inaccurate initial terms. CP standardization is a challenging task to automate by machines due to patient safety considerations associated with the medical data stored in CPs. In this research, we hypothesize that the concepts of semantic relatedness, through the use of semantic relations in SNOMED CT ontology, can help humans in solving this standardization challenge by intelligently automating the tedious process of searching for and locating the most probable terminology corrections. 22, 23 The use of ontology semantic relations is justified by the fact that initial terms are similar in their meaning and semantic to the target terms (e.g. referring to the same body parts, similar types of medical conditions, etc.). The proposed approach in this research adopts SNOMED CT terminology system and achieves the balance between automatic machine intervention and domain experts control to ensure patient safety. Semantic relatedness is a form of measurement that quantitatively identifies the level of connectedness between two concepts based on all existing semantic relations. 23 On the other hand, semantic similarity is a metric that can be defined as a quantitative measure of likeness between terms based on their taxonomic distribution within a domain ontology. 22 Both semantic relatedness and semantic similarity are metrics over the terms; however, semantic relatedness includes any relation between the terms, while semantic similarity only includes "is-a" relations. 22, 23 For example, in SNOMED CT, "ischemic stroke", whose SNOMED CT ID (SCTID) is 422504002, is similar to "cerebrovascular accident", whose SCTID is 230690007 (there is an "is-a" relation between the terms in the ontology), but is also related to "ischemia", whose SCTID is 52674009 (there is a "Due-to" relation between the terms in the ontology), as shown in Figure 1 . 17 Our method in this research includes all semantic relations in SNOMED CT ontology (not restricted to is-a relations), and thus, it is a semantic relatedness approach. As highlighted in AlMousa et al. and Liu et al. [24, 25] , semantic relations play an intrinsic role in computing the semantic relatedness, thus a semantic score based on the various semantic relations embedded in SNOMED CT ontology is used in this research. It is important to mention that since CPs are treatment plans of interventions and procedures performed on patients, caution should be exercised when utilizing automatic standardization of medical terms in CPs for safety reasons. Therefore, machine-based standardization methods should be considered as decision support methods rather than decision-making methods. The final decision-makers in medical CP term standardization are terminology-knowledgeable human domain experts. Therefore, in this paper, we present an approach where a list of terms is proposed by our algorithm; the final decision, however, is made by the domain experts. The proposed approach was validated by detailed case studies and by a dataset of 14 pairs of SNOMED CT terms (see the section on Experimental Results and Discussion and Table 6 for more details). The major contribution of this paper can be summarized as follows: The field of CP digitization and standardization is a new and emerging field with a few publications. [26] [27] [28] To the best of our knowledge, this is the first study attempting to enhance the field with the automation of the CP term standardization process. Our semantic score approach is a holistic multi-relational approach that can explore all types of relations in the SNOMED CT ontology. 24 The rest of the paper is organized as follows. The next section addresses technical background and related work, and in particular, it covers SNOMED CT and CP Computerization and Standardization. Our method of CP standardization using SNOMED CT-based semantic relatedness is presented next and is followed by the experimental results and discussion. Finally, conclusions are drawn and future research work is presented in the last section. The Systematized Nomenclature of Medicine -Clinical Terms (officially abbreviated as SNOMED CT, and sometimes referred to as SCT) is a systematically organized computer processable collection of medical terms providing codes, terms, definitions, and synonyms that are used in reporting and medical documentation. SNOMED CT was adopted in this work as the base of CP standardization because it is the most comprehensive health care clinical terminology in the world 29 . The main purpose of SNOMED CT is to encode the meanings that are used in health care/health informatics and to support the effective clinical recording of data with the goal of improving patient care and health decision-making. SNOMED CT comprehensive coverage includes body structures, organisms, clinical findings, symptoms, diagnoses, procedures, and other etiologies, substances, pharmaceuticals, devices, and specimens. Table 2 shows the names of the nineteen (19) top classes in the structure of the SNOMED CT ontology. Top classes have is-a relations with the root class "SNOMED CT Concept," as shown in Figure 2 . Besides is-a relations, SNOMED CT concepts have many other relations/attributes such as associated-with, contained-in, due-to, finding-site, has-ingredient, and is-about. Relations in SNOMED CT are organized based on their roles in the ontology (e.g. relations used to define clinical finding concepts, relations used to define procedure concepts, relations used to define body structure concepts). Table 3 shows example relations and their definitions. The January 31, 2021 release of the SNOMED CT International Edition included 350,000 + concepts that provide the core general terminology for electronic health records (EHRs). The SNOMED CT logical model ( Figure 3 ) defines the way in which each type of component and derivative is related and represented in SNOMED CT. The core component types are concepts, descriptions, and relationships. The logical model therefore specifies a structured representation of the concepts used to represent clinical meanings, the descriptions used to refer to these concepts, and the relationships between the SNOMED CT concepts. 17 Figure 6 . Contextualized relevancy questions based on the "Ischemic stroke" child SNOMED CT (SCT) terms. CPs are novel health care management plans that contain details of the interventions and procedures of patients' treatment and follow-up. An example CP for ischemic stroke is shown in Figure 4 . CPs are essential sources of data and data analytics in health care. [26] [27] [28] However, a key factor that is impeding CP-based health data analytics and hindering the smooth transfer of CP data to other HISs is that CPs are prepared in hospitals without attention to standardizing their medical terms. [26] [27] [28] After a thorough review of CP research found in literature, along with discussions with our domain experts at the Thunder Bay Regional Health Sciences Centre, it was clear that most CPs are currently developed using non standard local terms and abbreviations. [30] [31] [32] [33] [34] [35] [36] [37] [38] This situation makes CPs prone to human error and forms a challenge to exchanging them across medical institutions, thus limiting their adoption worldwide. This also causes the loss of valuable CP data because existing HISs use standardized terminology systems when encoding their medical terms. To give an example, the instruction "Complete Hx and PE" that is part of a CP for open appendectomy, 30 cannot be captured in HISs. To address this issue, we recently published a CP standardizing and digitizing framework 26 in which an instruction like this is standardized manually as "Complete History and Physical Examination" where "History and Physical Examination" is a standard SNOMED CT term whose SNOMED CT identification number is 53807006. 26 However, manual CP term standardization is a tedious and time-consuming process, and therefore, automation to CP term standardization was introduced in this research. CP computerization was the focus of many studies in the literature. [8] [9] [10] [11] [12] [13] [14] [15] [16] For example, Liu et al. 14 proposed an ontological approach for real-time CP monitoring. The main objective of their study was to establish communication between the CP and the electronic medical record system with the ability to display reminders on medical activities. The CP used in their prototype was related to unstable angina. Blaser et al. 16 developed a prototype CP system that was embedded within an EMR system to guide patients' treatment. For a more detailed and recent review, the reader is referred to Alahmar et al. 26 The literature review shows that the common theme in nearly all studies is that the goal of the computerization process was limited to connecting CPs with EMRs. We view this as a limitation because full CP digitization can only be achieved by standardizing their medical terms, which is an area of research that was not investigated in previous work related to CP computerization. This resulted in large amounts of CP data omitted in EMRs/HISs, and not utilized in data analytics. Our proposed solution for this challenge is to encode CP data using internationally recognized medical reference terminology (SNOMED CT). SNOMED CT encoding is realized by representing each CP medical term by its equivalent SNOMED CT term and SNOMED CT ID (SCTID). As mentioned above, this study introduces an automated CP standardization technique. Table 4 shows a comparison between this research and recent similar work in the literature. To automate the CP standardization process, we propose a semantic-based search algorithm that exploits various semantic relations. The algorithm supports physicians and domain experts in their task of standardizing CP terms. The main intuition is that the similarity between SNOMED CT terms can help the algorithm find and propose the accurate SNOMED CT terms while searching the hierarchy of terms within the SNOMED CT ontology. Thus, the use of semantic relations can help domain experts in the CP validation process by automating the process of finding the correct terms which are usually similar to the local terms. In a large number of cases, the inaccurate local term might be either a sibling or an abstract term of the desired term. For example, the correct and accurate term "Occlusive stroke", whose SCTID is 373606000 is the subclass of the inaccurate term (i.e. initial term) "ischemic stroke", whose SNOMED SCTID is 422504002. The algorithm is described below (see Algorithm 1). The algorithm is implemented in an interactive software system that assists domain experts in determining the accurate standardized SNOMED CT (SCT) terms. The algorithm limits the possible terms to only those that are semantically relevant to the initial term, and presents a set of contextualized relevancy questions. The contextualized relevancy questions are obtained from the non-taxonomic relations that each child SCT term has. Figure 5 shows the SCT term "ischemic stroke (SCTID: 422504002)" with its child terms, and the non-taxonomic relations associated with it (i.e. "Finding site", "Associated morphology", and "Due to"). Based on each non-taxonomic relation of the child terms, a question set and options are presented to the domain experts to provide the appropriate context, as shown in Figure 6 . Line 4 in Algorithm 1 initializes the root with the parent of the initial SCT term based on the semantic relation "is-a." Then, iterative steps limit the candidate SCT terms to the most relevant and semantically similar terms to the initial term based on Eq. 3, where the F 1 -measure (i.e. the harmonic mean of the precision and recall, see equations (1) and (2)) represents the score of similarity between the expert's selection of the contextualized options, and the semantic relations and associations of the child SCT terms. Note that the similarity score, presented in equation (3), for any given term could have a value of between [0−1], where zero means not contextually similar/ related, and one means highly contextually related. For instance, based on the domain expert's answers presented in Figure 6 and the non-taxonomic relations of "Occlusive stroke" and "Perinatal arterial ischemic stroke", their similarity scores are determined as 0.91 and 0.6, respectively ( Figure 7) . Finally, the domain expert is presented with a limited number of plausible SCT terms to either select an appropriate term (as the approved standardized term) or expand a child term to repeat the process until the standardized term is found. where TP are the true positive observations that represents the agreement between the expert's selection and the existing semantic associations with the SCT term, and FP (false positive) are the semantic associations that are present in the child SCT term but are not selected by the expert (Figure 8 ). where FN are the false-negative observations that represent the semantic associations that are selected by the expert but are not present in the child SCT term (Figure 8) . Here, we present the experimental results and discussion for common CP standardization scenarios. The scenarios are related to local CP terms from Category 1 and Category 2 (as described in the introduction). In this scenario, we consider CP local terms that are used in terminology systems other than the SNOMED CT system (e.g. LOINC, ICD-10 and its variants like ICD-10-PCS, etc.). It should be noted that this is an example from Category 2 local terms. The main challenge in converting these terms into standardized terms is finding the accurate SNOMED CT terms because terminology systems consist of many terms that look similar, but do not exactly convey the identical medical interventions or meanings. A typical scenario would be a term written in the CP from ICD-10 terminology, whereas the CP is required to be fully standardized using SNOMED CT prior to its inclusion in a hospital's IT system. An example is given below. In this scenario, the ICD-10-PCS term "Resection of Appendix, Percutaneous Endoscopic Approach (0DTJ4ZZ)" was found in the CP. A straightforward similar term from SNOMED CT would be "Excision of appendix (SCTID: 80146002)" because both terms refer to similar medical procedures. For domain experts, the existence of "endoscopic approach" in the original term is important and must be reflected in the approved SCT term. The steps below outline how the accurate SNOMED CT term can be inferred based on the semantic relatedness approach. Note that the accurate term will be inferred to be "Laparoscopic appendectomy (SCTID: 6025007)." Initially, the algorithm starts by retrieving parent SCT term (parent of (80146002)) which is the SCT term "Partial excision of large intestine (SCTID: 27010001)." Then, for each child SCT term, the algorithm will retrieve the set of relevant questions with their respective answers based on all semantic relations associated with each child SCT term from the SNOMED CT ontology (Algorithm 1 lines 7-11). These lines form an iteration in the algorithm which is defined as a loop over the child SNOMED CT terms of the root SNOMED CT term under consideration. For the first iteration, the algorithm displays two contextually relevant questions based on the two existing relations (or attributes) possessed by child concepts (i.e. "Method" and "Procedure site -Direct"). All four child SCT terms, in this iteration, share the same option for the "Method" relation (i.e. "method: Excision -action"), however, they differ with the "Procedure site -Direct" relation (i.e. "Appendix structure", "Cecum structure", "Colon structure", or "Rectum structure"). Based on the expert's response for the "Procedure site -Direct" question ("Appendix structure"), line 12 in Algorithm 1 (presented in Eq. 4) computes the threshold value. Then, for this iteration, out of the four child terms ("Excision of appendix", "Excision of cecum", "Excision of colon", and "Resection of rectum"), the algorithm will display only the initial SCT term (i.e. "Excision of appendix"), as it is the most relevant to the "Procedure site -Direct: Appendix structure," where the expert will have the option to expand it and begin a second iteration. In the second iteration, by expanding the term ("Excision of appendix (SCTID: 80146002)"), a new set of contextually relevant questions are displayed to experts based on the new attribute relations of the child terms. Figure 9 shows these contextually relevant questions and the selected answers chosen by the domain experts. Based on these questions and answers, a new similarity score threshold value is computed using equation (4) . Finally, the SCT terms with a similarity score above or equal to the current iteration's threshold value are added to the candidate list of terms to be displayed to the expert and ranked according to their similarity score, from highest to lowest Finally, the expert decides on the approved standardized SCT term, which is "Laparoscopic appendectomy (ID: 6025007)," with a similarity score of 0.86. Figure 10 shows the iteration process resulting from Algorithm 1, showing the similarity scores for each SCT term and the threshold value for each iteration. The threshold value is defined in Eq. 4 as the number of options selected by the expert divided by the sum of selected options and nonselected relations. This ensures that the threshold calculated value is a reflection of the expert's minimum agreement with the presented options. where SO is the number of selected options and NSR is the number of nonselected relations. The next scenario relates to an initial SNOMED CT term "Intracranial hemorrhage (SCTID: 1386000)" and an approved term "Subarachnoid intracranial hemorrhage (SCTID: 21454007)." Both terms are used to encode hemorrhagic stroke, a common medical condition. The subtle difference between the terms is that intracranial hemorrhage refers to bleeding that occurs when a blood vessel within the skull is ruptured or leaks, whereas subarachnoid intracranial hemorrhage is a type of stroke caused by bleeding into the space surrounding the brain (not within the skull). Similar to the first scenario, Figure 11 shows the iterative process described in Algorithm 1 for selecting the standardized approved SNOMED CT term, which is, in our case, "subarachnoid intracranial hemorrhage (SCTID: 21454007)" given the initial term "intracranial hemorrhage (SCTID: 1386000)." Table 5 shows the details of each of the three iterations required in this case, including the percentage reduction of SNOMED CT search space after each iteration. In iteration 1, among 104 SNOMED CT child terms, the algorithm utilizes the expert's answers for the contextualized questions to reduce the candidate correct terms to only 4 possible terms, achieving a 96.2% reduction in the search space. Iteration 2 finds 14 child terms and limits the candidate terms to 2, achieving 85.7% search space reduction. The correct SNOMED CT term "subarachnoid intracranial hemorrhage (SCTID: 21454007)" was located in iteration 3 among the four existing child terms. This scenario demonstrates the capability of the proposed algorithm in significantly reducing the search space for the semantically relevant SNOMED CT terms. To demonstrate the robustness of the proposed algorithm, a dataset of 14 pairs of SNOMED CT terms was examined, where the domain experts determined a target SNOMED CT term, given an initial local CP term, as shown in Table 6 . Based on the responses of the experts to the contextualized questions for each pair, the algorithm demonstrates a consistent search space reduction and convergence to the target term for all scenarios, as shown by the experimental results in Table 7 . The experimental results show that the search space reduction reached 98.9% on individual iterations and an overall average reduction of 71.3% considering the largest search tree in each iteration (i.e. the search tree that has the largest number of children). The proposed algorithm was implemented in a prototype software system that we made available on GitHub * . The developed software (with its graphical user interface (GUI)) enables domain experts to search for a SNOMED CT term by ID or name. Then, based on its children's semantic relations within SNOMED CT ontology, the system presents all possible relations and options, as shown in Figure 12 . Once the expert selects the relevant answers, the system filters out child SNOMED CT terms with scores below the threshold calculated using equation (4) . This, in turn, allows for further expansion and searching within subsequent children, or selecting a child term as a standardized target term that is approved by domain experts. Figure 12 shows the main screen of the developed software system and outlines its components as described above. It is worth mentioning that this research is not proposing a new standard but rather adopting an existing international standard. SNOMED CT is a well-known standard that has been successfully used and maintained worldwide since 1999. This contributes to the success and adoption of the proposed method in this research. Automation in CP standardization must be addressed carefully due to patient safety considerations associated with the medical data stored in CPs. In this research, we demonstrated a new automated CP standardization approach that achieves a balance between machine intervention and domain experts' control. Experimental results reveal that the proposed method achieves a high percentage of SNOMED CT search space reduction, which saves domain experts time and helps them to efficiently locate the correct standardized CP terms. The developed standardization approach can be used in other ontologically represented domains like standardizing clinical/medical terms in EMRs and medication prescriptions. The emphasis in this research was on the standardization of initial local CP terms that exist in SNOMED CT, but are not accurate based on domain expert's judgment. Therefore, a key task in the presented approach was to apply semantic relatedness to search the SNOMED CT ontology for terms with semantic relatedness to the initial SNOMED CT terms. The main limitation of this research is that local CP terms or abbreviations that are nonexistent in any terminology system were not addressed. Thus, our future investigation in this domain will concentrate on standardizing this type of local CP terms. In such cases, the initial term cannot be found within SNOMED CT ontology. Thus, standardizing the terms requires matching the local term to an external knowledge base. Such an approach would involve the use of NLP techniques different than the technique used in this study (such as word embedding methods). Clinical pathway in cardiovascular disease management Application of fast-track surgery combined with a clinical nursing pathway in the rehabilitation of patients undergoing total hip arthroplasty Defining barriers and enablers for clinical pathway implementation in complex clinical settings Application of data mining techniques to predict the length of stay of hospitalized patients with diabetes Real-time medical emergency response system: exploiting IoT and big data for public health Blockchain for public health care in smart society Implementation of an electronic health record integrated clinical pathway improves adherence to COVID-19 hospital care guidelines Electronic stroke CarePath: integrated approach to stroke care An ontological modeling approach to align institution-specific Clinical Pathways: Towards interinstitution care standardization Operationalizing prostate cancer clinical pathways: An ontological model to computerize, merge and execute institution-specific clinical pathways Computerizing clinical pathways: ontology-based modeling and execution An ontology model for clinical pathway audit The eClinical care pathway framework: a novel structure for creation of online complex clinical care pathways and its application in the management of sexually transmitted infections An ontology-based real-time monitoring approach to clinical pathway An ontology-based hierarchical semantic modeling approach to clinical pathway workflows Improving pathway compliance and clinician performance by using information technology Clinical pathways and order sets in Epic Logical Observation Identifiers, Names, and Codes (LOINC) The ICD-10 Procedure Coding System (ICD-10-PCS) Semantic similarity from natural language and ontology analysis The state of the art in semantic relatedness: a framework for comparison Exploiting nontaxonomic relations for measuring semantic similarity and relatedness in WordNet Concept vector for semantic similarity and relatedness based on wordnet structure Ontological framework for standardizing and digitizing clinical pathways in healthcare information systems SNOMED CT-based standardized e-clinical pathways for enabling big data analytics in healthcare Optimizing Hospital Resources using Big Data Analytics with Standardized e-Clinical Pathways Utilization of clinical pathway on open appendectomy: a quality improvement initiative in a private hospital in the Philippines Developing a clinical pathway for somatic symptom and related disorders in pediatric hospital settings A clinical pathway for the postoperative management of hypocalcemia after pediatric thyroidectomy reduces blood draws The effectiveness of a clinical pathway in liver surgery: a case-control study Optimizing the Management of Spasticity in people with spinal cord damage: a clinical care pathway for assessment and treatment decision making from the ability network, an international initiative Implementation of prospective, surgeon-driven, risk-based pathway for pancreatoduodenectomy results in improved clinical outcomes and first year cost savings of $1 million Clinical-and costeffectiveness of the STAR care pathway compared to usual care for patients with chronic pain after total knee replacement: study protocol for a UK randomised controlled trial Enhanced recovery after surgery pathway for patients undergoing cardiac surgery: a randomized clinical trial Development of clinical pathway for non-surgical management of chronic periodontitis A new electronically based clinical pathway for schizophrenia inpatients: a longitudinal pilot study Acknowledgments: We extend a sincere thank you to the Thunder Bay Regional Health Sciences Centre (TBRHSC) and The Ottawa Hospital for providing us with sample CPs. We also wish to thank our domain experts, Dr A. Hassan and Dr K. Darko, as well as the nurses from the Regional Stroke Unit, who helped us throughout our CP automation research. *https://github.com/mohannad57/SNOMED_CT_Alg.