key: cord-0853972-hvv067o8
authors: Mendoza Ladd, Antonio; Diehl, David L
title: Artificial intelligence for early detection of pancreatic adenocarcinoma: The future is promising
date: 2021-04-07
journal: World J Gastroenterol
DOI: 10.3748/wjg.v27.i13.1283
sha: 9a27f02111d54c1cb4421f8155f3db8e694b40c1
doc_id: 853972
cord_uid: hvv067o8

Pancreatic ductal adenocarcinoma (PDAC) is a worldwide public health concern. Despite extensive research efforts toward improving diagnosis and treatment, the 5-year survival rate at best is approximately 15%. This dismal figure can be attributed to a variety of factors including lack of adequate screening methods, late symptom onset, and treatment resistance. Pancreatic ductal adenocarcinoma remains a grim diagnosis with a high mortality rate and a significant psy-chological burden for patients and their families. In recent years artificial intelligence (AI) has permeated the medical field at an accelerated pace, bringing potential new tools that carry the promise of improving diagnosis and treatment of a variety of diseases. In this review we will summarize the landscape of AI in diagnosis and treatment of PDAC.

Current modalities for the diagnosis and treatment of pancreatic ductal adenocarcinoma (PDAC) remain disappointing. With an overall survival rate of 3%-15%, it is one of the deadliest malignancies in the world [1] . Although currently it is the 7 th leading cause of cancer death worldwide, recent trends suggest that in North America and Europe PDAC will soon become 2 nd and 3 rd respectively [2, 3] . Several factors contribute to the poor survival statistics of PDAC: Lack of adequate screening tests, delayed diagnosis, and sub-optimal treatment options. Consequently, improvements in all these areas are desperately needed.

Recent technological advances have led to the increased application of artificial intelligence (AI) in different disciplines. Because computers can store and analyze larger amounts of data than the human brain, AI has the potential to achieve unmet needs in medicine. Since improving outcomes for PDAC is an area of urgent need, this review will provide a summary of current and future applications of AI in the diagnosis and management of PDAC.

AI is a branch of computer science dedicated to developing models aimed at performing functions comparable to those accomplished by the human brain. Machine learning (ML) is the area of AI that deals with developing computer models capable of learning specific tasks through the repetition of calculations derived from large amounts of data [4] . These computer models analyze data through repetitive calculations using mathematical self-derived algorithms that are constantly adjusted until the model produces the desired outcome. Once the combination of adjustments necessary to achieve the outcome has been discovered, the computer then "learns" how to perform that specific task [5] .

Broadly speaking, ML can be either supervised or unsupervised. The difference lies in whether the desired outcome of interest is previously known by the computer. In supervised learning, a computer is first introduced to a training dataset (the "input") as well as the desired outcome of interest (the "output"). The computer then analyzes the input making the necessary adjustments to the algorithm until it consistently produces the desired output [6] . This type of learning requires large amounts of training data that has been pre-labeled ("curated") by a human operator. Once the training of the machine is completed, a different dataset is used to test its performance (testing data). In unsupervised learning, the computer is introduced to unlabeled data. The machine then sorts it using the algorithm to identify features within it that can be grouped and analyzed further to reach a specific outcome [7] . Because the data in unsupervised ML is not curated, larger amounts of training data are required than for supervised ML.

To date, most of the ML used in medicine has been supervised; and it has been made possible due to the emergence of a relatively new discipline called radiomics. Radiomics studies the conversion of digital medical images into data that can be then subjected to statistical analysis [8] . Using computer technology, predefined quantitative features are extracted from computed tomography (CT), magnetic resonance Imaging (MRI) or other imaging modalities. An essential step prior to this data extraction however, is lesion "segmentation". Segmentation is in simple terms, delineation of the lesion within an image. In supervised ML a human operator delineates the lesion prior to the algorithm operating on the data. In unsupervised ML, the algorithm learns how to segment the lesion of interest by itself.

Once the quantitative information of the lesion in the image is extracted, it is April 7, 2021 Volume 27 Issue 13 analyzed utilizing artificial neural networks (ANN). An ANN is a group of interconnected computers with a structure similar to a neural network in the human brain. Each computer represents a neuron or "node", and each connection a synapse or "weight" (Figure 1 ). ANNs are organized in "layers" (groups of nodes) and the most basic model contains three: (1) Input layer (which receives the data), (2) Hidden layer (which performs the calculations and analyses), and (3) Output layer (which produces the final output) [9] . Deep neural networks contain more than one hidden layer and therefore can learn to analyze data with higher complexity levels; this is termed "deep learning" [10] . The different layers work in a hierarchical structure to produce the desired output.

Endoscopic ultrasound (EUS) is currently one of the most useful imaging modalities in the diagnosis of PDAC. The overall sensitivity of EUS guided biopsies for the diagnosis of PDAC reaches 98%, but its specificity can be as low as 20% [11] . The accuracy of EUS depends on both operator and lesion-related factors. The most important operator-related factor is the amount of experience performing EUS. On occasions, variations in gastroduodenal anatomy, (i.e. after partial gastric resection, or presence of a duodenal stricture) can significantly limit the ability of operator to visualize the pancreas. On the other hand, lesion-related factors include patient's body habitus, the presence of acute inflammation (when EUS is done immediately after an episode of acute pancreatitis) or the presence of chronic pancreatitis (CP), particularly in the presence of parenchymal calcifications. Sensitivity of EUS for PDAC in the presence of CP can be as low as 54% [12, 13] . In addition, CP and autoimmune pancreatitis can occasionally form pseudotumors, which may complicate image analysis by the endosonographer.

Several studies have reported on the application of AI in the analysis of EUS images of pancreatic diseases [14] [15] [16] [17] [18] [19] [20] [21] [22] (Table 1) . For the most part, these studies have focused on evaluating the accuracy of ANNs in differentiating CP from PDAC. Norton et al [14] analyzed still EUS images previously selected by experts who did not perform the procedure and were blinded to the final diagnosis using an ANN. A total of 21 patients with PDAC and 14 with CP were included. Four features were analyzed in each image by the ANN, achieving an overall accuracy of 89%.

In a similar study, Das et al [17] retrospectively analyzed the performance of an ANN in differentiating PDAC from normal pancreas and CP. A total of 56 patients (22 normal, 12 CP and 22 PDAC) were studied. Their AI algorithm identified PDAC with an area under the curve of 0.93. The differences in accuracy in this study may have been secondary to the more stringent criteria in the definition of CP and the higher number of image features analyzed by the ANN. A larger study performed by Zhu et al [18] , analyzed 262 patients with PDAC and 126 with CP and reported that their algorithm reached an overall accuracy of 94%.

Elastography is an imaging method developed to establish the differences in consistency ("strain") between normal and abnormal tissue during EUS. Such differences are portrayed in a color-coded overlay on the EUS image, with red correlated with softer tissue and blue with harder [19] . A multicenter study reported an overall accuracy, sensitivity and specificity of 84%, 88% and 83% respectively in differentiating PDAC from CP [20] . This study used ANN analysis of histograms from elastography images previously selected by experts blinded to the patients' diagnoses. Another similar small study of 68 patients reported an accuracy rate of 90% [21] .

Contrast agents have also been developed to aid in the differentiation between PDAC and CP. Săftoiu et al [22] reported that ANN analysis of contrast-enhanced EUS images could establish a difference with an area under the curve of 94%. Few studies have focused on using standard EUS B mode images to diagnose pancreatic tumors in the absence of CP. These have reported accuracies up to 99% [15, 16] .

Computerized tomography (CT) is perhaps the most common medical imaging modality being explored with AI. The analysis of CT images of neoplastic lesions involves three main steps: detection, characterization and monitoring of change over time [23] . Most of the available studies have focused on AI-assisted characterization of lesions, which is equivalent to the previously defined concept of segmentation. Three studies have applied AI to the analysis of CT images in PDAC for diagnostic purposes [24] [25] [26] . In a small study of 15 healthy patients and 44 with a variety of pancreatic tumors, Fu et al [24] reported that their algorithm achieved an overall sensitivity of 76%. In a retrospective case control study of 380 patients (190 cases and 190 controls) Chu et al [25] reported an accuracy of 99% in their computer-derived algorithm. Meanwhile, a prospective study by Liu et al [26] reported an accuracy of 76%. AI has also been utilized to establish correlations between the CT images of PDAC and their subsequent biological behaviors [27, 28] . Two important ongoing projects are worth mentioning. The Felix Project funded by the Lustgarten Foundation is a multidisciplinary study carried out by a group at Johns Hopkins University. Using deep learning computer models with manually segmented images from 156 PDAC cases and 300 normal controls, the group reported a sensitivity April 7, 2021

Volume 27 Issue 13 Figure 1 Basic anatomy of an artificial neural network. Input layer, hidden layer (may have more than one) and output layer. All nodes are interconnected through wieghts (arrows). and specificity of 94% and 99% respectively in their initial report [29] . Based on these encouraging results, the group's next step is to expand their analysis to 575 normal and 750 PDAC patients. The group's ultimate goal is to fine tune the performance of their algorithm prior to expanding it to larger externally derived datasets. The second ongoing study is being conducted by the Alliance of Pancreatic Cancer Consortium Imaging Working Group (2). Their objective is to collect pre-and postdiagnosis CT, magnetic resonance imaging (MRI) and transabdominal ultrasound images from patients ultimately diagnosed with PDAC. These images will be used to create a repository that will later be shared and analyzed. The ultimate goal of the project is to develop AI models that can predict the appearance of PDAC and diagnose the disease in its early stages.

Segmentation of MRI images by AI has been reported to be more technically challenging than CT images [30, 31] . A few studies have reported that ML models can be trained to accurately identify pathology in the pancreas, although the literature on PDAC is scarce. Devi et al [32] reported that an ANN could accurately identify a variety of abnormal pancreatic findings with a 96% accuracy. Similarly, Gao et al [33] reported that an AI model differentiated normal from abnormal pancreas at a level comparable to humans (77% vs 82% respectively). In the only published study aimed specifically at identifying PDAC in MR images, Liang et al [34] reported that a convoluted neural network (a variety of ANN) performed similarly to humans in identifying the lesion.

Artificial Intelligence processing of MRI images has been also applied in the context of PDAC therapy. Spieler et al [35] reported that an ANN accurately delineated pancreatic tumors prior to radiotherapy. Zhao et al [36] reported similarly positive results. In a study utilizing AI aiming to automatically calculate the dose of stereotactic body radiation therapy, Campbell et al [37] demonstrated that an ANN-calculated dose was comparable to a human-calculated one. By applying radiomics to MRI images, investigators have reported that their quantitative data can be correlated with aspects such as tumor subtype, survival and response to chemotherapy [38, 39] . This same technology has made possible other studies showing that MR images data can be used to predict relapse after PDAC treatment [40, 41] .

The evolution of AI has resulted in models with the capacity of analyzing data beyond quantitative image features. This type of ML requires a higher complexity in the architecture of the neural networks given the broad range of variables analyzed at any given point in time. These models have been applied in the development of algorithms that can accurately identify patients with or at risk of developing PDAC based on several clinical variables. April 7, 2021

Volume 27 Issue 13

Muhammad et al [42] utilized an ANN to analyze a large patient population derived from the National Health Interview Survey and the Prostate, Lung, Colorectal and Ovarian trial. The authors developed and trained the ANN with > 800000 patients of which 898 had PDAC. Analyzing variables such as demographics, comorbidities, race and family history, the model predicted the development of PDAC with an AUC of 0.85. In a similar manner, albeit with a lower accuracy, Klein et al [43] utilized data from the PanScan Consortium to develop a model that predicted high risk of PDAC among patients of European ancestry with an AUC of 0.61.

New onset diabetes has been adopted as a marker for patients with high risk of developing PDAC within the following 3 years [44] . As such, it has been the subject of analysis by AI techniques. Hsieh et al [45] compared the PDAC prediction accuracy of new onset diabetes when analyzed by a regular logistic regression or an ANN. A total of 3092 PDAC cases were identified from a population of > 1000000 patients. Interestingly, the logistic regression slightly outperformed the ANN (AUROC 0.7 and 0.64% respectively). In a complex study combining PubMed data and clinical information, Zhao et al [46] utilized an innovative weighted Bayesian network that accurately predicted PDAC with an AUROC of 0.91. In a similar but simpler study, Sanoob et al [47] reported that an ANN can accurately diagnose PDAC based on a combination of signs and symptoms [47] . ANNs that analyze clinical data have also been used in determining patient survival and performance after PDAC treatment [48, 49] .

Currently, pathologists are responsible for interpretation of histology specimens, and this process is dependent on their previous training, level of experience and individual skills. In an attempt to standardize interpretation and reduce human bias, AI techniques have been applied to pathology specimen analysis [50] .

Application of AI in pathology is dependent on creation of a high-resolution digital image from the glass slide. This step is called "whole slide imaging" (WSI). WSI is accomplished by use of glass slide scanners and the technology to support them [51] . WSI and the necessary IT infrastructure to support its clinical use is referred to as "digital pathology". It is expected that eventually, pathology workflow will move away from pathologists looking at glass slides through a microscope to pathologists reviewing digital images of slides on high resolution computer screens. Food and Drug Administration clearance for these devices is relatively new (2017), and widespread adoption of digital pathology technology is still in its early phases. There is an increasing amount of data on validation of WSI compared to microscope viewing of glass slides, for example a recent study showed good concordance of slide interpretation of frozen sections done with glass slides/microscope compared to WSI/digital pathology workstation [52] Similar excellent intraobserver concordance between glass slides and digital pathology has been shown for routine clinical workload in surgical pathology [53] .

There is already significant work and available commercial devices that can bring AI computing power to aid in the interpretation and screening of biopsies, although commercial expansion of this is in its infancy [54] . Much of the work regarding AI in pathology concerns prostate and breast malignancy, since the incidence of these malignancies is fairly high, and the clinical need and potential commercial applications present a more attractive corporate opportunity for device sales. However, with continued digitization of glass slides, particularly of pancreatic malignancy (both FNA specimens and surgical pathology) it is hoped that an enlarging curated group of cases can serve as a training set for AI analysis regarding pancreatic cancer.

There has been only a limited number of investigations of AI in pathology for the diagnosis of pancreatic cancer. Although the sensitivity and specificity of EUS guided FNA samples is in average > 90% [55] , some specimens still fall under the "atypical cells" category, posing a considerable diagnostic dilemma. In a recent study, Momeni-Boroujeni et al [56] studied the performance of an ANN in reclassifying EUS-FNA specimens originally labeled as "atypical" by pathologists. Among a group of 31 patients in whom the final diagnosis had been previously established by other diagnostic methods, the ANN's overall accuracy for adequately reclassifying the specimen as malignant or benign was 77%.

Two of the main factors driving the high mortality of PDAC are suboptimal understanding of its malignant behavior and its unpredictable treatment response rate. April 7, 2021

Volume 27 Issue 13

Advances in AI-assisted genetic and molecular profiling of PDAC have recently broadened insight on these factors [57] . Recent data showed that early diagnosis could be possible through AI analysis of the transcription products of certain PDAC genes. These studies have reported sensitivity and specificity ranging from 88%-95% and 83%-95% respectively [58, 59] . AI has also been utilized to match PDAC biological information with chemical properties of specific drugs in order to develop models capable of predicting response to these specific agents [60, 61] .

Technological advances in the last 50 years have exponentially increased the amount and quality of data available for medical decision making. Hence, it is becoming increasingly evident that new methods for storage and analysis of it are necessary. Although the concept of AI or its applications in medicine may still seem foreign for most practitioners, it is rapidly positioning itself as an indispensable tool to reduce human error. As sophisticated and elegant our diagnostic and therapeutic capacities may be currently, they remain inevitably limited by our subconscious and conscious bias, as well as our wide range of intellectual and technical skills. William Osler's famous quote: "medicine is a science of uncertainty and an art of probability" will likely never be proven false.However the degree of uncertainty and probability considered tolerable in modern medicine is constantly shrinking. The advent of AI brings, in theory, the promise of reducing and even eliminating these shortcomings Nevertheless, this promise is one that needs to be taken cautiously. There are several hurdles that must be overcome before AI can see widespread adoption in medical care. One limitation is the current lack of adequate standardization. Uniform protocols for data collection, processing, storage, reproduction and analysis must be established and standardized. Furthermore, different types of data may require different AI technologies. For example, ANNs trained to adequately classify histologic slides of pancreatic biopsies which have been fixed and stained with a specific method, may underperform, or not perform at all, when presented with slides prepared in a different manner. Creating such universal protocols, although possible, will be laborious and expensive.

Another concern is with the ethical handling of information. AI systems require vast amounts of data, and therefore, its implementation demands reliable methods of patient data de-identification. This is indispensable to ensure patient confidentiality, since one of the pillars of AI is data sharing. On the other hand, de-identified data needs to maintain its traceability, in order to allow individual practitioners to retrieve it and make the necessary decisions at the bedside. Three different models have been developed for data sharing in AI, and all have their advantages and disadvantages [2, [62] [63] [64] [65] .

Centralized models require sharing of large amounts of data by different sources (i.e institutions). This data is uploaded into a central server that carries out the algorithmic adjustments. Once trained, the central server shares the finalized algorithm with the individual sources for internal use. The main drawback of this model is that centralization of the information in the server may increase the risk of a security breach, as the individual source no longer controls the information (Figure 2) . In distributed or federated models, each source develops and adjusts its own algorithm with internal data. Once each source has fine-tuned their algorithm, they share its parameters with a central server. The central server then utilizes all the individual parameters to develop a centralized algorithm that later gets returned to the source for internal use (Figure 3 ). The main advantage of this model is that data is not shared with the central server. In hybrid models features of both models are present. A data repository is created to be shared by both data providers and the central server. The data repository then develops an algorithm using each individual institution's data before it gets sent to the central server. The central server then updates the master algorithm before it gets sent back to the data repository (Figure 4 ).

The quality of data currently utilized in AI needs to be improved. Most of the AI systems used in data analysis so far have been trained and tested with rather small datasets originating from within local institutions. This raises the issue of information bias. These datasets lack the degree of diversity necessary to mirror the scenarios human providers face during routine clinical practice. For AI systems to perform adequately, the datasets need to be sufficiently diverse in all the possible variables that come into play when making clinical decisions (demographics, medical and/or family history, physical and laboratory findings and others). Therefore, datasets need to analyzes all the data and develops and algorithm that is sent to each institution. This algorithm is then used by each institution to analyze its own internal data in the future.

Figure 3 Federated artificial intelligence information sharing system. Each individual institution develops its own algorithm with internal data. Once the algorihms are developed, their parameters are shared with the central server. The server then develops a master algorithm using all the individual parameters. The master algorighm is sent back to the institutions for its internal use.

originate from a variety of sources for them to be representative and inclusive, not from a limited number of large academic medical centers or research institutions.

Another shortcoming is the fact that the average ANN functions as a "black box" [66] . As such, how a specific variable in a dataset is weighted by specific nodes in the network is currently uninterpretable. When evaluating the performance of any ANN, clinicians, mathematicians and computer scientists need to understand the "reasoning" that occurs within the hidden layers. Although questioning of ANNs is possible through mathematical reasoning, it does not reflect clinical decision making. Understanding the way in which ANNs analyze information is paramount for improving their performance and correcting errors that can lead to fatal consequences.

Finally, much of the promise of applying AI in medical image analysis depends on the ability to get actionable results rapidly. The current need for multiple intricate post-processing steps prior to its analysis, indicates that much more work must be April 7, 2021

Volume 27 Issue 13 Figure 4 Hybrid artificial intelligence infromation sharing system. A data repository is created as an intermediary between the institutions and the central server. The data repository develops one or more algorithm(s) with the data. The parameters of these algorithms are then shared with the central server to create a master algorithm which is then returned to the repository. New data coming from the institutions is then used by the repository to create new parameters that are then sent to the central server to renew the master algorithm.

done to develop this into a technology that provides "on-the-fly" results [63] . Because of the limitations enumerated above, it is clear that major improvements to the technology need to occur before AI can support everyday activities in clinical practice. It is evident however, that medicine has reached a "point of no return" regarding application of AI. Overcoming these hurdles will require collaboration between academic centers, industry, computer scientists, venture capitalists, regulating organizations and governments ( Figure 5 ). The main goals of this collaboration should be streamlining development of standard data platforms, lowering technology costs and making the technology more "user friendly" so that any provider can use it in real time.

With further research, AI could have a large impact on the diagnosis and treatment of PDAC in the future. Novel screening methods are needed, and AI analysis of large comprehensive clinical datasets may yield opportunity for early detection or even predict development of PDAC before a visible lesion can be seen on imaging. An AI protocol which prescreens computed tomography or magnetic resonance imaging prior to a radiologist reading the study could ensure that lesions will not be missed due to human error. "On the fly" AI assistance with endoscopic ultrasound imaging could help the endosonographer optimally target a needle biopsy of a mass. The pathologist can be assisted by an AI algorithm that analyzes the biopsy on the slide that is being read. And AI could prove very useful for following response to treatment and even in suggesting optimal treatment regimens, with personalized treatment strategies based on biological profiling. While AI applications in PDAC are still in the very early stage of development, further investment in research could lead to substantial improvements in screening, early diagnosis, and treatment.

Smart Intelligent Computing and Applications

Performance of deep learning for differentiating pancreatic diseases on contrastenhanced magnetic resonance imaging: A preliminary study

Auto-segmentation of pancreatic tumor in multi-parametric MRI using deep convolutional neural networks

Automatic segmentation of abdominal anatomy by artificial intelligence (AI) in adaptive radiotherapy of pancreatic cancer

Markerless Pancreatic Tumor Target Localization Enabled By Deep Learning

Neural network dose models for knowledge-based planning in pancreatic SBRT

A machine learning algorithm predicts molecular subtypes in pancreatic ductal adenocarcinoma with differential response to gemcitabinebased vs FOLFIRINOX chemotherapy

A machine learning model for the prediction of survival and tumor subtype in pancreatic ductal adenocarcinoma from preoperative diffusion-weighted imaging

Use of Machine-Learning Algorithms in Intensified Preoperative Therapy of Pancreatic Cancer to Predict Individual Risk of Relapse

Development of a Novel Multiparametric MRI Radiomic Nomogram for Preoperative Evaluation of Early Recurrence in Resectable Pancreatic Cancer

Pancreatic cancer prediction through an artificial neural network

An absolute risk model to identify individuals at elevated risk for pancreatic cancer in the general population

Model to Determine Risk of Pancreatic Cancer in Patients With New-Onset Diabetes

Development of a prediction model for pancreatic cancer in patients with type 2 diabetes using logistic regression and artificial neural network models

Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction

Artificial neural network for diagnosis of pancreatic cancer

An Evaluation of Artificial Neural Networks in Predicting Pancreatic Cancer Survival

Machine learning of clinical performance in a pancreatic cancer database

Challenges in the Development, Deployment, and Regulation of Artificial Intelligence in Anatomic Pathology

Whole Slide Imaging: Technology and Applications

Validation of Remote Digital Frozen Sections for Cancer and Transplant Intraoperative Services

Validation of a digital pathology system including remote review during the COVID-19 pandemic

Artificial intelligence applications in endoscopic ultrasound

Endoscopic Ultrasound for Early Diagnosis of Pancreatic Cancer

Computer-assisted cytologic diagnosis in pancreatic FNA: An application of neural networks to image analysis

Machine Learning and Network Analyses Reveal Disease Subtypes of Pancreatic Cancer and their Molecular Characteristics

Meta-analysis of transcriptome data identifies a novel 5-gene pancreatic adenocarcinoma classifier

PDAC-ANN: an artificial neural network to predict pancreatic ductal adenocarcinoma based on gene expression

The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity

Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties

Distributed deep learning networks among institutions for medical imaging

Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation

Alternative models for sharing confidential biomedical data

Distributed learning on 20 000+ lung cancer patients -The Personal Health Train

Privacy and accountability in black-box medicine