key: cord-0050206-g6xwvg76 authors: Chen, Hsinchun; Zeng, Daniel; Yan, Ping title: ESSENCE date: 2009-07-14 journal: Infectious Disease Informatics DOI: 10.1007/978-1-4419-1278-7_10 sha: d15d78ad692850c07a3baa7e3188d00f026d04ee doc_id: 50206 cord_uid: g6xwvg76 The Electronic Surveillance System for the Early Notification of Community-Based Epidemics (ESSENCE) was developed by the Johns Hopkins University Applied Physics Laboratory (JHU/APL) in collaboration with the Maryland Department of Health and Mental Hygiene, the District of Columbia Department of Health, and the Virginia Department of Health under the sponsorship of the Defense Advanced Research Projects Agency (DARPA). It is now used in the Department of Defense Global Emerging Infections System (DoD-GEIS). It is currently deployed in the National Capital Area (NCA) (Lombardo et al., 2004). The system monitors both military and civilian healthcare data daily for early outbreak detection and warning, fusing information from multiple data sources that vary in their medical specificity, spatial organization, scale, and time-series behavior (Burkom et al., 2004). ESSENCE has gone through a series of important development stages, and its most current prototype is ESSENCE IV. Figure 10-1 shows the system architecture of ESSENCE. It collects public health status information from three major channels: clinical data, nonclinical syndromic data, and health events-related information. The accessibility of the collected information is managed by either disclosure control or sharing polices to ensure the privacy of personal healthcare information. Automated outbreak detection and alerting are supported. Situation and threat awareness and epidemiology investigation support are integrated with secured Web-based visualization and user interfaces. (DoD-GEIS). It is currently deployed in the National Capital Area (NCA) (Lombardo et al., 2004) . The system monitors both military and civilian healthcare data daily for early outbreak detection and warning, fusing information from multiple data sources that vary in their medical specificity, spatial organization, scale, and time-series behavior . ESSENCE has gone through a series of important development stages, and its most current prototype is ESSENCE IV. Figure 10-1 shows the system architecture of ESSENCE. It collects public health status information from three major channels: clinical data, nonclinical syndromic data, and health events-related information. The accessibility of the collected information is managed by either disclosure control or sharing polices to ensure the privacy of personal healthcare information. Automated outbreak detection and alerting are supported. Situation and threat awareness and epidemiology investigation support are integrated with secured Webbased visualization and user interfaces. 3. Information about external surveillance activities in the NCA. The process for data collection is automated. For the electronically available clinical or nonclinical data, the system constantly polls from the hospital information system via a query software . For publicly available health event-related news, the information is collected via electronic media. External surveillance activities are continually communicated among the public health officials and epidemiologists manually or electronically. Daily counts are placed into the following syndrome groups (Table 10-1) (Lombardo et al., 2004) . (Each of these groups is defined by a specific set of ICD-9 codes.) The free text chief complaints are processed and classified into syndrome categories with either a natural language processing algorithm or a weighted keyword matching based parser (Lombardo, activities. This information in most cases is available via electronic media. Likewise, the occurrence of high-profile events in the community may change improved timely reporting, optimized automated data transmission, and more . For spatial anomaly detection, the Kulldorff scan statistic as implemented in the SaTScan software is used as a primary spatial analysis tool. A modified version of scan statistic is also developed to produce approximate clusters of space-time interaction. In ESSENCE, the outbreak detection methods take a "data-fusion" approach that includes multiple data streams. Burkom and Elbert applied the Kulldorff statistic to multiple data sources in ESSENCE by treating them as covariates while using whatever spatial information is available in each source . A multiple univariate strategy can also be applied to the multiple data stream analysis, by treating each data stream separately with a univariate outbreak detection method. Then a consensus approach based on Bayes Belief Network (BBN) is used to combine the outputs of the multiple univariate algorithms to optimize the decision (Burkom et al., 2004) . The BBN approach increases the sensitivity while controlling the false-alert rate. Table 10-2 lists the three categories of outbreak detection methods currently employed in ESSENCE. The outbreak detection capacity with ESSENCE has been tested in a few studies. In the 2003 study , several outbreak scenarios were developed to test the performance, each scenario consisting of a series of real data streams with a simulated outbreak superimposed. The value of multiple data sources added to the detection performance is discussed by plotting the performance of the algorithms for respiratory syndrome as a function of the number of infected people and the involvement of different data sources (ER visits, absenteeism data, OTC influenza medication sales, and school absentee totals). It shows that the absenteeism data contributes to the timeliness in the detection by 2 days and require a smaller population of infected people. In the Bio-ALIRT evaluation program, three of the ESSENCE's detection algorithms (Provider-count-adjusted MSPC, multiple univariate EWMA, and Bayes Belief Network combination) aggregating multiple data sources were tested for respiratory or gastrointestinal syndromes . Sensitivity and timeliness are measured as performance assessment metrics. The performance results of the three methods are summarized in (Lombardo et al., 2004) . In general, the provider-count-adjusted MSPC and multiple univariate EWMA algorithm reduce the median detection time by 5 days for the most constrained false-alert rate, whereas the BBN improved timeliness by 2 days. The BBN also detected an additional outbreak at the lowest specificity (Lombardo et al., 2004 ). ESSENCE provides a map-based visualization tool that can display both raw case/event data and clusters/hotspots identified by scan statistics. The user can enter zip codes or click on an area on the map to select subsets of data of interest. The details about cases or events are presented as tables or time-series graphs. ESSENCE provides the second portal listing alerts generated as the output of the detection processes. These lists consist of color-coded flags to indicate the extent of deviations from the baseline normalcy. Upper confidence limits (UCLs) for the daily predictions are computed and used as thresholds for alerts. If an observed case count exceeds the 95% UCL but not the 99% UCL, a low-level (yellow) alert is generated. If the count exceeds the 99% UCL, a high-level (red) flag results. The user can organize the alert lists for selected data of interest. They can also sort these lists by various criteria and access data or port them to the map visualization tool to view the spatial distribution of flagged cases. The third ESSENCE tool, the query portal, allows a user to select subsets of data and data elements from drop-down menus and view these data elements over a user-specified timeframe as graphs or tables. The fourth portal can be used to generate summary reports, which can then be exported outside of ESSENCE for further analysis. The user can select any data elements in the archive and view historic counts as well as upward or downward trends. Information dissemination in ESSENCE is based on user roles and jurisdictions. A basic function of ESSENCE is to deliver alerts and surveillance information to the military and civilian public health authorities in the NCA. The system provides detection outputs as well as the details of underlying data streams via secure Web sites. Information is provided in many separate information layers. In ESSENCE, this data layering approach was implemented to facilitate the distribution to various user roles. "For example, a user who logs on from an emergency room may be able to see only the emergency room data from his or her jurisdiction, whereas a user recognized as a director of epidemiology would have access to all the information within his or her jurisdiction as well as the shared information from the surrounding jurisdictions in the region" . provide the following project link and some key readings for the readers who might be interested in learning more details about the ESSENCE system A systems overview of the Electronic Surveillance System for the Early Notification of Community-based Epidemics (ESSENCE II) Role of Data Aggregation in Biosurveillance Detection Strategies with Applications from ESSENCE Electronic Surveillance System for the Early Notification of Community-Based Epidemics (ESSENCE II), Framework for Evaluating Syndromic Surveillance Systems