key: cord-0785738-2uddatfh authors: Kotorov, Rado; Chi, Lianhua; Shen, Min title: Personalized Monitoring Model for Electrocardiogram Signals: Diagnostic Accuracy Study date: 2020-12-29 journal: JMIR Biomed Eng DOI: 10.2196/24388 sha: a80dc9bd243276de63a4b24f63a093a0d8f7cba6 doc_id: 785738 cord_uid: 2uddatfh BACKGROUND: Due to the COVID-19 pandemic, the demand for remote electrocardiogram (ECG) monitoring has increased drastically in an attempt to prevent the spread of the virus and keep vulnerable individuals with less severe cases out of hospitals. Enabling clinicians to set up remote patient ECG monitoring easily and determining how to classify the ECG signals accurately so relevant alerts are sent in a timely fashion is an urgent problem to be addressed for remote patient monitoring (RPM) to be adopted widely. Hence, a new technique is required to enable routine and widespread use of RPM, as is needed due to COVID-19. OBJECTIVE: The primary aim of this research is to create a robust and easy-to-use solution for personalized ECG monitoring in real-world settings that is precise, easily configurable, and understandable by clinicians. METHODS: In this paper, we propose a Personalized Monitoring Model (PMM) for ECG data based on motif discovery. Motif discovery finds meaningful or frequently recurring patterns in patient ECG readings. The main strategy is to use motif discovery to extract a small sample of personalized motifs for each individual patient and then use these motifs to predict abnormalities in real-time readings of that patient using an artificial logical network configured by a physician. RESULTS: Our approach was tested on 30 minutes of ECG readings from 32 patients. The average diagnostic accuracy of the PMM was always above 90% and reached 100% for some parameters, compared to 80% accuracy for the Generalized Monitoring Models (GMM). Regardless of parameter settings, PMM training models were generated within 3-4 minutes, compared to 1 hour (or longer, with increasing amounts of training data) for the GMM. CONCLUSIONS: Our proposed PMM almost eliminates many of the training and small sample issues associated with GMMs. It also addresses accuracy and computational cost issues of the GMM, caused by the uniqueness of heartbeats and training issues. In addition, it addresses the fact that doctors and nurses typically do not have data science training and the skills needed to configure, understand, and even trust existing black box machine learning models. Background An electrocardiogram (ECG) is a medical test that records the electrical activities of the heart. It is widely used by medical practitioners for diagnosing cardiac conditions by detecting irregular heart rhythms and abnormalities [1] . In some cases, arrhythmic heartbeats can be lethal and the risk of sudden death is significant without remote patient monitoring (RPM) [2] . Therefore, it is highly desirable for patients to have an efficient ECG remote monitoring system that can identify life-threatening situations and send alerts to their health care providers [3] . Lately, the demand for remote ECG monitoring has increased drastically because of the COVID-19 pandemic. To prevent the spread of the virus and keep individuals with less severe cases out of hospitals, more patients are having heart disease diagnosed and monitored remotely, while at home. The accuracy of the ECG signal classifier is becoming more important because false alarms can overwhelm the system. Therefore, classifying the ECG signals accurately and sending alerts to health care professionals in a timely fashion are urgent problems that need to be addressed. The classification of ECG signals is an extremely challenging problem as there are no defined optimal classification rules. Many researchers have focused on developing different machine learning models, such as Bayesian framework [4] , random forest [5] , gradient boosting [6] , ensemble boosting [7] , and support vector machine [8] , among others, and achieved relatively high accuracy. To extract different features for models, various techniques were proposed, such as principal component analysis [9] [10] [11] [12] , wavelet transform [13] , and filter banks [14] . Deep learning methods such as deep neural networks [15] , convolutional neural networks [16] [17] [18] [19] [20] , and recurrent neural networks [21] are also applied extensively to classification problems. Deep learning can be a powerful tool for solving cognitive problems [22] , but accurately training such models requires large amounts of labeled data [23] . For clinical applications, due to limited patient contact, variation in medical care, and privacy issues, getting a large amount of high-quality data can be very challenging [24] , and the efficacy of deep learning methods can be greatly affected by the lack of training data [25] [26] [27] [28] [29] . According to Chen et al's [30] investigation of the training time of deep learning and machine learning methods, deep learning requires a longer training time compared to conventional machine learning algorithms. In addition, building and maintaining the computational infrastructure required for deep learning can be too costly for small health care organizations to implement. Thus, a less computationally expensive method is needed to effectively resolve the issue. Another limitation of deep learning methods is that the models are not able to capture the individuality of the ECG features and patterns [31] . Most of the deep learning models are generalized models and are not able to be built to the individual level due to a lack of data [26] . However, each patient has unique heartbeats and the waveforms can be completely different on an individual level. Hence, accuracy might be an issue for these models when using real-time data. Traditional machine learning methods usually require more effort related to data preprocessing and feature engineering compared to deep learning models [29] . In addition, they tend to be like black boxes to medical practitioners without a data science background. The lack of interpretability can hinder health care providers' decision making process and communication with patients. In this paper, we propose a Personalized Monitoring Model (PMM) for ECG monitoring based on motif discovery to address the abovementioned challenges. Motif discovery is a method for analyzing large amounts of time series data. In the health care domain, it has been used for trend analysis and data summarization [32] . Motifs are defined as frequently recurring patterns in certain time series [33] . In a motif discovery process, a similarity search is conducted based on a certain similarity threshold to detect and locate previously defined patterns. In a similarity search, the distances between time series subsequences are calculated, which indicates how similar two subsequences are. The primary aim of this research is to create a robust and easy-to-configure solution for monitoring ECG signals in real-world settings. We developed a technique for building personalized prediction models to address the limitations of generalized models [31] . The main strategy of the model is to extract personalized motifs for each patient and use the motifs to predict the rest of the readings of that patient using an artificial logical network. By performing a systematic analysis and evaluation, we will investigate the hypothesis that the proposed PMM is more accurate and efficient than generalized models. In most cases, doctors and nurses do not have a data science background and the existing machine learning models might be difficult to configure. Hence, a new technique will be required as RPM becomes more common, as has occurred due to the COVID-19 pandemic. The main goal is to develop a technique that allows doctors, nurses, and other medical practitioners to easily configure a personalized model for RPM. The proposed model can be easily understood and configured by medical practitioners, since it requires less training data and fewer parameters to configure. In this section, we discuss the proposed PMM for ECG data in detail. The process includes time series sampling, personalized motif discovery, and motif-based prediction using an artificial logical network. We treated each patient's ECG measures as individual data; the recording is a time series. An ECG is a medical test that detects heart problems by measuring the electrical activity generated by the heart as it contracts. An ECG complex is composed of different components, or waves, that represent the electrical activity in specific regions of the heart. ECG readings from healthy hearts have a characteristic shape. If the ECG reading is a different shape, that could suggest a heart problem. During this stage, there is an important parameter, t, which represents the training ratio (0