key: cord-0987267-h5ehfhat authors: Donohue, Cara; Khalifa, Yassin; Perera, Subashan; Sejdić, Ervin; Coyle, James L. title: Characterizing Effortful Swallows from Healthy Community Dwelling Adults Across the Lifespan Using High-Resolution Cervical Auscultation Signals and MBSImP Scores: A Preliminary Study date: 2021-09-18 journal: Dysphagia DOI: 10.1007/s00455-021-10368-3 sha: a4e5a86a597ce5eddfd84fd0366dbf0f76e4f66e doc_id: 987267 cord_uid: h5ehfhat There is growing enthusiasm to develop inexpensive, non-invasive, and portable methods that accurately assess swallowing and provide biofeedback during dysphagia treatment. High-resolution cervical auscultation (HRCA), which uses acoustic and vibratory signals from non-invasive sensors attached to the anterior laryngeal framework during swallowing, is a novel method for quantifying swallowing physiology via advanced signal processing and machine learning techniques. HRCA has demonstrated potential as a dysphagia screening method and diagnostic adjunct to VFSSs by determining swallowing safety, annotating swallow kinematic events, and classifying swallows between healthy participants and patients with a high degree of accuracy. However, its feasibility as a non-invasive biofeedback system has not been explored. This study investigated 1. Whether HRCA can accurately differentiate between non-effortful and effortful swallows; 2. Whether differences exist in Modified Barium Swallow Impairment Profile (MBSImP) scores (#9, #11, #14) between non-effortful and effortful swallows. We hypothesized that HRCA would accurately classify non-effortful and effortful swallows and that differences in MBSImP scores would exist between the types of swallows. We analyzed 247 thin liquid 3 mL command swallows (71 effortful) to minimize variation from 36 healthy adults who underwent standardized VFSSs with concurrent HRCA. Results revealed differences (p < 0.05) in 9 HRCA signal features between non-effortful and effortful swallows. Using HRCA signal features as input, decision trees classified swallows with 76% accuracy, 76% sensitivity, and 77% specificity. There were no differences in MBSImP component scores between non-effortful and effortful swallows. While preliminary in nature, this study demonstrates the feasibility/promise of HRCA as a biofeedback method for dysphagia treatment. Within clinical settings, a common challenge for dysphagia practitioners remains the lack of inexpensive, portable, and non-invasive dysphagia management methods available for assessment and treatment. To diagnose dysphagia, instrumental methods remain the gold standard [e.g., videofluoroscopy (VF), fiberoptic endoscopic evaluation of swallowing (FEES)]. While these methods are objective and provide insight into swallowing physiology, there are limitations to performing them including the cost, limited access in some settings (and countries), exposure to radiation (i.e., VF), and inability for some patients to participate in the examination (e.g., patient size, COVID-19 restrictions, patient desire to forgo further imaging studies). In addition to this, few accurate and non-invasive methods to provide biofeedback during dysphagia treatment are readily available within clinical settings and few clinicians are trained in deploying these methods [1] . FEES and VF have been implemented as biofeedback methods for dysphagia treatment and have been shown to be advantageous for patient/caregiver education and developing individualized treatment plans [2] [3] [4] . In fact, clinician feedback and participant/patient performance have been shown to be more accurate for certain swallowing maneuvers using VF compared to other biofeedback methods [e.g., surface electromyography (sEMG)] [3, 4] . However, dysphagia treatment using only VF for biofeedback is unrealistic within clinical settings due to the cost, radiation exposure, and time constraints/accessibility [3, 4] . Due to the limitations of FEES and VF as biofeedback methods for treatment, other non-invasive modalities such as sEMG have been explored. Yet a study examining concurrent VF and sEMG found very weak to moderate correlations between submental sEMG durations and temporal kinematic measures of hyolaryngeal displacement using VF images when participants performed the Mendelsohn maneuver [5] . A recent systematic review that examined biofeedback methods used in dysphagia treatment found that accelerometry, sEMG, and tongue manometry were the most frequently used in research studies [6] . In three studies, visual biofeedback using sEMG and accelerometry led to significantly improved hyoid bone displacement (compared to a control) during dysphagia treatment that targeted functional swallowing exercises such as the effortful swallow and Mendelsohn maneuver [6] . While these results are promising, study limitations included small sample sizes, the heterogeneity of patients, and mixed evidence regarding whether biofeedback results in clinically meaningful, functional changes in swallowing [5] [6] [7] [8] [9] . More specifically in studies using accelerometry, low quality studies have been implemented with flawed study designs and the use of subjective and non-validated swallowing outcome measures [6] [7] [8] [9] . Due to the limitations of current biofeedback modalities, innovative methods for providing continuous monitoring and biofeedback during dysphagia treatment are under investigation. One such modality is a novel wearable electromyography sensor-array patch that has demonstrated similar signal quality as traditional, commercially available sEMG during water swallow tasks [10] . Another potential biofeedback modality currently being explored is high-resolution cervical auscultation (HRCA) [11] . HRCA is a method of characterizing swallow function that integrates information from acoustic and vibratory signals from non-invasive sensors (contact microphone, tri-axial accelerometer) attached to the anterior laryngeal framework during swallowing. Following collection of HRCA signals, HRCA signal features are extracted using advanced signal processing techniques to use the HRCA signal features as input to machine learning algorithms to provide insight into swallowing physiology using human ratings of VF images as the "ground truth." HRCA has demonstrated promise as a dysphagia screening method and potential diagnostic adjunct to VF by classifying safe and unsafe swallows (as measured by the penetration-aspiration scale) [11] [12] [13] [14] [15] [16] [17] , tracking hyoid bone displacement in healthy adults and patients with suspected dysphagia [18, 19] , annotating temporal swallow kinematic events in healthy adults and patients with suspected dysphagia (e.g., durations of upper esophageal sphincter opening and laryngeal vestibule closure) [20] [21] [22] , categorizing swallows between healthy participants and different patient populations [23, 24] , and detecting clinical ratings of swallow physiology in patients with suspected dysphagia using the Modified Barium Swallow Impairment Profile (MBSImP) [25] with a high degree of accuracy [19, 21] . However, the utility of HRCA's capabilities to noninvasively characterize these physiologic events, many of which are targets of behavioral augmentation via compensatory swallowing maneuvers (e.g., effortful swallow, Mendelsohn maneuver), and differentiate between swallows in which they are accurately deployed without imaging verification, has yet to be investigated. In our clinical work, we have observed difficulty by patients in generalizing training in these maneuvers to accurate performance when assessed using VF, likely due to the lack of ongoing performance evidence in the training stage in which mass practice is deployed in clinic and home programs. Success of such an effort to provide ongoing, noninvasive indications of accurate or inaccurate performance would be of value in demonstrating preliminary efficacy of HRCA as a potential biofeedback method for dysphagia treatment. Compensatory swallowing maneuvers (e.g., effortful swallow, Mendelsohn maneuver) are common dysphagia rehabilitation techniques that are used to improve swallow function in patients with dysphagia by altering swallowing physiology. The effortful swallow is one type of compensatory swallowing maneuver that is frequently deployed in clinical settings for patients with dysphagia and has been explored in research studies in both healthy adults and patients with dysphagia. Following dysphagia treatment targeting the effortful swallow, some patients with dysphagia have exhibited decreased pharyngeal residue and decreased penetration/aspiration, but no changes in upper esophageal sphincter (UES) opening diameter, duration of UES opening, laryngeal elevation, or hyoid movement [26] [27] [28] [29] . In healthy adults, the research evidence is mixed regarding the impact of effortful swallows on swallowing physiology. For example, one study in healthy adults found that effortful swallows led to longer durations for temporal swallow kinematic measurements (e.g., hyoid movement duration, duration of UES opening) and increased pyriform sinus residue [30] . Other studies in healthy adults have found no differences in airway protection or swallowing efficiency between non-effortful and effortful swallows [29] . While research studies have examined differences in temporal kinematic measurements between non-effortful and effortful swallows in healthy adults, no studies have examined differences between non-effortful and effortful swallows using a clinical rating tool (e.g., MBSImP), and few researchers have investigated non-invasive or non-imaging alternatives to VF that are capable of determining whether the effortful swallow maneuver is accurately performed once a patient has been properly trained. Such a system holds potential for enhancing clinician judgment of accurate performance (e.g., clinician feedback to patient) which is the source of accurate clinical cuing and patient performance for effortful swallows to mitigate maladaptive learning of the maneuver [3, 4] . Therefore, this study investigated 1. Whether HRCA can differentiate between non-effortful and effortful swallows performed by the same individuals; 2. Whether there are differences in MBSImP components #9 (anterior hyoid excursion), #11 (laryngeal vestibular closure), and #14 (pharyngoesophageal segment opening) between non-effortful and effortful swallows. We hypothesized that HRCA combined with signal processing and machine learning algorithms would classify swallows as non-effortful or effortful with a high degree of accuracy and that there would be differences in MBSImP component scores #9, #11, and #14 between non-effortful and effortful swallows. The Institutional Review Board for this institution approved this research study. All participants provided written informed consent. Data analyses were performed on 247 thin liquid swallows from 36 healthy community dwelling adults across the lifespan (19 male) between the ages of 49 and 86 (mean age 65.53 ± 7.67 years). This subset of data is part of an ongoing prospective study that aims to analyze swallow function in healthy community dwelling adults across the lifespan. Participants were enrolled in this research study based on the following inclusionary criteria based on participant report: no history of swallowing difficulties, history of a neurological disorder, prior surgery to the head or neck region, or chance of being pregnant (female participants). Participants underwent a standardized videofluoroscopic swallow study (VFSS) procedure with concurrent HRCA and were imaged in the lateral plane. For non-effortful swallows, participants swallowed ten thin liquid boluses in a randomized presentation order (five 3 mL boluses via spoon, five self-selected "comfortable" cup sips). For the 3 mL boluses via spoon, participants were instructed to "Hold the liquid in your mouth and wait until I tell you to swallow it." For the comfortable cup sips, participants were instructed to "Take a comfortable sip of liquid and swallow it whenever you're ready." For effortful swallows, participants swallowed one practice thin liquid water bolus and two 3 mL thin liquid barium boluses via spoon. During the practice effortful swallow, participants were instructed to "Swallow hard using all your throat muscles." For the effortful swallows that were recorded using VFSSs, participants were instructed to "Hold the liquid in your mouth and wait until I tell you to swallow it" and then to "Swallow hard" during the exam. For analyses purposes, only the 3 mL thin liquid boluses via spoon were used to compare the non-effortful and effortful swallows to minimize variation (e.g., bolus volume, utensil, command swallow). See Table 1 for the bolus characteristics for swallows used for analyses for this study. The average fluoro time for participants was 1.06 min. VFSS procedures were conducted using a standard fluoroscopy system (Precision 500D system, GE Healthcare, LLC, Waukesha, WI) at a pulse rate of 30 pulses per second (PPS). A frame grabber module (AccuStream Express HD, Foresight Imaging, Chelmsford, MA) captured the raw video signals at a rate of 73 frames per second (FPS). Prior to analysis, the video files were downsampled to 30FPS. HRCA signals were collected concurrently during the VFSSs via a tri-axial accelerometer (ADXL 327, Analog Devices, Norwood, Massachusetts) that was powered by a 3 V output (model 1504, BK Precision, Yorba Linda, California) and a contact microphone. The accelerometer and contact microphone were placed in custom casings to ensure adequate contact for signal acquisition during data collection. The non-invasive HRCA sensors were placed on the anterior laryngeal framework at the level of the cricoid cartilage with tape after cleaning participants' neck region with alcohol pads. The sensors were carefully placed to avoid interfering with VFSS images, to ensure adequate signal acquisition, and to ascertain alignment of the tri-axial accelerometer with the participant's neck [11, 31] . The precise placement of the accelerometer and contact microphone can be viewed in Fig. 1 . Signals from the accelerometer and the microphone were hardware bandpass filtered from 0.1 to 3000 Hz (model P55, Grass Technologies, Warwick, Rhode Island), amplified, and digitized using a data acquisition device (National Instruments 6210 DAQ) at a sampling rate of 20 kHz with the Signal Express program in LabView (National Instruments, Austin, Texas). Before analysis, the signals were then downsampled to 4 kHz to smooth out high frequency noise. Prior to data analysis for this study, one trained rater segmented video files into individual swallow segments with ongoing intra-rater reliability within a 3-frame tolerance of 100% based on randomly re-coding one out of every ten swallows. Another trained rater coded 10% of swallows for inter-rater reliability with intra-class coefficients (ICCs) of at least 0.9 [32] . The methods for swallow segmentation have been described in previous publications [14, 33] . No other temporal kinematic measurements were performed aside from identifying the onset and offset of each swallow, and the sole purpose of these measurements was to segment the video files into individual swallows. An MBSImP-certified clinician completed all MBSImP ratings for components #9, #11, and #14. Before performing swallowing ratings, inter-rater reliability was established by completing the MBSImP reliability test with at least 80% exact agreement for all MBSImP component scores. Ongoing intra-rater reliability was maintained by randomly selecting one swallow to re-code every ten swallows with 100% exact agreement. Inter-rater reliability was conducted on 10% of swallows by another certified MBSImP clinician with 79% exact agreement for components #9, #11, and #14. An autoregressive model was used to build a digital finite impulse filter to remove the device noise associated with each of the sensors. The filters were designed to remove the baseline noise present in the sensors' output when no physical input is applied. Afterward, motion artifacts and low frequency noise, such as head movement, were removed using fourth-order splines. Finally, wavelet denoising was used to eliminate any additional noise that might exist in the signals Fig. 1 Placement of HRCA neck sensors during data collection [21] [17, 20, 21] . The onset and offset of swallows were taken from the segmented videos after applying the proper sampling mapping between videos and signals. The signals were then segmented using the mapped onset and offset times for feature extraction [33] . A summary of the features extracted from the HRCA signals and the explanations of their meanings can be viewed in Table 2 . Nine features were extracted from the contact microphone and the three directions of the tri-axial accelerometer (anterior-posterior, superior-inferior, medial-lateral) for a total of 36 signal features. This set of features has been proven effective in differentiating between HRCA signals from different types of swallows and extraction of multiple swallow kinematics [17, 21, 24, 34] . We fit linear mixed models to examine the association between HRCA signal features, non-effortful swallows, and effortful swallows. We used multiple supervised machine learning classifiers (e.g., support vector machines [SVM], Naïve Bayes, decision trees, linear discriminant analysis) that use HRCA signal features as input to classify swallows as non-effortful or effortful. The supervised machine learning classifiers were deployed using the entire set of HRCA signal features (n = 36), the features that were statistically significant (n = 9), and the linearly independent features [as determined by performing a principal component analysis (PCA)]. A leave-one-out procedure was used to evaluate the classification accuracy of all the used classifiers. A leaveone-out procedure involves training the classifier on the entire data set except for one randomly selected swallow which is used to test the accuracy of the classifier. This training and testing procedure were repeated until all swallows in the data set were tested at least once. The accuracy, sensitivity, and specificity of all supervised machine learning classifiers were then calculated. We fit linear mixed models to determine if there were differences in MBSImP component scores #9, #11, and #14 between the non-effortful and the effortful swallows. SPSS (IBM, Armonk, NY) was used to fit the linear mixed models. MATLAB (The MathWorks, Inc., Natick, MA) and R (The R Foundation) were used to construct and evaluate the performance of the supervised machine learning classifiers. Results revealed that there was a statistically significant (p < 0.05) difference in 9 HRCA signal features between the non-effortful and effortful swallows. Complete results from the linear mixed model can be viewed in Table 3 . From the microphone signals, statistically significant features included standard deviation, peak frequency, spectral centroid, bandwidth, and wavelet entropy. From the anterior-posterior and medial-lateral accelerometer axes, standard deviation was the only statistically significant feature. From the superior-inferior accelerometer axis, statistically significant features included standard deviation and wavelet entropy. Figures 2 and 3 illustrate two examples of the differences in signal features (e.g., standard deviation, peak frequency) between the non-effortful and effortful swallows. After evaluating the performance of all supervised machine learning classifiers using the entire set of HRCA signal features (36), the features that were statistically significant (9) , and the statistically independent features; decision trees and linear discriminant analysis had the best performance. Using the 9 most significant HRCA signal features as input, decision trees classified swallows as non-effortful or effortful with 76% accuracy, 76% sensitivity, and 77% specificity. A complete summary of the performance of the different supervised machine learning classifiers can be viewed in Table 4 . For MBSImP component scores, results from the linear mixed model revealed that there were no significant differences (p > 0.05) in MBSImP component scores #9, #11, and #14 between the non-effortful and effortful swallows. Table 5 shows a complete summary of the MBSImP component scores for the non-effortful and effortful swallows. This study found that HRCA combined with advanced signal processing and machine learning techniques could accurately and autonomously classify swallows from healthy adults as non-effortful or effortful without imaging. This is of particular clinical interest given the results indicating that analysis of the VF data, which is commonly used to confirm treatment effect in training of the effortful swallow, did not generate significant differences in the MBSImP components measured. While preliminary in nature, these results provide evidence regarding the potential of HRCA as a biofeedback method and an indicator of accurate performance for use by the clinician in providing reinforcement to the patient, for dysphagia treatment protocols in the future. These results are especially encouraging given that participants had minimal training (i.e., one practice swallow) prior to performing two effortful swallows during the videofluoroscopic evaluation. Having an inexpensive, non-invasive, portable, and easy-to-use method for providing biofeedback during dysphagia treatment would significantly improve current dysphagia management of patients by providing clinicians and patients with immediate insight into performance of swallowing maneuvers and exercises such as the effortful swallow. These findings expand upon previous research studies that have demonstrated the potential of HRCA as an effective dysphagia screening method and adjunct to VF when instrumental swallow evaluations are not feasible and provide evidence to support pursuing HRCA as a biofeedback method. Interestingly, in addition to these findings, we did not detect a statistically significant difference in MBSImP component scores (#9, #11, #14) between non-effortful and effortful swallows. These results contribute to the mixed evidence base examining differences between non-effortful and effortful swallows in healthy adults [29, 30] . It is also possible that MBSImP ratings may not be sensitive enough to detect subtle changes in swallowing physiology because the MBSImP is a somewhat subjective, ordinal rating scale with a limited range of scores. This may particularly be true in the present study because all participants were healthy community dwelling adults with no report of swallowing difficulties; leading to a ceiling effect with MBSImP ratings. Future studies should replicate this research work by examining HRCA's ability to classify non-effortful and effortful swallows with a larger sample of swallows that includes swallows from both healthy adults and patients with dysphagia. Including a larger and more variable sample of swallows will assist in improving the accuracy of the supervised machine classifier and may also allow us to detect differences in MBSImP component scores between non-effortful and effortful swallows. In addition to this, future studies should examine HRCA's ability to provide real-time continuous biofeedback during a treatment session targeting effortful swallows. It will be important to explore HRCA's ability to provide insight into performance of other swallowing maneuvers/exercises that would benefit from biofeedback (e.g., Mendelsohn maneuver) as well. The purpose of this research study was to determine the efficacy of HRCA as an inexpensive, non-invasive, and portable dysphagia biofeedback method. Because of the preliminary nature of this study, a relatively small sample of swallows were included for analyses (n = 247), which may have resulted in inadequate statistical power for comparing MBSImP component scores between non-effortful and effortful swallows. In addition to this, only swallows from healthy community dwelling adults across the lifespan were included in the analysis and only three MBSImP component scores were examined. This likely limited the range of swallows included (e.g., finite range of MBSImP component scores, limited severity range) and also limits the generalization of findings to patients with dysphagia in clinical settings. Due to time constraints while collecting data in a University hospital, participants received minimal training or practice prior to completing effortful swallows, which may have impacted their performance. In addition to this, we did not confirm accurate performance of effortful swallows with a validated measurement tool such as sEMG or manometry, Minimal distention/partial duration; marked obstruction of flow (2) 0 (0) 3 (4.23) No distention with total obstruction of flow (3) 0 (0) 0 (0) so it is possible not all participants performed this compensatory maneuver correctly. Data were collected using a strict, standardized VF protocol to minimize radiation exposure to healthy community dwelling adults. As such, participants only swallowed thin liquid boluses and only two effortful swallows were collected from each participant during the VF procedure. Future studies should examine HRCA's ability to classify non-effortful and effortful swallows across various conditions (e.g., bolus volume, bolus viscosity, utensil) and across more trials from each participant. This preliminary study found that HRCA signal features combined with decision trees and linear discriminant analysis classified swallows as non-effortful or as an effortful swallow with up to 76% accuracy, 76% sensitivity, and 77% specificity. These results provide promising evidence regarding the efficacy of using HRCA as a monitoring system and biofeedback method for dysphagia treatment in the future. Future studies should expand upon these findings to improve the machine learning algorithm performance and to further validate HRCA as a biofeedback method by analyzing a larger number of swallows (e.g., patient and healthy community dwelling adults) and by exploring the efficacy of using HRCA as a biofeedback method for other dysphagia treatment targets (e.g., Mendelsohn maneuver). This inexpensive, non-invasive, and portable method has the potential to transform dysphagia rehabilitation by providing real-time feedback regarding treatment performance. Basic concepts of surface electromyographic biofeedback in the treatment of dysphagia: a tutorial Use of fiberoptic endoscopic evaluation of swallowing (FEES) in patients with amyotrophic lateral sclerosis Kinematic visual biofeedback improves accuracy of learning a swallowing maneuver and accuracy of clinician cues during training Kinematic visual biofeedback improves accuracy of swallowing maneuver training and accuracy of clinician cues during training in stroke patients with dysphagia The relationship between submental surface electromyography and hyo-laryngeal kinematic measures of Mendelsohn maneuver duration Does therapy with biofeedback improve swallowing in adults with dysphagia? A systematic review and meta-analysis Biofeedback therapy using accelerometry for treating dysphagic patients with poor laryngeal elevation: case studies Swallowing training combined with game-based biofeedback in poststroke dysphagia Development of innovative feedback device for swallowing therapy Validation of a novel wearable electromyography patch for monitoring submental muscle activity during swallowing: a randomized crossover trial Dysphagia screening: contributions of cervical auscultation signals and modern signal-processing techniques Classification of penetration-aspiration versus healthy swallows using dual-axis swallowing accelerometry signals in dysphagic subjects A comparative analysis of swallowing accelerometry and sounds during saliva swallows A comparative analysis of DBSCAN, K-means, and quadratic variation algorithms for automatic identification of swallows from swallowing accelerometry signals Baseline characteristics of cervical auscultation signals during various head maneuvers Deep learning for classification of normal swallows in adults Silent Aspiration Detection in High Resolution Cervical Auscultations Neck sensor-supported hyoid bone movement tracking during swallowing Tracking hyoid bone displacement during swallowing without videofluoroscopy using machine learning of vibratory signals Upper esophageal sphincter opening segmentation with convolutional recurrent neural networks in high resolution cervical auscultation How closely do machine ratings of duration of UES opening during videofluoroscopy approximate clinician ratings using temporal kinematic analyses and the mbsimp? Dysphagia Estimation of laryngeal closure duration during swallowing without invasive X-rays Detection of swallow kinematic events from acoustic high-resolution cervical auscultation signals in patients with stroke A preliminary investigation of whether HRCA signals can differentiate between swallows from healthy people and swallows from people with neurodegenerative diseases MBS measurement tool for swallow impairment-MBSImp: establishing a standard Videomanometric analysis of supraglottic swallow, effortful swallow, and chin tuck in patients with pharyngeal dysfunction Supraglottic swallow, effortful swallow, and chin tuck did not alter hypopharyngeal intrabolus pressure in patients with pharyngeal dysfunction A therapeutic maneuver for oropharyngeal dysphagia in patients with Parkinson's disease A systematic review of the physiological effects of the effortful swallow maneuver in adults with normal and disordered swallowing Alterations to swallowing physiology as the result of effortful swallowing in healthy seniors Methodology for detecting swallowing sounds Intraclass correlations: uses in assessing rater reliability Non-invasive identification of swallows via deep learning in high resolution cervical auscultation recordings Characteristics of dry chin-tuck swallowing vibrations and sounds We have no conflict of interest to declare.Ethical Approval All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. Informed consent was obtained from all individual participants included in the study.