key: cord-0922561-j3s94pym
authors: Shen, Chih-Hsiung; Chen, Wei-Lun; Wu, Jung-Jie
title: Research on Multiple Spectral Ranges with Deep Learning for SpO(2) Measurement
date: 2022-01-02
journal: Sensors (Basel)
DOI: 10.3390/s22010328
sha: ba02916bd868cd3ee1b927f60c11ec3a345a57f0
doc_id: 922561
cord_uid: j3s94pym

Oxyhemoglobin saturation by pulse oximetry (SpO(2)) has always played an important role in the diagnosis of symptoms. Considering that the traditional SpO(2) measurement has a certain error due to the number of wavelengths and the algorithm and the wider application of machine learning and spectrum combination, we propose to use 12-wavelength spectral absorption measurement to improve the accuracy of SpO(2) measurement. To investigate the multiple spectral regions for deep learning for SpO(2) measurement, three datasets for training and verification were built, which were constructed over the spectra of first region, second region, and full region and their sub-regions, respectively. For each region under the procedures of optimization of our model, a thorough of investigation of hyperparameters is proceeded. Additionally, data augmentation is preformed to expand dataset with added noise randomly, increasing the diversity of data and improving the generalization of the neural network. After that, the established dataset is input to a one dimensional convolution neural network (1D-CNN) to obtain a measurement model of SpO(2). In order to enhance the model accuracy, GridSearchCV and Bayesian optimization are applied to optimize the hyperparameters. The optimal accuracies of proposed model optimized by GridSearchCV and Bayesian Optimization is 89.3% and 99.4%, respectively, and trained with the dataset at the spectral region of six wavelengths including 650 nm, 680 nm, 730 nm, 760 nm, 810 nm, 860 nm. The total relative error of the best model is only 0.46%, optimized by Bayesian optimization. Although the spectral measurement with more features can improve the resolution ability of the neural network, the results reveal that the training with the dataset of the shorter six wavelength is redundant. This analysis shows that it is very important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral ranges and number of wavelengths. It shows that our proposed 1D-CNN model gives a new and feasible approach to measure SpO(2) based on multi-wavelength.

The oxyhemoglobin saturation by pulse oximetry (SpO 2 ) is an important health indicator. In pulse oximetry, the oxyhemoglobin saturation of a healthy person is usually 95-100%, and a reading below 90% is considered a dangerously low value. Since shortness of breath is the main symptom of severe COVID-19, pulse oximetry is used as an early warning sign to track their blood oxygen levels in case patients with supplemental oxygen in the hospital. SpO 2 monitoring helps doctors understand whether the patient's vital signs are stable and make appropriate treatment immediately.

At present, the measurement of SpO 2 with photoelectric technology uses red light near 660 nm and near-infrared light near 940 nm as the light source [1, 2] , with PPG (Photoplethysmography) to read the light intensity and convert the signal into sequence data for further analysis. PPG has developed into two types of measurement principle, namely a transmission-type and reflection-type. The reflection system is common in We present here the novel use of deep learning architecture with one-dimensional convolutional neural networks (1D-CNN) to overcome the aforementioned shortcomings in current retinal oximetry model for directly quantifying SaO 2 and SpO 2 concentration concurrently in a reproducible, robust way. Deep learning architecture has been finding more and more applications in the biomedical field both in image recognition and spectral identification. For spectroscopy, deep learning architecture with 1D-CNN have incredible advantage over statistical regression techniques, owed mainly to their ability to learn and weigh the importance of different spectral regions automatically. Furthermore, they learn this weighting of spectral characteristics with no prior knowledge of the constituent's absorption spectra. This is extremely interesting for us to develop SpO 2 spectroscopy with multi-wavelength spectral absorption based on the deep learning architecture. Furthermore, Equation ( 2) is applicable to dual-wavelength measurement, which use the ratio of Q to further calculate the SpO 2 . For the calculation of multi-wavelength, there does not exist a general formula to give the SPO 2 from multiple inputs from the reading of spectral meter. Especially for more thorough analysis of multi-wavelength measurement, the number of inputs need not be fixed and several different ranges of spectral detection are required and analyzed in the work. Hence, a 1D-CNN is used in this study as a method to achieve the multi-wavelength measurement which isn't fixed on the number of wavelengths. In this study, the model input is transmission spectra, and the model output is ten SpO 2 values.

Medical data can generally be divided into two types: images and numerical values. At present, most of the applications in two-dimensional CNN are applied to identify image data. However, spectral data is part of the sequence value, and it is not directly related to two-dimensional CNN. Therefore, 1D-CNN will be a powerful tool for us to achieve our goals.

1D-CNN is becoming more and more widely used in medical treatment currently. For example, in 2020, Ramisİleri and other researchers detect EOG (Electrooculogram) signals around the eyeballs and use 1D-CNN to determine whether the subject has dyslexia. Finally, the accuracy of their experimental results was 73.6128 ± 2.8155% [10] . Besides, 1D-CNN is widely used to achieve detection and diagnosis of symptoms [11] [12] [13] [14] . It is obvious that 1D-CNN has high potential in judging numerical data.

Before we use 1D-CNN, our previous study has used deep neural networks (DNN) to train a predictive model to measure SpO 2 [15] . In the study, 12-channel spectra are used to train the DNN model constructed by three hidden layers with 200 neural, and finally a single neuron is used as the output. It is different from 1D-CNN prediction that output the probability of each category. 1D-CNN's operation mode can be regarded as a scalar inner product of many different filters and spectral data. Each filter is composed of patches with different weights. These filters that move on the data are also called kernel maps. It moves in a single fixed direction on the sequence data. In this way, the neural network will automatically extract the important features of each data as the identification basis. Equation (3) shows the mathematical expression of the 1D-CNN operation, where H k is the length of the convolution kernel, U k is the length of the sequence data, and k is the number of steps required to scan a single data. Besides, h k-i is the i th row of convolution kernel and u i is the i th row of sequence data. In addition, 1D-CNN has a fixed width of convolution kernel, it is unnecessary to specified the width separately. Therefore, the input shape of the 1D-CNN is a 3D tensor, which is batch size, steps, and input dimension in order.

For a successful neural network model, it is necessary to have appropriate and highquality training data. In 1999, Moritz Friebel and other researchers analyzed blood from 400 nm to 2500 nm [16] . They showed the absorption coefficient graph of Hb and HbO 2 in Figure 1 . From the figure, the absorption coefficients from 600 nm to 1100 nm are highly distinguishable. Therefore, this study chooses the long wavelength range of visible light to infrared light as the training data of neural network. row of sequence data. In addition, 1D-CNN has a fixed width of convolution kernel, it is unnecessary to specified the width separately. Therefore, the input shape of the 1D-CNN is a 3D tensor, which is batch size, steps, and input dimension in order.

For a successful neural network model, it is necessary to have appropriate and highquality training data. In 1999, Moritz Friebel and other researchers analyzed blood from 400 nm to 2500 nm [16] . They showed the absorption coefficient graph of Hb and HbO2 in Figure 1 . From the figure, the absorption coefficients from 600 nm to 1100 nm are highly distinguishable. Therefore, this study chooses the long wavelength range of visible light to infrared light as the training data of neural network. In this study, multi-wavelength sensor is chosen to meet the requirements of the ideal measurement wavelengths. Due to breakthroughs in spectral measurement technology, this sensor has a lower cost and a smaller size. In 2019, J.-S. Botero-Valencia and other researchers used these sensors, and a neural network to convert the multi-channel light intensity that was originally discrete in the spectra into a continuous curve [17] . It also shows that the SpO2 value calculated by the traditional dual-wavelength pulse oximeter is mainly susceptible to motion artifacts, background light, and low perfusion state errors. Spectral analysis has been identified as a good way to improve calculations. Several researches show that multi-wavelength improves most of the defect of traditional oximeter. We propose to use 12-wavelength spectral absorption measurement to improve the accuracy of SpO2 measurement, and build different datasets according to the spectral characteristics. Besides, dataset is added with noise randomly, increasing the diversity of data and improving the generalization of the neural network. The final result shows that a spectrometer constructed through a neural network has a resolution of up to 5 nm and a maximum error of less than 2%, which shows the necessity of multi-wavelength spectral measurement for SpO2.

In this study, the setup of measurement includes three parts, including light source, light sensor and the architecture of light sensing, and signal processing. In this study, multi-wavelength sensor is chosen to meet the requirements of the ideal measurement wavelengths. Due to breakthroughs in spectral measurement technology, this sensor has a lower cost and a smaller size. In 2019, J.-S. Botero-Valencia and other researchers used these sensors, and a neural network to convert the multi-channel light intensity that was originally discrete in the spectra into a continuous curve [17] . It also shows that the SpO 2 value calculated by the traditional dual-wavelength pulse oximeter is mainly susceptible to motion artifacts, background light, and low perfusion state errors. Spectral analysis has been identified as a good way to improve calculations. Several researches show that multi-wavelength improves most of the defect of traditional oximeter. We propose to use 12-wavelength spectral absorption measurement to improve the accuracy of SpO 2 measurement, and build different datasets according to the spectral characteristics. Besides, dataset is added with noise randomly, increasing the diversity of data and improving the generalization of the neural network. The final result shows that a spectrometer constructed through a neural network has a resolution of up to 5 nm and a maximum error of less than 2%, which shows the necessity of multi-wavelength spectral measurement for SpO 2 .

In this study, the setup of measurement includes three parts, including light source, light sensor and the architecture of light sensing, and signal processing.

Since the light transmission-type of SpO 2 measurement is used in this study, a measuring light source that contains both visible light and infrared light having a stable light intensity is indispensable. On the other hand, a xenon lamp is used with high luminous efficiency, long life, and the emission spectral range (400~1100 nm) completely covers the range required for research. In order to stabilize the light source and temperature, a 6.4 V constant voltage power supply is adapted to replace the original battery for UltraFire 9P xenon flashlight to maintain a better power stabilization.

The multi-wavelength measurement including visible light and infrared light ranges was defined for our measurement and 1D-CNN neural network architecture. Two sixchannel sensors, AS7262 and AS7263, are used, which cover visible light and infrared light ranges and conduct measurement of transmission light. AS7262 captures six wavelengths of 450 nm, 500 nm, 550 nm, 570 nm, 600 nm, and 650 nm, and its full width at half maximum (FWHM) is 40 nm. AS7263 captures the remaining six wavelengths, which are 610 nm, 680 nm, 730 nm, 760 nm, 810 nm, and 860 nm, and its full width at half maximum (FWHM) is 20 nm. Among these 12-channel, the NIR (near infrared region) area sensed by AS7263, will become the focus of our discussion.

Two embedded modules are used to acquire data from two sensors and read the light intensity value through I 2 C communication protocol. The data of these 12-channel are read by the two embedded modules and transmitted to the edge computing system for SpO 2 calculations.

Using a finger clip to fix the finger position can improve the stability of the measurement data during the measurement process. The 3D printed finger clip designed according to human's finger is used in this study, as shown in Figure 2 . The shield of ambient light is used on the end of the finger to ensure the sensor to receive the light transmitted by fingers. There is a rectangular window on the top for light projection, and a pupil at the bottom for penetrated light transmission.

Since the light transmission-type of SpO2 measurement is used in this study, a measuring light source that contains both visible light and infrared light having a stable light intensity is indispensable. On the other hand, a xenon lamp is used with high luminous efficiency, long life, and the emission spectral range (400~1100 nm) completely covers the range required for research. In order to stabilize the light source and temperature, a 6.4 V constant voltage power supply is adapted to replace the original battery for UltraFire 9P xenon flashlight to maintain a better power stabilization.

The multi-wavelength measurement including visible light and infrared light ranges was defined for our measurement and 1D-CNN neural network architecture. Two sixchannel sensors, AS7262 and AS7263, are used, which cover visible light and infrared light ranges and conduct measurement of transmission light. AS7262 captures six wavelengths of 450 nm, 500 nm, 550 nm, 570 nm, 600 nm, and 650 nm, and its full width at half maximum (FWHM) is 40 nm. AS7263 captures the remaining six wavelengths, which are 610 nm, 680 nm, 730 nm, 760 nm, 810 nm, and 860 nm, and its full width at half maximum (FWHM) is 20 nm. Among these 12-channel, the NIR (near infrared region) area sensed by AS7263, will become the focus of our discussion.

Two embedded modules are used to acquire data from two sensors and read the light intensity value through I 2 C communication protocol. The data of these 12-channel are read by the two embedded modules and transmitted to the edge computing system for SpO2 calculations.

Using a finger clip to fix the finger position can improve the stability of the measurement data during the measurement process. The 3D printed finger clip designed according to human's finger is used in this study, as shown in Figure 2 . The shield of ambient light is used on the end of the finger to ensure the sensor to receive the light transmitted by fingers. There is a rectangular window on the top for light projection, and a pupil at the bottom for penetrated light transmission. In order to make the equipment more stable during the measurement process, a 3D printed light sensing case beneath the finger clip device is designed and fabricated, as shown in Figure 3 . In order to make the equipment more stable during the measurement process, a 3D printed light sensing case beneath the finger clip device is designed and fabricated, as shown in Figure 3 . Raspberry Pi 4 Model B is selected as the edge computing module for result computing and historical record maintenance. The spectral signal from the embedded modules can be sorted out, and input into the pre-trained neural network model for measurement, and displayed on the screen. Figure 4 is the complete measurement configuration of the equipment. The light source emitted by the xenon lamp and focusing on the finger clip through the convex lens. When the subject puts the index finger of the left hand into the finger clip, the light source will project on the skin close to the nail, and the attenuated light penetrating the finger will pass through the beam splitter and project on the AS7262 and AS7263 light sensors. The measurement accuracy of this system is complicated and mainly relies on several major devices which limit the accuracy and resolution of measurement. Firstly, the two six-channel sensors, AS7262 and AS7263 are used with 16 bits A/D converter which gives ±1 LSB Max (±0.0015% of Full Scale) with no missing codes. The measurement covers the visible light and infrared light ranges with peak sensitivities at six wavelengths, each with 40 nm/ 20 nm full width at half maximum (FWHM) for AS7262 and AS7263, respectively. Table 1 shows some important features of the AS7262/AS7263 sensors. 

Raspberry Pi 4 Model B is selected as the edge computing module for result computing and historical record maintenance. The spectral signal from the embedded modules can be sorted out, and input into the pre-trained neural network model for measurement, and displayed on the screen. Figure 4 is the complete measurement configuration of the equipment. The light source emitted by the xenon lamp and focusing on the finger clip through the convex lens. When the subject puts the index finger of the left hand into the finger clip, the light source will project on the skin close to the nail, and the attenuated light penetrating the finger will pass through the beam splitter and project on the AS7262 and AS7263 light sensors. The measurement accuracy of this system is complicated and mainly relies on several major devices which limit the accuracy and resolution of measurement. Firstly, the two sixchannel sensors, AS7262 and AS7263 are used with 16 bits A/D converter which gives ±1 LSB Max (±0.0015% of Full Scale) with no missing codes. The measurement covers the visible light and infrared light ranges with peak sensitivities at six wavelengths, each with 40 nm/ 20 nm full width at half maximum (FWHM) for AS7262 and AS7263, respectively. Table 1 shows some important features of the AS7262/AS7263 sensors. The illuminated intensity on the fingers from Xenon light source is calibrated and normalized to eliminate the temperature drift of intensity. The SpO2 data are acquired by a standard meter, Rossmax SB100 with 1% resolution which is used to calibrate and label the output of training dataset from our proposed model. The accuracy of Rossmax SB100 is within ±2% over the range of SpO2, 70~99%. Moreover, the signal obtained by the sensor is transmitted to the embedded modules through the DuPont cable, and the reading time interval of the light sensor is 0.7 s due to the 0.6 s for integration time of light sensors and 0.1 s for the computation of edge computing module. Finally, the data is transmitted to the neural network model, which was trained in Raspberry Pi to make continuous measurement of SpO2. The The illuminated intensity on the fingers from Xenon light source is calibrated and normalized to eliminate the temperature drift of intensity. The SpO 2 data are acquired by a standard meter, Rossmax SB100 with 1% resolution which is used to calibrate and label the output of training dataset from our proposed model. The accuracy of Rossmax SB100 is within ±2% over the range of SpO 2 , 70~99%. Moreover, the signal obtained by the sensor is transmitted to the embedded modules through the DuPont cable, and the reading time interval of the light sensor is 0.7 s due to the 0.6 s for integration time of light sensors and 0.1 s for the computation of edge computing module. Finally, the data is transmitted to the neural network model, which was trained in Raspberry Pi to make continuous measurement of SpO 2 . The results will be displayed on the screen with the designed human-machine interface and recorded in chronological order at the same time automatically.

The experiment flow chart is shown in Figure 5 . The experimental process is divided into three stages. There are data establishment, 1D-CNN model configuration and training, and equipment execution and verification. The focus of this study is the second stage. Besides, there are three issues in this experiment that need to be analysis, training with different spectral channels, random noise addition, and hyperparameters optimization, respectively. 

The experiment flow chart is shown in Figure 5 . The experimental process is divided into three stages. There are data establishment, 1D-CNN model configuration and training, and equipment execution and verification. The focus of this study is the second stage. Besides, there are three issues in this experiment that need to be analysis, training with different spectral channels, random noise addition, and hyperparameters optimization, respectively. 

When the experiment and training of neural networks are performed, two issues need to be considered, namely over-fitting and under-fitting. It is a challenge that trained model can complete the SpO 2 measurement data under all conditions, especially when the SpO 2 measurement is performed in the actual subjects, the actual experimental dataset is not easy to get acquisition with all conditions thoroughly. When a neural network is trained with a small dataset, this will cause the network remember the training dataset instead of learning the general characteristics of our experimental SpO 2 measurement data. For this reason, the model performs well on the training dataset but does not perform well on the test dataset. When a small dataset provides a bad description of our problem, it may lead to a problem that is difficult to learn. Obtaining more data is a very expensive and difficult task when SpO 2 is below 95%. At this point, two techniques including noise addition and normalization are adopted to obtain better model performance. In this research, the noise on neural networks will be applied and be analyzed in detail. This technique not only reduces overfitting, but also optimizes our model faster and the overall performance is improved noticeably.

When we collect neural network training data, traditional measurements require blood tests, and a large amount of continuous data cannot be obtained. Although spectral analysis has been identified as a good way beyond the dual-wavelength pulse oximeters, in order to achieve calibration, the SpO 2 data are acquired by a standard meter, Rossmax SB100 under a severe and fixed conditions to avoid the drawbacks of dual-wavelength pulse oximeters.

In order to obtain reliable and valid data, the SpO 2 of right hand is measured by a standard meter, Rossmax SB100, and the SpO 2 of left hand is measured by the constructed measuring device to simulate hypoxia by holding breath to obtain the spectra and SpO 2 at the same time. 215 sets of measured light intensity of each wavelength and the corresponding time is recorded, which is helpful for labeling the SpO 2 of standard meter. Finally, the multi-wavelength spectra of SpO 2 from 81% to 99% is completed.

High-quality data must have completeness and reliability. Therefore, the multiwavelength spectral data collected by the measuring device is normalized between 0 and 1 by Equation (4), which is denoted as I normalized , as shown in Figure 6 . Furthermore, I max is the maximum of the spectral data, and I min is the minimum of the spectral data. Normalized data have two major benefits. First, they can effectively reduce the number of iterations required by the gradient descent method. Secondly, they makes the data in different dimensions comparable without concerning the accuracy influence of a large number in certain dimensions. In the data configuration stage, the spectra of each SpO2 concentration is expanded to 1000 rows through the random noise to match the input dimension of the 1D-CNN. Each dataset will have 1000 rows of SpO2 spectral data, satisfying the input dimension of the neural network. Among the measurement in Figure 6 , it is worth to noticed that the normalized spectra curve at the spectral region (450 nm, 500 nm, 550 nm, 570 nm, 600 nm, and 610 nm) denoted as first region is difficult to be distinguished since the spectral absorbance of oxyhemoglobin is relatively indistinguishable in this spectral range [13] . On the other hand, the normalized spectral curves at the second region (650 nm, 680 nm, 730 The data with the least noise interference in each concentration is selected as the effective data, which is denoted as I eff . The SpO 2 dataset is created in an interval of two with SpO 2 99%, 97%, 95%, 93%, 91%, 89%, 87%, 85%, 83%, and 81% separately to maintain excellent resolution for neural network measurement, as shown in Figure 6 , and put it into the written program for preprocessing. Random noise with five conditions as 0%, 1%, 2%, 5%, and 10% is applied uniformly to expand the number of data and increase the generalization ability of the model. Hence, the neural network can be more accurate when facing different subjects.

In the data configuration stage, the spectra of each SpO 2 concentration is expanded to 1000 rows through the random noise to match the input dimension of the 1D-CNN. Each dataset will have 1000 rows of SpO 2 spectral data, satisfying the input dimension of the neural network. Among the measurement in Figure 6 , it is worth to noticed that the normalized spectra curve at the spectral region (450 nm, 500 nm, 550 nm, 570 nm, 600 nm, and 610 nm) denoted as first region is difficult to be distinguished since the spectral absorbance of oxyhemoglobin is relatively indistinguishable in this spectral range [13] . On the other hand, the normalized spectral curves at the second region (650 nm, 680 nm, 730 nm, 760 nm, 810 nm, and 860 nm) are easier to be distinguished since the spectral absorbance of oxyhemoglobin is relatively indistinguishable in this spectral range [13] . Hence, three datasets for training and three datasets for verification are built, which are constructed over the spectra of the first region, second region, and full region, respectively.

In this study, the model configures with two convolutional layers and a pooling layer twice, followed by hidden layer and an output layer, shown in Figure 7 . The convolutional layers and the hidden layers both use Relu as the activation function due to its high performance on convergence [18, 19] . The output layer uses Softmax and cooperates with categorical cross-entropy as the loss function due to the excellent performance of Softmax when solving multi-category problem [20, 21] . 

After completing the neural network training, the SpO2 measurement needs to be implemented by putting the model on our configured hardware device. Besides, there are three tasks for us to complete the measurement equipment. At the beginning, designing program and human-machine interface and insert into the edge computing module with the trained model is the most primary task in this stage. Secondly, although a light shield has applied on the sensor end, it cannot completely prevent the ambience light. Therefore, it is significantly to filter the noise out. Lastly, since the input shape required by 1D-CNN is sequential data, each input data must be expanded into the length set by the neural network model.

To investigate the effectiveness of spectral regions for deep learning for SpO2 measurement, three datasets for training and verification are built, which are constructed over the spectra of the first region, second region, and full region and their sub-regions, respectively. For each region under the procedures of optimization of our model, a thorough of investigation of hyperparameters is proceeded firstly. Therefore, the estimated sensitive hyperparameters are analyzed by GridSearchCV which will not only search for the best In terms of output, since the original output result of the CNN is the weight of each category, it is different from our ideal output. Therefore, the final result is the sum of multiplying the SpO 2 labels and their weights of each category, shown in Equation (5), where L i is the i th SpO 2 label and W i is the i th corresponding weight.

After completing the neural network training, the SpO 2 measurement needs to be implemented by putting the model on our configured hardware device. Besides, there are three tasks for us to complete the measurement equipment. At the beginning, designing program and human-machine interface and insert into the edge computing module with the trained model is the most primary task in this stage. Secondly, although a light shield has applied on the sensor end, it cannot completely prevent the ambience light. Therefore, it is significantly to filter the noise out. Lastly, since the input shape required by 1D-CNN is sequential data, each input data must be expanded into the length set by the neural network model.

To investigate the effectiveness of spectral regions for deep learning for SpO 2 measurement, three datasets for training and verification are built, which are constructed over the spectra of the first region, second region, and full region and their sub-regions, respectively. For each region under the procedures of optimization of our model, a thorough of investigation of hyperparameters is proceeded firstly. Therefore, the estimated sensitive hyperparameters are analyzed by GridSearchCV which will not only search for the best parameters of the model, but also automatically cross-validate all training sets and retrain the model [22] . The first parameter analyzed is the dropout ratio which shows a sensitive affection of accuracy in our model. The appropriate dropout ratio can effectively prevent neural network from overfitting. Figure 8a shows the impact of different dropout ratios. On the accuracy of the neural network and the remaining hyperparameters are fixed and chosen during the grid searching of dropout parameter. After the searching, the neural network has the highest accuracy when dropout ratio is 0.35 which is marked with a shaded circle. The second relatively sensitive parameter is learning rate. As shown in Figure 8b , the results show it has the highest accuracy rate when the learning rate is 9 × 10 −4 , marked with a shaded circle. In this searching of parameters, if the learning rate is too high, the loss value will oscillate after converging to a certain value, and it will not converge to the global best solution. On the contrary, if the learning rate is too low, the convergence speed will be slowed, and a high-precision model may not be obtained due to trapped in the local optimum. 

In order to improve model performance and obtain high accuracy, it is necessary to optimize hyperparameters. GridSearchCV and Bayesian optimization (BO) are used to adjust the hyperparameters. GridSearchCV will search all of the parameters being assigned completely. BO will refer to the past results and constantly updates the probability model to focus the hyperparameters that may be the best solution, which will effectively increase the rate of searching [23] . Before deciding on the final searching scope, GridSearchCV and BO search coarsely, and a smaller searching range is arranged according the previous optimization results at the end. The searching conditions of hyperparameters with GridSearchCV and BO are shown in Tables 2 and 3, respectively. 

In order to improve model performance and obtain high accuracy, it is necessary to optimize hyperparameters. GridSearchCV and Bayesian optimization (BO) are used to adjust the hyperparameters. GridSearchCV will search all of the parameters being assigned completely. BO will refer to the past results and constantly updates the probability model to focus the hyperparameters that may be the best solution, which will effectively increase the rate of searching [23] . Before deciding on the final searching scope, GridSearchCV and BO search coarsely, and a smaller searching range is arranged according the previous optimization results at the end. The searching conditions of hyperparameters with GridSearchCV and BO are shown in Tables 2 and 3, respectively. The automatic adjustment of hyperparameters can effectively acquire the optimal parameters of CNN and also improve the time cost of manual adjustment. At the beginning of optimization procedures, we compare the accuracy of the model, where hyperparameters generated by GridSearchCV and BO are shown at Tables 4 and 5. The tables show that although most the model obtained by GridSearchCV has good accuracy, the performance of model using BO shows better results, which reveals that the model of this study can be optimized more effectively using BO. 

It is important to investigate the issues of validity of 1D-CNN Model to the spectral measurement and the accuracy and validation of spectral regions for different models from datasets constructed from different spectra on neural network training will be analyzed and discussed. In Figure 6 we have carried out three datasets, which are constructed over the spectra of the first region, second region, and full region, respectively. Therefore, the model accuracy trained with different datasets and noise addition with GridsearchCV and Bayesian Optimization are shown in Figure 9 . denoted as S14 including three wavelengths of 450 nm, 570 nm, and 610 nm over the first region. Furthermore, the second region spectra are also divided as follows: the first subregion denoted as S21 including three shorter wavelengths of 650 nm, 680 nm, and 730 nm; the second sub-region denoted as S22 including three middle wavelengths of 730 nm, 760 nm, and 810 nm; the third sub-region is denoted as S23 including three longer wavelengths of 760 nm, 810 nm, and 860 nm; and the last sub-region is denoted as S24 including three wavelengths of 650 nm, 760 nm, and 860 nm over the second region. Figures 10 and  11 shows the accuracy of the models trained by the above training sets of four sub-regions in each region under different noise ratios. It shows that the neural network trained by S11 and S23 which includes three longer wavelengths of 450 nm, 500 nm, 550 nm, and 760 nm, 810 nm, 860 nm performs the higher accuracy. It indicates that training dataset with more distinctive results can effectively improve the performance of the model. However, compared to the model trained by the six-channel of the second region, the model trained by S23 shows lower accuracy and it shows that the training data set still needs a sufficient number of wavelengths and high-quality data to effectively improve the performance of the model. Although the spectral measurement with more features can improve the resolution ability of the neural network, the results reveal that the training with the dataset of the shorter six-channel of the first region is redundant. This analysis also shows that it is very important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral region and number of wavelengths. This analysis shows that it is very important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral region and number of wavelengths. The results show that the accuracy of model trained by the first region dataset with GridsearchCV and Bayesian optimization is lower than the other region. It may be caused by the intervals between each normalized spectra curve at the spectral region are closer than the curves of the other regions. In the full region dataset, although it has the most complete data to obtain features, it is still affected by data of the first region, and it is difficult to achieve the best accuracy. In addition, since the data set constructed by the second region is highly distinguishable, the trained model shows excellent accuracy in every noise ratio.

Especially for more detailed analysis, we further explore the division of the spectral channel of the first and second region into four different sub-regions, respectively, in order to further analyze whether the neural network can obtain higher-precision measurements only through a few specific channels. Four sub-regions of first region spectra are as follows: the first sub-region denoted as S11 including three shorter wavelengths of 450 nm, 500 nm, and 550 nm; the second sub-region denoted as S12 including three middle wavelengths of 500 nm, 550 nm, and 570 nm; the third sub-region is denoted as S13 including three longer wavelengths of 570 nm, 600 nm, and 610 nm; and the last sub-region denoted as S14 including three wavelengths of 450 nm, 570 nm, and 610 nm over the first region. Furthermore, the second region spectra are also divided as follows: the first sub-region denoted as S21 including three shorter wavelengths of 650 nm, 680 nm, and 730 nm; the second sub-region denoted as S22 including three middle wavelengths of 730 nm, 760 nm, and 810 nm; the third sub-region is denoted as S23 including three longer wavelengths of 760 nm, 810 nm, and 860 nm; and the last sub-region is denoted as S24 including three wavelengths of 650 nm, 760 nm, and 860 nm over the second region. Figures 10 and 11 shows the accuracy of the models trained by the above training sets of four sub-regions in each region under different noise ratios. It shows that the neural network trained by S11 and S23 which includes three longer wavelengths of 450 nm, 500 nm, 550 nm, and 760 nm, 810 nm, 860 nm performs the higher accuracy. It indicates that training dataset with more distinctive results can effectively improve the performance of the model. However, compared to the model trained by the six-channel of the second region, the model trained by S23 shows lower accuracy and it shows that the training data set still needs a sufficient number of wavelengths and high-quality data to effectively improve the performance of the model. important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral region and number of wavelengths. This analysis shows that it is very important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral region and number of wavelengths. 

In this section, the models trained for each condition of noise will be analyzed. Figure  9 shows that the model trained by noise-free addition has lower accuracy than the rest of the noise addition group. Therefore, by adding noise to the training dataset, the immunity of the neural network to noise, that is, the generalization ability of model will be greatly improved. In Figure 9 , when the noise addition ratio increase, model accuracy is promoted. However, the accuracy declines slightly after 2%. Therefore, there are at least 2% of random noise is added to effectively improve the accuracy of the neural network. From Figure 9 , the 2% noise addition will produce the greatest benefit to our optimized model. In summary, investigation of characteristics of the training data with proper noise will help to improve the anti-disturbances ability of neural network. Although the spectral measurement with more features can improve the resolution ability of the neural network, the results reveal that the training with the dataset of the shorter six-channel of the first region is redundant. This analysis also shows that it is very important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral region and number of wavelengths. This analysis shows that it is very important to construct an effective 1D-CNN model area for spectral measurement using the appropriate spectral region and number of wavelengths.

In this section, the models trained for each condition of noise will be analyzed. Figure 9 shows that the model trained by noise-free addition has lower accuracy than the rest of the noise addition group. Therefore, by adding noise to the training dataset, the immunity of the neural network to noise, that is, the generalization ability of model will be greatly improved. In Figure 9 , when the noise addition ratio increase, model accuracy is promoted. However, the accuracy declines slightly after 2%. Therefore, there are at least 2% of random noise is added to effectively improve the accuracy of the neural network. From Figure 9 , the 2% noise addition will produce the greatest benefit to our optimized model. In summary, investigation of characteristics of the training data with proper noise will help to improve the anti-disturbances ability of neural network.

According to the analysis in Sections 4.1 and 4.2, this study chose the model trained by the 2nd region dataset with 2% noise addition. Moreover, Table 6 shows the most accuracy model hyperarameters obtained by BO, and Table 7 shows a more in-depth discussion and evaluation on the measurement ability of the model in SpO 2 from 99% to 90%, which including the maximum error, average error, and standard deviation, so that the capability of measurement in each concentration can be realized clearly. According to the analysis in Table 7 , the maximum error of all concentrations is less than 2%, which is smaller than the maximum of traditional measurement deviation, proved that this new approach has higher degree of accuracy. Besides, the standard calibration is constructed by fixed condition of each SpO 2 to fit the trend line of this CNN model and obtain the equation and R 2 value. Figure 12 shows the calibration of predicted SpO 2 for our proposed model according to the reading of SB100, which needs 8 s to reach stability, since the resolution of standard meter, Rossmax SB100, is 1% of the resolution of regular specification (which is far below the resolution of our proposed model as 0.1%). At the same time, another influencing factor worth considering is the measurement synchronization between the systems during the calibration measurement. Due to the time to achieve stability of output value in the measurement of different systems, and the physiologically stable distribution of SpO 2 , there will be also some influences which causes the less deviation between calibration and measurement. The linear regression analysis of proposed model that the maximum error of the model is smaller than the value of Table 7 , and the slope of the fitting curve with linear form is close to 1. Therefore, it shows that the predicted result of the model is highly correlated with the measured value of SB100, and further strengthens the feasibility of this new approach. Furthermore, the R 2 value is as high as 0.97, which shows that the predicted result of the model is highly correlated with the measured value of SB100, and further strengthens the feasibility of this new approach. According to the analysis in Table 7 , the maximum error of all concentrations is less than 2%, which is smaller than the maximum of traditional measurement deviation, proved that this new approach has higher degree of accuracy. Besides, the standard calibration is constructed by fixed condition of each SpO2 to fit the trend line of this CNN model and obtain the equation and R 2 value. Figure 12 shows the calibration of predicted SpO2 for our proposed model according to the reading of SB100, which needs 8 s to reach stability, since the resolution of standard meter, Rossmax SB100, is 1% of the resolution of regular specification (which is far below the resolution of our proposed model as 0.1%). At the same time, another influencing factor worth considering is the measurement synchronization between the systems during the calibration measurement. Due to the time to achieve stability of output value in the measurement of different systems, and the physiologically stable distribution of SpO2, there will be also some influences which causes the less deviation between calibration and measurement. The linear regression analysis of proposed model that the maximum error of the model is smaller than the value of Table 7 , and the slope of the fitting curve with linear form is close to 1. Therefore, it shows that the predicted result of the model is highly correlated with the measured value of SB100, and further strengthens the feasibility of this new approach. Furthermore, the R 2 value is as high as 0.97, which shows that the predicted result of the model is highly correlated with the measured value of SB100, and further strengthens the feasibility of this new approach. Moreover, in our previous study used DNN to measure SpO 2 , the total relative error is 0.76%, while the total relative error measuring by 1D-CNN model is 0.46%, which indicates that the accuracy of result has been highly improved. Besides, the results measuring by the 1D-CNN model have a lower standard deviation, which means that the overall measurement has a higher degree of stability.

To verify the validation of model and application of measurement system, a timevarying measurement of SpO 2 is developed. In the following, the moving average filter will be applied to remove noise from the sensor reading. During the dynamic response measurement, the output of SpO 2 concentration of spectral measurement based on our 1D-CNN model with optimization is compared with the reading of standard meter of SB100 at the same time. The SpO 2 under test varies gradually from 98 to 82 for 135 samples within 94.5 s and the sampling time is 0.7 s for each reading. Figure 13 shows three curves including the readings from the original prediction, SB100 and the signal after two filters. The green line represents the original data, and the orange line represents the signal after two filters: the median filter and the moving average filter. The signal of our proposed spectral measurement based on 1D-CNN Model is much more smooth and varies within a reasonable range compared to the reading of SB100, and even shows better signal quality than the readings of SB100.

To verify the validation of model and application of measurement system, a timevarying measurement of SpO2 is developed. In the following, the moving average filter will be applied to remove noise from the sensor reading. During the dynamic response measurement, the output of SpO2 concentration of spectral measurement based on our 1D-CNN model with optimization is compared with the reading of standard meter of SB100 at the same time. The SpO2 under test varies gradually from 98 to 82 for 135 samples within 94.5 s and the sampling time is 0.7 s for each reading. Figure 13 shows three curves including the readings from the original prediction, SB100 and the signal after two filters. The green line represents the original data, and the orange line represents the signal after two filters: the median filter and the moving average filter. The signal of our proposed spectral measurement based on 1D-CNN Model is much more smooth and varies within a reasonable range compared to the reading of SB100, and even shows better signal quality than the readings of SB100. 

In this research, a technique for SpO2 of spectral measurement based on a 1D-CNN model is proposed and verified. In order to investigate the multiple spectral regions used for the deep learning of SpO2 measurement, we observed several spectral regions with large signal responses in SpO2 concentration and their boundary wavelengths to construct different spectral regions to analyze the validity of the proposed model. We found in the measurement in Figure 6 that the normalized spectral curve of the spectral region (450 nm, 500 nm, 550 nm, 570 nm, 600 nm, 610 nm) represented as the first region shows less response [13] . On the other hand, the normalized spectral curve of the second region (650 nm, 680 nm, 730 nm, 760 nm, 810 nm, 860 nm) is easier to distinguish since the spectral absorbance is relatively higher [13] .

Three datasets for training and verification are built based on our analysis and it will also give the limitation of proposed model which can be possible improved by the hybrid Figure 13 . Original spectral data and measurement results after moving average filtering.

In this research, a technique for SpO 2 of spectral measurement based on a 1D-CNN model is proposed and verified. In order to investigate the multiple spectral regions used for the deep learning of SpO 2 measurement, we observed several spectral regions with large signal responses in SpO 2 concentration and their boundary wavelengths to construct different spectral regions to analyze the validity of the proposed model. We found in the measurement in Figure 6 that the normalized spectral curve of the spectral region (450 nm, 500 nm, 550 nm, 570 nm, 600 nm, 610 nm) represented as the first region shows less response [13] . On the other hand, the normalized spectral curve of the second region (650 nm, 680 nm, 730 nm, 760 nm, 810 nm, 860 nm) is easier to distinguish since the spectral absorbance is relatively higher [13] .

Three datasets for training and verification are built based on our analysis and it will also give the limitation of proposed model which can be possible improved by the hybrid deep neural network (HDNN) model with dropout of redundant spectral ranges. Our analysis shows it is extremely important to adopt an appropriate spectral region for building an effective 1D-CNN model region for the spectral measurement. The dataset is added with noise randomly, increasing the diversity of data and improving the generalization of the neural network. After that, the established dataset is delivered input to a 1D-CNN to obtain a measurement model of SpO 2 . Two optimization methods including GridSearchCV and Bayesian Optimization are applied to optimize the hyperparameters. The optimal accuracies of proposed model after optimization by GridSearchCV and Bayesian Optimization is 89.3% and 99.4%, respectively, trained with 2% random noise and the dataset at the spectral region of six wavelengths including 650 nm, 680 nm, 730 nm, 760 nm, 810 nm, and 860 nm. The total relative error of the best model is only 0.46%, optimized by Bayesian optimization. It shows that our proposed 1D-CNN model gives a new and feasible approach to measure SpO 2 based on multi-wavelength. Moreover, we show that the multi-wavelength spectral measurement with deep learning architecture of 1D-CNN has low error in the elimination of the effect of tissue and has improved the accuracy of SpO 2, which can be widely applied to the other optical or radiometric spectroscopic measurement with complicated algorithms.

Design and development of pulse oximeter

Determination of SpO2 by Spectral Analysis of Data from a Low Cost Pulse Oximeter

Multiwavelength pulse oximetry: Theory for the future

Calibration-free measurement of the oxygen saturation in human retinal vessels. Ophthalmic Technol

Pulse oximetry theory and calibration for low saturations

Low cost calibration free Pulse oximeter

A novel method of measurement of oxygen saturation in arterial blood

A novel calibration-free method of measurement of oxygen saturation in arterial blood

New method to diagnosis of dyslexia using 1D-CNN

Epileptic seizure detection using EEG signals based on 1D-CNN Approach

Detection of obstructive sleep apnoea using features extracted from segmented time-series ECG signals using a one dimensional convolutional neural network

ECG signal analysis for patient with metabolic syndrome based on 1D-convolution neural network

Wearable devices acquired ECG signals detection method using 1D convolutional neural network

Analysis of SpO2 concentration measurement based on multi-wavelength with CNN and DNN model

Optical properties of circulating human blood in the wavelength range 400-2500 nm

Multi-channel low-cost light spectrum measurement using a multilayer perceptron

Learning and recovery in the ReLU model

Deep sparse rectifier neural networks

Softmax discriminant classifier

Biomedical Engineering Towards the Year 2000 and Beyond (Cat

Hybrid software obsolescence evaluation model based on PCA-SVM-GridSearchCV

Bayesian optimization for accelerating hyper-parameter tuning

The authors declare no conflict of interest.