key: cord-0717056-18k4po39 authors: Wu, KeQing; Duan, ShengBao; Wang, YuJue; Wang, HongMei; Gao, Xin title: Convolutional neural network-based automatic classification for incomplete antibody reaction intensity in solid phase anti-human globulin test image date: 2022-03-07 journal: Med Biol Eng Comput DOI: 10.1007/s11517-022-02523-1 sha: 5271e802e7ec169a149d1ed8c8634e7897c1d47a doc_id: 717056 cord_uid: 18k4po39 The precise classification of incomplete antibody reaction intensity (IARI) in hydrogel chromatography medium high density medium solid-phase Coombs test is essential for haemolytic disease screening. However, an automatic and contactless method is required for accurate classification of IARI. Here, we present a deep ensemble learning model that integrates five different convolutional neural networks into a single model for IARI classification. A dataset, including 1628 IARI images and corresponding labels of IARI categories ((-), (1 +), (2 +), (3 +), and (4 +)), was used. We trained our model using 1302 IARIs and validated its performance using 326 IARIs. The proposed model achieved 100%, 99.4%, 99.4%, 100%, and 100% accuracies in the ( −), (1 +), (2 +), (3 +), and (4 +) categories, respectively. The results were compared with those of manual classification by immunologists (average accuracy: 99.8% vs. 88.3%, p < 0.01). Following model assistance, all three immunologists achieved increased accuracy (average accuracy: + 6.1%), with the average accuracy of junior immunologists maximum increasing by 11.3%. The time required for model classification was 0.094 s·image(–1), whereas that required manually was 5.528 s·image(–1). The proposed model can thus substantially improve the accuracy and efficiency of IARI classification and facilitate the automation of haemolytic disease screening equipment. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11517-022-02523-1. Acute blood loss, anaemia, and coagulopathy are treated using blood transfusion [1, 2] . Improper blood transfusion or incompatible transfusion increases the risk of haemolytic diseases (e.g., haemolytic disease in new-borns, autoimmune haemolytic disease, drug immune haemolytic diseases), renal failure, and even death [3] [4] [5] . The incomplete antibody reaction intensity (IARI) is the main factor responsible for incompatible transfusion. IARI is divided into five categories ((-), (1 +), (2 +), (3 +), (4 +)), with the higher intensity category causing more serious incompatible transfusion [6] . Therefore, IARI multi-classification tests are essential before blood transfusion [7] [8] [9] . Currently, the haemolytic IARI multi-classification test mainly uses the micro-column gel immune-assay Coombs test (MGIA-Coombs test) as it has high sensitivity and strong interpretability [10] . However, fibrin in the plasma can erroneously trap red blood cells (RBCs) at the top of the gel column, leading to high false-positives with the MGIA-Coombs test [11] [12] [13] [14] . The solid-phase red cell adherence Coombs test (SPRCA Coombs test) was then proposed, involving pre-coating of anti-human globulin (AHG) on U-bottom microwells to prevent RBCs from binding to fibrin, which reduces false-positive results [15] . However, the SPRCA Coombs test requires a tedious and error-prone washing process for RBC suspensions to separate sensitised * HongMei Wang wanghm@sibet.ac.cn * Xin Gao xingaosam@163.com RBCs from free fibrin, in turn causing false-negative results [16] [17] [18] [19] [20] . Recently, the hydrogel chromatography medium high density medium solid-phase Coombs test (HCM-HDMS Coombs test) was proposed. It involves hydrogel chromatography medium (HCM) as the separation solution in the reaction-and-separation chamber for separating sensitised RBCs from free fibrin, thus eliminating the washing process and effectively reducing the false-negative results [21] . However, this chamber obstructs the view during the process of observing incompatible IARI in the HCM-HDMS Coombs test, thus affecting accurate visual classification, whereas moving the chamber away results in reagent contamination and leakage. Further, the observation process by immunologists is subjective and diverse [22] . Therefore, contactless, automatic, and intelligent multi-classification methods are needed to enhance the practical value of the HCM-HDMS Coombs test. Deep learning has led to the achievement of remarkable success in medical image classification [23, 24] . In particular, with the advent of convolutional neural networks (CNNs), high-level semantic features of images can be automatically and effectively extracted to reduce the necessity of handcrafted feature processes. Recently, Liu, et al. applied a CNN model to the multi-classification of COVID-19 pneumonia, other common pneumonia, and normal controls using CT images and achieved an accuracy of 92.49% [25] . In another study, CNN models were utilized in the automated multi-classification of cells in the epithelial tissue of oral squamous cell carcinoma, with an accuracy of 97.5% [26] . Tessema et al. demonstrated the potential of integrating the deep learning-based automatic model into the quantitative multi-classification of blood cells with an average accuracy of 80.6% [27] . Thus, we hypothesized that CNN methods have the potential to achieve automatic and intelligent IARI multi-classification, and this will assist immunologists and clinicians to improve clinical efficiency and accuracy. However, among the five IARI categories, the number of poor positive samples in (1 +), (2 +), and (3 +) is much smaller than that in (-) and (4 +), demonstrating the sample imbalance distribution of IARI categories, which will result in a shift in the decision boundary of the CNN networks in the training process. Further, the distinguishing characteristics between adjacent categories such as (-) and (1 +); (1 +), (2 +), and (3 +); and (3 +) and (4 +) are not particularly obvious, thus seriously affecting the ability of CNN networks to automatically learn and identify. Moreover, there are several bubbles, particulates, and other artefacts in reaction mixture of HCM-HDMS Coombs test that hinder the classification task. Therefore, the above existing CNN models cannot be directly used for effectively solving the IARI multi-classification. In this study, we aimed at establishing a novel deep learning model for the automatic classification of IARI, which is unaffected by the influence of the sample imbalance distribution of IARI categories and the interference of artefacts in the HCM-HDMS Coombs test. An ensemble learning framework is used to reduce the influence of sample imbalance distribution and obtain accurate classification results. A convolutional block attention module (CBAM) is used to avoid the interference of artefacts by combining pixellevel channel interaction relationships and spatial location information. In total, 1725 blood samples were collected from the Suzhou Blood Centre and the First Affiliated Hospital of Soochow University, China; these were kept at 4 °C and used within 1 week. The corresponding IARI of the blood samples were obtained using the HCM-HDMS Coombs test, and 97 samples (5.62%) whose IARI category could not be obtained accurately were excluded. A total of 1628 IARI samples (94.38%) were selected, and the number of each IARI category was as follows: 650 (-), 230 (1 +), 68 (2 +), 130 (3 +), and 550 (4 +). IARI images were captured from U-microplate bottoms in a closed image acquisition space with a stable light field, using a digital camera. Each image had a size of 229 × 230 pixels. The images for the five IARI category samples are shown in Fig. 1 . To ensure that all IARI samples were correctly classified, the labels were determined by three professional immunologists. If there was a difference among the labels, the suspected samples were re-tested using the MGIA-Coombs test to obtain the correct category. In total, 1302 IARIs (80%) were used as a training dataset to develop the deep learning model, and another 326 IARIs (20%) were used as a testing dataset for model evaluation [28] . Table 1 lists the number of labelled samples for each category in the datasets. As we used a deep neural network-based model for classification, the training dataset was not sufficient to achieve invariances and robustness for the network model. Considering that data augmentation is a common procedure for generating sufficient training data for CNN-based models, we utilised the data augmentation package from Torchvision-Transform (https:// pytor ch-cn. readt hedocs. io/ zh/ latest/ torch vision/ torch vision-trans form/) and augmented the training dataset by adopting image cropping, flipping horizontally and vertically, rotating at four fixed angles of 0°, 90°, 180°, and 270° and zooming. In this study, we proposed an end-to-end deep learning model based on CNNs to classify the IARI, as shown in Fig. 2a , via two main stages. First, five sub-models were built by associating the IARI status. Second, the sub-models were combined into an ensemble model in parallel to obtain the final category using a collective decision mechanism. The details are as presented below. In the first stage, to address the problem of a single model not being able to fully capture the detailed features distinguishing between adjacent IARI categories, five different CNN-based frameworks, including Alex Deep Convolutional Neural Network (AlexNet), Visual Geometry Group (VGG) Network, Residual Network (ResNet), Inception Network, and Dense Convolutional Network (DenseNet), were adapted to classify the IARIs with improved classification performance. AlexNet transforms the linear mapping between features into a nonlinear relationship to simulate any polynomial [29] . The VGG Network reduces the computation of each convolution layer and captures more abundant features using the stacked convolution core [30] . ResNet adds residual blocks and eliminates overfitting [31] . The Inception Network balances the network depth and width and reasonably reduces the dimensions [32] . DenseNet utilises feature information more efficiently through dense connections and reduces gradient vanishing [33] . Further, a large number of bubbles, particulates, or other artefacts in IARIs contribute to the useless features extracted and hinder classification. For increased focus on effective areas and to suppress useless features, a CBAM was added to each CNN framework, as a hybrid attention mechanism capable of combining channel dimensions and spatial dimensions [34] . In the channel dimension, average pooling was used to aggregate channel interaction information, and maximum pooling was used to infer the finer channel information to further improve the representation power of the network. In the spatial dimension, average pooling and maximum pooling were concatenated to generate an efficient feature descriptor for extracting valid feature location information. As shown in Fig. 2b , compared with the original CNN, CBAM was only inserted between the feature 14 26 extractor and classifier of each CNN, instead of in the feature extraction process, which emphasises the crucial feature information and ensures effective feature extraction of five CBAM-CNNs (CBAM-AlexNet, CBAM-VGG, CBAM-ResNet, CBAM-Inception, and CBAM-DenseNet). In the second stage, five CBAM-CNNs were used to form an ensemble model with a parallel combination [35] . A collective decision mechanism, referred to as relative plurality voting (RPV) module, was also constructed and added to the ensemble model. Based on the RPV module, the intensity category with the most votes among all sub-models was identified as the final classified category, providing a more reasonable decision boundary for the model. The ensemble model with the RPV module showed the advantage of making full use of multiple networks to offset the limitations of a single network and reduce the overall classification error rate [36] . For IARI images, a corresponding ensemble classification model was developed. To achieve quick convergence of the proposed model, a training dataset comprising five IARI categories was used to train the CBAM-CNN model. The loss function Loss, for training was cross-entropy, which can be represented as follows: where C is the number of CBAM-CNN models, l c is the loss of one CBAM-CNN model, N is the number of IARI images, x i is one IARI image, x i is the nonlinear transformation of x i , y i is the IARI category corresponding to x i , is the parameter set of the model, and p is the probability output from the model. Relying on PyTorch open-source libraries as a back end, the ensemble model was implemented on an Ubuntu 16.04 computer with one Intel Xeon CPU, using an NVIDIA RTX 2080 Ti GPU, with 32 GB available RAM. Herein, four metrics, Accuracy, Precision, Recall, and F-score, were used to quantitatively evaluate the performance of the model for each IARI category classification, and these are defined as follows: where true positive (TP) represents the number of positives correctly predicted by the classification discriminant model, and true negative (TN) represents the number of negatives predicted correctly; false positives (FP) and false negatives (FN) denote the number of positive and negative misjudgements by the classification model, respectively. β is the weight in the F-score calculation to balance the proportion of Precision and Recall, and is assigned as equal to 1. In the imbalanced datasets, macro-averaged metrics were computed the average overall categories and gave equal weights to each category, which were represented fairly for each category and regardless of its frequency [37, 38] . The macroperformance of the overall ensemble model for all categories (n = 5) was evaluated using macro-average X avg [39] [40] [41] , as follows: To quantify the comparison of classification performance between the model and the immunologists, the Kappa coefficient was used to measure the consistency between the predictive values and true values as follows [42] : where p 0 represents Accuracy. Statistical analysis was conducted using R software (version 3.5.1, https:// www.r-proje ct. org/). Accuracy, Precision, Recall, F-score, and Kappa coefficient were used to evaluate the performance of CNN models and immunologists. Kappa coefficient utilised the cohen.kappa() from the concord package in the consistency analysis. Pearson's chi-square test was applied to assess the differences in performance between the manual classification and the proposed ensemble learning model utilised the chisq.test. Statistical significance was set at p<0.01. The implementation details of the CBAM-CNN model training were as follows: batch size, 32; epoch number, 50; and adaptive moment estimation (Adam) optimiser [43] was used to tune the parameter set ; the initial learning rate was set to 5e − 4. To demonstrate the effectiveness of our proposed model, six metrics (Accuracy avg , Precision avg , Recall avg , F1-score avg , Kappa, and Time) were used to evaluate the classification performance of five independent CBAM-CNN models and the ensemble model. The corresponding performance of all the models is listed in To verify the clinical utility of our proposed model, we conducted a mind-machine comparison experiment [45] . This experiment contrasted the classification performance differences among the proposed model, immunologists, and immunologists and re-classified using model assistance (human-machine integration experiment), and especially, the classification performance of each category. The classification performances for each category and the average of the proposed model, three immunologists (Immunologist-1 and Immunologist-2 with about 2 years of experience each, and Immunologist-3 with 5 years of experience), and that of immunologists with model assistance are shown in In particular, the time taken for model classification was 0.094s·image -1 , which was approximately 60 times faster than that taken by the immunologists (Immunologist avg =5.528s·image -1 ). Further, to reflect the results intuitively, confusion matrices [46] were generated for the classification results of the ensemble model, immunologists, and immunologists with model assistance as shown in Fig. 3 . Confusion matrices are cross-compared, predicted, and used for actual decision classes as a widespread approach in deep learning. In confusion matrices, the abscissa represents the true label, and the ordinate represents the predicted label. In the red area, the numbers indicate the amount of data predicted for each category. The deeper red area indicates the larger amount of data predicted in the category, the deeper red area on the main diagonal indicates the higher accuracy of prediction, and the other lighter areas indicate a lower error rate. In the green area, the percentages in the right-most column represent the Precision of each category, the percentages in the bottom row represent the Recall of each category, and the percentage in the lower right corner represents total Accuracy. The deeper green area indicates better prediction performance. As shown in Fig. 3 , the total accuracy of the immunologists was also improved (Accuracy: Imm-1, 59.5% vs. 73.3%, Imm-2, 89.0% vs. 92.3% and Imm-3, 89.0% vs. 92.3%). In the model-assisted experiment, Immunologist-3 achieved the highest accuracy (Accuracy total = 92.3%), which was close that of the model (Accuracy total = 99.4%). This study presents a fast, fully automatic deep learning model based on CNNs for IARI classification. It is an endto-end hybrid processing method combining the ensemble model with the CBAM and RPV modules that can accurately divide the intensity of IARI into five categories. This decision-level fusion design dramatically improved the classification efficiency and precisely fitted the IARI dataset compared with the independent model. More importantly, the proposed model achieved better classification performance than the immunologists and effectively improved the classification accuracy of the immunologists. Table 2 shows the results of the macro-averaged metrics of the single and ensemble models for IARI multi-classification. In the IARI classification, all the models achieve an Accuracy avg of more than 90% in the overall categories because the deep learning models can automatically mine the subtle and deep features related to the IARI, which cannot be perceived manually. However, there are differences in the performance of the single models in different category classifications, as shown in the Supplementary Material Table S1 , the accuracy of the ResNet model in the (-) and (3 +) categories were 77.6% and 84.7%, respectively, and the accuracy of the DenseNet model in the (-) category was only 79.2%. Additionally, compared with the single model, the ensemble model substantially improves the accuracy of classification both in single categories and overall. As shown in Table 3 , the accuracies of all categories were above 99%, and the maximum improvement in the overall accuracy was up to 8.3% (Accuracy avg : ResNet 91.3% vs. ensemble model 99.6%). The ensemble model is efficient for the improvement of the model fit; however, it is not sensitive to outliers for reducing the decision boundary shift [47] [48] [49] [50] [51] . In addition, the ensemble model, by collective decision mechanism, focuses on synthesizing information from several sub-models with different structures and has been shown to reduce average error and combine the strengths of models in the exploration of diverse data patterns [52] [53] [54] . However, the addition of a poorly performing model will not reduce the overall model classification skill, because the ensemble model has a net gain compared to the single models [55, 56] . Given the above, the ensemble model can reduce the risk of relying on a single prediction distribution and extract richer semantic feature information than the single CNN models (such as each sub-model in the training process has a different probability for boundary regions in pixel-level), which are beneficial in classification tasks to or the achievement of better performance to improve classification accuracy [57] [58] [59] [60] [61] [62] . As shown in Table 2 , CBAM has a limited effect on overall model performance improvement in that it slightly increases the accuracy of the models except that of the VGG and Inception models. But CBAM reduces the cross-adjacent category errors, especially those of the CBAM-CNN Table 3 Comparison between the ensemble models and three immunologists in each sub-category Notes: Imm-n denotes Immunologist-1, Immunologist-2, Immunologist-3, and Immunologist-avg; w/o represents the immunologist without model assistance; w represents the immunologist with model assistance; "↑" indicates that the result of the immunologist with model assistance is better than that of the immunologists Table S1 and reducing the error rate of blood artefacts being mistaken as (4 +) category. CBAM flexibly introduced into various models, partially reserves the channel interaction information and spatial location information while gathering clues about actual class object features and giving a meaningful focus for the input images by element-wise operations [63] [64] [65] [66] [67] [68] [69] [70] [71] [72] [73] [74] . Thus, the CBAM-CNN models bring more robust and plausible classification decision-making. Table 3 shows that the ensemble model with CBAM, which we proposed, gives the best performance among all models both in the single categories and in the overall. We further compared the classification performance of the immunologists and our deep learning model. As illustrated in Table 3 , the performance of the model was higher than the average performance of three junior immunologists with varying experience. For the immunologists, (-) and (4 +) categories were relatively easier to classify, whereas (1 +), (2 +), and (3 +) classification were more prone to errors. The results show that the more experienced immunologist had a classification of higher accuracy. The immunologists also conducted reclassification with the assistance of our proposed model to verify the clinical utility of the model. In the (1 +), (2 +), and (3 +) categories, the performance of immunologists was greatly improved; especially for immunologists with relatively low experience, the auxiliary effect is more obvious. Further, analysis of the time required for classification using the model and by the immunologists showed that the calculation time of the proposed model is at the millisecond level and is 60 times faster than the time needed manually. Thus, the model holds great potential for real-time assistance, especially for junior immunologists. The confusion matrices of the ensemble model for classification tasks clearly and intuitively showed the classification performance of the models and immunologists across each category. Higher accuracy rates generally indicate better results, but FP and FN are also important and should not be ignored in clinical medicine. Reducing the ratio of FP to FN can significantly reduce the possibility of medical errors. There were no serious errors in the IARI classification using the ensemble models: no strongly positive IARI ((4 +) category) was identified as poorly positive ((1 +), (2 +), (3 +) categories) or negative category ((-) category), and no negative IARI was identified as a positive category. The model also did not show acrossadjacent category errors within the poorly positive categories. However, immunologists usually misclassified these samples as (-) and (1 +), (3 +), and (4 +), and the internal categories of poor positive samples, resulting in unsatisfactory classification results. Our analyses revealed that automatic classification is feasible and reliable and can significantly outperform the immunologists. We observed that the performance of the immunologists was highly improved with assistance of the proposed model: the main diagonal became deeper, indicating that the accuracy of each category increased; the other areas became lighter indicating that the errors in the across-adjacent categories were reduced; the green area became deeper, indicating an improvement in Accuracy total , Precision, and Recall. These results demonstrate that our proposed model may be used as a reference for assisting immunologists. In addition, in this experiment, batch size was set to 32 using the data-parallelization strategy to adapt to the IARI dataset, which can train the CNN models in the correct direction of change of gradients to be able to accurately classify IARI [75] [76] [77] [78] . The learning rate of 5e-4 with the fixed batch size can keep the generalization performance from being degraded and makes CNN models achieve the best performance because of small batch sizes requiring small learning rate [79] [80] [81] . The epoch was set at 50 to terminate the training because the models all achieved stable convergence. The models using the well-designed parameters are robust and achieve good results. However, our research had some limitations. First, our datasets were derived from the same source, and the AHG test was performed in the same laboratory, which contained limited variances; thus, the generalisation of the model needs to be externally verified at multiple centres. Second, the model could only differentiate the negative reaction, poor positive reaction, and strongly positive reaction. In clinical reality, there is a suspected category, ( ±), which is indistinguishable from negative and poor positive reactions by immunologists that have insufficient images to build the CNN model. Thus, in future, extending the dataset to multivendor and multi-centre platforms may further improve the performance of the model. Simultaneously, we will qualitatively distinguish the boundary between ( ±) and other classes described in this study. In this study, we presented a deep ensemble learning model based on CNN models that can accurately classify IARI into multiple categories. This model can aid immunologists in differentiating distinct clinical patients by providing an objective and accurate evaluation of IARI categories, which could reduce the risk of haemolytic diseases. The model holds great potential in the field of fully automatic machinery and holds promise for promoting intelligent AHG test classification. The online version contains supplementary material available at https:// doi. org/ 10. 1007/ s11517-022-02523-1. Anemia and blood transfusion in critically ill patients Critical issues in hematology: anemia, thrombocytopenia, coagulopathy, and blood product transfusions in critically ill patients Pre-transfusion testing Immunological aspects of blood transfusions Impact of allergy screening for blood donors: relationship to nonhemolytic transfusion reactions A new approach to detection of incomplete antibodies using hydrogel chromatography medium A new test for the detection of weak and incomplete Rh agglutinins An "incomplete" antibody in human serum Pre-analytical issues and interferences in transfusion medicine tests. In: Accurate Results in the Clinical Laboratory: A Guide to Error Detection and Correction The gel test: a new way to detect red cell antigen-antibody reactions Mixed field agglutination: unusual causes and serological approach Evaluation of the use of gel card system for assessment of direct coombs test: weighing the pros and cons Effects of micro-magnetic field at the surface of 316L and NiTi alloy on blood compatibility Blood compatibility of polymers: in vitro and in vivo tests A solid phase antibody screen A solid phase and microtiter plate hemagglutination method for pretransfusion compatibility testing Comparison between a solid-phase low-ionic-strength solution antiglobulin test and conventional low-ionic-strength antiglobulin test: assessment for the screening of antierythrocyte antibodies Comparison between the manual method of indirect coombs via gel technology and solid phase red cell adherence Evaluation and implementation of the gel test for indirect antiglobulin testing in a community hospital laboratory Method of evaluation of process of red blood cell sedimentation based on photometry of droplet samples ) A new reliable test for crossmatching: microplate hydrogel immunoassay technology Principles of laboratory techniques The significance of interdisciplinary integration in academic research and application Artificial intelligence (AI)-aided disease prediction Clinically applicable AI system for accurate diagnosis, quantitative measurements, and prognosis of COVID-19 pneumonia using computed tomography Automated classification of cells into multiple classes in epithelial tissue of oral squamous cell carcinoma using transfer learning and convolutional neural network Quantitative analysis of blood cells from microscopic images using convolutional neural network Using deep learning for image-based plant disease detection ImageNet classification with deep convolutional neural networks Very deep convolutional networks for large-scale image recognition Deep residual learning for image recognition Rethinking the inception architecture for computer vision Densely connected convolutional networks CBAM: Convolutional block attention module. Lecture Notes in Computer Science (including subseries Plurality voting-based multiple classifier systems: statistically independent with respect to dependent classifier sets Analysis of a plurality voting-based combination of classifiers Text categorization with classbased and corpus-based keyword selection A re-examination of text categorization methods Training algorithms for linear text classifiers Overview of the third text retrieval conference (TREC-3) Robust multiclass classification for learning from imbalanced biomedical data Statistical significance and normalized confusion matrices Adam: A method for stochastic optimization Metrics for multi-class classification: an overview A cross-modal 3D deep learning for accurate lymph node metastasis prediction in clinical stage T1 lung adenocarcinoma Confusion matrices and rough set data analysis Margin-based pareto ensemble pruning: an ensemble pruning algorithm that learns to search optimized ensembles Dynamic classifier selection: recent advances and perspectives A survey of selective ensemble learning algorithms New discoveries in balanced ensemble learning An ensemble simulation approach for artificial neural network: an example from chlorophyll a simulation in Lake Poyang Can multi-model combination really enhance the prediction skill of probabilistic ensemble forecasts? Nudging, ensemble, and nudging ensembles for data assimilation in the presence of model error Theoretical examination of a multimodel composite for seasonal prediction Evaluating perturbation impact on key travel models, census statistical disclosure control research project 1 The ensemble approach to forecasting: a review and synthesis Multistrategy ensemble learning: reducing error by combining ensemble learning techniques An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization Arcing classifiers Bundling heterogeneous classifiers with advisor perceptrons Ensemble of training models for road and building segmentation Squeeze-and-excitation networks A polarization fusion network with geometric feature embedding for SAR ship classification Learning deep features for discriminative localization What is the best multi-stage architecture for object recognition? IEEE 12th international conference on computer vision Linear spatial pyramid matching using sparse coding for image classification. IEEE Conference on computer vision and pattern recognition Object recognition with features inspired by visual cortex Forecasting the subway passenger flow under event occurrences with multivariate disturbances DRSNet: Novel architecture for small patch and low-resolution remote sensing image scene classification Learning mid-level features for recognition Multi-scale spatialspectral fusion based on multi-input fusion calculation and coordinate attention for hyperspectral image classification Co-attentive multi-task convolutional neural network for facial expression recognition Improving scalability of parallel CNN training by adjusting mini-batch size at run-time The effect of batch size on the generalizability of the convolutional neural networks on a histopathology dataset Practical recommendations for gradient-based training of deep architectures. Neural networks: Tricks of the trade Dynamically adjusting transformer batch size by monitoring gradient direction change On the importance of initialization and momentum in deep learning. International conference on machine learning Impact of training set batch size on the performance of convolutional neural networks for diverse datasets Revisiting small batch training for deep neural networks Competing interests The authors declare no competing interests.Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Institute of Biomedical Engineering and Technology, whose main research direction is precision medicine based on intelligent computing.ShengBao Duan is a professor of CAS Key Lab of Bio-Medical Diagnostics, mainly researching the development of in vitro diagnostic reagents related to blood transfusion safety.YuJue Wang is a member of CAS Key Lab of Bio-Medical Diagnostics, mainly researching the development of in vitro diagnostic reagents related to blood transfusion safety.HongMei Wang is a professor at CAS Key Lab of Bio-Medical Diagnostics, mainly researching the development of in vitro diagnostic reagents related to blood transfusion safety.Xin Gao is a professor at the Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Science.