key: cord-0216436-s6rbcpkb authors: Qu, Ruyi; Yang, Yi; Wang, Yuwei title: COVID-19 Detection Using CT Image Based On YOLOv5 Network date: 2022-01-24 journal: nan DOI: nan sha: 23dcc1c1c666921facf65a11affe174cf8ba68a3 doc_id: 216436 cord_uid: s6rbcpkb Computer aided diagnosis (CAD) increases diagnosis efficiency, helping doctors providing a quick and confident diagnosis, it has played an important role in the treatment of COVID19. In our task, we solve the problem about abnormality detection and classification. The dataset provided by Kaggle platform and we choose YOLOv5 as our model. We introduce some methods on objective detection in the related work section, the objection detection can be divided into two streams: onestage and two stage. The representational model are Faster RCNN and YOLO series. Then we describe the YOLOv5 model in the detail. Compared Experiments and results are shown in section IV. We choose mean average precision (mAP) as our experiments' metrics, and the higher (mean) mAP is, the better result the model will gain. mAP@0.5 of our YOLOv5s is 0.623 which is 0.157 and 0.101 higher than Faster RCNN and EfficientDet respectively. The COVID-19 pandemic has broken down the global medical order tremendously, which triggers an urgent need to improve the level of medical care. [1] Normally, COVID-19 diagnosis can be diagnosed via polymerase chain reaction, costing a long time almost 12 hours. If we need to get the quick result, the chest radiographs, obtained in minutes, can be an optimal choice. [2] Because COVID-19 looks similar to other viral and bacterial pneumonias on chest radiographs, which is difficult to diagnose. There are some medical misdiagnosis when radiologists make the diagnosis. Computer aided diagnosis (CAD) [3] increases diagnosis efficiency, helping doctors providing a quick and confident diagnosis, it has played an important role in the treatment of COVID-19. The computer vision technology mainly effects in the domain related with images, such as CT, MRI and PET scans. In this paper, we focus on the radiographs categorization, which contains three kinds: the negative for pneumonia, typical, indeterminate and atypical COVID-19. We describe the related objection detection methods in section II. And section III shows our model for this task: YOLO v5 while section IV introduces the experiment results and compared experiments. In the last section V, we draw the conclusion and proposed the further adjustment in the future. Numerous approaches on the objection detection have been proposed in recent years, and there are two mainly traditional streams. The mainstream target detection algorithm based on deep learning is divided into two-stage target detection algorithm and single-stage target detection algorithm according to the generation stage of candidate box. 1) Two-stage method In view of the low precision of traditional target detection algorithms, Girshick et al. [6] proposed R-CNN, a goal detection algorithm based on deep learning, which pioneered a precedent. Ren et al. [7] proposed the Fast R-CNN algorithm, on the basis of Fast R-CNN added the candidate window network (region proposal network, RPN), the candidate window network by setting different scale anchors (anchor) to extract candidate boxes, instead of selective search and other traditional candidate box generation methods, to achieve end-to-end network training, improve the network computing speed. The Faster R-CNN network consists of convolution layers, RPN networks, ROI of interest, classification, and regression layers 2) One-stage method In view of the inefficiency of the two-stage target detection algorithm, YOLO (you only look once) v1 [8] discarded the candidate box extraction branch in the algorithm and directly implemented feature extraction, candidate box classification and regression in the same branchless deep convolution network, making the network structure simple and increasing the detection speed from 7 frames/s of Fast R-CNN to 45 frames/s. So that the goal detection algorithm based on deep learning can meet the needs of real-time detection tasks under the computing power of the time. With the emergence of YOLov1, the goal detection algorithm based on deep learning begins to have two and single stages. l Our Contribution ü We applied the model YOLOv5 for objective detection. ü We introduce our dataset and do some analysis. ü In the experiments process, we do the comparing experiments and the result shows that our model performed better than the other models. The whole structure of YOLOv5 is shown in the following figure 1: Figure 1 : YOLOv5 Architecture Backbone structure is the network core of YOLOv5 technology, which aims to extract information for utilization and processing in the process of image input. In the network structure model, gradient information often has many repetitive problems, which can be solved by combining CSPNet technology, so that gradient changes and feature diagrams can be integrated together, and the number of parameters of the model and the FLOPS value of the model are controlled in a lower range, thus improving the accuracy and efficiency of reasoning, and also achieving the goal of reducing the size of the model. PANet is based on the Musk R-CNN network framework and the FPN network framework, and the dissemination of information is optimized and improved. In the process of extracting features, the network enhances the bottom-up path and improves the propagation of the underlying features. For the third network path, the data information characteristic mapping in the previous network stage is the input of this stage, operated according to the 3×3 convolution method, and the characteristic mapping value of each stage output will be directly connected with the path information of the same stage, effectively enhancing the joint utilization of high-level and low-level information. The damaged information paths between regions and features can also be recovered quickly under the action of adaptive feature pools, and the candidate regions are aggregated at each feature layer to avoid random allocation. The basic components of YOLOv5 are five parts: In our task, the important metrics we focus is the (mean average precision) mAP score. [9] From the confusion matrix, we can get the how to calculate the precision and recall. We can draw the PR curve and compute the area under the curve to judge the model's performance. In the objection detection task, we focus on the intersection over union (IOU). B_gt represents the actual ground frame (Ground Truth, GT) of the target, and B_p represents the predicted frame. By calculating the IOU of the two, it can be judged whether the predicted detection frame meets the conditions. Table 1 : compared experiments Yolov5s networks are the least deep and the smallest width of feature maps in the Yolov5 series. The latter three are on this basis to deepen, and constantly widen. EfficientDet [10] proposed to model scale the network such as the backbone of the detector. We know that the higher mAP is, the better result the model will gain. mAP@0.5 of our YOLOv5s is 0.623 which is 0.157 and 0.101 higher than Faster RCNN and EfficientDet respectively. Above, we have solved the task on abnormality detection, and this section will conclude our work. In our task, we solve the problem about abnormality detection and classification. The dataset provided by Kaggle platform and we choose YOLOv5 as our model. We introduce some methods on objective detection in the related work section, the objection detection Then in the section III we describe YOLOv5 model in the detail. Compared Experiment and results are shown in section IV. We choose mean average precision (mAP) as our experiments' metrics, and the higher (mean )mAP is, the better result the model will gain. mAP@0.5 of our YOLOv5s is 0.623 which is 0.157 and 0.101 higher than Faster RCNN and EfficientDet respectively. Severe acute respiratory syndrome-related coronavirus-The species and its viruses, a statement of the Coronavirus Study Group Lung infection quantification of COVID-19 in Fifty Years of Computer Analysis in Chest Imaging: Rule-Based Computer-aided diagnosis of diabetic retinopathy: a review Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition una_ected by shift in position Rich feature hierarchies for accurate object detection and semantic segmentation Faster r-cnn: Towards real-time object detection with region proposal networks YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers Estimating average precision with incomplete and imperfect judgments Efficientdet: Scalable and efficient object detection