key: cord-1006265-s7wvm5fu authors: Junior, Claudio Pedrassoli; Cavallieri, Gabriela Vidotto; da Silva, Francisco Assis; Fernandes, Geovana Letícia; Nai, Gisele Alborghetti; Salge, Ana Karina Marques; Puhle, Josiano Guilherme; de Resende e Silva, Debora Tavares; Pereira, Danillo Roberto; de Azevedo Mello, Fabíola; Favareto, Ana Paula Alves; Rossi, Renata Calciolari title: Digital image processing: a useful tool in the analysis of lung injuries caused by chronic inhalation of agricultural herbicides date: 2021-06-07 journal: Environ Sci Pollut Res Int DOI: 10.1007/s11356-021-14692-x sha: 73d887200b6b7ca03baa95b16d64da7a6df5c064 doc_id: 1006265 cord_uid: s7wvm5fu The herbicide 2,4-dichlorophenoxyacetic acid (2,4-D) is widely used in agriculture to control various weeds. The objective of this study was to use the digital image processing method to identify alveolar lesions in the lungs of rats submitted to chronic 2,4-dichlorophenoxyacetic acid (2,4-D) inhalation exposure. We used forty adult male Wistar rats. The rats were divided into four groups: control group (CG), low concentration group (LCG), medium concentration group (MCG), and high concentration group (HCG). In a 6-month exposure period, we used two boxes connected to ultrasonic nebulizers for herbicide spraying. After this period, the rats were euthanized for the collection and study of lung tissue. For each image, counts of injuries and blisters were performed automatically using a methodology based on digital image processing techniques. For analysis of the results, an electronic database (Excel®) was created. We used the Pearson method for correlation analysis; values of p <0.05 were considered significant. In the evaluation of healthy alveoli, we recorded positive and significant correlations between analysis from a pathologist and computational analysis. In the evaluation of injured alveoli, we recorded a positive but non-significant correlation between analysis from a pathologist and computational analysis. These results show the effectiveness of digital image processing when evaluating alveolar integrity. The success of plantations often depends on weed control actions, such as pesticide use. Since 2008, Brazil has been the global leader in the consumption of crop protection products. The abusive use of pesticides is directly related to numerous health problems and affects workers in the agribusiness production chain, residents of areas near production centers, and consumers who ingest derivatives of herbicide-aided crops (Abrasco 2016; Gomes 2002) . Inhaled particles can settle in the airways, where they interfere with lung-cleaning mechanisms, or deep in lungs. In cases of long-term exposure, these effects can eventually contribute to the development of chronic obstructive pulmonary disease (COPD), emphysema, and lung cancer. Lungs serve as gateways to systemic circulation. Thus, chronic inhalation of gases and vapors may cause systemic damage to various organs and systems, including the liver and bone marrow (Firetti et al. 2012) . The health and computing fields have been collaborating to improve parameters related to medical diagnosis. Digital image processing is an example of these efforts; it uses the extraction of patterns from a set of samples to acquire knowledge automatically (Pereira and Artificial intelligence myth and science 2005) . Digital image processing methods can be divided into two basic categories: supervised and unsupervised learning. Supervised learning methods pre-classify a sample set (training set) to extract sample patterns, in which each example used for training is labeled with the value of interest. A supervised learning algorithm searches for patterns among the value labels, while using all relevant information. Such methods need to be classified by a specialist (Koerich 2012) . When using supervised learning methods, the measure of accuracy must be estimated after the training session. This occurs through a set of tests or validations, in which samples are labeled 5 . Using this method, each sample is classified, and the predicted label is compared with the desired one. This requires an adjustment to make the obtained label equal to the real label of the sample; otherwise, a classification error appears. Through this criterion, the accuracy percentage of the digital image processing method can be calculated, which is then used as an accuracy metric to represent the precision of the classifier (Coppin 2010) . The performance and accuracy of most digital image processing methods are greatly influenced by internal parameters of the algorithms and selection of the characteristics of the data sets for the classification process (Coppin 2010) . The findings of this study can be used to make companies more aware of health risks and initiate discussions on sustainable projects that improve health and quality of life. This study aimed to use the digital image processing method to identify alveolar lesions in the lungs of rats submitted to chronic 2,4-D inhalation exposure. The experimental protocol was approved by the Ethics Committee for Use of Animals (CEUA) at UNOESTE and was carried out in accordance with the Guide for the Care and Use of Laboratory Animals of the National Institute of Health (Committee for Update of the guide for the care and use of the laboratory animals 2011). Adult male Wistar rats (200-250g, n = 40) were used, with an average age of 60 ± 5 days. They were placed in collective plastic cages (5 rats per cage) that measured 30×16×19 cm. The rats were kept in a temperature-controlled room (22 ± 2°C, 50-60% relative humidity) with a 12-h light/dark cycle (7 am to 7 pm, light period; 7 pm to 7 am, dark period). Throughout the experimental period, the rats were exposed to the herbicide at the same time of day. The rats received filtered water and commercial rat feed (Supralab®, Alisul, Brazil) ad libitum. All rats from each specific group were exposed to the herbicide at the same time. We used a commercial formulation of (2,4-dichlorophenoxy) acetic acid (2,4-D; Nortox S.A., Arapongas, Paraná, Brazil, registered in the Ministry of Agriculture, Livestock and Food Supply # 03009). The formulation constituted of 806g/L of dimethylamine salt of (2,4-dichlorophenoxy) acetic acid (2,4-D), 670g/L of 2,4-D acid equivalent, and 424g/L of inert ingredients. The 2,4-D formulation was diluted in 0.9% sodium chloride. Rats were randomly assigned to four experimental groups (n = 10/group), as shown in Table 1 . Considering that the rats had the same characteristics (e.g., sex, approximate weight), we randomly divided the rats upon their arrival from the central vivarium by randomly distributing their cages to each exposure group. In the animal experimentation laboratory, the rats were identified by Arabic numbers written on their tails. Rats from different groups were exposed for five consecutive days per week in a 180-day period. The rats were exposed with the concentration for their specific experimental group (Table 1) (de Mello et al. 2018; de Mello 2020) . Exposure occurred via inhalation in plastic boxes (32×24×32cm) that were connected to an ultrasonic nebulizer (Pulmosonic Star®, Brazil). Daily exposure periods lasted approximately 15 min; this is the required time for nebulization of the entire solution (de Mello et al. 2018; de Mello 2020) . The 2,4-D concentrations for each experimental group were chosen in accordance with standard product application and agronomic prescription to different cultures. For better interpretation and comparison of results in the literature, 2,4-D concentrations used in agriculture (grams of active ingredient per hectare -g a.i. ha -1 ) were adjusted to the box dimensions and converted to the units of mg/m −3 and ppm (de Mello et al. 2018; de Mello 2020) . All rats were exposed to the herbicide for 6 months. They were then euthanized after this period. Anesthesia and euthanasia were performed by sodium thiopental (Syntec, USA) with doses of 100 mg/kg of weight administered in the peritoneal cavity. Indications of death were reflex loss and absence of respiratory movements and heartbeat. g.i.a/ha = grams of active ingredient per hectare, considering environmentally relevant concentrations, according to product application in agriculture ppm = parts per million After euthanasia, autopsies were performed to remove lungs for further analysis. For the maintenance of lung morphological architecture and constant pressure (20 cmH2O), we used 10% buffered formalin in the trachea region. For processing and paraffin block production, we collected lung tissue at random from central and peripheral areas of the lungs of each rat (PARKINSON et al. 2011) . Sample sections of 5μm were stained with hematoxylin eosin for selection of the most appropriate airways for pulmonary epithelium evaluation. We selected areas with preserved histological architecture. Pathological analyzes were performed by a researcher with experience in pathology. We considered alveoli with similar architectural patterns as healthy. Images classified as "altered" were submitted to the machine-learning process. Classifications were based on specifications used by Ruppert et al. (2012) , which considered alveolar damage as the following: involvement and uniform temporal appearance of alveolar collapse; hyaline membranes; obliterative fibrosis; formation of neo-septa and moderately organized fibrosis; intraalveolar hemorrhage characterized by the presence of blood in alveolar spaces; enlarged and swollen alveolar septa, usually combined with mononuclear inflammatory infiltrates. Normal and injured cells in each image were counted automatically using a methodology based on digital image processing techniques. The images were pre-processed to segment each cell. The segmented cells were then classified as normal or injured, according to metrics obtained from cell images and empirically defined parameters (later described in this section). Figure 1 shows an example of an original image of lung cells with a region marked by two dashed lines in the upper left corner. This demarcated region was cut out from this specific image in order to explain the methodology adopted for pre-processing and cell segmentation. In the image selection process for machine learning, the evaluator who selected the images characterized as "normal" was blinded; the images were previously given random numbers by other researchers. . Each lung cell image (Fig. 2, (a) example of the cropped image of the region marked in Fig. 1 ) was converted from RGB (red, green, blue) color space of the original image to HSV (hue, saturation, value) color space (Fig. 2) (b) . Hue defines the color component. Saturation defines color purity. Value (value/brightness) defines the amount of light (Gonzalez and Woods 2010) . In the sequence, an Otsu (1979) threshold was applied only to the "H" channel of Fig. 2 (b) , which generated Fig. 2 (c) image. This image contains a significant highlighted (in red) cell region. For greater cell enhancement, we also used the YUV color space in Fig. 2 (c) image. YUV color space (Y is gray scale or brightness information, U and V are the chromaticity or color information in the components) is different from RGB, as it works with separate light and color components (Russ 1999) . The image was then converted from the YUV color space to RGB, which resulted in Fig. 2 (d) image. This image shows the cells changing as they move away from the edges. In addition, we converted Fig. 2 (d) image from the RGB color space to the HSV 7 color space. Of the three channels, we only used the "S" channel. We applied channel "S" (a binary threshold) to this image to generate Fig. 2 (e) image. This provides a binary image of the cells separated from the edges without damaging the contour. This process was necessary, as the images of the original cells varied in illumination and were often unclear. Figure 2 (f) image shows a cell that was segmented by a region-filling algorithm (Gonzalez and Woods 2010). We demarcated the cell by a perimeter (a rectangle surrounding the cell) and calculated the minimum rectangle. The binary image demonstrated in Fig. 2 (f) (shown with more detail in Fig. 3 ) contains a seeded cell. It serves as an example of the metrics used for classification of each cell. These metrics include the following: (Fig. 1) . b Image (a) converted from the RGB color space to HSV 7 . c Image (b) after applying an Otsu 8 threshold to channel "H." d Image (c) converted from the YUV 9 color space to RGB. e Image (d) converted from the RGB color space to HSV (Gonzalez and Woods 2010) and with application of a binary threshold in the "S" channel, resulting in a binary image. f Demarcated cell The classification of cells as normal and injured was carried out with regard to rules and minimum/maximum parameters. These parameters were empirically attributed to the metrics defined and obtained from the segmented cells of the images. We excluded cells with the following attributes: cells found at the edge of the images, cells less than 43×43, cells with a total size less than 725, cells with perimeters less than 175, and cells with widths less than 37. From the cells that remained after the previously mentioned exclusions, we used cells with the following attributes: areas less than or equal to a maximum area of 20,500, perimeters less than or equal to a maximum perimeter of 1250, sizes less than or equal to a maximum rectangle of 490×490. An electronic database (Excel®) was created for the analysis of the results. Pearson's correlation was performed for correlation analyzes, as the samples were normal and homogeneous (Kolmogorov-Smirnov normality test). Values of p <0.05 were considered significant. To determine the minimum sample size for comparison between scores of the eight groups, we calculated the samples using the "pwr" packet (available in Program R) and performed variance analysis (CHAMPELY 2016; R DEVELOPMENT CORE TEAM 2019). After obtaining the minimum number of elements per sample, 15% more elements were added to each sample for adaption to possible non-parametric distribution (Beaver and Lehmann 2006). The following parameters were used: a) Test power = 80% b) Significance level = 5% c) Number of groups for comparison = 8 d) Effect size (Cohen's D) = 0/50 Based on these data, we concluded that at least 12 elements were needed per group. Thus, the value of the sample calculation was 10%. We analyzed fragments of lungs from 40 rats that were subjected to chronic 2,4-D exposure. No rats died in the experimental phase of this study. In addition, no complications occurred that would make this study unfeasible. In the evaluation of healthy alveoli, we found positive (r 2 = 0.9234) and significant (p <0.001) correlations between analysis from an observing pathologist and computational analysis (Fig. 4) . Our study examined damaged alveoli with areas of inflammatory infiltrates. With damaged alveoli, we recorded a positive (r 2 = 0.2130) and non-significant (p = 0.2584) correlation between analysis from an observing pathologist and computational analysis (Fig. 5 ). Our study demonstrates the effectiveness of digital image processing to identify healthy alveoli, as we recorded positive and significant correlations between pathologist and computational analyses. But different characteristics of the alveolar injuries caused by 2,4-D exposure possibly interfered in the ability to classify damaged alveoli. Several studies in the literature have examined lung lesion classification. Distinguishing injured lung tissue from uninjured (or healthy) tissue was the subject of a study by Sluimer et al.; they used digital image processing with 657 regions of interest from 116 data sets to distinguish injured from uninjured tissue (Sluimer et al. 2003) . A study by Mansoor et al. (2017) adapted the imaging algorithm and used the location of ribs to estimate the volume of lungs in 400 images with various diseases. They performed a second part of the process after significant differences were found between estimated volumes and segmentation results. In this second part, characteristics based on texture were used to identify abnormal patterns in the image, which were then included in the result (Mansoor et al. 2017) . Fig. 4 In healthy alveoli, correlation between analysis from the observing pathologist and computational analysis Fig. 5 In altered alveoli, correlation between analysis from the observing pathologist and computational analysis Wang et al. (2009) conducted a study with 31 volumes of normal cases and 45 volumes of abnormal cases. Before thresholding and applying texture extraction, they used an algorithm to label 3D-related components in the first 30 slices. They then identified the trachea location, which was used as a seed to trace the airways and prevent interference with lung segmentation. To extract information from digital images, the images (and parts/objects that compose it) must be divided. This process is called image segmentation. It can be formally defined as an identification process (through a numerical algorithm) of the homogeneous regions of a specific image and the edges or limits of these regions. In most cases, these homogeneous regions are objects, and their edges are contoured (WANG et al. 2009 ). Since the start of the COVID-19 pandemic, image processing of lungs has been frequently used to optimize diagnosis determination (as nucleic acid tests can result in falsenegatives for COVID-19) and monitor confirmed COVID-19 cases (LI et al. 2021) . A systematic review demonstrated that digital processing of radiological images of the lungs to diagnose COVID-19 reduced false-positives and false-negatives (Ghaderzadeh and Asad 2021) . Through the use of computed topography, machine learning has shown effectiveness in diagnosing other pulmonary complications and diseases, such as pulmonary emphysema (HUMPHRIES et al. 2020) , fibrosis (CHRISTE et al. 2019) , and pulmonary nodules (Nasrullah et al. 2019) . These findings corroborate with the results of our study and show the effectiveness of digital image processing to diagnose lung lesions and pulmonary alterations, as well as justify the increasing use of digital image processing in clinical practice and scientific research. The previously mentioned studies aimed to segment the lungs and lesions automatically by using a data set, without human interaction or a starting point of the anatomical location from other structures/previously defined models. But until now, no studies have used digital image processing with data and algorithms to examine the effect of herbicides on lung function. The types of alterations caused by chronic 2,4-D inhalation exposure have not yet been defined; thus, it is not possible to obtain data for comparison. However, various studies have reported the occurrence of pulmonary alterations after acute and subacute oral exposure to 2,4-D. In one study, 2,4-D was added to the drinking water of rats. But rather than the lungs, this specific study investigated effects on the antioxidant defense system. It reported that herbicide ingestion led to a significant increase in the concentration of malondialdehyde (MDA) lipid peroxidation content (CELIK and TULUCE 2007) . By analyzing leukocytes from bronchialveolar lavage fluid, Kaur et al. (2021) reported lung inflammation in mice after oral exposure to high and low doses of 2,4-D for 90 days. The cytotoxicity of 2,4-D was analyzed in an in vitro study with human lung cells; the study reported an increase in apoptosistype cell death that was dependent on the herbicide dose. The study also reported that the induction of pulmonary cytotoxicity by 2,4-D occurs due to the rupture of the tubulinmicrotube cell network (Ganguli et al. 2014) . The human and rat respiratory systems are highly similar (FUSCO et al. 2002) . Thus, this study can aid future research involving the analysis of histological images of human lung tissues to identify alveolar lesions through machine learning. However, to increase the amount of data for more accurate and effective image processing, studies must investigate the specific lung tissue lesions of herbicide-exposed populations. Our study examined various types of histopathological alterations that characterize alveolar lesions, which made it difficult to identify a single pattern for image classification. In contrast, with healthy alveolar images, we recorded a significant correlation between pathologist and computational analyses. However, to obtain satisfactory image processing and diagnostic accuracy, a greater quantity of models and data volume is needed. Computational analysis of the data revealed the existence of intact and healthy alveoli and damaged alveoli after herbicide 2,4-D exposure. This demonstrates the potential of digital data processing to improve medical diagnosis and research related to inflammation and mucus from chronic 2,4-D exposure. Few studies have used digital image processing methods in histopathological analysis. Further studies are necessary to understand immune system behavior after exposure to different doses of agricultural herbicides. Determination of toxicity of subacute treatment of some plant growth regulators on rats pwr: basic functions for power analysis. R package version Computer-aided diagnosis of pulmonary fibrosis using deep learning and CT images Committee for Update of the guide for the care and use of the laboratory animals: environment, housing and management. In: Guide for the care and use of laboratory animals Evaluation of genotoxicity after acute and chronic exposure to 2, 4-dichlorophenoxyacetic acid herbicide (2, 4-D) in rodents using machine learning algorithms Avaliação da cavidade nasal de camundongos submetidos à exposição ao herbicida ácido 2,4-diclorofenoxiacético Analysis of strategic variables for the development of agriculture in the Pontal do Paranapanema region Modelo experimental de enfisema pulmonar em ratos induzidos por papaína 0 1 4 ) 2 , 4 -Dichlorophenoxyacetic acid induced toxicity in lung cells by disruption of the tubulin-microtubule network Deep learning in the detection and diagnosis of COVID-19 using radiology modalities: a systematic review Ambiente e Lung Deep learning enables automatic classification of emphysema pattern at CT Exposures to 2,4-Dichlorophenoxyacetic acid with or without endotoxin upregulate small cell lung cancer pathway Computer-aided diagnosis of COVID-19 CT scans based on spatiotemporal information fusion Deep learning of the marginal form: applications to the segmentation of the pulmonary field in pediatrics Automated lung nodule detection and classification using deep learning combined with multiple strategies A threshold selection method from gray-level histogram Diagnostic necropsy and selected tissue and sample collection in rats and mice Artificial intelligence myth and science Clinical manifestations and pulmonary histopathological analysis related to different diseases in patients with fatal pulmonary thromboembolism -an autopsy study Computer-aided diagnosis in high resolution CT of the lungs Topological structural analysis of digitized binary images by border following. Computer Vision, Graphics and Image Processing Automated segmentation of lungs with severe interstitial lung disease in CT Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations