key: cord-0788679-0dn0zye5 authors: Fang, Hao; Xin, Shiwei; Pang, Huishan; Xu, Fan; Gui, Yuhui; Sun, Yan; Yang, Nai title: Evaluating the effectiveness and efficiency of risk communication for maps depicting the hazard of COVID‐19 date: 2021-08-05 journal: Trans GIS DOI: 10.1111/tgis.12814 sha: 64758c5f3307ebb1c2a0f9beb26cd2db2bb34bdd doc_id: 788679 cord_uid: 0dn0zye5 COVID‐19 maps convey hazard and risk information to the public, which play an important role in the risk communication for individual protection. The aim of this study is to improve the effectiveness and efficiency of communicating the specific risk of COVID‐19 maps. By testing 71 subjects from Wuhan, China, this study explored how color schemes (cool, warm, and mixed colors) and data presentation forms (choropleth maps, graduated symbol maps) influence visual cognition patterns, risk perception, comprehension, and subjective satisfaction. The results indicated that the warm scheme (yellow/red) has significant strengths in visual cognition and understanding, and the choropleth map (vs. the graduated symbol map) has significant strengths in risk expression. On subjective satisfaction, the combination of the mixed scheme (blue/yellow/red) and the choropleth map scored highest mean value. These results have implications for enhancing the focused functions of COVID‐19 maps that fit different terms: in the early and medium terms of disease transmission, choropleth maps with warm or cool colors should be considered as a priority design for their better risk perception. When the epidemic conditions are on the upturn, a better reading experience combination of choropleth maps with mixed colors can be considered. | 3 FANG et Al. In early research, color was defined as a primary visual variable by Bertin (1983) . It has become a consensus in cartography that color is the most significant element affecting map reading (Cai, Mao, & Zhou, 2000; Jan, 2010; Ling, Wang, & Ding, 2017; Robinson, 1960) . In thematic maps, color schemes encode and translate complex statistical data into visualizations (Ling et al., 2017) , affecting readers' visual strategies and cognitive patterns (Bostrom et al., 2008) , which is reflected in reading interests, memories, and emotions (Elliot & Maier, 2014; Nicholson-Cole, 2005 ; Thompson et al., 2015) . A color-enhanced image could transmit more threat than gray-scaled images for readers (Ash, Schumann III, & Bowser, 2014) . By using high-contrast color schemes which clearly distinguish from the background in hazard maps, readers are better able to perceive and understand hazard and risk information (Kaufmann & Ramirez-Andreotta, 2020) . Among all colors, red is the commonest for warning and the most salient color that stands out (Bostrom et al., 2008) . Certain research has empirically shown that reading information with cool or warm colors leads to different emotions and task performances, but the conclusions were inconsistent (Bartram, Patra, & Stone, 2017; Cyr, Head, & Larios, 2010; Madden, Hewett, & Roth, 2000; Thompson et al., 2015) . Mehta and Zhu (2009) studied the cognitive and behavioral influence owed to different colors, implying that red and blue could activate different motivations, and the red (vs. blue) color evokes a higher level of emotion. Elliot, Maier, Moller, Friedman, and Meinhardt (2007) got a reverse result; they found that red (vs. green or blank) color has a negative effect on information access because people consider red a sign of failure and danger, leading to cognitive avoidance. Though red is widely used as a warning color, Elliot and Maier (2014) later reviewed a considerable number of empirical research articles and proposed that the meanings and effects of colors are closely bonded to the context, which means that even the same color may indicate converse semantics. Kwallek, Lewis, and Robbins (1988) drew a similar conclusion that red would evoke anxiety. In addition, some researchers thought that mixed color schemes containing two main feature dimensions (hues of both warm and cool, and lightness) have advantages in visual searching (e.g., Shortridge, 1982) . This opinion is also supported by Mersey (1990) and Breslow, Ratwani, and Trafton (2009) , with the latter finding that a multicolored scheme performed best in identification tasks; the former study concluded that individuals can distinguish a greater number of different hues than different shades of a single hue in maps. So, we could assume that the COVID-19 maps with mixed color (vs. warm or cool color) have a richer visual feel and superior reading experience. According to the literature cited, which color scheme can balance risk warning and accuracy comprehension for COVID-19 maps needs further study. We propose the following hypothesis: Hypothesis 1 Different color schemes have significant effects on risk perception, comprehension, and satisfaction in COVID-19 maps. External forms of data presentation in hazard maps sort data into nominal or quantitative attributes and visually display them through thematic maps (Dent, Torguson, & Hodler, 2008; Slocum, McMaster, Kessler, & Howard, 1999) . A thematic map is a type of map designed to describe a scenario emphasizing specific geographical features in a certain area. It encodes and maps the numerical levels into specific visual parameters, such as color, size, shape, and geometry, which impact readers in the perceptual and cognitive domain (Dent et al., 2008) . In terms of psychology, Hegarty and Just's (1993) work pointed out that a schematic representation with features and relationships helps readers to grasp integrated formation and problem-solving, and they identified three phases of such cognitive possessing: (1) visual perception of primary reading; (2) interpretation of schematic characteristics; and (3) comprehension of indications with internal experience. In terms of cartography, Fairbairn, Andrienko, Andrienko, Buziek, and Dykes (2001) implied that appropriate methods of data classification are crucial to the depiction of visualization. Cheong et al. (2020) found a relationship between different cartographic representations of risk and readers' decision-making, and they indicated a pattern of less-preferred graphical symbolizations associated with slower responses and lower-risk route choices. Based on cognitive map theories, Hou, Rashid, and Lee (2017) compared different types of thematic maps and arrived at a result that the key to improve information transmission effectiveness and user experience is ensuring the validity of representation. Thematic symbology (e.g., choropleth, graduated/proportional symbol), for example, is used in existing maps to visualize quantitative data that summarize potential impacts of hazards (Kostelnick et al., 2013 ). An earlier study by Brewer and Pickle (2002) proposed that choropleth maps describing statistical data classification with color schemes and representation forms have been a common expression method in epidemiology. However, there are still shortcomings to the choropleth method when its application is to disease maps. Barrozo, Pérez-Machado, Small, and Cabral-Miranda (2016) proposed that a choropleth map may mislead readers on a uniform distribution, with its colors filling entire regional areas. Similarly, Calka, Nowak Da Costa, and Bielecka (2017) implied that choropleth maps in natural hazards risk assessment have a defect in the incompatibility of mapping units used for data analysis and visualization. Shortridge (1982) inferred that graduated symbol maps are more suitable for presenting differences between levels, using diagrams with separate visual dimensions such as size and shape. Meanwhile, Wang, Sun, Wang, Jiang, and Lv (2003) referred to choropleth maps expressing characteristics and distributions of phenomena more vividly. The opinions of Chen and Jiang (2001) , along with Liao (2003) , agreed with the above; that is, choropleth maps have advantages in terms of expressing distribution range and quantity difference, while graduated symbol maps are more appropriate and accurate to describe dynamic changes and magnitudes of a single unit. Thus, which representation form is the most suitable for COVID-19 maps needs further research, and a hypothesis is proposed as follows: Hypothesis 2 Different data presentation forms have significant effects on risk perception, comprehension, and satisfaction in COVID-19 maps. In thematic maps, data presentation forms and color schemes have a joint effect on the visual expression. Taylor (1991) , who established the core and main body of contemporary cartography, connected information transfer theory with visual cognition theory in a visualized theoretical system, which mainly consisted of data presentation and color scheme. Carswell, Frankenberger, and Bernhard (1991) also indicated that inappropriate color and data presentation would make key message lost in a redundant visual, resulting in overloaded and disturbed cognition. So, a hypothesis is proposed as follows: Hypothesis 3 Different color schemes and data presentation forms have significant interaction on risk perception, comprehension, and satisfaction in COVID-19 maps. Eye movements, reflecting the processing of visual cognition and mental activity (Çöltekin, Fabrikant, & Lacayo, 2010; Fabrikant, Rebich-Hespanha, Andrienko, Andrienko, & Montello, 2008) , have been acknowledged as an efficient tool, along with task experiments, to evaluate the effectiveness and efficiency of information delivery in the domain of geovisualization (Brodersen, Andersen, & Weber, 2002; Jacob & Karn, 2003; Koua, MacEachren, & Kraak, 2006; Li & Chen, 2012) . Some examples of the eye-tracking technique assisting in hazard/ risk map design and evaluation are as follows. Zheng, Li, Fang, and Qian (2016) examined how users translate and comprehend information in maps by collecting fixation and sequence data with an eye-tracking method to design hazard emergency maps. Meyer et al. (2012) interviewed users and tested them with a compound eyetracking task so as to provide suggestions to improve flood maps for oriented groups. Brychtova, Paszto, Marek, and Panek (2013) assessed the validity of a risk map website with eye-tracking for the purpose of optimizing the layouts of websites and maps for emergency events. Golebiowska, Opach, and Rød (2017) used hazard maps as experimental material and analyzed the eye-tracking parameters of subjects to evaluate the usability of geovisualization tools. In addition, eye-tracking is also applied in studies of hazard maps to evaluate comprehension and decision-making; for example, Korporaala and Fabrikant (2019) explored how risk uncertainties and time limitations influence decision-making in different representation maps with behavioral tasks and eye-tracking. In conclusion, eye-tracking parameters in hazard and risk map reading tasks help us compare differences in visual strategies and attentive patterns between users. This approach helps to formalize and quantify the framework of hazard maps, optimizing the layout of visual elements and improving the effectiveness of risk communication. Thus, this study used the eye-tracking technique to measure and compare visual cognition patterns in COVID-19 maps. There were 71 subjects randomly recruited via cellular phone text messaging in Wuhan, of which 23 (32.4%) were males and 48 (67.6%) were females. They ranged in age from 20 to 45 and their careers were wide-ranging, including teaching staff, administrative staff, graduate students, interpreter, security personnel, and so forth, with the majority above college degree. All the subjects lived long-term in Wuhan, Hubei Province, China and had no familiarity with areas other than Wuhan. The majority of subjects had experience of browsing COVID-19 maps via mobile phones. All subjects participating in this experiment declared that they had normal eyesight (or correctedto-normal eyesight) and normal color sense, and each was paid ¥20 as a reward. We selected 37 well-known online media sources (e.g., JHU CSSE, mapbox, dxy.cn, and Tencent News; https://www. arcgis.com/apps/opsda shboa rd/index.html#/bda75 94740 fd402 99423 467b4 8e9ecf6; https://www.mapbox.cn/ coron aviru smap/#3.91/27.16/113.28; https://ncov.dxy.cn/ncovh 5/view/pneum onia; https://news.qq.com/zt202 0/ page/feiyan.htm?from=timel ine&isapp insta lled=0#/globa l) containing COVID-19 maps by rank on research engines (Baidu and Google). By analyzing the color schemes and data presentation forms of these COVID-19 maps, three types of color schemes were determined: cool, warm, and mixed. Cool color schemes mainly used blue as a filler color, ranging in lightness from light to dark as the magnitude increases. Warm color schemes mainly used a brighter yellow, red, or orange for lower levels of magnitude and a darker red for higher levels of magnitude. A mixed color scheme filled the map with both warm and cool color, always ranging from low-brightness blue (lower magnitudes) to low-brightness red (higher magnitudes). Therefore, we used sequential yellow/red as the warm color scheme, sequential single-hued blue as the cool color scheme, and diverging blue/yellow/red as the mixed color scheme. All the color schemes used in experimental materials were selected from ColorBrewer, an online tool providing effective color schemes for cartographers. The convention of "light is less, dark is more" (Garlandini & Fabrikant, 2009; He, Song, & Li, 2016) was followed in the coding relationship between magnitude and luminance to ensure colorblind safeness. The variable levels of data representation forms were also extracted from COVID-19 maps of the 36 media sources selected, and the statistical tallies showed that choropleth maps accounted for 29 (78.38%) and graduated symbol maps accounted for 19 (51.35%); among these, 12 (32.43%) used both choropleth and graduated symbol maps. Other forms accounted for 1 (2.7%). Therefore, choropleth and graduated symbol maps were taken as two variable levels of data representation form. The choropleth map generally divides the area into regional or natural units, filling with colors or textures which map the statistical classifications. This representation method emphasizes depicting the phenomena and developments with large-area colors. In comparison, the graduated symbol map generally uses administrative regions as units, using charts, tables, and other forms of statistical graph in each unit, focusing on the total amount, composition, and variables within units (He et al., 2016) . Each subject in this study needed to accomplish a three-part measurement, including eye-tracking, behavioral tasks, and subjective scales. Seven metrics of dependent variables were measured in total: (1) first fixation duration; (2) fixation count; (3) average saccade amplitude; (4) accuracy rate; (5) response time; (6) risk perception; and (7) subjective satisfaction. First fixation duration, fixation count, and average saccade amplitude were taken for eye-tracking metrics. First fixation duration records the time of the eyes' first fixation before gazing to the next point, which generally has a timespan of 200-250 ms. It validly indicates the primary processing of the early stage of visual stimuli (Dong et al., 2019; Li & Chen, 2012) . Fixation count is the total number of fixations during the eye-tracking experiment. A higher fixation count reflects more complicated experimental materials to be processed or a higher reading interest of readers. Average saccade amplitude is the average distance of gazing from one focus to another, and implies the breadth of visual search and the difficulty of the task (Yan et al., 2013) . A larger fixation count presents a larger amount of information perceived by the reader at each gazing point, and vice versa, so it is an indicator of reading efficiency and complexity (Zhang & Ye, 2006 ). Accuracy rate and response time of tasks were behavioral metrics to measure the comprehension accuracy and efficiency in the COVID-19 maps. According to Amar, Eagan, and Stasko's (2005) "low-level mathematical and cognitive actions" on task classification, three types of task questions were designed to present the fundamental process of information retrieval from COVID-19 maps. Examples of task questions are given in Table 1 . The within-group questions are the same, while the between-group questions are different. The complexity between the three types of task questions increases sequentially, but the questions are still low-level in mathematical cognitions and operations. Moreover, in order to avoid the familiarity effect and experience interference, Wuhan was excluded from the task objects for its well-known condition. All these questions were verified via a pre-test. Risk perception and the satisfaction of subjects were measured by seven-point Likert scales, as in Table 2 . Arranging all the cities in Hubei Province according to their order of magnitude, Xiangyang (T4, Table 2 ) was one of the cities in the median level, so we chose it as the question-oriented city of risk perception. According to the levels of independent variables referred to above, we designed experimental materials with three types of color scheme (cool, warm, and mixed) and two forms of data presentation (choropleth maps and graduated symbol maps), displaying the distribution of COVID-19 infections in Hubei Province. The statistical data of infected people were obtained from the website of the Chinese Center for Disease Control and Prevention (http:// www.china cdc.cn/jkzt/crb/zl/szkb_11803/ jszl_11809/). In order to avoid the familiarity effect, datasets used in choropleth and graduated symbol maps were different within groups. The choropleth maps (a1, a2, a3) were based on the number of people diagnosed with COVID-19 in Hubei Province on February 5, 2020, and the data were classified into five levels (10-99, 100-499, 500-999, 1,000-4,999, 5,000-50,000). Graduated symbol maps (b1, b2, b3) were based on the number of people diagnosed with COVID-19 in Hubei Province on March 1, 2020, and the data were classified into five levels (10-99, 100-499, 500-999, 1,000-1,999, 2,000-11,000). All the material maps ( Figure 1 ) were made in ArcMap 10.2 software. In the eye-tracking part of the experiment, a Tobii Pro Fusion eye-tracker was connected to a computer with a sampling rate of 600 Hz, accuracy of 0.3°, and precision of 0.06° RMS (Figure 2 ). The stimuli were presented via Tobii Pro Lab software. In the behavioral task and scale parts of the experiment, stimuli were displayed via the E-prime software program. TA B L E 1 Task questions for measuring accuracy and efficiency of comprehension This experiment adopted a mixed design, using color schemes as between-group variables and data representation forms as within-group variables. A mixed design considers both interactive correlation and single variable effects for multiple-factor experiments, composing between-group and within-group designs. When an experiment has multiple variables, a mixed design has advantages in terms of validity and efficiency, and could avoid errors caused by repeated measurements in a within-group design (Cai & Yan, 2011; Yuan & Wang, 2002) . In total, 71 subjects were randomly allocated in three groups: 24 in the cool color group, 24 in the warm color group, and 23 in the mixed color group. Each subject would view two COVID-19 maps, which were the same in terms of color scheme and different in terms of data presentation form. The experiment was conducted from June 5 to 15, 2020. Human ethics has been approved by the Research Ethics Board of China University of Geoscience. The laboratory was located in a quiet and well sound-insulated room. We sanitized the experimental devices and ventilated the room in the interval between participants. Researchers wore masks and reminded subjects to wear a mask during the experiment. Light was mainly provided by the computer screen. The range of head movement allowed was within a threedimensional range of 50 cm W × 36 cm H × 70 cm D, and the allowed operating distance from the monitor was within 45-70 cm. The procedure of the eye-tracking experiment is presented in Figure 3 (warm color group) as example. First, an introduction is shown on the screen along with a verbal explanation to inform the subjects about the experiment's aims and requirements. After the subjects sit in front of the screen, the distance to the screen is adjusted. Then they are required to keep their heads as still as possible and gaze directly at the screen for F I G U R E 2 Experimental devices (including eye-tracker, recorder, and demo program) F I G U R E 3 Procedures of eye-tracking experiment (warm color group for example) calibration in the Tobii Pro Lab program. After the calibration is finished, the subjects are told to browse the following pages as per their normal habits. A fixation point presented in the center of the screen lasts for 500 ms, followed by one map lasting for 20,000 ms. Afterwards, the screen goes blank for a 1,000 ms rest. This is then repeated, presenting a fixation point for 500 ms and another map for 20,000 ms. In order to eliminate the order effect, the display order of the two maps is random. It takes about 1 min to finish this part of the experiment for each subject. After completing the eye-tracking experiment, the subjects rested for about 1 min while the researcher quit the Tobii Pro lab software and entered the E-prime program. All the stimuli in this part of the experiment are Notes: F (F-distribution) is a continuous probability distribution that arises frequently as the null distribution of a test statistic. The F-distribution has two parameters: degrees of freedom numerator (dfn) and degrees of freedom denominator (dfd). "The dfn is the number of degrees of freedom that the estimate of variance used in the numerator is based on. The dfd is the number of degrees of freedom that the estimate used in the denominator is based on" (Freund, Wilson, & Mohr, 2010) . t is an inferential statistic; p is the probability of obtaining test results as extreme as the results actually observed during the test; η² is the strength of association. *p <0.05; **p < 0.01; ***p < 0.001. presented in the E-prime program, and the procedure is shown in Figure 4 . First, the subjects read the introduction presented on the screen to know the steps of the procedure, then they enter the practice program before the formal experiment. COVID-19 maps of the China region are used for viewing. Through answering the province-oriented task questions with clicks on the maps, subjects can familiarize themselves with the operations until they are ready. In the formal experience, a corresponding introduction is also presented. In the behavioral measurement, the questions with options attached are shown below the map, three pages in total. Participants are first required to answer each question according to the map. The response time is unlimited, but the subjects are asked to choose options in as short a time as possible. After completing three task questions, the screen layout still shows the map, but the task questions are replaced with a seven-point risk scale. Subjects respond with their risk perception of cities whose quantitative magnitudes are in the median level of the map, choosing and clicking the extent to describe the risk extent they feel. The following page is a seven-point subjective satisfaction scale, independently presented (without maps) in the center of the screen. Subsequently, there is a blank page for 3,000 ms. Finally, the above process is repeated, with another representation form map with a set of questions. The order of the two maps is random. It takes about 5 min to finish this part of the experiment for each subject. The eye-tracking data was exported in TXT format from Tobii Pro Lab software and imported into Microsoft Office Excel 2010, sorting the data into a format compatible with SPSS software. The behavioral data were merged with E-prime and imported into SPSS. Two persons' data were skipped because they were invalid-one whose eye movements failed to be recorded because his/her head position moved; the other because they did not complete the behavioral tasks. Therefore, 69 valid subjects remained. All statistical data were analyzed with SPSS 18.0 software. The results in Table 3 demonstrate the differences in individuals' eye movement data when they use COVID-19 maps. The multi-factor analysis of variance (ANOVA) results of the first fixation duration are shown in Table 3 . The interaction effect between color schemes and data presentation forms was statistically significant, F(2, 66) = 4.957, p < 0.05, and the comparisons between six variable combinations are described in Figure 5 . The results of a simple effect test following significant interactions (Table 4) showed that the first fixation duration of choropleth maps was significantly longer than that of graduated symbol maps under the premise of mixed color (p < 0.05), and the first fixation duration of mixed color was significantly longer than that of warm color or cool color under the premise of choropleth maps (p < 0.001). The main effect of data presentation forms on first fixation duration was significant, F(1, 66) = 5.954, p < 0.05. Testing the results with post-hoc multiple comparisons (Table 5 ) on the variable of data presentation forms, the first fixation duration of choropleth maps was significantly longer than that of graduated symbol maps (p < 0.05). The main effect of color schemes was significant, F(2, 66) = 6.977, p < 0.01. The results of post-hoc testing (Table 5) show that on the variable of color schemes, the first fixation duration of warm color was significantly shorter than that of cool color (p < 0.05) or mixed color (p < 0.001). The multi-factor ANOVA results of average saccade amplitude are shown in Table 3 . The interaction between color schemes and data presentation forms was significant, F(2, 66) = 3.780, p < 0.05, and the comparisons between variable combinations are shown in Figure 6 . The results of simple effects on interaction showed (Table 4) TA B L E 7 Results of post-hoc test following significant main effects that the average saccade amplitude of choropleth maps was higher than that of graduated symbol maps under the premise of warm color (p < 0.001), and the average saccade amplitude of cool color was significantly lower than that of warm color under the premise of graduated symbol maps (p < 0.05). The main effect of data presentation forms was significant on average saccade amplitude, F(1, 66) = 3.780, p < 0.05. Testing the results with post-hoc multiple comparisons (Table 5 ) on the variable of data presentation forms, the average saccade amplitude of choropleth maps was higher than that of graduated symbol maps (p < 0.05). The multi-factor ANOVA results of fixation count are shown in Table 3 . There was no significant interaction between color schemes and data presentation forms. The main effect of data presentation forms was significant, F(1, 66) = 4.768, p < 0.05. Testing the results with post-hoc multiple comparisons (Table 5) , the fixation count of the choropleth maps was significantly more than that of graduated symbol maps (p < 0.05). The multi-factor ANOVA results of accuracy rate and response time are shown in Table 6 . The interaction between color schemes and data presentation forms on response time was significant, F(2, 66) = 3.939, p < 0.05, and the comparisons between the variable combinations are shown in Figure 7 . Testing the simple effect of interaction, the response time of choropleth maps was significantly longer than that of graduated symbol maps under the premise of mixed color (p < 0.05). The main effect of color schemes was significant (p < 0.01) on the response time. Post-hoc testing results (Table 5) show that the response time of mixed color is significantly longer than that of warm color (p < 0.01) and cool color (p < 0.05). On the variable of accuracy rate, the main effect of color schemes was significant (p < 0.01). Post-hoc testing (Table 5) shows that the accuracy rate of mixed color is significantly higher than that of warm color (p < 0.05) and cool color (p < 0.01) (Tables 7 and 8 ). The multi-factor ANOVA results of risk perception are shown in Table 9 . There was no significant interaction between color schemes and data presentation forms. The main effect of data presentation forms on risk perception was significant, F(1, 66) = 22.901, p < 0.001. Testing the main effect with post-hoc multiple comparisons (Table 10) , the risk perception of choropleth maps was significantly higher than that of graduated symbol maps (p < 0.001). The multi-factor ANOVA results of satisfaction are shown in Table 9 . There was no significant interaction. The main effect of data presentation forms on satisfaction was significant, F(1, 66) = 4.067, p < 0.05. Testing the main effect with post-hoc multiple comparisons (Table 10) , the satisfaction of choropleth maps was significantly higher than that of graduated symbol maps (p < 0.05). The experimental results show that both color schemes and data presentation forms influence risk communication of COVID-19 maps, but the aspects they impact are not exactly the same. Color schemes and data presentation forms have interaction effects on the comprehension efficiency and visual cognition patterns. In terms of the main effects of single variables, the color schemes have significant effects on initial visual processing, comprehension effectiveness, and efficiency, and the data presentation forms have significant effects on visual patterns, subjective satisfaction, and risk perception. The eye-tracking results show the impact of color schemes on the early stage of cognitive processing. According to the indications about eye-tracking parameters (Yan et al., 2013) , a shorter first fixation duration means the visual object is more attractive. As warm color had a significantly shorter first fixation duration than cool or mixed colors in all maps, and warm and cool colors had a significantly shorter first fixation duration than mixed color in choropleth maps, the warm color was the most attractive, while the mixed color was the worst. Attention theories declare that only a few stimuli can be caught and sequentially processed with limited resources (Treisman & Gelade, 1980) . There are two ways to guide attention: one is a down-top pattern driven by stimuli, which means that the salient objects more easily obtain attention (Johnston, Hawley, Plewe, Elliott, & DeWitt, 1990) ; the other is a top-down pattern driven by working memory, which means that attention automatically matches the external properties of objects with a similar internal experience of working memory (Olivers, Meijer, & Theeuwes, 2006; Soto, Heinke, Humphreys, & Blanco, 2005) . Severtson and Vatovec (2012) referred to the yellow/red scheme as better for attention-getting in risk expression, whether in the top-down or down-top patterns. Moreover, red is the commonest warning color (Griffith & Leonard, 1997; Kaufmann & Ramirez-Andreotta, 2020) , so red has high familiarity in the mental representation of working memory. Considering the results, the warm color scheme (vs. the cool and mixed color schemes) has an advantage in terms of visual salience and leads to a shorter time taken by users to grasp the information characteristics needed to complete initial processing. The interaction on average saccade amplitude shows that the warm color scheme in the choropleth maps leads to better reading efficiency than the cool color scheme, while mixed color schemes are more complex to process. However, with the warm color scheme, the graduated symbol maps have better readability than the choropleth maps. This distinctness may be impacted by the avoidance effects of a large area of red (Elliot & Maier, 2014) . Griffith and Leonard's (1997) study found that 98% of subjects bind the red color with noncontext semantics of "stop." Moreover, Soto et al. (2005) proposed that the attention-guiding effect of color is superior to shape in top-down processing, so avoidance effects did not appear in the graduated symbol maps. The interaction and main effect on behavioral metrics indicated that color schemes affect comprehension efficiency and accuracy. The mixed color maps had statistically worse performance on efficiency but best on accuracy versus those of the warm and cool color maps. The response time reveals the complexity of information processing. The disadvantage in terms of complexity of the mixed color scheme is in line with other results discussed above. A number of studies suggested that a single hue with brightness ratio changes is generally appropriate for quantitative data visualization (Kelleher & Wagener, 2011; Sherman-Morris, Antonelli, & Williams, 2015) . However, the strength of understanding of the mixed color scheme on experimentally observed accuracy might result in its distinctive visual salience of magnitudes, which means it is easier to identify and fix (Treisman & Gelade, 1980) . Lewandowsky et al. (1993) used cognitive theory to explain the reasons that contrasting colors in statistical maps having stronger variable perception than single-hue schemes. In the experimental material, the mixed color scheme had distinct characteristics in hue and luminance, intact for cognitive identification (Shortridge, 1982) . Besides, some research could provide possible mechanisms and high-consistency explanations to support our results. Gao, Guan, and Wang (2008) concluded that thematic maps with mixed colors have prominent features but can easily cause visual fatigue. Cheong et al. (2020) suggested that a visually complex representation may consume more time and attention to understand, but may promote more detailed considerations and better decision-making. The results show that data representation forms significantly affect the risk perception of COVID-19 maps. The choropleth maps had better risk expressiveness than the graduated symbol maps. According to the holistic model of Gestalt theory proposed by Koffka (1935) , such difference might be due to distinct diagrams. As the outlines of diagrams in choropleth maps use specific regional shapes, this makes it easier for users to bring their experience of real and integral administrative areas into the maps. Graduated symbol maps use separate circles of varying sizes as diagrams, and their risk expression via geometrical features is inferior to that of choropleth maps (Mehta & Zhu, 2009) . Moreover, the studies of Xie, Li, and Yu (2008) and López-Vazquez and Marvan (2003) mention that vividness of information is one determining factor evoking and improving risk perception. Choropleth maps with concrete visual features of "figure" have a higher level of risk perception. The first fixation duration of the graduated symbol maps was significantly shorter than that of the choropleth maps in both interaction and main effect, showing the better attractiveness of the graduated symbol maps. The average saccade amplitude of choropleth maps was significantly greater than that of graduated symbol maps in both interaction and main effect, indicating the wider searching scope and higher efficiency of choropleth maps. The fixation count of choropleth maps was significantly lower than that of graduated symbol maps, which also shows that choropleth maps have better readability and less cognitive load than graduated symbol maps. These three eye-tracking metrics consistently indicate that the information of choropleth maps (vs. graduated symbol maps) is easier to process. Shortridge (1982) summarized the process of map reading into three basic steps: identifying symbols by visual search, comparing symbol differences, and integrating information to understand. They also mentioned that the size, color, and shape of diagrams in a map should be regarded as separate dimensions that are sequentially processed. So, it takes more time to process objects with more visual dimensions. In this study, color is the main dimension to recognize units and magnitudes of choropleth maps, while graduated symbol maps using more dimensions (color and size, along with similar shape) need more time to be processed. In addition, Leclercq (2002) proposed the same sensory process consuming the same type of cognitive resources, leading to competition. Therefore, diagrams in graduated symbol maps need more resources of attention than those in choropleth maps. The response time of graduated symbol maps was significantly shorter than that of choropleth maps for the interaction with mixed color, that is, the comprehension efficiency of the graduated map was higher. Some cartographers pointed out that using a choropleth map for visualizing the absolute magnitude, like the mass of existing COVID-19 maps available in the media, would distort or mislead the interpretation of the map, because map readers automatically consider the size of the areas (Dent et al., 2008) . Compared to graduated symbol maps, this may result in lower comprehension efficiency of choropleth maps. Moreover, the fact of visually salient color performing better in relatively small-sized maps supports this result (Severtson & Vatovec, 2012) . The discussion above has referred to the mixed color scheme having the highest cognitive load among the three types of color schemes (Gao et al., 2008; Kelleher & Wagener, 2011; Lewandowsky et al., 1993; Sherman-Morris et al., 2015) . So, larger color-filled areas in choropleth maps would enlarge the visual interference between adjacent units. The choropleth maps have better satisfaction than the graduated symbol maps. Consistent with the cartographic studies referred to in Section 2.2, a possible explanation for this result is the characteristics of the data. Choropleth maps and graduated symbol maps each have their own advantages in expressing different types of dataset (Calka et al., 2017) . Chen and Jiang (2001) , Liao (2003) , and Wang et al. (2003) arrived at a consensus that choropleth maps perform better when datasets reflect a distribution range and quantity difference. Considering the datasets used in the experimental materials display the static distribution of the infection on a single day, the choropleth maps are more intuitive than the graduated symbol maps on expressiveness. Moreover, Lewandowsky et al. (1993) suggested that graduated symbol maps cannot establish "Gestalt" for their independent symbols in cognition. Meihoefer (1973) implied that it is difficult for readers to accurately identify magnitudes by similar-sized symbols in a small area, so graduated symbol maps have lower readability than choropleth maps when the size of the device display is relatively small. For the reasons above, choropleth maps are more suitable for presenting numbers of COVID-19 infections. This study explored whether color schemes and data representation forms influence risk communication of COVID-19 maps. Through a compound task containing measurement of eye-tracking, behavior, and scale, the analyzed results revealed the relationship between variables. The main conclusions are as discussed: the warm color scheme (yellow/red) has significant strengths in visual cognition and understanding, and the choropleth map has significant strengths in risk expression. But in terms of participants' satisfaction, the combination of mixed color scheme (blue/yellow/red) and choropleth map scored highest. What was unexpected in the experiment was the warm scheme not having a significantly higher risk perception than the cool or mixed schemes. Based on the conclusions above, certain design strategies can be proposed: to determine which combination of color scheme and data presentation form to use for visualizing public health hazard or risk, the main function of the maps should be considered. For example, in the early and medium terms of disease transmission, risk perception is the primary function, so the design should give priority to choropleth maps with warm or cool color schemes. In urgent situations, warm color schemes can perform best in attracting readers' attention. When the situation of the epidemic is on the upturn, a better combination of choropleth maps with mixed color can be considered. Although it has become common to use maps for risk communication in all kinds of hazards, systematic methodologies for validity evaluation are still insufficient. This study aimed to improve the risk communication of COVID-19 maps, trying to find suitable solutions in terms of cartographic design. The eye-tracking method was used to capture visual cognition patterns and provide cognitive explanations of users' strategies that supported the hypotheses. Through discussing the further mechanism behind the results, relationships between visual presentation and usability were observed. This study enriches the perspectives and theories in the general area of hazard and risk mapping, especially with themes of public health events. Moreover, this study could help build the context of hazard scenarios in virtual geographic environments (VGEs) as well. As a VGE is an integration of geographic information and human behavioral models, it should consider the usability of its user interface settings to determine users' strategies, especially user effectiveness and efficiency (Ugwitz et al., 2019) . It contributes by providing an empirical multidimensional approach to evaluate hazard visualization, but also provides practical strategies for the design and optimization of related infectious disease maps. However, this study also has some limitations. First, a conventional principle of thematic map design divides the data into four to seven levels (Field, 2018; Olson, 1972) , and the data in the experimental materials have been divided into five levels. Whether the number of data levels affects the performance of information transmission needs further study. Second, readers' perception and cognition are also related to the perceivers' age, cultural background, individual knowledge, and graphic literacy (Freedman & Shah, 2002; Sherman-Morris et al., 2015) . Graphic literacy refers to one's ability to understand graphically presented information and includes general knowledge about making inferences from different graphic formats (Freedman & Shah, 2002) . Higher graphic literacy is generally related to higher education level or graphic experience (Galesic & Garcia-Retamero, 2011 ); therefore, different readers reading the same map may have different cognitive patterns. As the majority of subjects recruited in this study have a Batchelor degree, the sample may not be sufficiently representative. Third, maps used in the experiment were displayed statically on a computer screen and could only be manipulated with a mouse. In a real scenario, COVID-19 maps are usually presented using a web-based mapping application with interactive manipulation and other kinds of visual features, such as 3D, textures and hatching, animation, zoom in and out. Therefore, the participants may have different performance and emotion in a real scenario compared to viewing maps in the experiment. In follow-up research, the scope of the subjects will be expanded to make the research more general. We are deeply grateful for the support of Tobii Innovation Work Plan with the eye-tracking device "Tobii Pro Fusion" and the software "Tobii Pro Lab." We appreciate advice and guidance on the perspective of risk from Drs. Cao Yinghui and Shi Qiushuang, helping us find our direction in this article. Thanks go to Dr. Yu Junlei for sharing his experience and suggestions, so that the experimental design could be optimized in a short time. Special thanks to Kuang Huolingxiao of Durham University for assistance with experiments. Thanks to our colleagues Wang Zhong and Zhao Ying for patiently answering questions and providing ideas. Finally, we would like to thank Professors Wu Lei, Hou Guanhua, and Huo Faren for reading previous versions of the article and patiently suggesting revisions from start to finish. The datasets that support the findings of this study are available at the links below: • http://121.199.52.77:300/Data-of-COVID -19-proje ct.zip • https://www.dropb ox.com/s/8e71r thoqy jqkiy/ Data%20of%20COV ID-19%20pro ject.zip?dl=0 • https://pan.baidu.com/s/1KxwF C34cY y1189 8qWmSl8w; keyword r4b7. Nai Yang https://orcid.org/0000-0001-5306-1163 Information visualisation as a resource for popularising the technical-biomedical aspects of the last Ebola virus epidemic: The case of the Spanish reference press Low-level components of analytic activity in information visualization Tornado warning trade-offs: Evaluating choices for visually communicating risk Changing spatial perception: Dasymetric mapping to improve analysis of health outcomes in a megacity Affective color in visualization Semiology of graphics Visualizing seismic risk and uncertainty Geographical tracking and mapping of coronavirus disease COVID-19/severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic and associated events around the world: How 21st century GIS technologies are supporting the global fight against outbreaks and epidemics Cognitive models of the influence of color scale on data visualization tasks Evaluation of methods for classifying epidemiological data on choropleth maps in series Applying eye-movement tracking for the study of map perception and map design Web-design evaluation of the crisis map of the Czech Republic using eye-tracking Choropleth symbol design Comment on the actual state of using psychological mixed factorials designs in China Fine scale population density data and its application in risk assessment. Geomatics, Natural Hazards and Risk Is a picture worth a thousand words? Evaluating the effectiveness of maps for delivering wildfire warning information Graphing in depth: Perspectives on the use of three-dimensional graphs to represent lower-dimensional data Principles of map design Evaluating the impact of visualization of wildfire hazard upon decision-making under uncertainty Evaluating the impact of visualization of risk upon emergency route-planning Exploring the efficiency of users' visual analytics strategies based on sequence analysis of eye movement recordings Colour appeal in website design within and across cultures: A multi-method evaluation Thematic map design New progress in the study of eye movement and visual cognition in cartography since Color psychology: Effects of perceiving color on psychological functioning in humans Color and psychological functioning: The effect of red on performance attainment Novel method to measure inference affordance in static small-multiple map displays representing dynamic processes Representation and its relationship with cartographic visualization Toward a model of knowledge-based graph comprehension Statistical methods Evaluating cartographic design in flood risk mapping Graph literacy: A cross-cultural comparison Research on color design for thematic maps Evaluating the effectiveness and efficiency of visual variables for geographic information visualization For your eyes only? Evaluating a coordinated and multiple views tool with a map, a parallel coordinated plot and a table using an eye-tracking approach Association of colors with warning signal words Constructing mental models of machines from text and diagrams Mapping: Inventories, susceptibility, hazard and risk Cognitive map or medium materiality? Reading on paper and screen Eye tracking in human-computer interaction and usability research: Ready to deliver the promises Attention capture by novel stimuli Communicating the environmental health risk assessment process: Formative evaluation and increasing comprehension through visual design Ten guidelines for effective data visualization in scientific publications The impact of information transmission on epidemic outbreaks Principles of Gestalt psychology Geovisualizing attribute uncertainty of interval and ratio variables: A framework and an implementation for vector data How are map-based decisions influenced by uncertainty visualization in risky and time-critical situations? A cartographic framework for visualizing risk Evaluating the usability of visualization methods in an exploratory geovisualization environment Effects of office interior color on workers' mood and productivity Theoretical aspects of the main components and functions of attention Visualization approaches for communicating real-time flood forecasting level and inundation information Perception of clusters in statistical maps Cartography eye movements study and the experimental parameters analysis Contemporary cartography Classification and influence of symbols in static maps Risk perception, stress and coping strategies in two catastrophe risk situations Modeling tools for dengue risk mapping-A systematic review Managing images in different cultures: A cross-national study of color meanings and preferences Blue or red? Exploring the effect of color on cognitive task performances The visual perception of the circle in thematic maps/experimental results. The Canadian Cartographer Colour and thematic map design: The role of colour scheme and map complexity in choropleth map communication Recommendations for the user-specific enhancement of flood maps Representing climate change futures: A critique on the use of images for visual communication. Computers, Environment and Urban Systems Feature-based memory-driven attentional capture: Visual working memory content affects visual attention The effects of class interval systems on choropleth map correlation. The Canadian Cartographer Risk of MERS importation and onward transmission: A systematic review and analysis of cases reported to WHO Elements of cartography The theory-based influence of map features on risk beliefs: Self-reports of what is seen and understood for maps depicting an environmental health hazard Measuring the effectiveness of the graphical communication of hurricane storm surge threat Stimulus processing models from psychology: Can we use them in cartography? The American Cartographer Thematic cartography and visualization Early, involuntary top-down guidance of attention from working memory Severe acute respiratory syndrome epidemic and change of people's health behavior in China Geographic information systems: The microcomputer and modern cartography The influence of probabilistic volcanic hazard map properties on hazard communication A feature-integration theory of attention Spatial analysis of navigation in virtual geographic environments Cartography. Zhengzhou, China: School of Surveying and Mapping How can a risk be increased? An analysis of risk communication channels Review of eye-movement measures in reading research Visualization of data regarding infections using eye tracking techniques The procedure of the design of psychological experiments The matic map of symbolic representation. Modern Surveying and Mapping A review of eye movement indicators in current reading studies A study of emergency map design based on eye movement experiment and template technology Evaluating the effectiveness and efficiency of risk communication for maps depicting the hazard of COVID-19