key: cord-0607649-zs7h1sci authors: Gordon, Gemma; Holcomb, Amelia; Kelly, Tom; Keshav, Srinivasan; Ludlum, Jon; Madhavapeddy, Anil title: How Computer Science Can Aid Forest Restoration date: 2021-08-12 journal: nan DOI: nan sha: 93868db37a2c39470f499c078a73200a20894502 doc_id: 607649 cord_uid: zs7h1sci The world faces two interlinked crises: climate change and loss of biodiversity. Forest restoration on degraded lands and surplus croplands can play a significant role both in sequestering carbon and re-establishing bio-diversity. There is a considerable body of research and practice that addresses forest restoration. However, there has been little work by computer scientists to bring powerful computational techniques to bear on this important area of work, perhaps due to a lack of awareness. In an attempt to bridge this gap, we present our vision of how techniques from computer science, broadly speaking, can aid current practice in forest restoration. We are emerging into a post-COVID world where attention is keenly focused on the next two global crises: climate change and biodiversity loss. Recognizing that some critical aspects of modern society can never be fully decarbonised, a shift to a net zero-carbon economy will necessarily require some level of carbon capture and sequestration (CCS). Many approaches to CCS have been proposed, including pumping CO 2 into oil wells [56] and trapping it as chalk precipitate [40] . These solutions, however, do not address the biodiversity crisis. In contrast,the conservation of existing forests and the restoration of forests to degraded lands and surplus croplands can play a significant role both in sequestering carbon and re-establishing biodiversity. In this paper, we focus only on forest restoration. To give an idea of the scale involved, the median carbon sequestration potential of an agroforestry system, which combines agriculture and forestry, is estimated to be about 95 metric tons (T) per hectare [1] and, if left undisturbed, can trap this CO 2 for decades or even centuries [52] . This is roughly the equivalent of the emissions from 10 UK households in one year [29] . Recent work suggests that the earth has the potential to sustain a trillion more trees planted in approximately 900 million hectares of currently denuded land [7] , with a carbon sequestration potential of 205 GT, which is comparable to anthropogenic emissions of 1500 GT to date [46, 8] . Thus, technological interventions to promote reforestation is an attractive approach to mitigate anthropogenic climate change. In addition to carbon sequestration, native forests provide many other benefits both to the environment and to humans. Environmental benefits include increased biodiversity, air purification, flood prevention, and slope stabilization. Human benefits include jobs in forestry and agroforestry, improving public health, providing food, and providing a source of sustainable building materials. For these reasons, reforestation has become a topic of great current interest. For example, the World Economic Forum announced in January 2020 an initiative to plant 1 trillion trees [49] . Unfortunately, reforestation is not simply a matter of planting saplings or seeds in the ground and walking away [12, 28, 49] . • Reforestation guided purely by economic criteria results in monoculture plantations that can neither sequester carbon for the long term nor support biodiversity [28, 34] . They also lack resilience to disease, pests, and climate change. • Successful tree-planting, but of fast-growing species such as pine or eucalypts, can result in not only draining of the water [51] . • There is ongoing debate about the accuracy of the models used to project the CCS benefits from forests [59] . Fortunately, over the past two decades, practitioners in the discipline of Forest and Landscape Restoration (FLR) have developed a deeper understanding of reforestation approaches that balance the needs of the environment and local actors [28, 13, 9, 38, 12] . At the risk of oversimplification, this work can be summarized as follows. First, for ecological restoration and carbon sequestration, as opposed to mere tree harvesting, it is important to combine appropriate policies (such as a guarantee of long-term land tenure) with institutional frameworks (such as participatory management of communal forests). Importantly, reforestation outcomes must be economically sustainable for all stakeholders. To ensure this, local actors must be engaged participants in the planning process. Second, there is a continuing need for the improved monitoring of trees, other vegetation, soil, and wood products at all spatial scales. This monitoring can not only help identify forests under threat of degradation and deforestation but also help evaluate forest health, raise the alarm for interventions and provide a basis for reliable administration of carbon credit payments. These two areas are precisely where computer science, broadly speaking, can help FLR practitioners, especially when there is a need for very large scale and resources are limited. In the remainder of this paper, we present an outline of the some potential areas for computer scientists to be involved in forest restoration (Section 2), along with a handful of near-term research opportunities (Section 3) to demonstrate the great potential for work in this area. Note that some interventions can reuse existing computational tools, while others will require fundamentally new techniques. A forest restoration project goes through three distinct phases [12] ( Figure 2 ). In the planning phase ( §2.1), critical decisions need to be made about where to plant trees (location) and what species to plant (species mix) so as to restore woodlands in a way that best mimics pre-existing ecosystems. In the deployment phase ( §2.2) trees are planted, potentially along with ground-based monitoring equipment such as sensors. Finally, in the sustenance phase ( §2.3), the health of both the trees themselves as well as the FLR organization need to maintained. We discuss these phases in more detail next, outlining how computer science can help in each phase, and subsequently cover research opportunities ( §3). In the planning phase, forestry experts and local actors collaborate to plan the tree locations, species mix, and time of planting. Choice of location: Trees would ideally be planted to rehabilitate degraded lands that are otherwise unsuitable for agriculture or human habitation, making best use of existing rootstock and undisturbed pockets of mature forest that can provide animal, plant, and fungal colonizers [28] . At a macroscopic level, AI-based analysis of satellite data and LiDAR can be used to create 3D elevation maps to identify potential areas of forest degradation [60, 3] . Once suitable land has been identified, the local climate and micro-climate must be considered in choosing exact planting locations. Significant factors include the land slope, levels of rainfall and groundwater flow, wind speed and flow, different types of soil and different native species of vegetation [24] . Given the complexity, participation by local actors can help avert poor decisions (Section 3.2). At a microscopic level, careful consideration of the existing vegetation and soil state is required, since dis-turbing soil actually releases existing stores of greenhouse gases, as well as increasing the risk of erosion, particularly if protective soil crusts are destroyed [23] . Moreover, forests that soak up rainwater for their own growth may result in shriking water supplies downstream. Hence, mathematical optimization can be used to choose locations that balance the cost of breaking ground and loss of water supply with long-term benefits from planting in a particular location. Note that successful FLR projects need to consider the human element of land function, social participation, and natural resources management [18, 6] . Optimization models must, therefore, include the socio-environmental factors attached to the physical locations, such as the type of land tenure, the current functions of the land and potential impacts associated with the land in question. Choice [43] . Species diversity complicates the planning process due to the need for more sophisticated analysis of mapping data [17] . There is some evidence that the correlation between biodiversity above ground and soil carbon sequestration rates is dependent on the landscape and climate context [45] . Hence, there is a need for accurate modelling of multi-species forests (discussed in §3.4). In this phase, both trees and monitoring equipment must be deployed safely across the restoration area. Planting: Trees can be planted as saplings, seeds, or pelletised seeds, i.e., seeds embedded in a pellet that contains soil, fertilizer, anti-grazing chemicals, and sometimes a dye for visibility [28] . Planting can be done by humans, or by helicopters and planes. In many cases, planting must be preceded by burning weed cover or scarifying the earth using tractors or other equipment. Given the labour intensive nature of planting, it is natural to consider the use of automation. Handling, storage and delivery practices can affect seedling establishment, and environmental factors and the planting microsite directly affect the physiological response of saplings immediately after planting [22] . If the species is one that can be grown from seed, drone fleets can be used to seed large areas rapidly [16] . It is conceivable that a form of robotics may be used to plant either seeds or saplings, as well as for weeding in the first few years 1 ; this is a potential topic for future research. Sensor co-deployment: In addition to planting the trees, sensor technology should be simultaneously deployed. Remote sensing technologies such as RFID and LoRA can be used to gather microscopic level data such as soil salinity, which can aid the identification of vulnerable areas or vegetation that requires immediate intervention but also play a vital role in ongoing data collection, feedback and extrapolation. We discuss the types of sensors that might be deployed in this forested environment later ( §3.3). While many socio-economic and socio-environmental factors determine the long-term success or failure of an FLR project [31, 32], it is clear that planted forests need to be monitored for several decades to ensure long-term carbon sequestration. Organizations responsible for forest management also need to be financially sustainable. Forest sustainability: Once seeds or saplings are established and thriving, we need to ensure their continued health for maximum carbon storage, as well as broader markers such as biodiversity. Environmental success indicators include: a suitably high survival rate; the structure of the planted vegetation (canopy, ground, litter and shrub cover); species diversity; and ecosystem functions (overall soil health, quantities of groundwater and of course, carbon sequestration) [31] . All of these indicators need to be appropriately monitored ( §3.3). Organization viability: Organizations responsible for forest management need to be financially sustainable and free of fraud. Creating a system of trusted proofs for carbon certification may be a viable source of funding for the long term. Local sensor data and global satellite sweeps combined with digitally verified contracts and transactions could go some way to eliminate the fraud present in existing carbon certification programs [19] . We discuss some options later ( §3.1). Computer Science has an important role to play in scaling FLR programs to the global scale required to plan, deploy and sustain a trillion trees over the coming decades. We now discuss several research opportunities based on current understanding of FLR ( §2). Forest-based carbon sequestration has recently started to attract substantial funding. For example, the UK Forestry Commission announced the Woodland Carbon Guarantee [58] in 2019, a £50m scheme to provide landowners with the option to sell sequestered CO2 from newly planted trees back to the government for a guaranteed price for the next 30 years. As schemes such as this grow worldwide, the opportunity for fraud has also followed [19] . Hence, there is an opportunity to use distributed ledger technology to inject trust into the system [54] . Automation via trustworthy distributed ledgers [4, 2] would ensure that such schemes are economically sustainable over the decades needed to realise the carbon sequestration potential of newly planted forests. A distributed ledger for carbon sequestration credits would need to register ownership of a a reforestation project and then reliably track its carbon sequestration over time. The project owner would be permitted to generate annual sequestration certificates for the prior year of forest regrowth; if the project were to fail, then revenues from the sale of these certificates would dry up, incentivizing the appropriate behaviour [52] . Satellite observations could be used to reliably observe project outcomes. Moreover, cloud-based data archiving, curation, and analysis of satellite observations would provide ongoing detailed information to certificate purchasers about the progress of the scheme. The design of such a global, scalable, and trustworthy system is an open area for research. It is critical to the long-term success of an FLR project that local actors and policy makers are happy with the eventual outcomes. FLR practitioners must also elicit knowledge of local conditions and constraints from local stakeholders, including indigenous communities, during the planning phase [20]. These two objectives can be met by presenting visualizations of potential outcomes and using collaboration technology to allow planning participants to contribute their knowledge. While technologies for collaboration are well known [48] , there are research opportunities in developing innovative visualization technology specifically for FLR. For example, state-of-the-art visualization using Augmented Reality might be beneficial in this context [57] . We believe that AR can help in creating common ground and make the outcomes more real to participants, policy makers, and stakeholders. Once local stakeholders are involved in the process, it is natural to crowdsource data from them to scale our data collection. Mobile apps that encourage citizen science (such as Seek by iNaturalist and WWF [50]) could provide ongoing crowdsourced data feeds to augment the planning and sustenance processes, as well as educate participants about the progress and benefits they are realizing as the newly planted forests grow. Visualizations can also be used to present large amounts of data that has been gathered from satelliteand ground-based observations. One useful direction may be for computer scientists to construct an online knowledge elicitation, communication, and collaboration platform to involve all the stakeholders and work across disciplines [30]. Monitoring forests using sensor systems is critical from many perspectives: deciding on interventions, measuring forest health, and validating carbon credits. Macroscopic monitoring from satellites in well known, and more recently mesoscopic monitoring, such as LiDAR scans from planes or drones, provide an excellent source of data for recognising individual trees [33], which is vital for trustable sequestration credits and estimating current and potential carbon sequestration. Further data can be gathered at the microscopic scale from on-the-ground sensors helping us to determine the health of individual trees (e.g. identifying disease, insect or animal damage, or failure to thrive) and enabling rapid intervention if required. As trees grow, the micro-climate changes, strongly affecting the plant communities [5], therefore ongoing measurements of temperature, humidity and other environmental parameters are also needed [61] . However, many sensors in use today in agricultural contexts are not suitable for long-term mass-scale forest deployment. Forests environments are less controlled and consistent than agricultural ones, and access to deployed sensors is comparatively limited. Thus, the sensors need to be self-managing, robust and long lived. Sensors may make different trade-offs between energy usage and lifespan. For example, active medium-lifespan sensors (e.g. spectroscopy for measuring soil quality) used during sapling establishment could be deployed alongside passive long-lived sensors designed to last for the expected lifetime of the trees (e.g. temperature and humidity via chipless RFID [11]). Additional sensors may also be considered outside of those normally associated with agriculture, for example to account for dead organic matter, estimate carbon content of soil and litter, and to monitor for poaching, theft, vandalism and possible fraud. Thus, there is ample scope for research into appropriate sensor system design. Another direction of sensor system design is for moni-toring, reporting, and verification (MRV) of forest-based carbon projects. The existing protocol to measure forest carbon sequestration, for example in the UK Woodland Carbon Code (WCC) [55] , is almost entirely manual, with a suggested material list that includes a smartphone, GPS or compass, printed paper spreadsheets, and measuring tape. The process is time-intensive and may be quite costly, with surveyors recommended to begin their work 6-12 months before the scheduled verification deadline. Moreover, the UK is not alone: both a 2016 review of research into REDD+ programs and a 2018 report on forest-based carbon projects highlighted MRV as a key challenge [41, 21] . In the short term, camera-mounted, GPS-enabled drones with a connection to the cloud, working alongside human surveyors, might offer a dramatic improvement for data collection, management, and analysis. Longer term, these labelled data sets could further the development of machine learning and vision algorithms to measure tree growth or infer species and health. Eventually, this data might be integrated with existing satellite or LiDAR imagery, and could be collected completely autonomously. An agent-based model (ABM) can simulate the growth of a multi-species forest from seed to maturity, taking into account the local soil condition, environmental variability, and inter-species interaction [39, 44] . It can play a key role throughout the lifecycle of tree restoration projects, from planning to sustenance. In the planning phase ( §2.1), an ABM allows competing forest restoration approaches to be evaluated ahead of planting. It may help determine what species mix to plant in a given area or which land area is most likely to be successful in reforestation, incorporating concerns spanning multiple disciplines, such as the level of stakeholder engagement and the soil characteristics. Robustness stress testing could also be conducted for fire risks and potential climate changes. Such a simulation tool can also be used to help sustain a forest project ( §2.3), for example by allowing foresters to simulate interventions for disease management or understand how a forest will respond to extreme weather events. It can also play a key role in the financial element of project sustenance. Many carbon credit programs require demonstrating increased sequestration against a hypothetical baseline [42] , which can be more easily and reliably established through trusted, peer-reviewed models. There are existing agent-based modelling packages that allow large-scale forest simulation, for example TROLL [39], but new CS techniques will be needed to scale up the simulations and utilize cloud-scale clusters. An ideal ABM would also allow large scale evaluation of emergent system behaviour while automatically integrating macroscopic (satellite), mesoscopic (LiDAR) and microscopic (soil condition) data to calibrate and verify the simulations. For example, satellite images could be used to identify species types in existing forest and re-forested areas. These areas could have their growth from saplings simulated, using our knowledge of growth processes [27, 15, 53] , while controlling error propagation by feeding in high-res satellite imagery [47] . By adjusting the model's parameters, the simulation should show the same crown cover (for some metric of similarity) as reality. This combination of growth processes with multiple data scales would allow new reforestation techniques to be accurately developed. The role of reforestation in carbon sequestration is well recognized. A naive approach to reforestation, however, can lead to numerous undesirable consequences, which can be averted by relying on proven methodologies from the discipline of Forest and Landscape Restoration. We have identified several broad areas where computer science tools and techniques can help FLR practitioners. We also identify CS research problems in the specific areas of forest simulation, trustable carbon credits, visualization, and monitoring. We hope that the new tools and techniques created from this research will give FLR practitioners new degrees of freedom in restoring degraded lands and combating climate change. We conclude by quoting the UK's Committee on Climate Change, which in its recent assessment criterion for new potential landuse policies asks "Is the [reforestation] policy feasible to implement, monitor and verify over time?" [29] . By extending the scope of what is feasible, computer scientists can aid in developing effective reforestation policies not just in the UK but around the world. [3] Hans-Erik Andersen, Robert J. McGaughey, and Stephen E. Reutebuch. "Estimating forest canopy fuel parameters using LIDAR data". en. In : : http : / / www . sciencedirect . com / science / article / pii / S0959652616301147 (visited on 02/05/2020). : 10.1126/science. aad8132. : https://science.sciencemag. org / content / 352 / 6291 / 1312 (visited on 02/04/2020). Carbon sequestration in tropical agroforestry systems Introduction to the Tezos Blockchain REDD+ research: Reviewing the literature, limitations and ways forward". en Methodology for Improved Forest Management Through Extension of Rotation Age (IFM ERA).pdf. en. Tech. rep. Version 1.2, Sectoral Scope 14. Ecotrust Likelihood of changes in forest species suitability, distribution, and diversity under future climate: The case of Southern Large scale agentbased modelling: A review and guidelines for model scaling Growing biodiverse carbon-rich forests". en CO2 and Greenhouse Gas Emissions Upscaling Forest Biomass from Field to Satellite Measurements: Sources of Errors and Ways to Reduce Them". en Reviewing the E-Collaboration marketplace: A survey of electronic collaboration systems Getting the message right on nature-based solutions to climate change An (even more) inconvenient truth: why carbon credits for forest preservation may be worse than nothing Rate of tree carbon accumulation increases continuously with tree size Algorithms for simulating thinning and harvesting in five European individual-tree growth simulators: a review". en How blockchain will change organizations Footprint and detectability of a well leaking CO2 in the Central North Sea: Implications from a field experiment and numerical modelling". en MetaTree: augmented reality narrative explorations of urban forests Woodland Carbon Guarantee Improving uncertainty in forest carbon accounting for REDD+ mitigation efforts Tree species classification and estimation of stem volume and DBH based on single tree extraction by exploiting airborne full-waveform LiDAR data". en Advances in Microclimate Ecology Arising from Remote Sensing". en