GeoPl@ntNet: A Platform for
Exploring Essential Biodiversity Variables

Lukas Picek César Leblanc Alexis Joly Pierre Bonnet Rémi Palard Maximilien Servajean ^aINRIA, ^bCIRAD, ^cLIRMM

Abstract

This paper describes GeoPl@ntNet, an interactive web application designed to make Essential Biodiversity Variables accessible and understandable to everyone through dynamic maps and fact sheets. Its core purpose is to allow users to explore high-resolution AI-generated maps of species distributions, habitat types, and biodiversity indicators across Europe. These maps, developed through a cascading pipeline involving convolutional neural networks and large language models, provide an intuitive yet information-rich interface to better understand biodiversity, with resolutions as precise as 50×50 meters. The website also enables exploration of specific regions, allowing users to select areas of interest on the map (e.g., urban green spaces, protected areas, or riverbanks) to view local species and their coverage. Additionally, GeoPl@ntNet generates comprehensive reports for selected regions, including insights into the number of protected species, invasive species, and endemic species.

\paperid

1 Introduction

Global changes rapidly transform ecosystems, and their local impacts are context-dependent and hard to predict [joly2024lifeclef, joly2025lifeclef]. Monitoring species composition, biodiversity indicators and habitat types at high spatial resolution is crucial for understanding ecosystem responses and aiding decision-making, but it has proven to be very challenging [guisan2013predicting]. Deep learning-based distribution models offer a promising venue by allowing to use high-resolution geographic predictors and remote sensing data to address sampling gaps [deneu2021convolutional].

This work aims to leverage deep learning models for the high-resolution mapping of plant species [leblanc2022species], habitat types [leblanc2024deep], and biodiversity indicators [estopinan2024mapping] across Europe. The three types of maps, developed through a multimodal cascading pipeline, provide critical insights into biodiversity patterns and ecosystem dynamics.

By leveraging an extensive dataset [geoplant2024picek] and combining both surveys made by vegetation scientists [braun1932plant] and observations made by citizens [contini2025seatizen], the species distribution model [joly2023overview, joly2024overview] produced species maps for over 10,000 plant species. Derived biodiversity indicators offer critical information for conservation efforts (i.e., species richness, presence of endangered or invasive species, and other conservation-relevant metrics). Furthermore, the development of high-resolution maps for habitats, generated by coupling species distribution models with habitat classification models, provides a strong foundation for understanding landscape dynamics.

In summary, this work represents a major step forward in producing high-resolution and large-scale biodiversity maps to support conservation and land-use planning [bellard2012impacts, elith2009species]. Making them openly accessible and highly interactive through the GeoPl@ntNet web application empowers decision-makers and practitioners (see Figure 1).

Refer to caption — Figure 1: Home page of the GeoPl@ntNet web application.

2 Methodology

The whole cascading pipeline [leblanc2025mapping] is illustrated in Figure 2.

Dataset. We use GeoPlant [geoplant2024picek, botella2023geolifeclef], covering most of European flora by aggregating 5M presence‐only [bonnet2023synergizing] and 90K presence‐absence [chytry2016european] records. For each 50 $\times$ 50m cell (5.5B in total), we use Sentinel tiles, Landsat time series, and a 20-year climate record. To reflect habitat suitability, no coordinates are used [cole2023spatial]. We merge data into binary occupancy labels and use target‐group background [barber2022target, phillips2009sample] for bias.

Species. We use a multimodal ensemble, which outperforms classical SDMs [botella2023overview, picek2024overview]. We use three ResNet-like encoders [he2016deep] to process modalities independently. Embeddings are concatenated and fed to a sigmoid‐activated classifier. Training uses SGD with BCE loss. To produce maps, Europe was divided into 25×25km meta-tiles and we used the year 2021 (environmental data was averaged between March 21 and December 1, 2021), snapping water-centered cells to the nearest land. Probabilities were thresholded to yield binary maps.

Indicators. Biodiversity indicators [butchart2010global] are extracted from the predicted species assemblages. They summarize ecological properties such as species richness [gotelli2001quantifying]. To derive these assemblages, the probabilities are thresholded using a conformal prediction approach [fontana2023conformal], ensuring a low probability of omitting truly present species, even if it results in some false positives. This strategy prioritizes minimizing omission errors. Seven indicators (i.e., species richness, EU directive species, threatened species, most threatened, tree species, invasive species, and specialist species) have been successfully produced.

Habitats. We use Pl@ntBERT [leblanc2024pl, leblanc2025learning], a domain-specific language model [marcos2025fully], to assign habitat types to predicted assemblages. Pl@ntBERT performs multi-class classification into EUNIS habitat types [chytry2020eunis], leveraging its ability to capture co-occurrence patterns. The model operates purely on species presence data (i.e., no environmental variables), making it robust to sampling bias and suitable for generalization across regions. The model predicts EUNIS Level 3 habitats directly, while Levels 1 and 2 are derived from the classification hierarchy. In total, over 200 Level 3 habitat maps have been generated (keeping only the most likely label for each pixel).

Website. The front-end is developed as a single-page application using VueJS. It uses the Leaflet library to display the maps, all available as WMS streams, the standard communication protocol of the Open GeoSpatial Consortium. Thus, they can easily be integrated into any other cartographic tool. As the amount of data to be stored is huge ( $\approx$ 15TB), an NFS mounting point is used on a Storage Virtual Machine, which we also use to store cached raster tiles provided by the MapProxy cache server and COGs used by TiTiler. The website is not just for visualization but also to get insights about any Area of Interest and export them in JSON-like format (see Figure 3).

3 Use cases

GeoPl@ntNet is designed to support a wide range of biodiversity-related use cases across Europe by offering an intuitive and interactive interface for exploring spatial data. It features four main modules (see Figure 4), Home, Species, Habitats, and Indicators, each tailored to help users gain insights from high-resolution biodiversity maps. Whether the user is a conservation practitioner, urban planner, researcher, policymaker, or educator, the application facilitates exploration, analysis, and reporting with no technical overhead. A demonstration video is available on YouTube.

Home module. This module offers a comprehensive overview of the selected area, aggregating insights from the three other modules. It includes a summary of the most common and conservation-relevant species, dominant habitats, and biodiversity scores. It is the starting point for users who want a quick, readable biodiversity report.

Species module. This module allows users to explore the predicted distribution of plant species within any selected area. After choosing a region of interest, users are presented with a ranked list of species likely to be found there (see Table 1). Each species entry is enriched with coverage statistics and taxonomic information, helping users assess which species are most representative or dominant in the area. The tool supports use cases such as drafting species inventories, identifying candidate species for restoration projects, and detecting potential newcomers in changing environments.

Indicators module. This module provides a snapshot of ecological health through a set of biodiversity indicators computed from the local species assemblage. Users can instantly see how an area scores in terms of different metrics. This functionality supports monitoring programs and conservation prioritization. For instance, users can quickly compare areas based on their conservation value, while NGOs can identify high-priority zones for habitat protection.

Habitat module. This module allows users to visualize the most probable habitat types in the selected region. It allows filtering and sorting by habitat level and coverage, enabling a focused inspection of habitat diversity and dominance. Users can explore the landscape composition, change hierarchical levels (see Table 2), and download information relevant for land-use planning, environmental education, or ecological research. This is especially useful for assessing the ecological impact of land transformation.

Across all modules, users benefit from a seamless map-based interface where they can pan, zoom, and draw custom regions of interest. Reports are generated in real time, allowing for rapid exploration and comparison of multiple areas (see Figure 5). GeoPl@ntNet is not just a visualization tool, it is a decision-support system that turns large-scale biodiversity predictions into accessible insights.

4 Conclusion

This work presents GeoPl@ntNet, an interactive web application based on a multimodal deep learning framework. Its main principle is to enable users to explore high-resolution (50m), AI-generated, maps of plant species, habitat type, and biodiversity indicator distribution. The whole of Europe can be explored this way, at any scale and for the majority of vascular plants, seven different biodiversity indicators and all terrestrial habitats. It can also interactively explore the biodiversity in a user-defined region of interest, bridging the gap between large-scale ecological modeling and real-world decision-making.

Table 1: Evaluation of the deep-SDM (with different metrics).

Branch	AUC	F₁	Recall@50	Recall@250
All (Sen+Bio+Lan)	0.931	0.338	0.639	0.908

Table 2: Evaluation of Pl@ntBERT (at different levels).

Top-SDM predictions	Level 1	Level 2	Level 3
Keeping first 100 species	76.30%	62.68%	44.72%

{ack}

The research described in this paper was funded by the European Commission through the GUARDEN (safeGUARDing biodivErsity aNd critical ecosystem services across sectors and scales) and MAMBO (Modern Approaches to the Monitoring of BiOdiversity) projects. These projects received funding from the European Union’s Horizon Europe research and innovation programme under grant agreements 101060693 and 101060639, respectively. Further models developed based on this methodology will meet the needs of the European biodiversity strategy for 2030 through those projects. They will be used in particular to enhance the biodiversity maps at the European scale. The content of this paper reflects the views only of the authors, and the European Commission cannot be held responsible for any use which may be made of the information contained therein.