In the domain of geospatial data management, accurately defining the extent of a resource is crucial for effective search and retrieval. Traditionally, Minimum Bounding Boxes (MBB) have been the go-to method for this purpose. However, in this paper we conducted an empirical study to reveal that MBBs often fall short in terms of precision (returning false positive results such as Figure 1), leading to less effective spatial searches.
To adress these limitations, we compared three methods for defining the extent of geospatial resources: Minimum Bounding Boxes (MBB), Convex Hulls, and Discrete Global Grid Systems (DGGS). The study utilized a rich dataset from major European geospatial data catalogs to evaluate the performance of these methods The results showed that the MBB method, while easy to implement, suffer a significant gap in precision, with accuracy ranging between 73% and 97%. This variability can lead to a high number of false positives, making spatial searches less reliable. However, our proposed method, the DGGS (Figure 3) demonstrated a remarkable precision rate between 96% and 99%.
The transition to DGGS represents a significant leap forward for geospatial data management. We also propose a pragmatic roadmap for integrating DGGS into existing infrastructures, highlighting the potential for reduced false positives and improved search accuracy. This study showcases the potential of DGGS to transform geospatial data retrieval. By adopting these advanced techniques, we can enhance the accuracy and reliability of spatial searches, paving the way for better data discovery.
An empirical study of the limitations of minimum bounding boxes for defining the extent of geospatial resources: the use of DGGS and other alternatives for improving the performance of spatial searches. S. MARTIN-SEGURA, F. J. LOPEZ-PELLICER, R. BÉJAR, J. NOGUERAS-ISO F. J. ZARAZAGA-SORIA. International Journal of Geographical Information Science, pp. 1–20.