Not All Datasets Are Created Equal
Image Credit: Chinmaysk, CC BY-SA 3.0, Image Cropped
Species data for understanding biodiversity dynamics: The what, where and when of species occurrence data collection (2021) Petersen et al., Ecological Solutions and Evidence, https://doi.org/10.1002/2688-8319.12048
With the rise of the internet, GPS’ and smartphones, the amount of openly available species occurrence data has reached previously unfathomable numbers. This increase is mostly due to the engagement of the citizen scientist – regular people getting out there in nature and taking part in data collection and research. From people taking photos of flowers in their backyard to organised salamander spotting safaris, citizen scientists have opened up data that previously would have cost massive amounts to produce.
The Global Biodiversity Information Facility (GBIF) is the largest hub of such data, collating data ranging from amateur observation to museum specimens to professional surveys. It is well-known, however, that this kind of openly available data comes with a myriad of caveats: some species groups are reported much more than others (I am looking at you, bird-watchers), and “roadside bias” (see Did You Know?) haunts the records. But how are the records distributed among different land-cover types on a country-scale, does it differ between groups of conservation concern, and does it depend on who the reporters are?Read more