Eugene Yu Hin Yau, Emily E. Jones, Toby Pak Nok Tsang, Shuang Xing, Richard T. Corlett, Patrick Roehrdanz, David J. Lohman, Adam Kai Chi Lee, Catherine Wai Ching Hai, Shawan Chowdhury, Jane K. Hill, Jade A. T. Badon, Cheong Weei Gan, Yves Basset, I-Ching Chen, Suzan Benedick, Anuj Jain, Tiffany L.T. Ki, Krushnamegh Kunte, Akihiro Nakamura, Lien Van Vu, Sarah A. Scriven, Alice C. Hughes, Timothy C. Bonebrake*
*E-mail: tbone@hku.hk
Insect biogeography is poorly documented globally, particularly in the tropics. Recent intensive research in tropical Asia, combined with increasingly available records from citizen science, provides an opportunity to map the distributions of tropical Asian butterflies. We compiled a dataset of 730,190 occurrences of 3,752 tropical Asian butterfly species by aggregating records from GBIF (651,285 records), published literature (27,217), published databases (37,695), and unpublished data (13,993). Here, we present this dataset and single-species distribution maps of 1,576 species. Using these maps, along with records of the 2,176 remaining species, we identified areas of limited sampling (e.g., the Philippines, Myanmar, and New Guinea) and predicted areas of high diversity (Peninsular Malaysia and Borneo). This dataset can be leveraged for a range of studies on Asian and tropical butterflies, including 1) species biogeography, 2) sampling prioritization to fill gaps, 3) biodiversity hotspot mapping, and 4) conservation evaluation and planning. We encourage the continued development of this dataset and the associated code as a tool for the conservation of tropical Asian insects. For more details on our dataset and SDMs, please refer to our data paper
.
- Cleaned butterfly occurrence records from GBIF, Borneo Butterfly Distribution Database (B2D2), Spatial occurrence data for the animals of Bangladesh derived from Facebook, and other credible published sources.
- Distribution maps of tropical Asian butterflies as predicted by species distribution models(SDMs) can be downloaded as separate raster files or one single PDF file from our
Figshare repository
. - R script used to construct SDMs:
- Please download files essential for running our R scripts from our
Figshare repository (SDMsupp_files.zip)
before running our codes. - Additional R scripts used to clean our dataset, prepare for distribution modeling, analyze SDM outputs, and buffer occurrence points of unmodelled species:
Code/Supplementary
Harmonize species names
Clean dataset family names
Identify possible biogeographic range of dispersal for each species
Calculate spatial sampling effort
Map diversity patterns by stacking single species distributions
SDM performance evaluation
SDM variable importance
Buffer occurrence points of unmodelled species
Compare SDM results with that of Daru (2024)
- JavaScript code used in Google Earth Engine to extract and filter Landsat data for use as SDM variable (NDVImean), view in
Google Earth Engine
or download code from here:Code/Variables