Abstract
The 17 Sustainable Development Goals (SDGs) are a “shared blueprint for peace and prosperity for people and the planet, now and into the future”. Since 2015, they help pointing out pathways to solve interlinked challenges being faced globally. The monitoring of SDGs is essential to assess progress and obstacles to realise such shared goals. Streams of SDG-related documents produced by governments, academia, private and public entities are assessed by United Nations teams to measure such progress according to each SDG, requiring labelling to proceed to more in-depth analyses. Such laborious task is usually done by the experts, and rely on personal knowledge of the links between the documents contents and the SDGs. While UNEP has experts in many fields, links to the SDGs that are outside their expertise may be overlooked. In this context, we propose to solve this problem with a multi-label classification of texts using Bidirectional Encoder Representations from Transformers (BERT). Based on this method, we designed the SDG-Meter, an online tool able to indicate to the user in a fully automatic way the SDGs linked to their input text but also to quantify the degree of membership of these SDGs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
Overfitting occurs when the algorithm over-learns (overfit.)in other words, when it learns from data but also from patterns (diagrams, structures) which are not related to the problem, such as noise, thus degrading the performance of the algorithm.
- 7.
- 8.
- 9.
The original version of BERT is no longer available for the moment because its improved version “SMITH” is under development.
- 10.
- 11.
- 12.
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Cer, D., et al.: Universal sentence encoder for english, pp. 169–174 (2018). https://aclanthology.org/D18-2029, https://doi.org/10.18653/v1/D18-2029
Chen, T., Guestrin, C.: XGBoost. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016). https://doi.org/10.1145/2939672.2939785
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL (2019)
Ding, M., Zhou, C., Yang, H., Tang, J.: CogLTX: Applying BERT to long texts. In: NeurIPS (2020)
Guisiano, J., Chiky, R.: Automatic classification of multilabel texts related to sustainable development goals (SDGs). In: TECHENV EGC2021. Montpellier, France (2021). https://hal.archives-ouvertes.fr/hal-03154261
Howard, J., Ruder, S.: Universal language model fine-tuning for text classification (2018)
Joshi, A.: A knowledge organization system for the united nations sustainable development goals. In: Verborgh, R., et al. (eds.) ESWC 2021. LNCS, vol. 12731, pp. 548–564. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-77385-4_33
Körfgen, A.: It’s a hit! mapping Austrian research contributions to the sustainable development goals. Sustainability 10, 3295 (2018)
LaFleur, M.: Art is long, life is short: an SDG classification system for DESA publications (2019). https://doi.org/10.2139/ssrn.3400135
Matsui, T., et al.: A natural language processing model for supporting sustainable development goals: translating semantics, visualizing nexus, and connecting stakeholders. Sustain. Sci (2022). https://doi.org/10.1007/s11625-022-01093-3
OCDE: Industrial policy for the sustainable development goals (2021). https://www.oecd-ilibrary.org/content/publication/2cad899f-en, https://doi.org/10.1787/2cad899f-en
Pennington, J., Socher, R., Manning, C.: GloVe: global vectors for word representation. In: EMNLP vol. 14, pp. 1532–1543 (2014). https://doi.org/10.3115/v1/D14-1162
Pincet, A., Okabe, S., Pawelczyk, M.: Linking aid to the sustainable development goals –a machine learning approach. In: OECD Development Co-operation Working Papers, vol. 52 (2019)
Pukelis, L., Puig, N., Srynik, M., Stanciauskas, V.: OSDG –open-source approach to classify text data by un sustainable development goals (SDGs) (2020)
Sovrano, F., Palmirani, M., Vitali, F.: Deep learning based multi-label text classification of UNGA resolutions. CoRR abs/2004.03455 (2020). https://arxiv.org/abs/2004.03455
Wang, Z., Ng, P., Ma, X., Nallapati, R., Xiang, B.: Multi-passage BERT: a globally normalized BERT model for open-domain question answering. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5878–5882. Association for Computational Linguistics, Hong Kong, China (2019). https://aclanthology.org/D19-1599, https://doi.org/10.18653/v1/D19-1599
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Guisiano, J.E., Chiky, R., De Mello, J. (2022). SDG-Meter: A Deep Learning Based Tool for Automatic Text Classification of the Sustainable Development Goals. In: Nguyen, N.T., Tran, T.K., Tukayev, U., Hong, TP., Trawiński, B., Szczerbicki, E. (eds) Intelligent Information and Database Systems. ACIIDS 2022. Lecture Notes in Computer Science(), vol 13757. Springer, Cham. https://doi.org/10.1007/978-3-031-21743-2_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-21743-2_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21742-5
Online ISBN: 978-3-031-21743-2
eBook Packages: Computer ScienceComputer Science (R0)