LGCVNov 8, 2021

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

arXiv:2111.04724v191 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of inconsistent evaluation in SDG monitoring for researchers and policymakers, though it is incremental as it builds on existing data and methods.

The authors tackled the lack of standardized benchmarks for using machine learning to monitor the UN Sustainable Development Goals (SDGs), introducing SustainBench with 15 tasks across 7 SDGs and publicly releasing datasets for 11 of them to provide consistent evaluation metrics and lower barriers for ML researchers.

Progress toward the United Nations Sustainable Development Goals (SDGs) has been hindered by a lack of data on key environmental and socioeconomic indicators, which historically have come from ground surveys with sparse temporal and spatial coverage. Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellites or social media, to provide insights into progress toward SDGs. Despite promising early results, approaches to using such data for SDG measurement thus far have largely evaluated on different datasets or used inconsistent evaluation metrics, making it hard to understand whether performance is improving and where additional research would be most fruitful. Furthermore, processing satellite and ground survey data requires domain knowledge that many in the machine learning community lack. In this paper, we introduce SustainBench, a collection of 15 benchmark tasks across 7 SDGs, including tasks related to economic development, agriculture, health, education, water and sanitation, climate action, and life on land. Datasets for 11 of the 15 tasks are released publicly for the first time. Our goals for SustainBench are to (1) lower the barriers to entry for the machine learning community to contribute to measuring and achieving the SDGs; (2) provide standard benchmarks for evaluating machine learning models on tasks across a variety of SDGs; and (3) encourage the development of novel machine learning methods where improved model performance facilitates progress towards the SDGs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes