Maxwell B. Joseph

h-index30

3papers

3,140citations

3 Papers

2.1QMJun 9

Continuous biome representations from Earth observation embeddings

Maxwell B. Joseph, Flávia De Souza Mendes, Dieu My T. Nguyen et al.

Biotic communities vary continuously across space, yet biome maps impose categorical boundaries that compress this variation, particularly at ecotones where transitional communities are ecologically distinct. Could Earth observation (EO) foundation models, which encode spectral, spatial, and temporal information with dense embeddings, convert discrete biome maps into continuous representations that better capture ecological variation? Here, we fit a linear classifier on Clay v1.5 satellite image embeddings to predict biome labels from a categorical map. The softmax output yields a continuous probability vector whose dimensions correspond to named biome classes. We evaluate this approach using six Brazilian biomes, 1.3 million embeddings, and 10,015 withheld forest inventory plots spanning 4,672 plant species. The continuous biome representation outperforms discrete biome labels for predicting species occurrence (mean per-species AUC 0.618 vs. 0.570 across 10 spatial cross-validation folds). Decomposing this gain shows that continuity in the graded probability output, rather than label reassignment, accounts for the improvement; the pattern holds across all distances from biome boundaries. The raw 1024-dimensional embedding remains the strongest predictor we tested (mean AUC 0.646 vs. 0.618), but the continuous representation recovers most of the embedding's gain over discrete labels. This simple approach provides a probabilistic replacement for categorical map labels, preserving their meaning while encoding graded variation that discrete maps suppress.

1.2LGOct 16, 2020

Predicting Playa Inundation Using a Long Short-Term Memory Neural Network

Kylen Solvik, Anne M. Bartuszevige, Meghan Bogaerts et al.

In the Great Plains, playas are critical wetland habitats for migratory birds and a source of recharge for the agriculturally-important High Plains aquifer. The temporary wetlands exhibit complex hydrology, filling rapidly via local rain storms and then drying through evaporation and groundwater infiltration. Using a long short-term memory (LSTM) neural network to account for these complex processes, we modeled playa inundation for 71,842 playas in the Great Plains from 1984-2018. At the level of individual playas, the model achieved an F1-score of 0.538 on a withheld test set, displaying the ability to predict complex inundation patterns. When averaging over all the playas in the entire region, the model is able to very closely track inundation trends, even during periods of drought. Our results demonstrate potential for using LSTMs to model complex hydrological dynamics. Our modeling approach could be used to model playa inundation into the future under different climate scenarios to better understand how wetland habitats and groundwater will be impacted by changing climate.

1.9MLJul 10, 2018Code

Predicting property damage from tornadoes with zero-inflated neural networks

Jeremy Diaz, Maxwell Joseph

Tornadoes are the most violent of all atmospheric storms. In a typical year, the United States experiences hundreds of tornadoes with associated damages on the order of one billion dollars. Community preparation and resilience would benefit from accurate predictions of these economic losses, particularly as populations in tornado-prone areas increase in density and extent. Here, we use a zero-inflated modeling approach and artificial neural networks to predict tornado-induced property damage using publicly available data. We developed a neural network that predicts whether a tornado will cause property damage (out-of-sample accuracy = 0.821 and area under the receiver operating characteristic curve, AUROC, = 0.872). Conditional on a tornado causing damage, another neural network predicts the amount of damage (out-of-sample mean squared error = 0.0918 and R2 = 0.432). When used together, these two models function as a zero-inflated log-normal regression with hidden layers. From the best-performing models, we provide static and interactive gridded maps of monthly predicted probabilities of damage and property damages for the year 2019. Two primary weaknesses include (1) model fitting requires log-scale data which leads to large natural-scale residuals and (2) beginning tornado coordinates were utilized rather than tornado paths. Ultimately, this is the first known study to directly model tornado-induced property damages, and all data, code, and tools are publicly available. The predictive capacity of this model along with an interactive interface may provide an opportunity for science-informed tornado disaster planning.