Downscaling Model Evaluation

Out-of-Bag Performance, Variable Importance, and Predictor Collinearity

Author

Akash B V

Published

May 5, 2026

This page summarizes model diagnostics for the Random Forest downscaling models used to predict SIPNET outputs from 100 anchor sites to approximately 132,000 annual cropland fields. For ecological interpretation of predictor effects (ALE/ICE plots), see the Model Drivers page.

Validation Status

These results are from a proof-of-concept modeling framework that has not been validated against field observations. Interpret all values as illustrative projections, not empirical estimates.

Out-of-Bag Performance

Out-of-bag (OOB) R² measures how well each Random Forest predicts held-out training sites. It is computed separately for each of 20 ensemble members; the table shows the median and interquartile range. OOB R² is a conservative estimate of spatial predictive accuracy because it uses only the approximately one-third of training sites not included in each bootstrap sample (each bootstrap draws n with replacement from n, leaving ~37% out-of-bag).

Downscaling model performance summary. Values from 20-member ensemble Random Forests trained at 100 annual crop anchor sites.
Variable	Median OOB R²	IQR	Top 2 Predictors	Assessment
N₂O Flux	0.52	0.38–0.72	vapr, temp	Best spatial model
TotSoilCarb	0.34	0.26–0.41	ocd, vapr	Moderate
AGB	0.09	-0.04–0.52	precip, vapr	Poor – high variance across ensembles
CH₄ Flux	-0.15	-0.17 to -0.13	none meaningful	No spatial signal (near-zero baseline)

CH₄ Model Has No Predictive Skill

Non-flooded annual croplands produce negligible CH₄ in SIPNET because methanogenesis requires sustained anaerobic conditions (continuous flooding) that do not occur under standard irrigation. The statewide baseline is approximately 14,000 kg CH₄ yr⁻¹ – effectively zero. With no spatial variation in the training data, the Random Forest cannot distinguish signal from noise. The negative R² confirms the model performs worse than predicting the mean. CH₄ importance rankings and ALE/ICE plots reflect noise, not ecology. Meaningful CH₄ fluxes are expected when rice paddies are included in future model runs.

AGB Model Quality

The AGB model’s wide IQR (-0.04 to 0.52) across ensembles means some ensemble members have no spatial predictive power while others are moderate. This inconsistency should be considered when interpreting AGB spatial maps. The low and variable R² likely reflects the fact that annual crop AGB at the modeled time point is small (post-harvest residue) and may not vary systematically with the environmental covariates used here.

Variable Importance

Variable importance is measured as the increase in mean squared error (MSE) when each predictor is randomly permuted while all others are held constant. Higher values indicate the model depends more on that variable for accurate predictions. Importance is summarized as the median across 20 ensemble members; error bars in the figure show the IQR.

Top 4 predictors by median increase in MSE for each variable. N₂O and CH₄ importance values are near-zero in absolute terms due to the scale of the response; relative rankings are meaningful for N₂O but not for CH₄.
Rank	TotSoilCarb	AGB	N₂O Flux	CH₄ Flux
1	ocd (18.4)	precip (9.7)	vapr	all near 0
2	vapr (12.8)	vapr (8.7)	temp	all near 0
3	temp (12.0)	ocd (7.9)	precip	all near 0
4	precip (6.9)	temp (7.4)	srad	all near 0

Interpretation

Soil carbon is most strongly driven by existing organic carbon density (ocd), which is both a predictor and correlated with the output. This reflects the well-known positive feedback between soil organic matter and further C retention – SOM persistence is controlled by environmental and biological interactions rather than molecular recalcitrance alone (Schmidt et al. 2011, Nature). Climate variables (vapr, temp) rank second and third, capturing the temperature-driven decomposition gradient across California.

N₂O flux is driven by climate variables (vapr, temp, precip), consistent with the well-established temperature and moisture sensitivity of soil nitrification and denitrification processes (Butterbach-Bahl et al. 2013, Philosophical Transactions of the Royal Society B). The dominance of vapor pressure likely reflects its strong correlation with temperature (r = 0.86); together they capture the north-south climate gradient that controls soil moisture availability for N-cycling microbes.

AGB shows precipitation and vapor pressure as top predictors, consistent with water availability limiting annual crop productivity in California’s Mediterranean climate. However, the low and variable R² means these relationships are not reliably captured.

CH₄ flux shows no meaningful importance for any predictor, confirming the training data contains no spatial signal for the model to learn.

Predictor Collinearity

The environmental predictors are correlated due to California’s climate gradients. High collinearity does not bias Random Forest predictions, but it does affect the interpretability of individual predictor importance: when two predictors are correlated, their importance is split between them, and their individual rankings become unstable. This is why we use ALE plots (which handle correlated predictors correctly) rather than Partial Dependence Plots for ecological interpretation.

Pairwise Pearson correlations among the seven environmental predictors with |r| > 0.30. Computed from 377,807 cropland fields (all crop types) in the CADWR dataset. Values are similar for the approximately 132,000 annual cropland subset because the correlations reflect California’s climate geography, not crop type.
Predictor Pair	Pearson r	Severity
temp – vapr	+0.86	Strong
precip – srad	-0.86	Strong
srad – vapr	+0.58	Moderate
temp – precip	-0.56	Moderate
ocd – vapr	-0.55	Moderate
temp – srad	+0.54	Moderate
precip – vapr	-0.52	Moderate
ocd – srad	-0.53	Moderate
ocd – precip	+0.49	Moderate

The two strongest correlations – temp/vapr and precip/srad – reflect California’s dominant climate gradients:

Temperature and vapor pressure (r = 0.86): Both increase from north to south along the Central Valley. This means the downscaling model cannot fully distinguish whether soil carbon losses or N₂O increases in warmer areas are driven by temperature per se or by the correlated increase in atmospheric moisture demand.
Precipitation and solar radiation (r = -0.86): Wetter areas (north coast, Sierra foothills) receive less direct solar radiation. This inverse relationship reflects the basic climate physics of cloud cover and California’s coastal-inland gradient.

These collinearities are an inherent feature of California’s climate geography, not a modeling artifact. They should be kept in mind when interpreting individual predictor effects in the Model Drivers plots.