A million kernels of truth: Insights into scalable satellite maize yield mapping and yield gap analysis from an extensive ground dataset in the US Corn Belt

Crop yield maps estimated from satellite data increasingly are used to understand drivers of yield trends and variability, yet satellite-derived regional maps are rarely compared with location-specific yields due to the difficulty of acquiring sub-field ground truth data at scale. In commercial agricultural systems, combine harvesters with onboard yield monitors collect real-time yield data during harvest with high spatial resolution, generating a large ground dataset. Here, we leveraged a yield monitor dataset of over one million maize field observations across the United States Corn Belt from 2008 to 2018 to evaluate the Scalable Crop Yield Mapper (SCYM). SCYM uses region-specific crop model simulations and climate data to interpret vegetation indices from satellite observations, thus enabling efficient sub-field yield estimation across large regions and multiple years without reliance on ground data calibration. We used the ground dataset to compare alternative SCYM model implementations, define minimum required satellite observation criteria, and evaluate the sensitivity of satellite-based yield estimates to on-the-ground variation in management, soil, and annual weather. We found that smoothing annual time series data with harmonic regression increased 30 m pixel-level accuracy from r2 = 0.31 to 0.40 and reduced dependency on specific satellite observation timing, enabling robust yield estimation on 97% of annual maize area using only Landsat data. Agreement improved as the assessment was aggregated to field-level (r2 = 0.45) and county-level (r2 = 0.69) scales, demonstrating the need for fine-resolution ground truth data to better assess sub-field level accuracy in high resolution products. We found that SCYM and ground data showed a similar yield response to management and environmental variation, particularly capturing linear and nonlinear responses to sowing density, soil water holding capacity, and growing season precipitation. However, sensitivity to factors like soil quality and planting date was muted for SCYM estimates compared to ground-based yields. Random forest models were able to match SCYM performance when trained on at least 1000 ground observations, but performed poorly when tested on years and locations not represented in the training data. Our results demonstrate that satellite yield maps can provide much-needed information on multidecadal yield trends and inform yield gap analyses.