Numerous studies have been published during the past two decades that use simulation models to assess crop yield gaps (quantified as the difference between potential and actual farm yields), impact of climate change on future crop yields, and land-use change. However, there is a wide range in quality and spatial and temporal scale and resolution of climate and soil data underpinning these studies, as well as widely differing assumptions about cropping-system context and crop model calibration. Here we present an explicit rationale and methodology for selecting data sources for simulating crop yields and estimating yield gaps at specific locations that can be applied across widely different levels of data availability and quality. The method...