Technical note: How many models do we need to simulate hydrologic processes across large geographical domains?

Robust large-domain predictions of water availability and threats require models that work well across different basins in the model domain. It is currently common to express a model's accuracy through aggregated efficiency scores such as the Nash–Sutcliffe efficiency and Kling–Gupta efficiency (KGE), and these scores often form the basis to select among competing models. However, recent work has shown that such scores are subject to considerable sampling uncertainty: the exact selection of time steps used to calculate the scores can have large impacts on the scores obtained. Here we explicitly account for this sampling uncertainty to determine the number of models that are needed to simulate hydrologic processes across large spatial domains. Using a selection of 36 conceptual models and 559 basins, our results show that model equifinality, the fact that very different models can produce simulations with very similar accuracy, makes it very difficult to unambiguously select one model over another. If models were selected based on their validation KGE scores alone, almost every model would be selected as the best model in at least some basins. When sampling uncertainty is accounted for, this number drops to 4 models being needed to cover 95 % of investigated basins and 10 models being needed to cover all basins. We obtain similar conclusions for an objective function focused on low flows. These results suggest that, under the conditions typical of many current modelling studies, there is limited evidence that using a wide variety of different models leads to appreciable differences in simulation accuracy compared to using a smaller number of carefully chosen models.

To Access Resource:

Questions? Email Resource Support Contact:

  • opensky@ucar.edu
    UCAR/NCAR - Library

Resource Type publication
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links

Related Dataset #1 : Catchment attributes for large-sample studies

Related Dataset #2 : A large-sample watershed-scale hydrometeorological dataset for the contiguous USA

Related Dataset #3 : Data from "A brief analysis of conceptual model structure uncertainty using 36 models and 559 catchments"

Related Software #1 : CH-Earth/multi-model-mosaic-paper: Peer review release

Additional Information N/A
Resource Format PDF
Standardized Resource Format PDF
Asset Size N/A
Legal Constraints

Copyright author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email opensky@ucar.edu
Resource Support Organization UCAR/NCAR - Library
Distributor N/A
Metadata Contact Name N/A
Metadata Contact Email opensky@ucar.edu
Metadata Contact Organization UCAR/NCAR - Library

Author Knoben, W. J. M.
Raman, A.
Gründemann, G. J. ORCID icon
Kumar, M.
Pietroniro, A.
Shen, C.
Song, Y.
Thébault, C. ORCID icon
van Werkhoven, K.
Wood, Andrew ORCID icon
Clark, M. P.
Publisher UCAR/NCAR - Library
Publication Date 2025-06-04T00:00:00
Digital Object Identifier (DOI) Not Assigned
Alternate Identifier N/A
Resource Version N/A
Topic Category geoscientificInformation
Progress N/A
Metadata Date 2025-12-24T17:47:44.398424
Metadata Record Identifier edu.ucar.opensky::articles:43782
Metadata Language eng; USA
Suggested Citation Knoben, W. J. M., Raman, A., Gründemann, G. J., Kumar, M., Pietroniro, A., Shen, C., Song, Y., Thébault, C., van Werkhoven, K., Wood, Andrew, Clark, M. P.. (2025). Technical note: How many models do we need to simulate hydrologic processes across large geographical domains?. UCAR/NCAR - Library. https://n2t.net/ark:/85065/d73r0zbq. Accessed 03 February 2026.

Harvest Source