On using AI‐based large‐sample emulators for land/hydrology model calibration and regionalization

AI‐based model emulators have emerged as a pragmatic strategy for calibrating Earth System models or their components (e.g., land, atmosphere, ocean), circumventing the previously insurmountable hurdle of the process‐heavy models' computational expense. Such emulators require large, spatially diverse data sets for training, however, which—in the land/hydrology context—contrasts with parameter estimation approaches that have traditionally emphasized optimizing model performance for individual basins, followed by similarity‐based transfer schemes for parameter regionalization. Compared to calibrating basins individually, direct land/hydrology process model calibration approaches typically perform worse when trained jointly on large collections of basins. Building on insights from large‐sample deep learning hydrologic modeling, this study introduces a Large‐Sample Emulator (LSE) approach that unifies and streamlines process model parameter calibration and regionalization. Tested across 627 basins in the continental United States using the Community Terrestrial Systems Model (CTSM), the LSE approach consistently improves runoff predictions in all basins, outperforming the Single‐Site Emulator (SSE) in both single‐objective and multi‐objective calibration tasks. Moreover, LSE‐based regionalization in unseen basins, evaluated through spatial cross‐validation, achieves better results than the default parameters in most cases. This LSE framework offers a promising strategy for effective large‐domain process‐based model calibration and regionalization.

To Access Resource:

Questions? Email Resource Support Contact:

  • opensky@ucar.edu
    UCAR/NCAR - Library

Resource Type publication
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links

Related Dataset #1 : ERA5-Land hourly data from 1950 to present

Related Dataset #2 : Data for LSE-CTSM parameter calibration and regionalization paper

Additional Information N/A
Resource Format PDF
Standardized Resource Format PDF
Asset Size N/A
Legal Constraints

Copyright author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email opensky@ucar.edu
Resource Support Organization UCAR/NCAR - Library
Distributor N/A
Metadata Contact Name N/A
Metadata Contact Email opensky@ucar.edu
Metadata Contact Organization UCAR/NCAR - Library

Author Tang, Guoqiang ORCID icon
Wood, Andrew ORCID icon
Swenson, Sean ORCID icon
Publisher UCAR/NCAR - Library
Publication Date 2025-07-01T00:00:00
Digital Object Identifier (DOI) Not Assigned
Alternate Identifier N/A
Resource Version N/A
Topic Category geoscientificInformation
Progress N/A
Metadata Date 2025-12-24T17:45:58.388674
Metadata Record Identifier edu.ucar.opensky::articles:43840
Metadata Language eng; USA
Suggested Citation Tang, Guoqiang, Wood, Andrew, Swenson, Sean. (2025). On using AI‐based large‐sample emulators for land/hydrology model calibration and regionalization. UCAR/NCAR - Library. https://n2t.net/ark:/85065/d7zs31zv. Accessed 10 February 2026.

Harvest Source