Identification

Title

On using AI‐based large‐sample emulators for land/hydrology model calibration and regionalization

Abstract

AI‐based model emulators have emerged as a pragmatic strategy for calibrating Earth System models or their components (e.g., land, atmosphere, ocean), circumventing the previously insurmountable hurdle of the process‐heavy models' computational expense. Such emulators require large, spatially diverse data sets for training, however, which—in the land/hydrology context—contrasts with parameter estimation approaches that have traditionally emphasized optimizing model performance for individual basins, followed by similarity‐based transfer schemes for parameter regionalization. Compared to calibrating basins individually, direct land/hydrology process model calibration approaches typically perform worse when trained jointly on large collections of basins. Building on insights from large‐sample deep learning hydrologic modeling, this study introduces a Large‐Sample Emulator (LSE) approach that unifies and streamlines process model parameter calibration and regionalization. Tested across 627 basins in the continental United States using the Community Terrestrial Systems Model (CTSM), the LSE approach consistently improves runoff predictions in all basins, outperforming the Single‐Site Emulator (SSE) in both single‐objective and multi‐objective calibration tasks. Moreover, LSE‐based regionalization in unseen basins, evaluated through spatial cross‐validation, achieves better results than the default parameters in most cases. This LSE framework offers a promising strategy for effective large‐domain process‐based model calibration and regionalization.

Resource type

document

Resource locator

Unique resource identifier

code

https://n2t.net/ark:/85065/d7zs31zv

codeSpace

Dataset language

eng

Spatial reference system

code identifying the spatial reference system

Classification of spatial data and services

Topic category

geoscientificInformation

Keywords

Keyword set

keyword value

Text

originating controlled vocabulary

title

Resource Type

reference date

date type

publication

effective date

2016-01-01T00:00:00Z

Geographic location

West bounding longitude

East bounding longitude

North bounding latitude

South bounding latitude

Temporal reference

Temporal extent

Begin position

End position

Dataset reference date

date type

publication

effective date

2025-07-01T00:00:00Z

Frequency of update

Quality and validity

Lineage

Conformity

Data format

name of format

version of format

Constraints related to access and use

Constraint set

Use constraints

<span style="font-family:Arial;font-size:10pt;font-style:normal;" data-sheets-root="1">Copyright author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.</span>

Limitations on public access

None

Responsible organisations

Responsible party

contact position

OpenSky Support

organisation name

UCAR/NCAR - Library

full postal address

PO Box 3000

Boulder

80307-3000

email address

opensky@ucar.edu

web address

http://opensky.ucar.edu/

name: homepage

responsible party role

pointOfContact

Metadata on metadata

Metadata point of contact

contact position

OpenSky Support

organisation name

UCAR/NCAR - Library

full postal address

PO Box 3000

Boulder

80307-3000

email address

opensky@ucar.edu

web address

http://opensky.ucar.edu/

name: homepage

responsible party role

pointOfContact

Metadata date

2025-12-24T17:45:58.388674

Metadata language

eng; USA