Toward a Scientific Discovery Engine for Weather and Climate Data: A Visual Analytics Workbench for Embedding-Based Exploration

Weather and climate science is producing increasingly large, high-dimensional datasets from numerical simulations, Earth system models, and AI-based weather and climate models. Embedding-based representations can make these data searchable through similarity search and analog retrieval, but nearest neighbors in latent space are not automatically scientifically meaningful. Researchers need tools to inspect how embeddings organize meteorological data, compare representation models, develop retrieval strategies, and verify results against physical evidence. We present an open-source visual analytics workbench for inspectable, configurable, and scalable embedding-based search over weather and climate data. The system links embedding experiments to source data, metadata, spatial context, model configurations, and retrieval parameters, allowing users to explore latent spaces, construct global or localized queries, and inspect retrieved analogs through meteorological views. We demonstrate the workbench through tropical-cyclone retrieval using ERA5 derived embeddings and IBTrACS metadata, and evaluate its out-of-core retrieval backend to show that large embedding collections can be searched beyond in-memory limits on commodity workstation hardware.

To Access Resource:

Questions? Email Resource Support Contact:

  • datahelp@ucar.edu

Keywords

Resource Type dataset
Temporal Range Begin N/A
Temporal Range End N/A
Temporal Resolution N/A
Bounding Box North Lat N/A
Bounding Box South Lat N/A
Bounding Box West Long N/A
Bounding Box East Long N/A
Spatial Representation N/A
Spatial Resolution N/A
Related Links N/A
Additional Information N/A
Resource Format Binary
Standardized Resource Format Binary
Asset Size 2.226 MB
Legal Constraints

Creative Commons Attribution 4.0 International License


Access Constraints None
Software Implementation Language N/A

Resource Support Name N/A
Resource Support Email datahelp@ucar.edu
Resource Support Organization
Distributor NSF NCAR Geoscience Data Exchange

Metadata Contact Name N/A
Metadata Contact Email datahelp@ucar.edu
Metadata Contact Organization NSF NCAR Geoscience Data Exchange

Author Cherukuru, Nihanth ORCID icon
Publisher NSF National Center for Atmospheric Research

Publication Date 2026-04-30
Digital Object Identifier (DOI) https://doi.org/10.5065/12ZJ-ZZ25
Alternate Identifier d041308
Resource Version N/A
Topic Category climatologyMeteorologyAtmosphere
Progress completed
Metadata Date 2026-04-30T21:15:10Z
Metadata Record Identifier edu.ucar.gdex::d041308
Metadata Language eng; USA
Suggested Citation Cherukuru, Nihanth. (2026). Toward a Scientific Discovery Engine for Weather and Climate Data: A Visual Analytics Workbench for Embedding-Based Exploration. NSF National Center for Atmospheric Research. https://doi.org/10.5065/12ZJ-ZZ25. Accessed 04 May 2026.

Harvest Source