OceanTACO

OceanTACO#

OceanTACO is a multi-source sea surface variable dataset with cloud-native dataloaders for machine learning workflows.

HuggingFace License Dataset License


OceanTACO provides co-located observations of sea surface height (SSH), sea surface temperature (SST), sea surface salinity (SSS), ocean currents, wind, and Argo float profiles — organized as regional NetCDF tiles and hosted on HuggingFace.

Key features:

  • Quickstart workflow: install, query a regional tile, and plot your first map in Getting Started

  • Data access workflow: browse remote files, subset by region/date, and work cloud-first via Dataset Workflows

  • Machine learning workflow: build training/evaluation query sets and use OceanTACODataset with PyTorch in Dataset ML Loader

  • Data generation workflow: reproduce formatting, tiling, and statistics with the Dataset Generation Pipeline

  • Hands-on tutorials: run complete examples for mapping, coupling, and case studies in Tutorials

OceanTACO overview figure

Contents