OceanTACO#
OceanTACO is a multi-source sea surface variable dataset with cloud-native dataloaders for machine learning workflows.
OceanTACO provides co-located observations of sea surface height (SSH), sea surface temperature (SST), sea surface salinity (SSS), ocean currents, wind, and Argo float profiles — organized as regional NetCDF tiles and hosted on HuggingFace.
Key features:
Quickstart workflow: install, query a regional tile, and plot your first map in Getting Started
Data access workflow: browse remote files, subset by region/date, and work cloud-first via Dataset Workflows
Machine learning workflow: build training/evaluation query sets and use
OceanTACODatasetwith PyTorch in Dataset ML LoaderData generation workflow: reproduce formatting, tiling, and statistics with the Dataset Generation Pipeline
Hands-on tutorials: run complete examples for mapping, coupling, and case studies in Tutorials

Contents
- Getting Started
- OceanTACO Dataset Workflows
- 1) Install Profiles
- 2) Main Retrieval APIs
- 3) End-to-End Retrieval Example
- 4) Retrieval Pattern: One Exact Tile
- 5) Retrieval Pattern: Stream Records
- 6) Retrieval Pattern: Local Full Snapshot
- 7) Retrieval Pattern: Multiple Sources, Same Region, Date Range
- 8) Choosing Retrieval Mode
- 9) Next Step: ML Loader Workflows
- OceanTACO ML Loader Workflows
- Dataset Description
- Dataset Generation Pipeline
- Tutorials
- API Reference