NOAA Enterprise Data Management Workshop May 2024
Users can no longer manage analysis on a single machine.
image source: https://www.ncei.noaa.gov/news/ncei-archive-growth-and-change
image source: ui.josiahparry.com/spatial-analysis.html
image credit: https://wiki.earthdata.nasa.gov/display/ESO/Zarr+Format
Format | Data Type | Standard Status |
---|---|---|
Cloud-Optimized GeoTIFF (COG) | Raster | OGC standard for comment |
Zarr, Kerchunk | Multi-dimensional raster | ESDIS and OGC standards in development |
Cloud-Optimized Point Cloud (COPC), Entwine Point Tiles (EPT) | Point Clouds* | no known ESDIS or OGC standard |
FlatGeobuf, GeoParquet, | Vector | no known ESDIS, draft OGC standard |
Format | Adoption | Standard Status |
---|---|---|
Cloud-Optimized GeoTIFF (COG) | Widely adopted | OGC standard for comment |
Zarr, Kerchunk | (Less) widely adopted, especially in specific communities | ESDIS and OGC standards in development |
Entwine Point Tiles (EPT), Cloud-Optimized Point Cloud (COPC) | Less common (PDAL Supported) | no known ESDIS or OGC standard |
GeoParquet, FlatGeobuf | Less common (OGR Supported) | no known ESDIS, draft OGC standard |
image source: https://www.kitware.com/deciphering-cloud-optimized-geotiffs/
image source: https://medium.com/devseed/cog-talk-part-1-whats-new-941facbcd3d1
image source: https://xarray.dev/
image source: https://fsspec.github.io/kerchunk/detail.html
image source: https://copc.io/
image source: https://worace.works/2022/02/23/kicking-the-tires-flatgeobuf/
image source: https://www.wherobots.ai/post/spatial-data-parquet-and-apache-sedona
Why Cloud Native and Analysis-Ready-Cloud-Optimized (ARCO) Data for Scalable Cloud Computing and Data Analytics to Support Open Science (PART I):
Prior presentations and studies discussing multiple formats
Format Homepages and Explainers