NOAA ISD data documentation #103
Replies: 1 comment
-
I think the PDF linked from the dataset page gives the information you need, but this seems like a pretty complicated format to parse. @gadomski, who was working on the STAC metadata for this dataset before its priority was bumped down, put together pyisd to help parse the data, if you're using Python. pip install -q git+https://github.com/gadomski/pyisd/ import isd.io
import urllib.request
filename, _ = urllib.request.urlretrieve("https://noaaisd.blob.core.windows.net/noaa-isd/data/2022/723656-23049-2022.gz", filename="/tmp/isd.gz")
df = isd.io.read_to_data_frame(filename)
df.head()
The files under Once we get back to this dataset we'll make sure the documentation is much improved. |
Beta Was this translation helpful? Give feedback.
-
I am trying to use the NOAA Integrated Surface Data from the Data Catalog, and I haven't been able to find the documentation necessary to work with it. There aren't any example notebooks, and there isn't any information about how the data is stored within the blob container provided.
I was able to explore the file storage structure within the storage blob using the Storage Explorer, and I am still not clear on where to find the files that I need.
The specific file that I have been looking for in this example corresponds to station number 72365623049.
These are the files I've reached so far, that looked promising:
https://noaaisd.blob.core.windows.net/noaa-isd/data/2022/723656-23049-2022.gz
This file has no extension and it isn't clear what format I am supposed to read it as.https://noaaisd.blob.core.windows.net/noaa-isd/isd-lite/data/2022/723656-23049-2022.gz
This file is also missing an extension, but appears to be a .tsv file with no headers, and while I was able to find some information about what the columns 'may' mean, from one of the technical documents, the data is only available for one month of 2022, when NOAA updates it daily. So it doesn't seem to have been updated since January.Neither one of the files I found available from Planetary Computer match the hourly data files available directly from NOAA. Could the documentation be updated for this dataset to make it possible to navigate the storage blob, and to understand how this data differs from the data that is available directly from the source?
Beta Was this translation helpful? Give feedback.
All reactions