US Government

From Archiveteam
Revision as of 01:27, 1 February 2025 by Nulldata (talk | contribs) (In progress...)
Jump to navigation Jump to search

Discovery

An official list of all registered .gov domains and federal .gov domains is available. The raw CSV files and the .gov zone file are also available on GitHub

Content at risk

Site Name Reason Archival Notes Status
https://data.gov data.gov There have been reports of datasets disappearing from the website[1] though this behavior might be normal due to the way that the site collects datasets from other locations. https://catalog.data.gov
https://inventory.data.gov
https://resources.data.gov
https://strategy.data.gov
https://sdg.data.gov
GitHub
job:4hb15f3ijn846c1dw0w58k4fe
job:4qlh2ol2vq2i525747l0yq6a4
job:25o494lfnnlxtobegl9grx7tt
job:e1ioqt5kilh8l4irihid8sqoq
job:79u49omgtqkj83cnpyuhx0xr8
job:akwvpyvnzeuhrvgh51tokrmsv
https://cdc.gov Centers for Disease Control Directed to pause communication[2] along with other health agencies. https://data.cdc.gov/
https://ftp.cdc.gov/
GitHub
https://cdc.gov -> job:hd3tvx4w14ybj2al0peewcv
https://ftp.cdc.gov/ -> job:8zn8f6a2620t1tnje3f1cyr2o
https://data.cdc.gov/ -> job:1u2ougx4kn6ueaiqddwjfeib7
https://www.ncei.noaa.gov/ National Centers for Environmental Information (Some?) data is linked to from data.gov.
It appears to be possible to enumerate datasets with 7-digit integer IDs starting at 0000001, e.g. https://www.ncei.noaa.gov/access/metadata/landing-page/bin/iso?id=gov.noaa.nodc:0000001. Legacy URL format that redirects appears to be http://accession.nodc.noaa.gov/0000001
https://www.nccs.nasa.gov/services/data-collections NASA Center for Climate Simulation Data
https://www.ipcc-data.org IPCC Data Distribution Centre Appears to have sequential IDs
https://www.bco-dmo.org/ Biological and Chemical Oceanography Data Management Office Appears to have sequential IDs