AI for Earth Data Sets
The Microsoft AI for Earth program hosts geospatial data on Azure that is important to environmental sustainability and Earth science. This repo hosts documentation and demonstration notebooks for all the data that is managed by AI for Earth. It also serves as a “staging ground” for the Planetary Computer Data Catalog.
If you have feedback about any of this data, or want to request additions to our data program, email aiforearthdatasets@microsoft.com
.
Table of contents
- AI for Earth Data Sets
- Data sets
- ALOS World 3D
- ASTER L1T (2000-2006)
- Copernicus DEM
- Daymet
- Deltares Global Flood Maps
- Deltares Global Water Availability
- Esri 10m Land Cover
- Global Biodiversity Information Facility (GBIF)
- Harmonized Global Biomass
- Harmonized Landsat Sentinel-2
- High Resolution Electricity Access (HREA)
- High Resolution Ocean Surface Wave Hindcast
- Labeled Information Library of Alexandria: Biology and Conservation (LILA BC)
- Landsat TM/MSS Collection 2
- Landsat 7 Collection 2 Level-2
- Landsat 8 Collection 2 Level-2
- MODIS (40 individual products)
- Monitoring Trends in Burn Severity Mosaics
- National Solar Radiation Database
- NASADEM
- NREL Puerto Rico 100 (PR100)
- NREL PV Rooftop Database
- NOAA Climate Data Records (CDR)
- NOAA Climate Forecast System (CFS)
- NOAA Digital Coast Imagery
- NOAA GFS Warm Start Initial Conditions
- NOAA GOES-R
- NOAA Global Ensemble Forecast System (GEFS)
- NOAA Global Forecast System (GFS)
- NOAA Global Hydro Estimator (GHE)
- NOAA High-Resolution Rapid Refresh (HRRR)
- NOAA Integrated Surface Data (ISD)
- NOAA Monthly US Climate Gridded Dataset (NClimGrid)
- NOAA National Water Model
- NOAA Rapid Refresh (RAP)
- NOAA US Climate Normals
- National Agriculture Imagery Program
- National Land Cover Database
- NatureServe Map of Biodiversity Importance (MoBI)
- Ocean Observatories Initiative CamHD
- Sentinel-1 GRD
- Sentinel-2 L2A
- Sentinel-3 L2
- Sentinel-5P
- TerraClimate
- UK Met Office CSSP China 20CRDS
- UK Met Office Global Weather Data for COVID-19 Analysis
- University of Miami Coupled Model for Hurricanes Ike and Sandy
- USFS Forest Inventory and Analysis
- USGS 3DEP Seamless DEMs
- USGS Gap Land Cover
- Legal stuff
Data sets
ALOS World 3D
Global topographic information from the JAXA ALOS PRISM instrument.
ASTER L1T (2000-2006)
The ASTER instrument, launched on-board NASA’s Terra satellite in 1999, provides multispectral images of the Earth at 15m-90m resolution. This data set represents ASTER data from 2000-2006.
Copernicus DEM
Global topographic information from the Copernicus program.
Daymet
Estimates of daily weather parameters in North America on a one-kilometer grid, with monthly and annual summaries.
Deltares Global Flood Maps
Global estimates of coastal inundation under various sea level rise conditions and return periods at 90m, 1km, and 5km resolutions. Also includes estimated coastal inundation caused by named historical storm events going back several decades.
Deltares Global Water Availability
Simulations of historical daily reservoir variations for 3,236 locations across the globe for the period 1970-2020 using the distributed wflow_sbm model. The model outputs long-term daily information on reservoir volume, inflow and outflow dynamics, as well as information on upstream hydrological forcing.
Esri 10m Land Cover
Global estimates of 10-class land use/land cover (LULC) for 2020, derived from ESA Sentinel-2 imagery at 10m resolution, produced by Impact Observatory.
Global Biodiversity Information Facility (GBIF)
Exports of global species occurrence data from the GBIF network.
Harmonized Global Biomass
Global maps of aboveground and belowground biomass carbon density for the year 2010 at 300m resolution.
Harmonized Landsat Sentinel-2
Satellite imagery from the Landsat 8 and Sentinel-2 satellites, aligned to a common grid and processed to compatible color spaces.
High Resolution Electricity Access (HREA)
Settlement-level measures of electricity access, reliability, and usage derived from VIIRS satellite imagery.
High Resolution Ocean Surface Wave Hindcast
Long-term wave hindcast data for the U.S. Exclusive Economic Zone (EEZ), developed by the U.S. Department of Energy’s Water Power Technologies Office.
Labeled Information Library of Alexandria: Biology and Conservation (LILA BC)
AI for Earth and partners have assembled a repository of labeled information related to wildlife conservation, particularly wildlife imagery.
Landsat TM/MSS Collection 2
Global optical imagery from the Landsat MSS and TM instruments, which imaged the Earth from 1972 to 2013, aboard the Landsat 1-5 satellites.
Landsat TM/MSS data are in preview; access is granted by request.
Landsat 7 Collection 2 Level-2
Global optical imagery from the Landsat 7 satellite, which has imaged the Earth since 1999.
Landsat 7 data are in preview; access is granted by request.
Landsat 8 Collection 2 Level-2
Global optical imagery from the Landsat 8 satellite, which has imaged the Earth since 2013.
MODIS (40 individual products)
Satellite imagery from the Moderate Resolution Imaging Spectroradiometer (MODIS).
Monitoring Trends in Burn Severity Mosaics
Annual burn severity mosaics for the continental United States and Alaska.
National Solar Radiation Database
Hourly and half-hourly values of the three most common measurements of solar radiation – global horizontal, direct normal, and diffuse horizontal irradiance - along with meteorological data.
NASADEM
Global topographic information from the NASADEM program.
NREL Puerto Rico 100 Dataset (PR100)
A collection of geospasial data useful for renewable energy development in Puerto Rico. The dataset is curated by the National Renewable Energy Laboratory.
NREL PV Rooftop Database
A lidar-derived, geospatially-resolved dataset of suitable roof surfaces and their PV technical potential for 128 metropolitan regions in the United States.
NOAA Climate Data Records (CDR)
Historical global climate information.
NOAA Climate Forecast System (CFS)
Model output data from the NOAA NCEP Climate Forecast System Version 2.
NOAA Digital Coast Imagery
High resolution (1 meter or less) imagery collected by a number of sources and contributed to the NOAA Digital Coast
NOAA GFS Warm Start Initial Conditions
Warm start initial conditions for the NOAA Global Forecast System.
NOAA GOES-R
Weather imagery from the GOES-16, GOES-17, and GOES-18 satellites.
NOAA Global Ensemble Forecast System (GEFS)
Model output data from the NOAA Global Ensemble Forecast System.
NOAA Global Forecast System (GFS)
Model output data from the NOAA Global Forecast System.
NOAA Global Hydro Estimator (GHE)
Global rainfall estimates in 15-minute intervals.
NOAA High-Resolution Rapid Refresh (HRRR)
Weather forecasts for North America at 3km spatial resolution and 15 minute temporal resolution.
NOAA Integrated Surface Data (ISD)
Historical global weather information.
NOAA Monthly US Climate Gridded Dataset (NClimGrid)
Gridded climate data for the US from 1895 to the present.
NOAA National Water Model
Data from the National Water Model.
NOAA Rapid Refresh (RAP)
Weather forecasts for North America at 13km resolution.
NOAA US Climate Normals
Typical climate conditions for the United States from 1981 to the present.
National Agriculture Imagery Program
NAIP provides US-wide, high-resolution aerial imagery. This data set includes NAIP images from 2010 to the present.
National Land Cover Database
US-wide data on land cover and land cover change at a 30m resolution with a 16-class legend.
NatureServe Map of Biodiversity Importance (MoBI)
Habitat information for 2,216 imperiled species occurring in the conterminous United States.
Ocean Observatories Initiative CamHD
Video data from the Ocean Observatories Initiative seafloor camera deployed at Axial Volcano on the Juan de Fuca Ridge.
Sentinel-1 GRD
Global synthetic aperture radar (SAR) data from 2017-present, projected to ground range.
Sentinel-1 GRD data are in preview; access is granted by request.
Sentinel-2 L2A
Global optical imagery at 10m resolution from 2016-present.
Sentinel-3 L2
Global multispectral imagery at 300m resolution, with a revisit rate of less than two days, from 2016-present.
Sentinel-3 data are in preview; access is granted by request.
Sentinel-5P
Global atmospheric data from 2018-present.
Sentinel-5P data are in preview; access is granted by request.
TerraClimate
Monthly climate and climatic water balance for global terrestrial surfaces from 1958-2019.
UK Met Office CSSP China 20CRDS
Historical climate data for China, from 1851-2010.
UK Met Office Global Weather Data for COVID-19 Analysis
Data for COVID-19 researchers exploring relationships between COVID-19 and environmental factors.
University of Miami Coupled Model for Hurricanes Ike and Sandy
Modeled wind, wave, and current data for Hurricanes Ike and Sandy, produced by the National Renewable Energy Laboratory.
USFS Forest Inventory and Analysis
Status and trends on U.S. forest location, health, growth, mortality, and production, from the US Forest Service’s Forest Inventory and Analysis (FIA) program.
USGS 3DEP Seamless DEMs
USGS Gap Land Cover
Legal stuff
Contributing
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.
Trademarks
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft’s Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.