Skip to main content
ORNL DAAC HomeNASA Home

DAAC Home > Get Data > NASA Projects > Arctic-Boreal Vulnerability Experiment (ABoVE) > User guide

ABoVE: Land Cover, Methane Flux, and Environmental Data, Big Trail Lake, Fairbanks AK

Documentation Revision Date: 2025-03-03

Dataset Version: 1

Summary

This dataset provides gridded land cover classifications for the area north of Big Trail Lake, an active thermokarst lake in Goldstream Valley (near Fairbanks, Alaska, USA) at 10 cm resolution. Land cover classifications were derived from a supervised random forest classification using RedEdgeMX and multiSPEC4C multispectral drone imagery, normalized difference vegetation index (NDVI), and the USGS 3D Elevation Program (3DEP) digital surface model (DSM) elevation data. RedEdgeMX data was collected in July 2021 on and multiSPEC4C on 2019-08-04. Additionally, gridded microtopography and slope estimates were derived at 1m resolution. Field samples were collected at 15 sites across the study area in June 2021 and August 2022. Field data include methane (CH4) and carbon dioxide (CO2) fluxes, gross primary production (GPP), SIMPER microbial community analysis, methanogen abundance, soil characteristics, and meteorological characteristics. Gridded products are provided in cloud-optimized GeoTIFF format, field data are provided in comma-separated values format, and sample location photos in JPEG format are contained within a compressed file.

This dataset includes 14 data files: 10 in comma-separated values format (*.csv), three in cloud-optimized GeoTIFF (*.tif) format, and JPEG images in one compressed (*.zip) file.

Figure 1. Examples of flux sample locations across the study area. Photographs are included in the dataset, and are from site 2 ("20210715_160144_west_facing.jpg"), site 3 ("20210716_133333_east_facing.jpg"), and site 4 ("20210716_134946_west_facing.jpg").

Citation

Farina, M.K., W. Christian, J.D. Watts, T. Mcdermott, R. Hatzenpichler, G. Larue, S. Powell, H. Webb, N. Barnes, and K. Okano. 2025. ABoVE: Land Cover, Methane Flux, and Environmental Data, Big Trail Lake, Fairbanks AK. ORNL DAAC, Oak Ridge, Tennessee, USA. https://doi.org/10.3334/ORNLDAAC/2393

Table of Contents

  1. Dataset Overview
  2. Data Characteristics
  3. Application and Derivation
  4. Quality Assessment
  5. Data Acquisition, Materials, and Methods
  6. Data Access
  7. References

Dataset Overview

This dataset provides gridded land cover classifications for the area north of Big Trail Lake, an active thermokarst lake in Goldstream Valley (near Fairbanks, Alaska, USA) at 10 cm resolution. Land cover classifications were derived from a supervised random forest classification using RedEdgeMX and multiSPEC4C multispectral drone imagery, normalized difference vegetation index (NDVI), and the USGS 3D Elevation Program (3DEP) digital surface model (DSM) elevation data. RedEdgeMX data was collected in July 2021 on and multiSPEC4C on 2019-08-04. Additionally, gridded microtopography and slope estimates were derived at 1m resolution. Field samples were collected at 15 sites across the study area in June 2021 and August 2022. Field data include methane (CH4) and carbon dioxide (CO2) fluxes, gross primary production (GPP), SIMPER microbial community analysis, methanogen abundance, soil characteristics, and meteorological characteristics. 

Project:  Arctic-Boreal Vulnerability Experiment (ABoVE)

The Arctic-Boreal Vulnerability Experiment (ABoVE) is a NASA Terrestrial Ecology Program field campaign being conducted in Alaska and western Canada, for 8 to 10 years, starting in 2015. Research for ABoVE links field-based, process-level studies with geospatial data products derived from airborne and satellite sensors, providing a foundation for improving the analysis, and modeling capabilities needed to understand and predict ecosystem responses to, and societal implications of, climate change in the Arctic and Boreal regions.

Related Publication:

Farina, M., W. Christian, N. Hasson, T. McDermott, S. Powell, R. Hatzenpichler, H. Webb, G. LaRue, K. Okano, E. Sproles, J. Watts. 2025. Methane flux spatial heterogeneity driven by surface and subsurface conditions in boreal forest-fen mosaic. [Manuscript submitted to Environmental Research Letters for publication.]

Related Data:

Barnes, N., H. Webb, M.K. Farina, S. Powell, and J.D. Watts. 2021. Multispectral Imagery, NDVI, and Terrain Models, Big Trail Lake, Fairbanks, AK, 2019. ORNL DAAC, Oak Ridge, Tennessee, USA. https://doi.org/10.3334/ORNLDAAC/1834

  • The multiSPEC4C imagery and NDVI data from Barnes et al. (2021) data were used to produce the land cover classification data in this dataset.

Data Characteristics

Spatial Coverage: Site approximately 150 m north of Big Trail Lake, located in Goldstream Valley near Fairbanks, Alaska, US

Spatial Resolution: 

  • DTM_DSM_Microtopography_Slope_1m.tif: 1 m
  • Land_Classification_Inputs_10cm.tif and Land_Classification_Outputs_10cm.tif: 10 cm
  • Field samples: point locations

Temporal Coverage: 2019-08-04 to 2022-08-23

Temporal Resolution: Flux samples were collected 1-2 times during the July 2021 and August 2022 field surveys. Drone imagery was collected twice, on 2019-08-04 (RedEdgeMX) and in July 2021 (multiSPEC4C). Meteorological variables were collected hourly from 2021-07-07 to 2021-07-27. Six soil loggers collected soil temperature data half-hourly from 2021-07-10 to 2021-07-28.

Study Areas: Latitude and longitude are given in decimal degrees.

Site Westernmost Longitude Easternmost Longitude Northernmost Latitude Southernmost Latitude
Goldstream Valley, Alaska, US -147.82 -147.82 64.92 64.92

Data File Information

This dataset includes 14 data files: 10 in comma-separated values format (*.csv), three in cloud-optimized GeoTIFF (*.tif) format, and JPEG images in one compressed (*.zip) file.

Missing numeric data are indicated by -9999 and missing text data are represented by N/A.

GeoTIFF Files

This dataset contains three cloud-optimized GeoTIFF (*.tif) files:

  • DTM_DSM_Microtopography_Slope_1m.tif contains four bands of lidar-derived or estimated elevation characteristics over the study area at one meter resolution. Projection: UTM zone 6N (EPSG: 32606). Map units: meters.
    • Band 1 (USGS_3DEP_2017_Fairbanks_Lidar_DTM_aggregated): Digital terrain model (DTM) showing the elevation of the underlying terrain of the earth’s surface in meters. Data are from USGS 3D Elevation Program DTM (USGS, 2017).
    • Band 2 (USGS_3DEP_2017_Fairbanks_Lidar_DSM_aggregated): Digital surface model (DSM) showing elevation values of landscape features on the earth's surface in meters. Data are from USGS 3D Elevation Program DSM (USGS, 2017).
    • Band 3 (Microtopography_meters): Gridded estimates of microtopography
    • Band 4 (Slope_8pixel_neighborhood_degrees): Gridded estimates of slope calculated from 8-pixel neighborhood
  • Land_Classification_Inputs_10cm.tif contains 12 bands of land classification inputs used to produce land cover gridded outputs (Land_Classification_Outputs_10cm.tif) at 10 cm resolution. Projection: WGS 84 (EPSG: 4326). Map units: decimal degrees
    • Band 1 (Blue_REMX): Blue band from drone RedEdge-MX sensor.
    • Band 2 (Green_REMX): Green band from drone RedEdge-MX sensor.
    • Band 3 (Red_REMX): Red band from drone RedEdge-MX sensor.
    • Band 4 (RedEdge_REMX): Red-edge band from drone RedEdge-MX sensor.
    • Band 5 (NIR_REMX): Near-infrared (NIR) band from drone RedEdge-MX sensor.
    • Band 6 (DSM_REMX): Digital surface model (DSM; ellipsoidal elevation in meters) from drone RedEdge-MX sensor.
    • Band 7 (Green_MS4C): Green band from drone multiSPEC-4C sensor (resampled to match pixel grid of RedEdge-MX imagery).
    • Band 8 (Red_MS4C): Red band from drone multiSPEC-4C sensor (resampled to match pixel grid of RedEdge-MX imagery).
    • Band 9 (NIR_MS4C): Near-infrared (NIR) band from drone multiSPEC-4C sensor (resampled to match pixel grid of RedEdge-MX imagery).
    • Band 10 (DSM_MS4C): Digital surface model (DSM; orthometric elevation in meters) from drone multiSPEC-4C sensor (resampled to match pixel grid of RedEdge-MX imagery).
    • Band 11 (DTM_MS4C): Digital terrain model (DTM; orthometric elevation in meters) from drone multiSPEC-4C sensor (resampled to match pixel grid of RedEdge-MX imagery).
    • Band 12 (USGS_3DEP_DSM_resampled): USGS 3DEP digital surface model (DSM; orthometric elevation in meters), resampled to match pixel grid of RedEdge-MX imagery.
  • Land_Classification_Outputs_10cm.tif contains three bands of land cover classifications derived using supervised random forest classification at 10 cm resolution. Land cover classifications are represented by integer values where 1=Graminoid; 2=Moss; 3=Shadow; 4=Water; and 5=Woody. Projection: WGS 84 (EPSG: 4326). Map units: decimal degrees.
    • Band 1 (Combined_landcover_prediction): The combined land cover classification from the models using RedEdgeMX and multiSPEC4C multispectral drone imagery.
    • Band 2 (Predicted_landcover_using_RedEdgeMX_imagery): Land cover classification based on RedEdge-MX multispectral drone imagery.
    • Band 3 (Predicted_landcover_using_multiSPEC4C_imagery): Land cover classification based on multiSPEC-4C multispectral drone imagery.

CSV Files

This dataset contains 10 comma-separated values (CSV) files:

  • Table_1_Chamber_Fluxes_Insitu_Data_July_2021.csv: in situ methane flux observations collected in July 2021 and modeled variables.
  • Table_2_Detailed_Chamber_Flux_Estimates_July_2021_August_2022.csv: Contains chamber methane (CH4) and carbon dioxide (CO2) flux estimates and GPP estimates from field surveys conducted in July 2021 and August 2022.
  • Table_3_Methanogen_Methanotroph_Relative_Abundance.csv: relative abundance observations of methanogens and methanotrophs at for four soil depths (0-12.5 cm, 12.5-25 cm, 25-37.5 cm, 37.5 - 50 cm).
  • Table_4_Soil_Sample_Chemical_Analysis.csv: Contains soil chemical analyses that were conducted on soil samples from 17 locations at various depths.
  • Table_5_Simper_Microbial_Community_Analysis.csv: SIMPER microbial community analysis results for each amplicon sequence variant (ASV).
  • Table_6_Qiime2_taxonomy_output.csv: the raw output from Qiime2 which includes the total number of reads assigned to each ASV in each sample. The header rows of the file are structured such that row 1 contains the Sample_ID, row 2 contains the sample depth in cm (depth_cm), row 3 is blank, and row 4 contains the Soil sample ID number (#OTU ID). The following rows contain the taxaon in column 1 and total count of reads assigned to each ASV in each soil sample in subsequent columns.
  • Table_7_Hourly_Meteorology_Weather_Sensor.csv: meteorology data from weather station at hourly resolution from 2021-07-07- to 2021-07-27. The weather station was located at 64.91984738, -147.81891737.
  • Table_8_Halfhourly_Soil_Temperature_Logger_Data.csv: Contains soil temperature data at half-hourly resolution 2021-07-10 to 2021-07-28.
  • Table_9_Training_Testing_Data_RedEdgeMX_Classification.csv: training and testing datasets used to calibrate and validate the supervised random forest land cover classification based on RedEdge-MX multispectral drone imagery.
  • Table_10_Training_Testing_Data_multiSPEC4C_Classification.csv: training and testing data used to calibrate and validate the supervised random forest land cover classification based on RedEdge-MX multispectral drone imagery.

Table 1. Data dictionary for Table_1_Chamber_Fluxes_Insitu_Data_July_2021.csv

Variable Units Description
Sample_ID   Sample location ID (56 unique IDs)
Site_ID   Site ID (15 unique IDs)
Latitude degrees north Sample location latitude in decimal degrees
Longitude degrees east Sample location longitude in decimal degrees
Vegetation_class   Vegetation class of sample: 1. Moss, sedge, field horsetail; 2. Sedge, grass, moss; 3. Moss, field horsetail, grass, sedge; 4. Variegated horsetail, moss, sedge; 5. Grass, sedge, moss; 6. Field horsetail, moss, grass, sedge; 7. Moss, field horsetail; 8. Dwarf shrub, moss, field horsetail, grass, sedge; 9. Moss.
Vegetation_details   Vegetation details at individual sample locations.
Mean_CH4_flux mg m-2 d-1 Mean CH4 flux across three repetitions
CH4_flux_category   CH4 flux category (VH=Very High; High=High; Int=Intermediate; Low=Low; Neg=Negative)
VWC_6cm

percent

Soil volumetric water content at 6 cm depth
VWC_12cm percent Soil volumetric water content at 12 cm depth
VWC_20cm percent Soil volumetric water content at 20 cm depth
SoilT_5cm degrees C Soil temperature at 5 cm depth
SoilT_10cm degrees C Soil temperature at 10 cm depth
Soil_pH_3cm   Soil pH at 2.5 cm depth

 

Table 2. Data dictionary for Table_2_Detailed_Chamber_Flux_Estimates_July_2021_August_2022.csv

Variable Units Description
Sample_ID   ID of sample location (56 unique sample locations). An asterisks (*) indicates flux samples that were taken in exactly the same locations as the July 2021 samples. Double asterisks (**) indicates flux samples that were taken in approximately the same locations as the July 2021 samples.
Chamber   LI-COR 8200-01S Smart Chamber (SC); Polycarbonate chamber with opaque cover (PC_cover); Polycarbonate chamber without opaque cover (PC_no_cover)
Rep   Repetition (1-3)
Date YYYY-MM-DD Date of flux sampling.
Time_AKDT   Time of flux sampling (Alaska Daylight Time).
Orig_CH4_flux nmol m-2 s-1 CH4 flux before adjusting for chamber headspace volume filled by vegetation.
Orig_CO2_flux umol m-2 s-1 CO2 flux before adjusting for chamber headspace volume filled by vegetation.
Veg_vol percent Estimated percent headspace volume filled by vegetation.
Adj_CH4_flux nmol m-2 s-1 CH4 flux after adjusting for chamber headspace volume filled by vegetation.
Adj_CO2_flux umol m-2 s-1 CO2 flux after adjusting for chamber headspace volume filled by vegetation.
GPP umol m-2 s-1 Gross Primary Productivity.

 

Table 3. Data dictionary for Table_3_Methanogen_Methanotroph_Relative_Abundance.csv

Variable Units Description
Sample_ID   Sample location ID (56 unique IDs)
Depth_cm cm Soil sample depth
Methanogen_Rel_Abundance percent Methanogen relative abundance
Methanotroph_Rel_Abundance percent Methanotroph relative abundance

 

Table 4. Data dictionary for Table_4_Soil_Sample_Chemical_Analysis.csv

Variable Units Description
Sample_ID   Sample location ID (56 unique IDs)
Depth_cm cm Soil sample depth
pH   Soil pH
OM percent Organic matter
EC mmhos cm-1 Electrical conductivity
Nitrate_N ppm Nitrate-nitrogen concentration
NH4_N ppm Ammonium-nitrogen concentration
P_Olsen ppm Phosphorus (Olsen test) concentration
K ppm Potassium concentration
Ca ppm Calcium concentration
Mg ppm Magnesium concentration
Na ppm Sodium concentration
S ppm Sulfate-sulfur concentration
CEC meq (100 g)-1 Cation exchange capacity in milliequivalents per 100 grams of soil
K_base_sat percent Base saturation K
Ca_base_sat percent Base saturation Ca
Mg_base_sat percent Base saturation Mg
Na_base_sat percent Base saturation Na
CCE percent Calcium carbonate equivalent

 

Table 5. Data dictionary for Table_5_Simper_Microbial_Community_Analysis.csv

Variable Units Description
SIMPER_Order   Simper Order
ASV   Amplicon sequence variant (ASV)
More_abundant_in_Intermediate_CH4_flux_samples   Was the AVS more abundant in Intermediate CH4 flux samples? (yes/no)
Domain   Domain
Phyla   Phyla
Class   Class
Order   Order
Family   Family
Genus   Genus
Cumulative_Sum 1 The cumulative contribution of each ASV to the difference between two groups (samples from the intermediate CH4 flux category, and samples from all other CH4 flux categories). The column is ordered with the highest contributors at the top.Thus the cumulative sum in the first row is the contribution from the 1st most distinguishing organism, the second row is the sum of rows 1 and 2, and so on. Units = proportion.
p_value 1 Permutation-based p-value. The probability of getting a larger or equal average contribution for each ASV if the grouping factor (CH4 flux category) was randomly permuted

 

Table 6. Table_6_Qiime2_taxonomy_output

Variable Units Description
Sample_ID   Sample location ID (56 unique IDs)
Depth_cm cm Soil sample depth (cm)
OTUID   Soil sample ID number. Note that AK64 , AK65, AK66, AK67 are DNA extraction blanks and PCRB1, PCRB1D, PCRB1Z, PCRB5, PCRB5Z PCR blanks

 

Table 7. Data dictionary for Table_7_hourly_meteorology_weather_sensor.csv

Variable Units Description
Date_time_AKDT YYYY-MM-DD hh:mm:ss Local date and time (Alaska Daylight Time, UTC−08:00)
RECORD   Record number
BattV_Avg V Battery voltage (Avg)
BattV_Max V Battery voltage (Max)
BattV_Min V Battery voltage (Min)
BattV V Battery voltage (Smp - the current/instantaneous value at the time of measurement)
BattV_Std V Battery voltage (Std)
PTemp_C_Avg degrees C Average Wiring Panel Temperature of the Campbell Scientific CR300 data logger (Avg)
PTemp_C_Max degrees C Average Wiring Panel Temperature of the Campbell Scientific CR300 data logger (Max)
PTemp_C_Min degrees C Average Wiring Panel Temperature of the Campbell Scientific CR300 data logger (Min)
PTemp_C degrees C Average Wiring Panel Temperature of the Campbell Scientific CR300 data logger (Smp - the current/instantaneous value at the time of measurement)
PTemp_C_Std degrees C Average Wiring Panel Temperature of the Campbell Scientific CR300 data logger (Std)
SlrFD_W_Avg W m-2 Solar lux density (Avg)
SlrFD_W_Max W m-2 Solar flux density (Max)
SlrFD_W_Min W m-2 Solar flux density (Min)
SlrFD_W W m-2 Solar flux density (Smp - the current/instantaneous value at the time of measurement)
SlrFD_W_Std W m-2 Solar flux density (Std)
Rain_mm_Tot mm Total rainfall (Tot)
Strikes_Tot 1 Count of lightning strikes (Tot)
Dist_km_Avg km Distance to lightning strikes (Avg)
Dist_km_Max km Distance to lightning strikes (Max)
Dist_km_Min km Distance to lightning strikes (Min)
Dist_km km Distance to lightning strikes (Smp - the current/instantaneous value at the time of measurement)
Dist_km_Std km Distance to lightning strikes (Std)
WS_ms_Avg m s-1 Wind speed (Avg)
WS_ms_Max m s-1 Wind speed (Max)
WS_ms_Min m s-1 Wind speed (Min)
WS_ms m s-1 Wind speed (Smp - the current/instantaneous value at the time of measurement)
WS_ms_Std m s-1 Wind speed (Std)
WS_ms_S_WVT m s-1 Wind speed (WVc)
WindDir_D1_WVT degrees Wind direction (WVc)
WindDir_SD1_WVT degrees Wind direction (WVc)
WindDir degrees Wind direction (Smp)
MaxWS_ms_Avg m s-1 Max wind speed (Avg)
MaxWS_ms_Max m s-1 Max wind speed (Max)
MaxWS_ms_Min m s-1 Max wind speed (Min)
MaxWS_ms m s-1 Max wind speed (Smp - the current/instantaneous value at the time of measurement)
MaxWS_ms_Std m s-1 Max wind speed (Std)
AirT_C_Avg degrees C Air temperature (Avg)
AirT_C_Max degrees C Air temperature (Max)
AirT_C_Min degrees C Air temperature (Min)
AirT_C degrees C Air temperature (Smp - the current/instantaneous value at the time of measurement)
AirT_C_Std degrees C Air temperature (Std)
VP_mbar_Avg mbar Vapor pressure (Avg)
VP_mbar_Max mbar Vapor pressure (Max)
VP_mbar_Min mbar Vapor pressure (Min)
VP_mbar mbar Vapor pressure (Smp - the current/instantaneous value at the time of measurement)
VP_mbar_Std mbar Vapor pressure (Std)
BP_mbar_Avg mbar Barometric pressure (Avg)
BP_mbar_Max mbar Barometric pressure (Max)
BP_mbar_Min mbar Barometric pressure (Min)
BP_mbar mbar Barometric pressure (Smp - the current/instantaneous value at the time of measurement)
BP_mbar_Std mbar Barmetric pressure (Std)
RH_Max percent Relative humidity (Max)
RH_Min percent Relative humidity (Min)
RH percent Relative humidity (Smp - the current/instantaneous value at the time of measurement)
RHT_C_Avg degrees C Temperature reading from the relative humidity & temperature sensor (Avg)
RHT_C_Max degrees C Temperature reading from the relative humidity & temperature sensor (Max)
RHT_C_Min degrees C Temperature reading from the relative humidity & temperature sensor (Min)
RHT_C degrees C Temperature reading from the relative humidity & temperature sensor (Smp - the current/instantaneous value at the time of measurement)
RHT_C_Std degrees C Temperature reading from the relative humidity & temperature sensor (Std)
TiltNS_deg_Avg degrees North/South orientation of the ClimaVue instrument (Avg)
TiltNS_deg_Max degrees North/South orientation of the ClimaVue instrument Tilt (Max)
TiltNS_deg_Min degrees North/South orientation of the ClimaVue instrument Tilt (Min)
TiltNS_deg degrees North/South orientation of the ClimaVue instrument Tilt (Smp - the current/instantaneous value at the time of measurement)
TiltNS_deg_Std degrees North/South orientation of the ClimaVue instrument Tilt (Std)
TiltWE_deg_Avg degrees West/East orientation of the ClimaVue instrument Tilt (Avg)
TiltWE_deg_Max degrees West/East orientation of the ClimaVue instrument Tilt (Max)
TiltWE_deg_Min degrees TWest/East orientation of the ClimaVue instrument Tilt (Min)
TiltWE_deg degrees West/East orientation of the ClimaVue instrument Tilt (Smp - the current/instantaneous value at the time of measurement)
TiltWE_deg_Std degrees West/East orientation of the ClimaVue instrument Tilt (Std)
SlrTF_MJ_Tot MJ m-2 Total solar radiation over the measurement interval
CVMeta   Data logger ID
Invalid_Wind_Avg 1 Count of invalid wind measurements (Avg). 
Invalid_Wind_Max 1 Count of invalid wind measurements (Max).
Invalid_Wind_Min 1 Count of invalid wind measurements (Min).
Invalid_Wind 1 Count of invalid wind measurements (Smp - the current/instantaneous value at the time of measurement).
Invalid_Wind_Std 1 Count of invalid wind measurements (Std).
Invalid_Wind_Tot 1 Count of invalid wind measurements (Tot).

 

Table 8. Data dictionary for Table_8_Halfhourly_Soil_Temperature_Logger_Data.csv

Variable Units Description Logger Latitude Logger Longitude
Date_time_AKDT YYYY-MM-DD hh:mm:ss Date and time (Alaska Daylight Time)    
SoilT_C_logger1 degrees C Soil temperature logger-1 64.92185221 -147.8203832
SoilT_C_logger2 degrees C Soil temperature logger-2 64.92252362 -147.8204773
SoilT_C_logger3 degrees C Soil temperature logger-3 64.9219876 -147.8192036
SoilT_C_logger4 degrees C Soil temperature logger-4 64.92197142 -147.8212274
SoilT_C_logger5 degrees C Soil temperature logger-5 64.92243131 -147.8210107
SoilT_C_logger6 degrees C Soil temperature logger-6 64.9223092 -147.8198968

 

Table 9. Data dictionary for Table_9_Training_Testing_Data_RedEdgeMX_Classification.csv

Variable Units Description
Latitude degrees north Latitude of training/testing point in decimal degrees.
Longitude degrees east Longitude of training/testing point in decimal degrees.
train_test   Indicates training or testing point.
Class   Ground-truth based on visual assessment of high-resolution drone imagery (1=Graminoid; 2=Moss; 3=Shadow; 4=Water; 5=Woody).
remx_blue 1 RedEdge-MX blue band; brightness value as a digital number.
remx_green 1 RedEdge-MX green band; brightness value.
remx_red 1 RedEdge-MX red band; brightness value..
remx_rededge 1 RedEdge-MX rededge band; brightness value.
remx_NIR 1 RedEdge-MX near-infrared (NIR) band; brightness value.
remx_NDVI 1 Normalized vegetation difference index (NDVI) calculated from RedEdge-MX imagery.
USGS_DSM m USGS 3DEP digital surface model (DSM; orthometric elevation)

 

Table 10. Data dictionary for Table_10_Training_Testing_Data_multiSPEC4C_Classification.csv

Variable Units Description
Latitude degrees north Latitude of training/testing point in decimal degrees.
Longitude degrees east Longitude of training/testing point in decimal degrees.
train_test   Indicates training or testing point.
Class   Ground-truth based on visual assessment of high-resolution drone imagery (1=Graminoid; 2=Moss; 3=Shadow; 4=Water; 5=Woody).
ms4c_NIR 1 multiSPEC-4C green band; reflectance
ms4c_red 1 multiSPEC-4C red band; reflectance.
ms4c_green 1 multiSPEC-4C near-infrared (NIR) band; reflectance.
ms4c_NDVI 1 Normalized vegetation difference index (NDVI) calculated from multiSPEC-4C imagery.
USGS_DSM m USGS 3DEP digital surface model (DSM; orthometric elevation)

Compressed Zip File

The dataset contains one compressed file, Sample_Location_Photos_July_2021.zip. Enclosed are 326 photos in JPEG (*jpg) format that were taken at soil sampling locations in 2021. There are 15 subdirectories, one for each Site_ID. The JPEG files are named YYYYMMDD_hhmmss_ Sample_ID.jpg, where YYYYMMDD_hhmmss is the timestamp and Sample_ID­ is the sample location ID.

Application and Derivation

This dataset provides high-resolution multispectral drone imagery (spatial resolution ~10 cm) and land cover maps (spatial resolution 1 m) over a boreal forest-fen mosaic ecosystem. Most variables were collected (or modeled) over multiple points in space (single time point), which may offer insight in the spatial relationships between carbon fluxes and driver variables (e.g., spatial relationships between CH4 flux and soil moisture, or between CH4 flux and soil microbial community composition). Additionally, soil microbial community analyses provide information about the community structure across space and depth.

Please see Farina et al. 2025 for additional details

Quality Assessment

Chamber flux estimates: Three of the 168 samples had poor model fits with either the CH4 or CO2 concentration data and were removed from the dataset. Among the remaining 165 CH4 flux estimates, 115 were fitted with exponential models, and 50 were fitted with linear models. The majority of these fitted models (96%) had R2 values greater than 0.9, and the remaining 4% of fitted models had R2 values between 0.77 and 0.87. The three replicate CH4 fluxes were averaged to calculate a mean flux per sample location. Among the 165 CO2 flux (ecosystem respiration; Reco) estimates, 158 samples were fitted with exponential models, and seven were fitted with linear models (R2 values ranging 0.94 – 0.99).

Soil moisture measurements: Across the 56 sampling locations, the Hydrosense II measurements of period (τ) ranged 2.339 – 3.832 μs for the 20 cm rod inserted vertically; 1.273 – 2.402 μs for the 12 cm rod inserted vertically; and 1.172 – 2.367 μs for the 12 cm rod inserted at a 30° angle. These measurements fell within valid ranges of τ. For the 20 cm rod, the valid range of τ is 1.38 μs (in air) to 3.89 μs (in water); for the 12 cm rod, τ ranges from approximately 1.0 μs (in air) to 2.47 μs (in water) (Campbell Scientific).

Other in situ measurements: The HI98331 probe has a soil temperature accuracy of ±1 °C, and the HI981030 Soil pH tester has an accuracy of ±0.05 pH.

Please see Farina et al. 2025 for additional details.

Data Acquisition, Materials, and Methods

GeoTIFF files

The USGS 3DEP DTM and DSM rasters (tile ID 4520; 0.5 m spatial resolution; USGS, 2017) were georeferenced to match the RedEdge-MX drone imagery (~10 cm spatial resolution; collected July 2021) as precisely as possible. Georeferencing was done using a third degree polynomial transformation algorithm and nearest neighbor resampling. The georeferenced DTM raster was aggregated to 1 m spatial resolution and projected to the WGS 84 / UTM Zone 6N coordinate reference system (EPSG:32606). The aggregated raster was used for subsequent modeling. The local mean DTM elevation was calculated for a 31 by 31 m moving window, and per-grid cell microtopography was calculated as the local mean subtracted from the 1-m DTM values. Gridded slope estimates were calculated using 8-pixel neighborhoods.

There are limitations to the approach used to model microtopography. USGS 3DEP digital terrain model (DTM) data was smoothed by calculating the moving window (31 m x 31 m) mean value. Per-grid cell microtopography was calculated as the local mean subtracted from the original DTM. However, the optimal degree of smoothing (i.e., moving window size) is uncertain. The 31-m window size was selected because the resulting smoothed DTM seemed to capture the broad scale gradient, but a larger or smaller window size could have a substantial impact on modeled topography (Erdbrügger et al., 2021).

The multiSPEC-4C imagery is described in more detail by Barnes et al. (2021).

Land cover classes (graminoid, moss, woody vegetation, shadow, and surface water) were mapped across the study area using supervised random forest classification informed by the drone multispectral imagery, normalized difference vegetation index (NDVI), and the USGS 3D Elevation Program (3DEP) digital surface model (DSM) elevation. Separate random forest classification models were calibrated and validated using the multiSPEC-4C imagery collected in August 2019 and the RedEdge-MX imagery collected in July 2021. In the multiSPEC-4C classification, predictor variables included the green, red, near-infrared (NIR), and NDVI layers from the multiSPEC-4C sensor, and 3DEP DSM. In the RedEdge-MX classification, predictor variables included the blue, green, red, red edge, NIR, and NDVI layers from the RedEdge-MX sensor, and 3DEP DSM.

Each classification model was calibrated using a dataset of 90 observations (20 graminoid, 10 moss, 20 woody vegetation, 20 shadow, 20 surface water), and validated using an independent dataset of 90 observations. The training and testing datasets were generated using visual assessment of the high-resolution imagery in QGIS mapping software (QGIS,2024). The supervised random forest classifications were run using the randomForest package in R (Liaw and Wiener, 2002). The two resulting classifications were combined to minimize the number of pixels classified as shadow. Because the RedEdge-MX imagery was collected during the same field campaign as the chamber flux and in situ data collection (July 2021), land cover predictions from the RedEdge-MX classification were selected as the baseline. Pixels that were predicted as shadow by the RedEdge-MX classification, and predicted as non-shadow by the multiSPEC-4C classification, were assigned the multiSPEC-4C prediction in the combined classification.

CSV Files:

For 56 methane flux sample locations, in situ observations (Table_1_Chamber_Fluxes_Insitu_Data_July_2021.csv) were collected in July 2021 (3 replicates per location, totaling 168 samples). Mean CH4 flux is the mean of three repetitions of chamber flux estimates. Modeled microtopography (m) and slope (degrees) were extracted from the gridded datasets for each flux sample location. Flux data were collected with a LI-COR Smart Chamber.

A Hydrosense II sensor with 20 cm and 12 cm probes was used to sample soil moisture (Campbell Scientific, Logan, UT, USA). To measure soil moisture at 20 cm and 12 cm depths, the 20 cm and 12 cm probes were inserted into the ground vertically. To measure soil moisture at 6 cm depth, the 12 cm probe was inserted into the ground at a 30° angle. To estimate soil moisture as volumetric water content (VWC), the Hydrosense II measurements of period (τ) were converted to percent VWC (Table_1_Chamber_Fluxes_Insitu_Data_July_2021.csv) using a calibration equation developed for organic boreal soils. The Soil Type A equation developed by Bourgeau-Chavez and Loboda (2018) was used for non-burned, organic soils. After calibration to percent VWC, the soil moisture values ranged from 29.0% to 88.2% (20 cm depth), 24.8% to 54.6% (12 cm depth), and 14.8% to 54.6% (6 cm depth).

Chamber methane (CH4) and carbon dioxide (CO2) flux estimates from field surveys conducted in July 2021 and August 2022 (Table_2_Detailed_Chamber_Flux_Estimates_July_2021_August_2022.csv) are included. For each flux estimate repetition, the original flux estimate, the estimated chamber headspace occupied by vegetation (percent volume), and the adjusted flux estimate, which accounts for the vegetation volume, are reported. The percent volume of chamber headspace filled by vegetation was estimated visually. CH4 and CO2 flux data were collected with either a LI-COR 8200-01S Smart Chamber (SC); Polycarbonate chamber with opaque cover (PC_cover); or Polycarbonate chamber without opaque cover (PC_no_cover). Each method used three replicates per sample. CO2 flux estimates without the opaque cover represent net ecosystem exchange (NEE), whereas CO2 flux estimates with the opaque cover represent ecosystem respiration (Reco). Gross primary productivity (GPP) was estimated as Reco - NEE. If the estimated GPP was a negative value, then the GPP was set to zero; and (c) CH4 and CO2 flux data collected with the LI-COR Smart Chamber at a subset of sample locations during the August 2022 field survey.

Relative abundance observations of methanogens and methanotrophs (Table_3_Methanogen_Methanotroph_Relative_Abundance.csv) were conducted at for four soil depths (0-12.5 cm, 12.5-25 cm, 25-37.5 cm, 37.5 - 50 cm) for subset of sample locations for which soil samples were collected (n = 17).

Soil chemical analyses (Table_4_Soil_Sample_Chemical_Analysis.csv) were conducted on soil samples from 17 locations at various depths. Measured variables included: 1.) general soil properties including pH, percent organic matter (OM), soluble salts (electrical conductivity, EC 1:1), sodium concentration (Na); 2.) macronutrient concentrations including nitrate-nitrogen, ammonium-nitrogen (NH4), phosphorous (P; Olsen method), potassium (K), calcium (Ca), magnesium (Mg), and sulfate-sulfur (S); and 3.) base saturation (K, Ca, Mg, Na).

SIMPER Microbial Community Analysis (Table_5_Simper_Microbial_Community_Analysis.csv) was used to compare microbial communities from the medium methane flux sites to the microbial communities from the high, low, and negative methane flux sites.

Meteorological data (Table_7_Hourly_Meteorology_Weather_Sensor.csv) were collected in July 2021 from a weather station that was located at 64.91985 latitude, -147.8189 longitude. Meteorological data were collected with a Campbell Scientific ClimaVue50 digital weather sensor, which collected measurements at 5 second resolution. Values in the data file are summarized in hourly increments.

At each sample location, soil temperature at 5 cm and 10 cm depth (Table_8_Halfhourly_Soil_Temperature_Logger_Data.csv), and soil pH at 2.5 cm and 6.4 cm depth (Table_1_Chamber_Fluxes_Insitu_Data_July_2021.csv), were collected using HI98331 and HI981030 GroLine soil probes (Hanna Instruments, Woonsocket, RI, USA). The soil pH probe was calibrated using pH 4.01 and 7.01 calibration buffer solutions.

Please see Farina et al. 2025 for additional details

Data Access

These data are available through the Oak Ridge National Laboratory (ORNL) Distributed Active Archive Center (DAAC).

ABoVE: Land Cover, Methane Flux, and Environmental Data, Big Trail Lake, Fairbanks AK

Contact for Data Center Access Information:

References

Barnes, N., H. Webb, M.K. Farina, S. Powell, and J.D. Watts. 2021. Multispectral Imagery, NDVI, and Terrain Models, Big Trail Lake, Fairbanks, AK, 2019. ORNL DAAC, Oak Ridge, Tennessee, USA. https://doi.org/10.3334/ORNLDAAC/1834

Bourgeau-Chavez, L., and T. Loboda. 2018. Organic soil calibration algorithms for the Campbell Scientific handheld Hydrosense-I and II units.

Erdbrügger, J., I. van Meerveld, K. Bishop, and J. Seibert. 2021. Effect of DEM-smoothing and-aggregation on topographically-based flow directions and catchment boundaries. Journal of Hydrology 602:126717. https://doi.org/10.1016/j.jhydrol.2021.126717

Farina, M., W. Christian, N. Hasson, T. McDermott, S. Powell, R. Hatzenpichler, H. Webb, G. LaRue, K. Okano, E. Sproles, J. Watts. 2025. Methane flux spatial heterogeneity driven by surface and subsurface conditions in boreal forest-fen mosaic. [Manuscript submitted to Environmental Research Letters for publication.]

Liaw, Andy, and Matthew Wiener. 2002. Classification and regression by randomForest. R News 2:18-22. https://journal.r-project.org/articles/RN-2002-022/RN-2002-022.pdf

U.S. Geological Survey (USGS). 2017. 2017 USGS 3DEP Lidar: Fairbanks, AK (QL1 & QL2) from 2010-06-15 to 2010-08-15. NOAA National Centers for Environmental Information, https://www.fisheries.noaa.gov/inport/item/55358

QGIS. 2024. QGIS Geographic Information System. Open Source Geospatial Foundation Project. http://qgis.org