Skip to main content
ORNL DAAC HomeNASA Home

DAAC Home > Submit Data > CSV Standards

CSV Standards

Back to Submit Data

General Formatting

  • Use ".csv" as the file extension
  • File names in snake case; i.e., lowercase with underscores like "example_file_name.csv" instead of spaces
  • Single header row with descriptive column names
  • Include one column that can be used as the unique identifier; the data type of that column can be either text or number
  • Do not include URLs in the data file
  • Columns delimited by commas
  • No empty lines or rows
  • Each row should have the same number of columns

Spatial Data

  • Latitude and longitude in separate columns with names "latitude" and "longitude" or "lat" and "lon"
  • Latitude and longitude given as decimal degrees
  • If a data file is not based on WGS 84 geographic coordinate system (e.g., EPSG:4326), use CoordX and CoordY as the column names
  • An optional *.prj file can be used to explicitly specify the projected coordinate system

Variables and Values

  • Column names in snake case; i.e., lowercase with underscores like "example_column_name" instead of spaces
  • Missing data for numeric columns coded as "-9999" and uses the appropriate level of precision
  • Missing data for text columns coded as "NA"
  • If a text field includes commas and quotes, save the file with the option of placing quotes around all text fields
  • Dates and times in 24 hour UTC; use local time zones only in addition to UTC
  • Dates in YYYY-MM-DD format
  • Times in hh:mm:ss format (or hh:mm:ss+nn if a time zone needs to be included)
  • Named sites and locations should have an associated geographic location
  • Columns should contain only text values or only numeric values, not a mixture of both. While many tools (such as Excel) will let you mix text values and numeric values in a single column, such as in the example below, this makes the data much more difficult to work with.

    The following is an example of what you should not do -- mixing numeric and text values in a single column:

    estimated_depth shrub_cover
    4 30
    6 > 75
    4 to 5 25
    5 65

See Best Practices for Data Management for additional guidance.