Prioritizing Datasets for Archival at the ORNL DAAC
This policy provides guidance on developing a priority order of archival of datasets submitted to the ORNL DAAC. The guidance starts with the characteristics of high priority datasets and works through the priority to lowest priority datasets.
- Data derived from NASA Terrestrial Ecology and related programs and projects
Generally, datasets from the NASA Terrestrial Ecology and related programs (Carbon Monitoring Systems (CMS), Interdisciplinary Science (IDS), Carbon Cycle Science, etc.) are processed in the same order they are received (first in, first out). The order datasets are processed may be adjusted based on the condition and quality of data and documentation when received and how quickly investigators respond to questions. For example, datasets that require more work and iteration with the investigators or compilers will be placed in a lower priority than data that requires less checking and fewer iterations with the investigators.
- Data associated with a published manuscript
Many journals now require that investigators need to have data archived when an article is published (citation and DOI). For the ORNL DAAC, archiving data associated with a manuscript means that, based on funding source and relevance to the ORNL DAAC, we establish a process that begins when we receive the data and documentation and when the manuscript enters peer review. In short, we will work with the investigator to develop datasets (data files, metadata, and documentation) so that when the manuscript is accepted we can provide a citation and DOI and archive the data, within a few (2-3) weeks of when the authors are notified that the manuscript has been accepted. We will provide a temporary URL for preliminary data files, for peer reviewer.
- Other data (UWG suggestions, self-nominations, etc.)
For data that do not fall into one of the above two categories, we will evaluate relevance to the ORNL DAAC's mission and funding source to determine which, if any, data will be archived at the DAAC. These data will be processed after data from items 1 and 2.
Datasets will be assigned to batches of 10 to 15 and are typically processed in a few weeks, depending on complexity and condition.