đź§ 
Open Data Commons
  • Open Data Commons
    • Principles of the ODC
  • How to get help
    • General help
    • FAQ
    • Let us know
  • đź“’Tutorials & Documentation
    • Getting started
    • Demo site
    • Getting your data ready
      • Required variables
      • Data format
      • Data dictionary
      • Common errors for Dataset and Data Dictionary
    • Upload to ODC
      • Upload new data
      • Upload a dictionary
      • Upload a Methodology file
      • Supplementary files
      • Common errors during upload
    • Manage a dataset
      • Update a dataset
      • Add metadata
      • Share data
    • Publish your dataset with a DOI
      • Request a DOI
      • Data Quality Checks for DOI
      • Publication in ODC-SCI
        • Summary of review process
      • Publication in ODC-TBI
    • Adding an experimental protocol to a dataset
    • How to cite ODC and dataset
    • Manage a lab
    • Get a reviewer token
    • Estimating costs for data management and sharing
    • Sample DMS
    • ODC Standards
      • Data formatting specifications
      • Common Terminology
        • ODC-SCI CoDEs
      • Metadata standards
        • ODC data dictionary
        • ODC Narrative and Metadata
    • Glossary
  • 🛠️ODC Tool Sandbox
    • Tool Sandbox
      • ODC quality control app
    • For developers
  • đź“—Fundamentals
    • Why share data with ODC?
    • What are the different account types on the ODC?
    • How does privacy work on the ODC?
    • FAIR data
  • âž•Extras
    • The ODC team
      • About ODC-SCI
      • About ODC-TBI
    • Funding and support
      • ODC-SCI funding
      • ODC-TBI funding
    • Publications
    • Our blogs
    • Workshops and Outreach
    • What people are saying
    • Terms of use and policies
Powered by GitBook
On this page

Was this helpful?

Edit on GitHub
Export as PDF
  1. Tutorials & Documentation

Estimating costs for data management and sharing

Information and tools for understanding and estimating costs involved in managing and sharing data

PreviousGet a reviewer tokenNextSample DMS

Last updated 2 years ago

Was this helpful?

Investigators may include costs involved managing and sharing data according to their DMS. Currently, there is no fee for sharing data through the Open Data Commons for data under 30 Gb in size.

Guidance from NIH on allowable costs: Reasonable, allowable costs may be included in NIH budget requests when associated with:

  1. Curating data and developing supporting documentation, including formatting data according to accepted community standards; de-identifying data; preparing metadata to foster discoverability, interpretation, and reuse; and formatting data for transmission to and storage at a selected repository for long-term preservation and access.

  2. Local data management considerations, such as unique and specialized information infrastructure necessary to provide local management and preservation (e.g., before deposit into an established repository).

  3. Preserving and sharing data through established repositories, such as data deposit fees necessary for making data available and accessible. For example, if a Data Management and Sharing Plan proposes preserving and sharing scientific data for 10 years in an established repository with a deposition fee, the cost for the entire 10-year period must be paid prior to the end of the period of performance. If the Plan proposes deposition to multiple repositories, costs associated with each proposed repository may be included.

The question is, how to estimate these costs? In our experience, researchers tend to underestimate the amount of time and effort involved in managing and sharing data. Depending on the institution and the size, nature and complexity of the data, the major costs are usually not storage or access, but rather personnel. If you do not have a dedicated data steward in your lab, you will have to ensure that you budget the required personnel to manage and share the data.

Some resources that can help:

Cost drivers for data adapted from the

  • : (of course, costs at your institution will be different, but it is a pretty complete guide to factors that should be considered and some cost saving tips).

  • Tool for estimating data submission costs. Some aspects are specific for this repository but many aspects are generic and can apply to sharing through any repository.

  • Estimating costs for using commercial clouds. Some good resources and advice can be found here: . Additional resources can be found under “Resources and Tools”.

  • Guidance for data management and sharing costs on NIH budget requests, adapted from materials developed by

đź“’
NOT-OD-21-015
National Academies of Science report on Lifecycle Decisions for Biomedical Data
Utrecht University Data Management Cost Estimation Tool
NIMH Data Archive (NDA) cost estimation tool:
https://training.incf.org/cloud-based-computer-matrix/costs
UCSF
21KB
Cost drivers for data management and submission (1).docx
Cost drivers for data adapted from the
82KB
Guidance on costs (1).pdf
pdf
Advice on cost drivers and sample language for budget statements
National Academies of Science report on Lifecycle Decisions for Biomedical Data