You can get a digital object identifier (DOI) for your dataset when you are ready to release your data to the public. DOIs are persistent identifiers that help to make your dataset findable and for other to cite your work!
Issuing a DOI for your data requires some extra information beyond getting your data ready. You will need:
Only PI and Managers can request DOIs since at the end of the process the data will be ready to be released to the public.
Make sure you have your dataset, data dictionary and metadata ready for DOI! This will reduce the time and work to do.
Make sure you are logged in as the PI or Lab Manager for the ODC lab and check that your Current Lab is correct. If you need to switch labs, use the “Switch Lab” option on the navigation menu.
Select “Current Lab” on the navigation menu as the PI or Lab Manager.
In the Dataset window, find the dataset of interest. Click the "Request DOI" button. Fill out the form, confirm you understand that the Data Team will review your dataset, and submit the DOI request. This will initiate the publication and DOI request process.
In ODC, the dataset and the data dictionary undergo quality checks for proper formatting. These checks ensure that the data is Interoperable and Reusable with other datasets. Some quality checks are performed during uploading datasets, ensuring minimal quality to all private and public datasets in the ODC. The check during the upload process is automatic without human oversight since data upload is handled privately within the account of the data owner. When data is released for publication, further checks will be conducted to ensure that the released dataset meets FAIR standards:
Source checks
(Checked at upload): ODC can not read the data file. Possible reasons include:
The data file is not a *.csv. The ODC only accepts the upload of *.csv data files.
Reserved special characters were used in the column headers (first row with the variable names). Check our recommendations for How to upload data.
Structure checks
Blank-header (Checked at upload): There is a blank variable name. All cells in the header row (first row) must have a value.
Duplicate-header (Checked at upload): Multiple columns with the same name. All column names must be unique.
Blank-row (Checked at upload): Rows must have at least one non-blank cell.
Duplicate-row: Rows can not be duplicated.
Schema checks
In ODC, the schema is marked by the data dictionary. These errors reflect conflicts between the data dictionary and the dataset.
Extra-header: The dataset contains at least one variable name not defined in the data dictionary.
Missing-header: The dataset is missing at least one variable name defined in the data dictionary.
Missing-definition: The definition of a variable in the data dictionary is missing.
Required-constraint (Checked at upload): A required field for the dataset contains no values or is not assigned to the dataset. Currently, the only required value in the datasets is the subject identifier. As ODC develops additional data standards, more variables may be required on all datasets.
Value-constraint: The values of a variable should be equal to one of the permitted values enumerated in the data dictionary or within the limits of the permitted values.
The ODC-SCI community has established a unique process and requirements to release a dataset to the public with a DOI. The process may take a few days, but ensures higher quality of the shared data and that data is FAIR.
In addition to having your data ready for DOI, the ODC-SCI community has created a minimal set of variables required for publication. Make sure you include them before you start the process!
The final publishing step involves a two-step review process by the ODC-SCI Editorial Board and Data Team. The length of the review process depends on various factors including:
Whether the Dataset is within the scope of ODC-SCI
Dataset formatting
Completeness of the dataset metadata
Completeness of dataset-associated documents (i.e. data dictionary).
Your dataset will first be sent to ODC-SCI editors to determine whether the content is appropriate and within the scope of ODC-SCI. You will be contacted by the Editor in Chief within 3-5 business days if the editors have any concerns regarding your dataset.
If the Editorial Board decides that your dataset fits within the scope of the ODC-SCI, the dataset and any comments from the editors will be sent to the ODC-SCI Data Team for the dataset review process.
The ODC-SCI Data Team will review your dataset, metadata, and data dictionary to ensure the formatting and contents conform to the ODC-SCI data structure and meet the minimum dataset standards for publication. The ODC-SCI Data Team will will contact you if any revisions are required.
The full dataset publication process can take a few weeks depending on the revision process. You can track any changes to the status of your publication/DOI request on the Current Lab page as a PI/Lab Manager.
Once the dataset is fully approved by both the Editorial Board and Data Team, the DOI will be reserved and the corresponding PI will be notified in an email. The PI has control over the last step of the publication process: final approval for publishing the dataset and the associated landing page. Once the PI approves, the landing page and dataset will be made public and accessible to all ODC-SCI registered users within 24 hours. The dataset DOI will be accessible on the published landing page.
The PI can find the final approval button on the Current Lab page in the Dataset window once the dataset has completed the ODC-SCI review process.
Releasing your data through the ODC-TBI requires a DOI request. Publication in ODC-TBI is overseen by and Editorial Board and the ODC Data Team. The Editorial Board reviews each dataset to determine if:
The dataset is relevant to TBI
The metadata are complete, clear and promote FAIR
The data dictionary is complete and helps a user to understand the data
The ODC Data Team performs various data quality checks for DOI and works with the submitters to resolve any issues.
The ODC-TBI has adopted the same review process as the ODC-SCI. The process is collegial and designed to increase the Findability, Accessibility, Interoperability and Reusability (FAIR) of each dataset. Briefly, the steps involve:
Initial checks for completeness of files by the Managing Editor
In depth curatorial review for quality checks and data formatting
Review of metadata by 2 Editorial Board members for relevance, completeness, FAIRness and clarity.
Feedback to the submitter on any required or recommended changes.