Sharing Research Data via the Rice Research Repository

Service Description

The Rice Research Repository (R-3) will host data associated with Rice research projects.


Overview

The Rice Research Repository (R-3) will host data associated with Rice research projects. Features include:

  • Receiving a DOI for your dataset, which facilitates citation.

  • Fondren Library will preserve your data using best practices such as file integrity checks and redundant, geographically distributed storage.

  • Your data will be backed up in geographically distributed locations, including through DuraCloud and AWS Glacier.

  • Fondren Library’s Digital Scholarship Services will help you describe and make available your data. 

  • By using a Rice-sponsored repository, you will be assured of support and will associate your work with a well-regarded institution.

  • You can publish your research data for free.

Why share your research data?

  • Comply with funder or journal requirements.

  • Get cited.

  • Support open science and the growth of knowledge.

  • Ensure the long-term preservation of and access to your data.

Policies

  1. Data must be generated by a Rice faculty member, postdoc, staff member, or graduate student for research purposes; it should be original or unique data. Data created by Rice undergraduates can be published with the support of a faculty or staff sponsor.

  2. Due to storage and technical limitations, the maximum size of a file is 1 GB, while the maximum size of the total dataset is 10 GB. (If you use the online submission form, the maximum file size is 100 MB due to technical issues, but we can use other methods to transfer the data file.) For cases that do not fit these parameters, please email researchdata@rice.edu.

  3. Datasets should be finalized, and researchers should ensure that they are ready to be published. If it is essential to provide access to a new version, the original will be retained. 

    Datasets should be complete, and researchers should ensure that they are ready to be published. Deposits may only be removed in exceptional circumstances. If it is essential to provide access to a new version, the original will be retained.

  4. Making the datasets publicly available must not violate privacy, intellectual property, or other laws or guidelines. You should have the rights to share the data or have obtained permissions from others with rights over the data to share it. Research that contains human subjects data must be properly anonymized, and sharing this data should generally have been cleared (or envisioned) through an approved Institutional Review Board (IRB) protocol.

  5. Datasets must be accompanied with sufficient documentation and metadata so that others can find and understand them. You should provide basic metadata for the dataset as a whole (such as title and creator) along with a readme file describing the contents of the data package.

  6. Datasets must be made publicly available. To facilitate data reuse, we recommend that researchers use a public domain license such as the CC-0 license or Open Data Commons Public Domain Dedication and License (PDDL). If you need to wait before making your data publicly available, we can embargo the data for a defined period of time (such as 6 months or 1 year). 

  7. Datasets must not contain corrupted files or malware.

  8. We strongly suggest that you convert your data to an open format (such as csv or txt) to facilitate usage and preservation.  We can’t accept encrypted or password-protected data.

  9. If you want to share datasets from multiple projects, please create a separate submission for each one. 

Recommended Workflow

Fondren Library is happy to assist with any step in this workflow. Contact researchdata@rice.edu or Lisa Spiro at 713-348-2480.

  1. Review your data to ensure that it can be shared publicly. You should aim to share all files required to replicate the analysis in your publication. Are there any privacy or intellectual property issues to consider? 

  2. Make sure that your data is well-organized. Is the dataset finalized and well-documented? Are the files well-named? Are there missing data, unlabeled variables, corrupted files, or other issues? 

  3. Determine whether R-3 is the best repository for your data. In some cases, a repository associated with your discipline may be a better choice, since it may offer more appropriate metadata templates or display features and be better recognized in your discipline. To identify an appropriate repository, please see:

    1. Nature’s Recommended Data Repositories

    2. PLOS Recommended Repositories

    3. Assessing General Data Repositories 

    4. Re3data

  4. If there are more than a few files, consider packaging your dataset, such as by creating a zip file.

  5. Make a readme file to document your data; this file may contain information about methods, files, and variables. 

  6. Review the terms of use for sharing your data through R-3 and determine which license to use for your dataset. Choices include

For more information on data licensing, please see “How to License Research Data” and “Who ‘owns’ your data?”

  1. Fill out our data sharing request form and provide the Digital Scholarship Services team access to your data (typically via Box, Google Drive, or a similar service).  

  2. Fondren’s Digital Scholarship Services will review the data sharing request. We will then work with you to create a metadata record for the dataset, using our application profile.

  3. Once the dataset is published, you will receive a DOI for your dataset so that it can be cited.

Resources