13 July - Webinar Slides
13 July - Webinar Slides
@chochrphd
Topics Of Discussion
Lifecycle
Collect &
Store Organize
Describe Quality
Stage 1 Things to consider:
• Policies
• Type of data
PLAN • Versions
• Backup
• Describing and labeling
• Access and Sharing
• Rights and Permissions
• Roles and Responsibilities
• Budget
Stage 1: Plan
https://old.dataone.org/data-management-planning
https://dmptool.org/
EXAMPLE: National Science Foundation (NSF) DMP
Length
• The data management plan is a supplementary document.
• Plans should be no longer than two pages.
Components
• Types of data produced
• Data and metadata standards
• Data access and sharing
• Data re-use and re-distribituion
• Archiving and preservation
EXAMPLE: National Science Foundation (NSF) DMP
Questions To Ask
1. What types of data will be produced for your project?
2. How will the data be created or captured?
3. What software programs will be used to generate your data?
4. How much data will be produced?
5. How big will your digital files be and how many will there be?
6. Will you be using existing data? If so, what is the source of that data?
EXAMPLE: National Science Foundation (NSF) DMP
The standards to be used for data and metadata format and content
(where existing standards are absent or deemed inadequate, this
should be documented along with any proposed solutions or
remedies).
EXAMPLE: National Science Foundation (NSF) DMP
Questions to Ask
1. How will you document your data and project?
2. What file formats will you be using in your project and why?
3. How will you organize your files into directories, and what naming
conventions will you use?
4. How often will your data change or be updated, and will versions need to be
tracked?
5. What types of metadata do you need to collect in order for someone else to
fully understand your data?
EXAMPLE: National Science Foundation (NSF) DMP
Policies for access and sharing, including provisions for the appropriate protection
of privacy, confidentiality, security, intellectual property, or other rights or
requirements. Policies and provisions for reuse, redistribution, and the production
of derivatives.
EXAMPLE: National Science Foundation (NSF) DMP
Plans for archiving data, samples, and other research products, and for
preservation of access to them.
EXAMPLE: National Science Foundation (NSF) DMP
Questions to Ask
1. What is your strategy for data storage and backup?
2. What data will be preserved for the long term?
3. Are extra steps required to prepare the data for preservation?
4. What related information or metadata will be preserved along with the data?
5. Where and how will the data be preserved?
6. What procedures does the archive have in place to ensure preservation and
backup?
7. How long will the data be kept after the project is completed?
Stage 2 Things to consider:
• Locate existing data
Cite datasets
Follow standards
Stage 3 Things to consider:
• Finding and reusing data
Collect & • Choosing a file format
• Naming data files
Organize • Data versioning
Stage 3: Collect and Organize
• Name
• Format
• Version
Stage 3: Collect and Organize 1. Meaningful
2. Length
File name 3. Underscores & Hyphens
4. YYYYMMDD
File Format
• Choose one and stick to it
• Consider the software that will be used to
access data
• Repository requirements
• Lost features during conversion
• Stanford University Libraries - Data Management
Services
• Cornell University Research Data Management
Service Group
• Cambridge University Libraries - Data Management
Stage 3: Collect and Organize
Data Versioning
Saving new copies of your files when you make changes so that you can go
back and retrieve specific versions of your files later.
image1_v1.jpg
image1_v2.jpg
image2_v1.jpg
image2_v2.jpg
Data Versioning
Style 2: Dated
image1_20151021
image1_20151214
image1_20160123
Data Versioning
dataset1_20160402_KES
dataset1_20160301_WTC
dataset1_20160814_GSC
Stage 4 Things to consider:
• Assurance (QA)
Methods
Description
Stage 5: Data Description
Components of data
description
• Describe scientific context
Metadata
• Lab notebooks
• Plain text README files
• Within data file
• Web forms
Stage 5: Data Description
Data Dictionary
• Enhance Impact
• Support Validation
• Encourage Collaboration
• Reduce redundancy
Stage 7: Data Sharing
Locations
• Disciplinary repository
• Data journal
• Supplementary File
• Web-based tools
Stage 7: Data Sharing
Adobe Bridge Adobe Bridge is free software for locally organizing images.
https://guides.library.yale.edu/rdm_healthsci/home
https://pitt.libguides.com/managedata/understanding#s-lg-box-
4890536
https://data.research.cornell.edu/content/readme
https://ukdataservice.ac.uk/deposit-data/preparing-data.aspx
https://dmptool.org/
https://www.dataone.org/
https://datadryad.org/stash
Thank you
Questions?
Pre-submitted questions
Online resources
Qualitative Data Management
https://guides.library.vcu.edu/health-sciences-lit-review/organize