0% found this document useful (0 votes)

19 views28 pages

Preparing Data For Sharing and Archiving

Uploaded by

Akinola Samuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views28 pages

Preparing Data For Sharing and Archiving

Uploaded by

Akinola Samuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Georgia Southern University

Digital Commons@Georgia Southern

Data Management Services Instructional Materials Promotional and Instructional Material

2016

Preparing Data for Sharing and Archiving

Data Management Services, Zach S. Henderson Library

Follow this and additional works at: https://digitalcommons.georgiasouthern.edu/lib-promo-dms-

instr

Recommended Citation
Data Management Services, Zach S. Henderson Library, "Preparing Data for Sharing and Archiving" (2016). Data Management
Services Instructional Materials. 7.
https://digitalcommons.georgiasouthern.edu/lib-promo-dms-instr/7

This presentation is brought to you for free and open access by the Promotional and Instructional Material at Digital Commons@Georgia Southern. It
has been accepted for inclusion in Data Management Services Instructional Materials by an authorized administrator of Digital Commons@Georgia
Southern. For more information, please contact [email protected].
Preparing Data for Sharing
and Archiving
Significant portions of this presentation are adapted from the Cornell University Research Data
Management Services Group website under a Creative Commons Attribution 4.0 International License,
and from ICPSR’s Guide to Social Science Data Preparation and Archiving: Best Practice Throughout
the Data Life Cycle (2012, 5th ed.). Ann Arbor, MI.

http://georgiasouthern.libguides.com/data
Agenda
• Why share and archive data?
• What should I share and archive?
• Data collection, file creation, and management
• Metadata creation
• Protecting subjects
• Copyright and re-use licensing
Why share and archive data?
• Many research funders now require PIs to maximize open
public access to data products.
• Many publishers now require open access to replication data
as a condition of publication.
• It benefits you, your collaborators, and
your research community.
What should I share and archive?

Content + Metadata

• Content = The bit-level data, the “target” of preservation.

• Metadata = The information needed to find, access,

understand, use, and preserve the content.
Metadata
• Descriptive Information
Makes the data discoverable

• Packaging Information
Explains how the data is organized

• Representation Information
Makes the data understandable; renders the
bit-level content into something meaningful.

• Preservation Information
Provides information to support long-term preservation and use
Data collection, file creation, and management
• Data and file structure
What is the data file going to look like and how will it be organized? What file types
will be used?
• Naming conventions
How will files and variables be named? What conventions will be used?
• Data integrity
How will the data be captured and checked for accuracy and integrity? How will
versions be checked?
• Dataset documentation
When and how will documentation be produced? What standards should be
used/applied to make it usable by others?
• Variable construction
What variables will be constructed following data collection? According to what
standards and how will they be documented?
• Project documentation
How will decisions be documented over the course of the research (e.g., field
procedures, coding decisions, variable construction)?
Data collection, file creation, and management
The likelihood of long-term preservation of content and
functionality is higher when file formats possess the
following characteristics:

– complete and open documentation

– platform-independence
– non-proprietary (vendor-independent)
– no “lossy” or proprietary compression
– no embedded files, programs or scripts
– no full or partial encryption
– no password protection
– uncompiled
Data collection, file creation, and management
• Make file names unique, including the most important identifying information.
Elements of a good file name may include:
– project name, acronym, or research data name
– study title
– location information
– researcher initials
– date (consistently formatted, i.e. YYYYMMDD)
– version number

• Use leading zeros to enable sorting (“1-100” should be numbered 001-100).

• Use underscores to separate elements; avoid special characters, spaces, and periods.
• File names should be short enough to be readable, while still being meaningful. For
example:
DryValleySoil_ICPOES_20101115_JDS.dat
DryValleySoil is the project name, ICPOES is the instrument from which the data
originated, 20101115 is the date of the sample run on the instrument, and JDS are the
initials of Jane Doe Scientist.
Data collection, file creation, and management
• Keep track of versions when working with data.
• Some research management tools, like Electronic Lab Notebooks
(ELN), Open Science Framework, and Box support version
control. Other options include using a naming scheme or version
control software.
• Best practices include:
– Save an untouched copy of the raw data and leave it that way. Always
work on copies of the "safe" untouched copy.
– Avoid ambiguous labels, such as 'revision', 'final', 'final2', etc. Instead,
use a file naming convention (like v001, v002 or v1_0, v1_2, v2_0).
– Use a directory structure naming convention that includes version
information.
– Consider using version control software (e.g., GitHub). However, test any
versioning tool to make sure that it supports retention of and/or
reversion to previous versions.
Data collection, file creation, and management
• Document and Use Directory Structure Naming Conventions
• Directory top-level folders should include the project title,
unique identifier, and date (year), but the files themselves
should be well-described independent of the directory structure.
• Consider creating a brief description of the contents of major
folders and providing an overview of the directory structure.
This can be a text document or readme file that is stored in a top-
level folder or shared space.
• Provide enough information to help someone else understand
the contents and organization of your files in your absence.
Data collection, file creation, and management
• When preparing tabular data for description and archiving:
– Only include the data; do not include figures or analyses.
– Consider aggregating data into fewer, larger files, rather than many small ones. Compress
cautiously.
– If a repository has no file format requirements, consider tab- or comma-delimited text
(*.txt or *.csv).
– Column headings should be meaningful, but not overly long.
– Use only alphanumeric characters, underscores, or hyphens in column headings.
– Some programs expect the first character to be a letter; start column headings with a letter.
– Indicate units of measurement in column headings and specify units in the metadata.
– Use only the first row to identify a column heading.
– Examples of good column headings:
max_temp_celsius - not max temp celsius (includes spaces)
airport_faa_code - not airport/faa code (includes special characters)
Data collection, file creation, and management
– Use standard codes or names when possible. Examples include using Federal
Information Processing (FIPS) codes for geographic entities and the Integrated
Taxonomic Information System (ITIS) for authoritative species names.
– When using non-standard codes, an alternative to defining the codes in the metadata
is to create a supplemental table with code definitions (e.g., in a codebook, the
instrument, or a “readme” file).
– Avoid using special characters such as commas, semicolons, or tabs, in the data itself
if the data file is in (or will be exported to) a delimited format.
– Do not rely on special formatting that is available in spreadsheet programs, such as
Excel.
– Indicate date information in an appropriate machine-readable format, such as
yyyymmdd or yyyy-mm-dd. Indicate time zone (including daylight savings, if
relevant) and use of 12-hour or 24-hour notation in the metadata.
– Alternately, use the ISO standard for formatting date and time strings. The standard
accommodates time zone information and uses 24-hour notation.
– Use a standard method to identify missing data; do not use zeroes or leave a cell
blank. -999 or -9999 is a common convention. Indicate the code for missing data in
the metadata.
Data collection, file creation, and management
• Consider performing basic data quality assurance to detect errors or
inconsistencies in data. Some common techniques are:
– Spot check some values in the data to ensure accuracy.
– Enter data twice and compare both versions to catch errors.
– Sort data by different fields to spot outliers and empty cells.
– Calculate summary statistics, or plot data to catch erroneous or extreme values.

• Provide summary information about the data and include it in the

metadata. For example:
– number of columns
– max, min, or mean of parameters
– number of missing values
– total file size
Metadata creation
• Metadata is documentation that describes data.
• In a lab setting, much of the content used to describe data is initially
collected in a notebook; metadata is a more formal, sharable
expression of this information.
• Metadata can include content such as contact information, geographic
locations, details about units of measure, abbreviations or codes used
in the dataset, instrument and protocol information, survey tool
details, provenance and version information and much more.
• Metadata can take many different forms, from free text to
standardized, structured, machine-readable, extensible content.
• Specific disciplines, repositories or data centers may guide or even
dictate the content and format of metadata.
Metadata creation
Examples metadata standards:
– Dublin Core - domain agnostic, basic and widely used metadata
standard
– DDI (Data Documentation Initiative) - common standard for
social, behavioral and economic sciences, including survey data
– EML (Ecological Metadata Language) - specific for ecology
disciplines
– ISO 19115 and FGDC-CSDGM (Federal Geographic Data
Committee's Content Standard for Digital Geospatial Metadata) -
for describing geospatial information
– MINSEQE (MINimal information about high throughput
SEQeuencing Experiments) - Genomics standard
– FITS (Flexible Image Transport System) - Astronomy digital file
standard that includes structured, embedded metadata
Metadata creation
Important metadata elements may include:
– Principal investigator(s) [Dublin Core -- Creator]
– Title [Dublin Core -- Title]
– Funding sources
– Data collector/producer
– Project description [Dublin Core -- Description]
– Sample and sampling procedures
– Weighting
– Substantive, temporal, and geographic coverage [Dublin Core -- Coverage]
– Data source(s) [Dublin Core -- Source]
– Unit(s) of analysis/observation
– Variables
– Related publications
– Technical information on files
– Data collection instruments
– Flowchart of the data collection instrument
– Index or table of contents
– List of abbreviations and other conventions
– Interviewer guide
– Coding instrument
Metadata creation
Metadata typically is provided via codebooks, XML files (e.g.,
DDI), manifests, etc.
example DDI
encoded
metadata
Protecting subjects
Subjects’ privacy and confidentiality must be protected consistent with
IRB requirements and disciplinary ethics. Variables that endanger
privacy and confidentiality include:
• Direct identifiers
Variables that point explicitly to particular individuals or groups.

• Indirect identifiers
Variables make unique cases visible when combined with other
identifiers.

• Geographic identifiers
Direct geographic identifiers may include specific addresses. Indirect
geographic identifiers may include census tracts, area codes, place of
birth or education, etc.
Protecting subjects
Common techniques for treating identifiers to protect subjetcs:
• Removal: Eliminate the variable.
• Top-Coding: Restrict the upper range of a variable
• Collapsing and/or Combining: Combining values into a summary variable
• Sampling: Release a random sample of sufficient size to yield reasonable
inferences.
• Swapping: Match unique cases on the indirect identifier, then exchange the
values of key variables between the cases. This retains the analytic utility and
covariate structure of the dataset while protecting subject confidentiality.
• Disturbing: Add random variation or stochastic error to the variable. This
retains the statistical properties between the variable and its covariates,
while preventing using the variable as a means for linking records.
Protecting subjects
Alternatives to altering the data:
• Restricted-use datasets
A dataset released only to approved researchers who agree to
abide by rules assuring subjects’ privacy and confidentiality
is maintained. Researchers are usually given access to the
data for a limited time, at the end of which they must return
or destroy the data.
• Data Enclaves
A physical or virtual environment that allows access, but
prevents researchers from retaining any of the data.
Copyright and re-use licensing
• Unmediated factual data cannot be copyrighted because it
is not possible to copyright facts (e.g., a temperature
reading). However:
– Some data may be protected, such as photographs.
– Organized data (e.g., a database) has a thin layer of copyright
protection because of the researcher’s creative input into creating
it.
• Copyright may govern the use of databases and some kinds
original data content, but contract law, trademarks, and
other mechanisms are required to regulate factual data.
Copyright and re-use licensing
Creative Commons (http://www.creativecommons.org/) offers a library of
standardized licenses, some of which may be used with data. Creative Commons
recommends the following three licenses only for data sharing. :
• CC Zero (“CC0”)
Waive all copyright and database rights, including your right to attribution.
This license effectively places the database and data into the public domain,
and maximizes the likelihood of reuse.
• CC Attribution 4.0 International (“CC BY 4.0”)
Waive all copyright and database rights except the right to attribution. This
license balances your right to be acknowledged with encouraging reuse.
• CC Attribution-ShareAlike 4.0 International (“CC BY-SA 4.0”)
Protect your right to attribution, as well as require that any derivative work be
shared under the same licensing conditions. This license may result in
confusing “license chaining,” and discourage some reuse and citation.
The library recommends using the CC BY 4.0 license in most cases.
Copyright and re-use licensing
Data Ownership at Georgia Southern University:
• Ownership of works produced by Georgia Southern faculty,
students, and non-academic staff is governed by the University
System of Georgia's Policy on the Use of Copyrighted Works in
Education and Research and Georgia Southern
University's Intellectual Property and Technology Transfer Policy.
• The precise answer to who owns your data depends on whether
the project was created as part of sponsored research; the
employment status of the creator; whether the work was
conducted pursuant to a specific direction or assigned duty; and,
whether substantial university resources were used in the creation
of the work.
• Consult with the Office of Research and Economic Development.
ICPSR Guide to Social Science Data
Preparation and Archiving

http://www.icpsr.umich.edu/files/ICPSR/access/dataprep.pdf
Digital Commons for Data
Comprehensive hosted solution for storing, managing, securing,
and sharing data:

• Unlimited storage (no additional cost).

• Support for all file types and formats.
• Authorization and access-control tools.
• Fully hosted: no IT resources required.
• Multiple back-ups, cloud storage, and quarterly archives.

http://digitalcommons.bepress.com/cgi/viewcontent.cgi?article=1000&context=promotional
Archiving and Publication
• Once ready for publication, data objects are re-described,
assigned DOIs if needed, and released to the public website.
• Update versions and file types as needed.
• Host on university, departmental, or
project-related data structure.
• Backup and Archiving is automatic.
• Links are permanent.
• Link to SelectedWorks profile.

http://georgiasouthern.libguides.com/data
Data Management Services
@ Henderson Library

http://georgiasouthern.libguides.com/data

Basic Computer Skills PPT Lecture Notes
80% (10)
Basic Computer Skills PPT Lecture Notes
88 pages
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
Data Management: File Organization: Christine Malinowski January 21, 2016
100% (1)
Data Management: File Organization: Christine Malinowski January 21, 2016
57 pages
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
From Everand
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
1/5 (1)
Chapter 10
No ratings yet
Chapter 10
36 pages
Data Management For Research: Aaron Collie, MSU Libraries Lisa Schmidt, University Archives
No ratings yet
Data Management For Research: Aaron Collie, MSU Libraries Lisa Schmidt, University Archives
66 pages
Collect Analyze - Data Management
No ratings yet
Collect Analyze - Data Management
4 pages
MITRES STR002S16 IntroDM
No ratings yet
MITRES STR002S16 IntroDM
53 pages
Making A Research Project Understandable
No ratings yet
Making A Research Project Understandable
12 pages
C++ File Handling Step by Step: A Practical Guide with Examples
From Everand
C++ File Handling Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Paperpdf 1593523642
No ratings yet
Paperpdf 1593523642
5 pages
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
DataTree DataManagementQandA
No ratings yet
DataTree DataManagementQandA
10 pages
Section3 - Organising Data Further Reading
No ratings yet
Section3 - Organising Data Further Reading
2 pages
Sewall Lab Data Protocol 1
No ratings yet
Sewall Lab Data Protocol 1
3 pages
File System
No ratings yet
File System
7 pages
05-File Management
No ratings yet
05-File Management
10 pages
UNS A Portable Mobile and Exchangeable Namespace For Supporting Fetch-from-Anywhere Big Data Eco-Systems
No ratings yet
UNS A Portable Mobile and Exchangeable Namespace For Supporting Fetch-from-Anywhere Big Data Eco-Systems
8 pages
Organising Project Data
No ratings yet
Organising Project Data
3 pages
13 July - Webinar Slides
No ratings yet
13 July - Webinar Slides
59 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Data Storage
No ratings yet
Data Storage
21 pages
Big Data Analytics (1) : Definition
No ratings yet
Big Data Analytics (1) : Definition
15 pages
File Name Guidelines
100% (2)
File Name Guidelines
4 pages
BestPractices - 2 - Naming Conventions - Data Quality
No ratings yet
BestPractices - 2 - Naming Conventions - Data Quality
13 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Data Analytics Course 3
No ratings yet
Data Analytics Course 3
4 pages
Introduction to Microsoft SQL Server
From Everand
Introduction to Microsoft SQL Server
Eric Frick
No ratings yet
Lecture #17&18-Data Organization
No ratings yet
Lecture #17&18-Data Organization
43 pages
Data Munging
No ratings yet
Data Munging
27 pages
Operating System Seminar 1
No ratings yet
Operating System Seminar 1
16 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
Research IIQ2 Mod 5 WK 5
No ratings yet
Research IIQ2 Mod 5 WK 5
15 pages
Data Management Checklist
No ratings yet
Data Management Checklist
5 pages
Beginner's Guide for Cybercrime Investigators
From Everand
Beginner's Guide for Cybercrime Investigators
Nicolae Sfetcu
5/5 (1)
Trok
No ratings yet
Trok
5 pages
Data Management: Protecting Your Data: Backups, Archives, and Data Preservation
No ratings yet
Data Management: Protecting Your Data: Backups, Archives, and Data Preservation
27 pages
Practical Parquet Engineering: Definitive Reference for Developers and Engineers
From Everand
Practical Parquet Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Finals PDF
No ratings yet
Finals PDF
8 pages
Publishing Data Guidance For Researchers v1.0
No ratings yet
Publishing Data Guidance For Researchers v1.0
6 pages
File Management Exercise
No ratings yet
File Management Exercise
1 page
Lunch and Learn Workshop Data Management
No ratings yet
Lunch and Learn Workshop Data Management
19 pages
Digital Curation Centre: DCC Template: Data Collection
No ratings yet
Digital Curation Centre: DCC Template: Data Collection
5 pages
Unit 1-Part3-Compressed
No ratings yet
Unit 1-Part3-Compressed
28 pages
DPNote4 FileNamingAndFormats
No ratings yet
DPNote4 FileNamingAndFormats
2 pages
Data Sceince - UNIT - 4
No ratings yet
Data Sceince - UNIT - 4
70 pages
Grade 10 ICT MWSP Quarter 3
No ratings yet
Grade 10 ICT MWSP Quarter 3
4 pages
Python File Handling Made Easy: A Practical Guide with Examples
From Everand
Python File Handling Made Easy: A Practical Guide with Examples
William E. Clark
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Whose Data Do You Trust? Integrity Issues in The Preservation of Scientific Data
No ratings yet
Whose Data Do You Trust? Integrity Issues in The Preservation of Scientific Data
11 pages
Managing Research Data
100% (1)
Managing Research Data
31 pages
Access 2016: Up To Speed
From Everand
Access 2016: Up To Speed
R.M. Hyttinen
5/5 (2)
Document Changes: 20060524: 7. Misunderstanding Resolution - Example Values Corrected
No ratings yet
Document Changes: 20060524: 7. Misunderstanding Resolution - Example Values Corrected
3 pages
CH05
No ratings yet
CH05
39 pages
Data Warehousing and Data Mining - Thara - M.Tech Cse
No ratings yet
Data Warehousing and Data Mining - Thara - M.Tech Cse
11 pages
Data Security Consideration
No ratings yet
Data Security Consideration
14 pages
Fok
No ratings yet
Fok
5 pages
CookBook 13 Technical Records - 10-2018 PDF
No ratings yet
CookBook 13 Technical Records - 10-2018 PDF
4 pages
Chapter-11& 12 File System
No ratings yet
Chapter-11& 12 File System
52 pages
Data Structures Explained: A Practical Guide with Examples
From Everand
Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
From Everand
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
Vladimir Pantic
No ratings yet
Naming Conventions - IDQ
No ratings yet
Naming Conventions - IDQ
12 pages
TKT-165365 - Bug Fixing in Miro
No ratings yet
TKT-165365 - Bug Fixing in Miro
5 pages
Application of NLP in Big Data
No ratings yet
Application of NLP in Big Data
10 pages
Creating A Query Using Bex Analyzer
No ratings yet
Creating A Query Using Bex Analyzer
10 pages
ACR1255U-J1 Secure Bluetooth NFC Reader: Technical Specifications V1.08
No ratings yet
ACR1255U-J1 Secure Bluetooth NFC Reader: Technical Specifications V1.08
7 pages
Top 60 Agile Interview Questions & Answers - Mindmajix' - Mindmajix - Com
No ratings yet
Top 60 Agile Interview Questions & Answers - Mindmajix' - Mindmajix - Com
14 pages
Er Model Homework
100% (1)
Er Model Homework
8 pages
13 FT Tarjeta de Comunicaciones Liebert INTELLISLOT Unity
No ratings yet
13 FT Tarjeta de Comunicaciones Liebert INTELLISLOT Unity
2 pages
Hta 11 02 PDF
No ratings yet
Hta 11 02 PDF
9 pages
TANDEM Architecture Paper
No ratings yet
TANDEM Architecture Paper
6 pages
Jurnal Satelit
No ratings yet
Jurnal Satelit
22 pages
Epp (Endpoint Protection Platform)
No ratings yet
Epp (Endpoint Protection Platform)
4 pages
Tech Note - Using Indirect Tags: Tagname Size Type Description Scope
No ratings yet
Tech Note - Using Indirect Tags: Tagname Size Type Description Scope
3 pages
How To Design and Build A Combat Robot
No ratings yet
How To Design and Build A Combat Robot
14 pages
SCCM 2012 Basics
100% (1)
SCCM 2012 Basics
7 pages
Civica GIS
No ratings yet
Civica GIS
5 pages
Seminar Report Ktu m1 Is
No ratings yet
Seminar Report Ktu m1 Is
29 pages
Chapter 4 Challenges By: Rubi Contreras CET-6: Questions
No ratings yet
Chapter 4 Challenges By: Rubi Contreras CET-6: Questions
2 pages
JR CV Scribd
100% (2)
JR CV Scribd
2 pages
Storyteller System Streamlined Combat Rules
No ratings yet
Storyteller System Streamlined Combat Rules
1 page
Normalization Is For Sissies: Pat Helland Microsoft
No ratings yet
Normalization Is For Sissies: Pat Helland Microsoft
8 pages
3060 13 Manual
No ratings yet
3060 13 Manual
110 pages
Codd's Rule
No ratings yet
Codd's Rule
30 pages
E Claw Service Manual 10.3
No ratings yet
E Claw Service Manual 10.3
25 pages
Programming in Haskell Second Edition, 5Th Printing Edition Hutton All Chapter Instant Download
No ratings yet
Programming in Haskell Second Edition, 5Th Printing Edition Hutton All Chapter Instant Download
65 pages
Digital Marketing Module 3
No ratings yet
Digital Marketing Module 3
13 pages
Unified Modeling Language (UML) Diagrams - GeeksforGeeks
No ratings yet
Unified Modeling Language (UML) Diagrams - GeeksforGeeks
28 pages
0 Dumpacore All 0
No ratings yet
0 Dumpacore All 0
12 pages
Oi - Dhh805a en 10 2012 PDF
No ratings yet
Oi - Dhh805a en 10 2012 PDF
56 pages

Preparing Data For Sharing and Archiving

Uploaded by

Preparing Data For Sharing and Archiving

Uploaded by

Georgia Southern University

Digital Commons@Georgia Southern

Preparing Data for Sharing and Archiving

Follow this and additional works at: https://digitalcommons.georgiasouthern.edu/lib-promo-dms-

• Content = The bit-level data, the “target” of preservation.

• Metadata = The information needed to find, access,

– complete and open documentation

• Use leading zeros to enable sorting (“1-100” should be numbered 001-100).

• Provide summary information about the data and include it in the

• Unlimited storage (no additional cost).

You might also like