Best Practices: Sorted by Tag
Access
- Advertise your data using datacasting tools
- Assign descriptive file names
- Backup your data
- Check data and other outputs for print and web accessibility
- Create, manage, and document your data storage system
- Define the data model
- Describe format for spatial location
- Describe measurement techniques
- Ensure integrity and accessibility when making backups of data
- Ensure the reliability of your storage media
- Identify data sensitivity
- Identify suitable repositories for the data
- Provide identifier for dataset used
- Recognize stakeholders in data ownership
- Sharing data: legal and policy considerations
- Use appropriate field delimiters
Analyze
- Consider the compatibility of the data you are integrating
- Describe method to create derived data products
- Document steps used in data processing
- Ensure datasets used are reproducible
- Identify most appropriate software
- Identify outliers
- Identify values that are estimated
- Store data with appropriate precision
- Understand the geospatial parameters of multiple data sources
Annotation
- Describe the research project
- Identify outliers
- Provide capabilities for tagging and annotation of your data by the community
- Separate data values from annotations
Assure
- Communicate data quality
- Confirm a match between data and their description in metadata
- Consider the compatibility of the data you are integrating
- Develop a quality assurance and quality control plan
- Double-check the data you enter
- Ensure basic quality control
- Ensure datasets used are reproducible
- Identify missing values and define missing value codes
- Identify outliers
- Identify values that are estimated
- Mark data with quality control flags
- Provide version information for use and discovery
Backup
- Create and document a data backup policy
- Ensure integrity and accessibility when making backups of data
- Ensure the reliability of your storage media
- Plan data management early in your project
Calibration
Citation
- Document the integration of multiple datasets
- Provide a citation and document provenance for your dataset
- Sharing data: legal and policy considerations
Coding
- Ensure basic quality control
- Identify missing values and define missing value codes
- Mark data with quality control flags
- Use consistent codes
Collect
Controlled vocabulary
- Choose and use standard terminology to enable discovery
- Create a data dictionary
- Identify and use relevant metadata standards
- Provide capabilities for tagging and annotation of your data by the community
- Use consistent codes
Data archives
- Decide what data to preserve
- Ensure datasets used are reproducible
- Ensure flexible data services for virtual datasets
- Ensure integrity and accessibility when making backups of data
- Ensure the reliability of your storage media
- Identify data sensitivity
- Identify data with long-term value
- Identify suitable repositories for the data
Data consistency
- Confirm a match between data and their description in metadata
- Develop a quality assurance and quality control plan
- Document the integration of multiple datasets
- Double-check the data you enter
- Preserve information: Keep raw data raw
- Provide identifier for dataset used
Data creators
- Describe the research project
- Provide a citation and document provenance for your dataset
- Sharing data: legal and policy considerations
Data management plan
- Define roles and assign responsibilities for data management
- Document your data organization strategy
- Provide budget information for your data management plan
- Revisit data management plan throughout the project life cycle
Data model
- Define expected data outcomes and types
- Define the data model
- Describe the overall organization of your dataset
- Document your data organization strategy
Data normalization
Data processing
- Describe method to create derived data products
- Document steps used in data processing
- Ensure datasets used are reproducible
- Identify most appropriate software
Data quality
Data services
- Advertise your data using datacasting tools
- Ensure flexible data services for virtual datasets
- Identify most appropriate software
Data source
- Define expected data outcomes and types
- Provide a citation and document provenance for your dataset
- Sharing data: legal and policy considerations
Database
- Consider the compatibility of the data you are integrating
- Describe the overall organization of your dataset
- Document your data organization strategy
- Maintain consistent data typing
Date
Describe
- Assign descriptive file names
- Choose and use standard terminology to enable discovery
- Confirm a match between data and their description in metadata
- Create a data dictionary
- Define the data model
- Define the parameters
- Describe format for spatial location
- Describe formats for date and time
- Describe method to create derived data products
- Describe the contents of data files
- Describe the overall organization of your dataset
- Describe the research project
- Describe the spatial extent and resolution of your dataset
- Describe the temporal extent and resolution of your dataset
- Describe the units of measurement for each observation
- Document steps used in data processing
- Describe the overall organization of your dataset
- Document your data organization strategy
- Ensure flexible data services for virtual datasets
- Identify and use relevant metadata standards
- Maintain consistent data typing
- Provide a citation and document provenance for your dataset
- Provide capabilities for tagging and annotation of your data by the community
- Provide identifier for dataset used
- Separate data values from annotations
- Use appropriate field delimiters
- Use consistent codes
Disaster recovery
- Backup your data
- Create and document a data backup policy
- Decide what data to preserve
- Ensure the reliability of your storage media
Discover
- Advertise your data using datacasting tools
- Assign descriptive file names
- Check data and other outputs for print and web accessibility
- Ensure datasets used are reproducible
Documentation
- Choose and use standard terminology to enable discovery
- Confirm a match between data and their description in metadata
- Create a data dictionary
- Create, manage, and document your data storage system
- Define the parameters
- Describe the contents of data files
- Describe the overall organization of your dataset
- Describe the spatial extent and resolution of your dataset
- Describe the temporal extent and resolution of your dataset
- Document and store data using stable file formats
- Document the integration of multiple datasets
- Identify and use relevant metadata standards
- Maintain consistent data typing
- Plan for effective multimedia management
- Provide capabilities for tagging and annotation of your data by the community
- Provide version information for use and discovery
- Revisit data management plan throughout the project life cycle
- Separate data values from annotations
- Understand the geospatial parameters of multiple data sources
File system
Flag
- Communicate data quality
- Develop a quality assurance and quality control plan
- Identify values that are estimated
- Mark data with quality control flags
- Separate data values from annotations
Format
- Assign descriptive file names
- Define expected data outcomes and types
- Describe format for spatial location
- Describe formats for date and time
- Describe the contents of data files
- Document and store data using stable file formats
- Identify and use relevant metadata standards
- Maintain consistent data typing
- Plan for effective multimedia management
- Preserve information: Keep raw data raw
- Separate data values from annotations
- Use appropriate field delimiters
Geography
Geospatial
- Describe format for spatial location
- Describe the research project
- Describe the spatial extent and resolution of your dataset
- Understand the geospatial parameters of multiple data sources
Here!
Image
Integrate
- Consider the compatibility of the data you are integrating
- Document steps used in data processing
- Document the integration of multiple datasets
- Provide budget information for your data management plan
- Understand the geospatial parameters of multiple data sources
Location
Measurement
- Describe the research project
- Describe the spatial extent and resolution of your dataset
- Describe the temporal extent and resolution of your dataset
- Describe the units of measurement for each observation
- Develop a quality assurance and quality control plan
- Store data with appropriate precision
Metadata
- Choose and use standard terminology to enable discovery
- Confirm a match between data and their description in metadata
- Create a data dictionary
- Create, manage, and document your data storage system
- Define the parameters
- Describe the contents of data files
- Describe the overall organization of your dataset
- Describe the spatial extent and resolution of your dataset
- Describe the temporal extent and resolution of your dataset
- Document and store data using stable file formats
- Describe the overall organization of your dataset
- Document the integration of multiple datasets
- Identify and use relevant metadata standards
- Maintain consistent data typing
- Plan for effective multimedia management
- Provide capabilities for tagging and annotation of your data by the community
- Provide version information for use and discovery
- Separate data values from annotations
- Understand the geospatial parameters of multiple data sources
Missing values
Ontologies
Parameter
Plan
- Create and document a data backup policy
- Create, manage, and document your data storage system
- Define expected data outcomes and types
- Define roles and assign responsibilities for data management
- Define the data model
- Identify data sensitivity
- Identify suitable repositories for the data
- Plan data management early in your project
- Plan for effective multimedia management
- Provide budget information for your data management plan
- Revisit data management plan throughout the project life cycle
Preserve
- Backup your data
- Choose and use standard terminology to enable discovery
- Create and document a data backup policy
- Decide what data to preserve
- Document and store data using stable file formats
- Ensure flexible data services for virtual datasets
- Ensure integrity and accessibility when making backups of data
- Ensure the reliability of your storage media
- Identify and use relevant metadata standards
- Identify data sensitivity
- Identify data with long-term value
- Identify suitable repositories for the data
- Plan data management early in your project
- Plan for effective multimedia management
- Preserve information: Keep raw data raw
- Provide a citation and document provenance for your dataset
- Provide identifier for dataset used
- Recognize stakeholders in data ownership
- Store data with appropriate precision
Provenance
- Describe method to create derived data products
- Document steps used in data processing
- Document the integration of multiple datasets
- Ensure datasets used are reproducible
- Provide a citation and document provenance for your dataset
- Provide identifier for dataset used
- Provide version information for use and discovery
- Recognize stakeholders in data ownership
- Understand the geospatial parameters of multiple data sources
Qualify
Quality
- Confirm a match between data and their description in metadata
- Consider the compatibility of the data you are integrating
- Develop a quality assurance and quality control plan
- Double-check the data you enter
- Ensure basic quality control
- Ensure integrity and accessibility when making backups of data
- Identify outliers
- Identify values that are estimated
- Provide version information for use and discovery
Replicable data
- Document steps used in data processing
- Ensure datasets used are reproducible
- Provide identifier for dataset used
Restore
- Ensure integrity and accessibility when making backups of data
- Ensure the reliability of your storage media
Standards
- Check data and other outputs for print and web accessibility
- Choose and use standard terminology to enable discovery
- Describe formats for date and time
- Describe the overall organization of your dataset
Storage
- Create and document a data backup policy
- Decide what data to preserve
- Document and store data using stable file formats
- Ensure the reliability of your storage media
- Identify data with long-term value
- Identify suitable repositories for the data
- Plan for effective multimedia management
- Store data with appropriate precision
Tabular
- Consider the compatibility of the data you are integrating
- Document and store data using stable file formats
Taxonomy
Terminology
Time
Units
- Create a data dictionary
- Describe the contents of data files
- Describe the units of measurement for each observation