Data Management Skillbuilding Hub

Best Practice: Communicate data quality

BEST PRACTICE

Best Practices by Data Life Cycle




Communicate data quality

Data Life Cycle stage(s): Assure

Information about quality control and quality assurance are important components of the metadata:

  • Qualify (flag) data that have been identified as questionable by including a flagging_column next to the column of data values. The two columns should be properly associated through a naming convention such as Temperature, flag_Temperature.
  • Describe the quality control methods applied and their assumptions in the metadata. Describe any software used when performing the quality analysis, including code where practical. Include in the metadata who did the quality control analysis, when it was done, and what changes were made to the dataset.
  • Describe standards or test data used for the quality analysis. For instance, include, when practical, the data used to make a calibration curve.
  • If data with qualifier flags are summarized to create a derived data set, include the percent flagged data and percent missing data in the metadata of the derived data file. High frequency observations are often downsampled, and it is critical to know how much of the data were rejected in the primary data.

Description Rationale

Data quality and any methods used for quality control should be communicated so others can assess the data independently.

Additional Information

Hook, L.A., Beaty, T.W., Santhana-Vannan, S., Baskaran, L. and Cook, R.B. 2007. Best practices for preparing environmental data sets to share and archive. Oak Ridge National Laboratory Distributed Active Archive Center, Oak Ridge, Tennessee, U.S.A. (daac.ornl.gov/PI/bestprac.html)

Sheldon, W., Henshaw, D. and Ramsey, K. 2007. Final Report: Workshop to define quality management standards for data completeness in derived data products. Long Term Ecological Research Network Document Archive, University of New Mexico, Albuquerque, NM.

Additional Information (Biblio)

Best Practices for Preparing Ecological and Ground-Based Data Sets to Share and Archive

Tags

 
 

Cite this best practice:

DataONE Best Practices Working Group, DataONE  (Aug 30, 2011) "Best Practice: Communicate data quality". Accessed through the Data Management Skillbuilding Hub at https://dataoneorg.github.io/Education/bestpractices/communicate-data-quality on May 24, 2019


Home

Hosted by DataONE

In collaboration with the community, DataONE has developed high quality resources for helping educators and librarians with training in data management, including teaching materials, webinars and a database of best-practices to improve methods for data sharing and management.

Question If you have a question or concern, please open an Issue in this repository on GitHub.