Data Management Skillbuilding Hub

Best Practice: Use consistent codes

BEST PRACTICE

Best Practices by Data Life Cycle




Use consistent codes

Data Life Cycle stage(s): Collect   Describe

Be consistent in the use of codes to indicate categorical variables, for example species names, sites, or land cover types. Codes should always be the same within one data set. Pay particular attention to spelling and case; most frequent problems are with abbreviations for species names and sites.

Consistent codes can be achieved most easily by defining standard categorical variables (codes) and using drop down lists (excel, database). Frequently a code is needed for ‘none of the above’ or ‘unknown’ or ‘other’ to avoid imprecise code assignment.

Description Rationale

Inconsistencies in coding schemes can create confusion on the part of data users. For example, a typical user might not understand why “T”, “temp”, and “t” are used interchangeable within a single data file.

Tags

 
 
 
 

Cite this best practice:

DataONE Best Practices Working Group, DataONE  (September 01, 2011) "Best Practice: Use consistent codes". Accessed through the Data Management Skillbuilding Hub at https://dataoneorg.github.io/Education/bestpractices/use-consistent-codes on May 24, 2019


Home

Hosted by DataONE

In collaboration with the community, DataONE has developed high quality resources for helping educators and librarians with training in data management, including teaching materials, webinars and a database of best-practices to improve methods for data sharing and management.

Question If you have a question or concern, please open an Issue in this repository on GitHub.