Data Management Skillbuilding Hub

Best Practice: Choose and use standard terminology to enable discovery

BEST PRACTICE

Best Practices by Data Life Cycle




Choose and use standard terminology to enable discovery

Data Life Cycle stage(s): Describe   Preserve

Terms and phrases that are used to represent categorical data values or for creating content in metadata records should reflect appropriate and accepted vocabularies in your community or institution. Methods used to identify and select the proper terminology include:

  • Identify the relevant descriptive terms used as categorical values in your community prior to start of the project (e.g., standard terms describing soil horizons, plant taxonomy, sampling methodology or equipment, etc.)
  • Identify locations in metadata where standardized terminology should be used and sources for the terms. Terminology should reflect both data type/content and access methods.
  • Review existing thesauri, ontologies, and keyword lists for your use before making up a new terms. Potential sources include: Semantic Web for Earth and Environmental Terminology (SWEET), Planetary Ontologies, and NASA Global Change Master Directory (GCMD)
  • Enforce use of standard terminology in your workflow, including:
    • Use of lookup tables in data-entry forms
    • Use of field-level constraints in databases (restrict data import to match accepted domain values)
    • Use XML validation
    • Do manual review
  • Publish metadata using Open Standards, for example:
    • z39.50
    • OGC Catalog Services for Web (CSW)
    • Web Accessible Directory (WAD)

If you must use an unconventional or unique vocabulary, it should be identified in the metadata and fully defined in the data documentation (attribute name, values, and definitions).

Description Rationale

The consistent use of well-defined, referenced terminology in describing data products, their parameters, and access methods improves the ability to discover those products for specific uses and access via well-known methods. Determine if there are controlled vocabulary terms, scientific taxonomies, and ontologies used by your community, and use those when creating metadata for your dataset.

Additional Information

Examples

Ecological Informatics, Volume 2, Issue 3, October 2007, Pages 279-296 Meta-information systems and ontologies. A Special Feature from the 5th International Conference on Ecological Informatics ISEI5, Santa Barbara, CA, Dec. 4–7, 2006 - Novel Concepts of Ecological Data Management S.I.

Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies. 2010. ANSI/NISO Z39.19. https://www.niso.org/publications/ansiniso-z3919-2005-r2010

Olsen, L.M., G. Major, K. Shein, J. Scialdone, R. Vogel, S. Leicester, H. Weir, S. Ritz, T. Stevens, M. Meaux, C.Solomon, R. Bilodeau, M. Holland, T. Northcutt, R. A. Restrepo, 2007 . NASA/Global Change Master Directory

(GCMD) Earth Science Keywords. Version 6.0.0.0.0 Citing GCMD Keywords https://earthdata.nasa.gov/about/gcmd/global-change-master-directory-gcmd-keywords

Tags

 
 
 
 
 
 

Cite this best practice:

DataONE Best Practices Working Group, DataONE  (July 01, 2010) "Best Practice: Choose and use standard terminology to enable discovery". Accessed through the Data Management Skillbuilding Hub at https://dataoneorg.github.io/Education/bestpractices/choose-and-use on Mar 01, 2024


Home

Hosted by DataONE

In collaboration with the community, DataONE has developed high quality resources for helping educators and librarians with training in data management, including teaching materials, webinars and a database of best-practices to improve methods for data sharing and management.

Question If you have a question or concern, please open an Issue in this repository on GitHub.