Data Management Skillbuilding Hub

Best Practice: Understand the geospatial parameters of multiple data sources

BEST PRACTICE

Best Practices by Data Life Cycle




Understand the geospatial parameters of multiple data sources

Data Life Cycle stage(s): Analyze   Integrate

Understand the input geospatial data parameters, including scale, map projection, geographic datum, and resolution, when integrating data from multiple sources. Care should be taken to ensure that the geospatial parameters of the source datasets can be legitimately combined. If working with raster data, consider the data type of the raster cell values as well as if the raster data represent discrete or continuous values. If working with vector data, consider feature representation (e.g., points, polygons, lines). It may be necessary to re-project your source data into one common projection appropriate to your intended analysis. Data product quality degradation or loss of data product utility can result when combining geospatial data that contain incompatible geospatial parameters. Spatial analysis of a dataset created from combining data having considerably different scales or map projections may result in erroneous results.

Document the geospatial parameters of any output dataset derived from combining multiple data products. Include this information in the final data product’s metadata as part of the product’s provenance or origin.

Description Rationale

Awareness and proper use of geospatial data parameters can affect the ability to interpret or produce sound analysis results.

Additional Information

Burley, T.E., and Peine, J.D., 2009’ NBII-SAIN Data Management Toolikit, U.S. Geological Survey Open-File Report 2009-1170, 96p. Available from: http://pubs.usgs.gov/of/2009/1170/

Examples

Combining geospatial data created at a 1:24,000 scale with data that were created at a 1:100,000 scale results in data that are only as accurate as the least accurate input (essentially resolution is lost). Combining geospatial data that have considerably differing map projections can also result in spatial errors and potentially erroneous results. Combining raster data with a 10m resolution with raster data that have a 100m resolution will result in data that are only as accurate as the least accurate input (essentially resolution is lost).

Tags

 
 
 
 
 
 

Cite this best practice:

Eric Lind, Steve Aulenbach, Tom Burley, DataONE  (May 11, 2011) "Best Practice: Understand the geospatial parameters of multiple data sources". Accessed through the Data Management Skillbuilding Hub at https://dataoneorg.github.io/Education/bestpractices/understand-the-geospatial on Mar 01, 2024


Home

Hosted by DataONE

In collaboration with the community, DataONE has developed high quality resources for helping educators and librarians with training in data management, including teaching materials, webinars and a database of best-practices to improve methods for data sharing and management.

Question If you have a question or concern, please open an Issue in this repository on GitHub.