Spatial ETL explained

Spatial extract, transform, load (spatial ETL), also known as geospatial transformation and load (GTL), is a process for managing and manipulating geospatial data, for example map data. It is a type of extract, transform, load (ETL) process, with software tools and libraries specialised for geographical information.

A common use of spatial ETL is to convert geographical information from a data source into another format that can be more easily used, for example by importing it into GIS software.[1] A tool may translate data directly from one format to another, or via an intermediate format. Intermediate formats are often used when data transformation must be carried out.

Origins and history

Although ETL tools for processing non-spatial data have existed for some time, ETL tools that can manage the unique characteristics of spatial data only emerged in the early 1990s.

Spatial ETL tools emerged in the GIS industry to enable interoperability (or the exchange of information) between the industry's diverse array of mapping applications and associated proprietary formats. However, spatial ETL tools are also becoming increasingly important in the realm of management information systems as a tool to help organizations integrate spatial data with their existing non-spatial databases, and also to leverage their spatial data assets to develop more competitive business strategies.

Traditionally, GIS applications have had the ability to read or import a limited number of spatial data formats, but with few specialist ETL transformation tools; the concept being to import data then carry out step-by-step transformation or analysis within the GIS application itself. Conversely, spatial ETL does not require the user to import or view the data, and generally carries out its tasks in a single predefined process.

With the push to achieve greater interoperability within the GIS industry, many existing GIS applications are now incorporating spatial ETL tools within their products; the ArcGIS Data Interoperability Extension being an example of this.[2]

Transformation

The transformation phase of a spatial ETL process allows a variety of functions; some of these are similar to standard ETL, but some are unique to spatial data.[3] Spatial data commonly consists of a geographic element and related attribute data; therefore spatial ETL transformations are often described as being either geometric transformations – transformation of the geographic element – or attribute transformations – transformations of the related attribute data.

Common geospatial transformations

the ability to convert spatial data between one coordinate system and another.

the ability to convert attributes of tabular data into spatial data

Additional features

Desirable features of a spatial ETL application are:

Uses

Spatial ETL has a number of distinct uses:

The removal of errors within a dataset

The bringing together of multiple datasets into a common framework – conflation is a good example of this

The comparison of multiple datasets for verification and quality assurance purposes

Conversion between different data formats.

Examples of spatial ETL tools

See also

Notes and References

  1. Web site: What is ETL… and How Can it Turn You into a Geospatial Rock Star? . XYHt . 4 June 2020 . 5 November 2022 . 6 November 2022 . https://web.archive.org/web/20221106001414/https://www.xyht.com/spatial-itgis/what-is-etl-and-how-can-it-turn-you-into-a-geospatial-rock-star/ . live .
  2. Web site: Spatial ETL tools . . 11 April 2023 . 11 April 2023 . https://web.archive.org/web/20230411154100/https://pro.arcgis.com/en/pro-app/latest/help/data/data-interoperability/spatial-etl-tools.htm . live .
  3. Book: Miller . Harvey . Han . Jiawei . Geographic Data Mining and Knowledge Discovery . 27 May 2009 . . 9781420073980 . 63 .