Apache Sqoop | |
Developer: | Apache Software Foundation |
Discontinued: | yes |
Latest Release Version: | 1.4.7 |
Operating System: | Cross-platform |
Programming Language: | Java |
Genre: | Data management |
License: | Apache License 2.0 |
Sqoop is a command-line interface application for transferring data between relational databases and Hadoop.[1]
The Apache Sqoop project was retired in June 2021 and moved to the Apache Attic.[2]
Sqoop supports incremental loads of a single table or a free form SQL query as well as saved jobs which can be run multiple times to import updates made to a database since the last import. Imports can also be used to populate tables in Hive or HBase.[3] Exports can be used to put data from Hadoop into a relational database. Sqoop got the name from "SQL-to-Hadoop".[4] Sqoop became a top-level Apache project in March 2012.[5]
Informatica provides a Sqoop-based connector from version 10.1.Pentaho provides open-source Sqoop based connector steps, Sqoop Import and Sqoop Export, in their ETL suite Pentaho Data Integration since version 4.5 of the software. Microsoft uses a Sqoop-based connector to help transfer data from Microsoft SQL Server databases to Hadoop.[6] Couchbase, Inc. also provides a Couchbase Server-Hadoop connector by means of Sqoop.[7]