Wednesday 20 April 2022

Introduction to Apache Sqoop

Apache Sqoop is a tool designed for bulk data transfer between structured, semi-structured and unstructured data sources.

 

a.   Structure Database: Relational databases like MySQL

b.   Semi structured database: MongoDB, Cassnadra, HBase

c.    Unstructured database: File System, HDFS, Amazon S3

 

Many developers use Sqoop to transfer the data from a RDBMS database to Hadoop environment and vice versa.

 

 


At the time of writing this article, Sqoop support following connectors.

 

a.   FTP connector

b.   JDBC connector

c.    HDFS connector

d.   Kafka connector

e.   Kite connector

f.     SFTP connector

 

 

References

https://sqoop.apache.org/

https://sqoop.apache.org/docs/1.99.7/user/Connectors.html


Previous                                                    Next                                                    Home

No comments:

Post a Comment