Storage Systems (Distributed File Systems)
By default, hms-mirror
is built with native support to communicate with 'hdfs', and 'ozone' distributed file systems. The hms-mirror
can be extended to support other distributed file systems by implementing an hcfs
(hadoop compatible filesystem) interface. These include Amazon S3, Google Cloud Storage, Azure Blob Storage, etc.
To support communication with any of these cloud platforms or other distributed files systems, you'll need to ensure the libraries needed to support that communication are available in the classpath for 'hms-mirror'.
Make sure these are available in the classpath for hms-mirror
BEFORE you start the application (Web and CLI).
We'll cover the libraries in the following sections. These libraries need to be copied to the $HOME/. hms-mirror/aux_libs
directory.
All these libraries are available in the Cloudera distribution. The below file listings have been sourced through our community testing and may not be exhaustive. If you find we've missed any, please let us know by logging an issue at hms-mirror Github Issues
Amazon S3
This obviously includes Amazon S3, but also includes other S3 compatible storage systems like Minio, etc. that are compatible with the S3 API.
Microsoft Azure
abfs