Setup
Binary Package
Download the latest version of hms-mirror-<version>-dist.tar.gz
HMS-Mirror Setup from Binary Distribution
On the edgenode:
Remove previous install directory
rm -rf hms-mirror-install-<version>
If you don't remove the previous install directory, the default
tar
behaviour will NOT overwrite the existing directory, hence you won't get the new version.
Expand the tarball
tar zxvf hms-mirror-<version>-dist.tar.gz
.Make sure you STOP any previous release of
hms-mirror
before attempting install.hms-mirror --stop
Two options for installation:
As the root user (or
sudo
), runhms-mirror-install-<version>/setup.sh
. This will install thehms-mirror
packages in/usr/local/hms-mirror
and create symlinks for the executables in/usr/local/bin
. At this point,hms-mirror
should be available to all user and in the default path.As the local user, run
hms-mirror-install-<version>/setup.sh
. This will install thehms-mirror
packages in$HOME/.hms-mirror
and create symlink in$HOME/bin
. Ensure$HOME/bin
is in the users path and runhms-mirror
.
DO NOT RUN hms-mirror
from the installation directory.
If you install both options, your environment PATH will determine which one is run. Make note of this because an upgrade may not be reachable.
Quick Start
After installation (above), run hms-mirror
for the particular interface that interests you.
Run:
hms-mirror --service
Open a browser to http://hostname:8090/hms-mirror
to access the Web UI.
`hms-mirror` requires a configuration file describing the LEFT (source) and RIGHT (target) cluster connections. There are two ways to create the config:
Use the default config template) as a starting point. Edit and place a copy here
$HOME/.hms-mirror/cfg/default.yaml
.hms-mirror --setup
- Prompts a series of questions about the LEFT and RIGHT clusters to build the default configuration file.
If either or both clusters are Kerberized, please review the detailed configuration guidance here and here.
General Guidance
Run
hms-mirror
from the RIGHT cluster on an Edge Node.
If running from the LEFT cluster, note that the
-ro/--read-only
feature examines HDFS on the RIGHT cluster. The HDFS client on the LEFT cluster may not support this access.Connecting to HS2 through KNOX (in both clusters, if possible) reduces the complexities of the connection by removing Kerberos from the picture.
The libraries will only support a Kerberos connection to a 'single' version of Hadoop at a time. This is relevant for 'Kerberized' connections to Hive Server 2. The default libraries will support a kerberized connection to a CDP clusters HS2 and HDFS. If the LEFT (source) cluster is Kerberized, including HS2, you will need to make some adjustments.
The LEFT clusters HS2 needs to support any auth mechanism BUT Kerberos.
Use an Ambari Group to setup an independent HS2 instance for this exercise or use KNOX.