You can use the Commvault software to back up and restore Hadoop (HDFS, HBase, Hive, and Kudu).
Backups
What Is Backed Up
-
HDFS data from all nodes in the Hadoop cluster.
-
For HBase, all HBase namespaces in the Hadoop cluster.
-
For Hive, all Hive databases and Hive tables in the Hadoop cluster.
-
For Kudu, all Kudu tables in the Hadoop cluster.
Backups You Can Perform
-
Full backups
-
Incremental backups
-
Synthetic full backups (only for HDFS)
When You Can Perform Backups
-
On a schedule: The server backup plan that you assign manages scheduled backups
-
On demand: You can perform on-demand backups at any time
Restores
Data You Can Restore
-
Files and folders
-
HBase tables
-
Hive databases and tables
-
Kudu tables
Backups You Can Use for Restores
- Backups from any date/time, including the most recent backup
Destinations You Can Restore To
-
The current location (in place)
-
A different location on the same cluster or a different cluster (out of place)
-
A file server (only for HDFS)