BACK to main page
Backup and restore metadata
In Sherlock, almost all the data is kept in S3. This also means, that you can stop and start Presto any time, you won’t lose any data. But Presto is using the Metastore service to keep metadata about table and view structures. It means, that if you restart Sherlock, although you still have all the old data, but Presto will not remember where to find this data in S3, because the Metastore lost all the schema related info.
You can backup the state of the Metastore to S3, using the
metastore_backup.sh command. You also have
to specify a backup path as a parameter. It will create a backup file in the same S3 bucket you
specified in your sherlock config file. Similarly, you can use the
metastore_restore.sh command to
restore a backup with a given name from S3 after you restarted the Sherlock cluster.
If you are opening now the
sherlock_metastore_backups folder on an S3 browser (e.g. in case of using the local
sandbox S3: http://localhost:9000/minio/sherlock/sherlock_metastore_backups/), then you can
see the dump file generated:
You can restore the backup later (e.g. after restarting the cluster), using the following command:
© 2018, 2019 Earlham Institute (License)