Backup Solution - Rsnapshot¶
Prerequisites¶
- Know how to install additional repositories and snapshots from the command-line
- Know about mounting filesystems external of your machine (external hard drive, remote filesystem, etc.)
- Know how to use an editor (
vi
is used here, but you can use your favorite editor) - Know a little BASH scripting
- Know how to modify the crontab for the root user
- Knowledge of SSH public and private keys (only if you plan to run remote backups from another server)
Introduction¶
rsnapshot is a very powerful backup utility that can be installed on any Linux-based machine. It can either back up a machine locally, or you can back up multiple machines, say servers for instance, from a single machine.
rsnapshot uses rsync
and is written entirely in perl with no library dependencies, so there are no weird requirements to install it. In the case of Rocky Linux, you should generally be able to install rnapshot by using the EPEL repository. However, the EPEL for 9.0 does not yet have rsnapshot available. For this reason, on Rocky Linux 9.0, we've included the instructions for installing from source.
This documentation covers the installation of rsnapshot on Rocky Linux only.
Rocky Linux 9.0 - Installing Rsnapshot¶
Since the release of Rocky Linux 9.0, the EPEL repository has not yet built rsnapshot in RPM form. There are currently bugzilla entries referencing others who would like to have this happen, but as of now, that has not happened. The workaround is to install rsnapshot from source. Because this is going to require build utilities, you'll need to install the 'Development Tools' group so that you have everything you need.
Installing Development Tools and Downloading the Source¶
As stated, the first step here is to install the 'Development Tools' group:
dnf groupinstall 'Development Tools'
You also need a few other packages:
dnf install wget unzip rsync openssh-server
Next we will need to download the source files from the GitHub repository. You can do this multiple ways, but the easiest in this case is probably just to download the ZIP file from the repository.
- Go to https://github.com/rsnapshot/rsnapshot
- Click on the Green "Code" button on the right
- Right-click on the "Download ZIP" and copy the link location
- Use
wget
orcurl
to download the copied link. Example:wget https://github.com/rsnapshot/rsnapshot/archive/refs/heads/master.zip
- Unzip the
master.zip
fileunzip master.zip
Building the Source¶
Now that we've got everything on our machine, the next step is to build. When we unzipped the master.zip
file, we ended up with an rsnapshot-master
directory. We will need to change into this for our build procedure. Note that our build is using all of the package defaults, so if you want something else, you'll need to do a little investigation. Also, these steps are directly taken from the GitHub Installation page:
cd rsnapshot-master
Run the authogen.sh
script to generate the configure script:
./autogen.sh
Hint
You may get several lines that look like this:
fatal: not a git repository (or any of the parent directories): .git
These are not fatal.
Next, we need to run configure
with the configuration directory set:
./configure --sysconfdir=/etc
Finally, run make install
:
sudo make install
During all of this, the rsnapshot.conf
file will be created as rsnapshot.conf.default
. We need to copy this over to rsnapshot.conf
and then edit it to fit what we need on our system.
sudo cp /etc/rsnapshot.conf.default /etc/rsnapshot.conf
This covers copying the configuration file over. The section below on "Configuring rsnapshot" will cover the changes needed in this configuration file.
Rocky Linux 8.6 - Installing Rsnapshot¶
All commands shown here are from the command-line on your server or workstation unless otherwise noted.
Installing The EPEL repository¶
We need the EPEL software repository from Fedora to install rsnapshot. To install the repository, just use this command:
sudo dnf install epel-release
The repository should now be active.
Install the Rsnapshot Package¶
Next, install rsnapshot itself:
sudo dnf install rsnapshot
If there are any missing dependencies, those will show up and you simply need to answer the prompt to continue. For example:
dnf install rsnapshot
Last metadata expiration check: 0:00:16 ago on Mon Feb 22 00:12:45 2021.
Dependencies resolved.
========================================================================================================================================
Package Architecture Version Repository Size
========================================================================================================================================
Installing:
rsnapshot noarch 1.4.3-1.el8 epel 121 k
Installing dependencies:
perl-Lchown x86_64 1.01-14.el8 epel 18 k
rsync x86_64 3.1.3-9.el8 baseos 404 k
Transaction Summary
========================================================================================================================================
Install 3 Packages
Total download size: 543 k
Installed size: 1.2 M
Is this ok [y/N]: y
Mounting A Drive or Filesystem For Backup¶
In this step, we show how to mount a hard drive, such as an external USB hard drive, that will be used to back up your system. This particular step is only necessary if you are backing up a single machine or server, as seen in our first example below.
- Plug in the USB drive.
- Type
dmesg | grep sd
which should show you the drive you want to use. In this case, it'll be called sda1.
Example:EXT4-fs (sda1): mounting ext2 file system using the ext4 subsystem
. - Unfortunately (or fortunately depending on your opinion) most modern Linux desktop operating systems automount the drive if they can. This means that, depending on various factors, rsnapshot might lose track of the hard drive. We want the drive to "mount" or make its files available in the same place every time.
To do that, take the drive information revealed in the dmesg command above and typemount | grep sda1
, which should show something like this:/dev/sda1 on /media/username/8ea89e5e-9291-45c1-961d-99c346a2628a
- Type
sudo umount /dev/sda1
to unmount your external hard drive. - Next, create a new mount point for the backup:
sudo mkdir /mnt/backup
- Now mount the drive to your backup folder location:
sudo mount /dev/sda1 /mnt/backup
- Now type
mount | grep sda1
again, and you should see something like this:/dev/sda1 on /mnt/backup type ext2 (rw,relatime)
- Next create a directory that must exist for the backup to continue on the mounted drive. We are using a folder called "storage" for this example:
sudo mkdir /mnt/backup/storage
Note that for a single machine, you will have to either repeat the umount and mount steps each time the drive is plugged in again, or each time the system reboots, or automate these commands with a script.
We recommend automation. Automation is the sysadmin way.
Configuring rsnapshot¶
This is the most important step. It's easy to make a mistake when making changes to the configuration file. The rsnapshot configuration requires tabs for any separation between elements, and a warning to that effect is at the very top of the configuration file.
A space character will cause the entire configuration—and your backup—to fail. For instance, near the top of the configuration file is a section for the # SNAPSHOT ROOT DIRECTORY #
. If you were adding this in from scratch, you would type snapshot_root
then TAB and then type /whatever_the_path_to_the_snapshot_root_will_be/
The best thing is that the default configuration that comes with rsnapshot only needs minor changes to make it work for a backup of a local machine. It's always a good idea, though, to make a backup copy of the configuration file before you start editing:
cp /etc/rsnapshot.conf /etc/rsnapshot.conf.bak
Basic Machine or Single Server Backup¶
In this case, rsnapshot is going to be run locally to back up a particular machine. In this example, we'll break down the configuration file, and show you exactly what you need to change.
You will need to use vi
(or edit with your favorite editor) to open the /etc/rsnapshot.conf file.
The first thing to change is the snapshot_root setting which by default has this value:
snapshot_root /.snapshots/
We need to change this to our mount point that we created above plus the addition of "storage".
snapshot_root /mnt/backup/storage/
We also want to tell the backup NOT to run if the drive is not mounted. To do this, remove the "#" sign (also called a remark, pound sign, number sign, hash symbol, etc.) next to no_create_root so that it looks like this:
no_create_root 1
Next go down to the section titled # EXTERNAL PROGRAM DEPENDENCIES #
and remove the comment (again, the "#" sign) from this line:
#cmd_cp /usr/bin/cp
So that it now reads:
cmd_cp /usr/bin/cp
While we do not need cmd_ssh for this particular configuration, we will need it for our other option below and it doesn't hurt to have it enabled. So find the line that says:
#cmd_ssh /usr/bin/ssh
And remove the "#" sign so that it looks like this:
cmd_ssh /usr/bin/ssh
Next we need to skip down to the section titled # BACKUP LEVELS / INTERVALS #
This has been changed from earlier versions of rsnapshot from hourly, daily, monthly, yearly
to alpha, beta, gamma, delta
. Which is a bit confusing. What you need to do is add a remark to any interval that you won't be using. In the configuration, delta is already remarked out.
For this example, we aren't going to be running any other increments other than a nightly backup, so just add a remark to alpha and gamma so that the configuration looks like this when you are done:
#retain alpha 6
retain beta 7
#retain gamma 4
#retain delta 3
Now skip down to the logfile line, which by default should read:
#logfile /var/log/rsnapshot
And remove the remark so that it is enabled:
logfile /var/log/rsnapshot
Finally, skip down to the ### BACKUP POINTS / SCRIPTS ###
section and add any directories that you want to add in the # LOCALHOST
section, remember to use TAB rather than SPACE between elements!
For now write your changes (SHIFT :wq!
for vi
) and exit the configuration file.
Checking The Configuration¶
We want to make sure that we didn't add spaces or any other glaring errors to our configuration file while we were editing it. To do this, we run rsnapshot against our configuration with the configtest option:
rsnapshot configtest
will show Syntax OK
if there are no errors in the configuration.
You should get into the habit of running configtest against a particular configuration. The reason for that will be more evident when we get into the Multiple Machine or Multiple Server Backups section.
To run configtest against a particular configuration file, run it with the -c option to specify the configuration:
rsnapshot -c /etc/rsnapshot.conf configtest
Running The Backup The First Time¶
Everything has checked out, so it's time to go ahead and run the backup for the first time. You can run this in test mode first if you like, so that you can see what the backup script is going to do.
Again, to do this you don't necessarily have to specify the configuration in this case, but you should get into the habit of doing so:
rsnapshot -c /etc/rsnapshot.conf -t beta
Which should return something like this, showing you what will happen when the backup is actually run:
echo 1441 > /var/run/rsnapshot.pid
mkdir -m 0755 -p /mnt/backup/storage/beta.0/
/usr/bin/rsync -a --delete --numeric-ids --relative --delete-excluded \
/home/ /mnt/backup/storage/beta.0/localhost/
mkdir -m 0755 -p /mnt/backup/storage/beta.0/
/usr/bin/rsync -a --delete --numeric-ids --relative --delete-excluded /etc/ \
/mnt/backup/storage/beta.0/localhost/
mkdir -m 0755 -p /mnt/backup/storage/beta.0/
/usr/bin/rsync -a --delete --numeric-ids --relative --delete-excluded \
/usr/local/ /mnt/backup/storage/beta.0/localhost/
touch /mnt/backup/storage/beta.0/
Once you are satisfied with the test, go ahead and run it manually the first time without the test:
rsnapshot -c /etc/rsnapshot.conf beta
When the backup finishes, navigate to /mnt/backup and take a look at the directory structure that was created there. There will be a storage/beta.0/localhost
directory, followed by the directories that you specified to backup.
Further Explanation¶
Each time the backup is run, it will create a new beta increment, 0-6, or 7 days worth of backups. The newest backup will always be beta.0 whereas yesterday's backup will always be beta.1.
The size of each of these backups will appear to take up the same amount (or more) of disk space, but this is because of rsnapshot's use of hard links. To restore files from yesterday's backup, you would simply copy them back from beta.1's directory structure.
Each backup is only an incremental backup from the previous run, BUT, because of the use of hard links, each backup directory, contains either the file or the hard-link to the file in whichever directory it was actually backed up in.
So to restore files, you don't have to pick and choose which directory or increment to restore them from, just what time stamp the backup should have that you are restoring. It's a great system and uses far less disk space than many other backup solutions.
Setting The Backup To Run Automatically¶
Once everything has been tested and we know that things will work without issue, the next step is to set up the crontab for the root user, so that all of this can be done automatically every day:
sudo crontab -e
If you haven't run this before, choose vim.basic as your editor or your own editor preference when the Select an editor
line comes up.
We are going to set our backup to automatically run at 11 PM, so we will add this to the crontab:
## Running the backup at 11 PM
00 23 * * * /usr/bin/rsnapshot -c /etc/rsnapshot.conf beta`
Multiple Machine or Multiple Server Backups¶
Doing backups of multiple machines from a machine with a RAID array or large storage capacity, either on premises or from across the Internet works very well.
If running these backups from across the Internet, you need to make sure that both locations have adequate bandwidth for the backups to occur. You can use rsnapshot to synchronize an on-site server with an off-site backup array or backup server to improve data redundancy.
Assumptions¶
We are assuming that you are running rsnapshot from a machine remotely, on-premise. This exact configuration can be duplicated, as indicated above, remotely off-premise as well.
In this case, you will want to install rsnapshot on the machine that is doing all of the backups. We are also assuming:
- That the servers you will be backing up to, have a firewall rule that allows the remote machine to SSH into it
- That each server that you will be backing up has a recent version of
rsync
installed. For Rocky Linux servers, rundnf install rsync
to update your system's version ofrsync
. - That you've either connected to the machine as the root user, or that you have run
sudo -s
to switch to the root user.
SSH Public / Private Keys¶
For the server that will be running the backups, we need to generate an SSH key-pair for use during the backups. For our example, we will be creating RSA keys.
If you already have a set of keys generated, you can skip this step. You can find out by doing an ls -al .ssh
and looking for an id_rsa and id_rsa.pub key pair. If none exists, use the following link to set up keys for your machine and the server(s) that you want to access:
Rsnapshot Configuration¶
The configuration file needs to be just like the one we created for the Basic Machine or Single Server Backup above, except that we want to change some of the options.
The snapshot root can be reverted back to the default like so:
snapshot_root /.snapshots/
And this line:
no_create_root 1
... can be commented out again:
#no_create_root 1
The other difference here is that each machine will have its very own configuration. Once you get used to this, you'll simply copy one of your existing configuration files over to a new name and then modify it to fit any additional machines that you want to backup.
For now, we want to modify the configuration file just like we did above, and then save it. Then copy that file as a template for our first server:
cp /etc/rsnapshot.conf /etc/rsnapshot_web.conf
We want to modify the new configuration file and create the log and lockfile with the machine's name:
logfile /var/log/rsnapshot_web.log
lockfile /var/run/rsnapshot_web.pid
Next, we want to modify rsnapshot_web.conf so that it includes the directories we want to back up. The only thing that is different here is the target.
Here's an example of the web.ourdomain.com configuration:
### BACKUP POINTS / SCRIPTS ###
backup root@web.ourourdomain.com:/etc/ web.ourourdomain.com/
backup root@web.ourourdomain.com:/var/www/ web.ourourdomain.com/
backup root@web.ourdomain.com:/usr/local/ web.ourdomain.com/
backup root@web.ourdomain.com:/home/ web.ourdomain.com/
backup root@web.ourdomain.com:/root/ web.ourdomain.com/
Checking The Configuration and Running The Initial Backup¶
Just like before, we can now check the configuration to make sure it is syntactically correct:
rsnapshot -c /etc/rsnapshot_web.conf configtest
And just like before, we are looking for the Syntax OK
message. If all is well, we can execute the backup manually:
/usr/bin/rsnapshot -c /etc/rsnapshot_web.conf beta
Assuming that everything works alright, we can then create the configuration files for the mail server (rsnapshot_mail.conf) and portal server (rsnapshot_portal.conf), test them, and do a trial backup.
Automating The Backup¶
Automating backups for the multiple machine/server version is slightly different. We want to create a bash script to call the backups in order. When one finishes the next will start. This script will look something like this and be stored in /usr/local/sbin:
vi /usr/local/sbin/backup_all
With the content:
#!/bin/bash
# script to run rsnapshot backups in succession
/usr/bin/rsnapshot -c /etc/rsnapshot_web.conf beta
/usr/bin/rsnapshot -c /etc/rsnapshot_mail.conf beta
/usr/bin/rsnapshot -c /etc/rsnapshot_portal.conf beta
chmod +x /usr/local/sbin/backup_all
Then create the crontab for root to run the backup script:
crontab -e
And add this line:
## Running the backup at 11 PM
00 23 * * * /usr/local/sbin/backup_all
Reporting The Backup Status¶
To make sure that everything is backing up according to plan, you might want to send the backup log files to your email. If your are running multiple machine backups using rsnapshot, each log file will have its own name, which you can then send to your email for review by Using the postfix For Server Process Reporting procedure.
Restoring a Backup¶
Restoring a backup, either a few files or a complete restore, involves copying the files you want from the directory with the date that you want to restore from back to your machine. Simple!
Conclusions and Other Resources¶
Getting the setup right with rsnapshot is a little daunting at first, but can save you loads of time backing up your machines or servers.
rsnapshot is very powerful, very fast, and very economical on disk space usage. You can find more information on Rsnapshot, by visiting rsnapshot.org
Author: Steven Spencer
Contributors: Ezequiel Bruni