Remus provides transparent high availability to ordinary virtual machines running on Xen. It does this by continually live migrating a copy of a running VM to a backup server, which automatically activates if the primary server fails. Key features:
- The backup VM is an exact copy of the primary VM (disk/memory/network). When failure occurs, the VM continues running on the backup host as if failure had never occurred.
- The backup is completely up-to-date: even active TCP sessions are maintained without interruption.
- Protection is transparent: existing guests can be protected without modifying them in any way.
Host (dom0) requirements
- Xen hypervisor with remus support and tools (included with Xen 4.0+)
- Note: Remus is not included with XCP, XenServer, or with some of the Linux pre-packaged versions of Xen, so please check your distribution or you may need to build Xen from source
- Xen dom0 kernel that meets the Remus dom0 requirements
- Shared storage is not required
- DRBD shared storage is supported, allowing faster and automatic re synchronization after a failed host is brought back online
- Otherwise to bring a failed node back online, the VM must be turned off
Guest (domU) requirements
- Xen PV guests that meet the Remus PV domU requirements
- Xen HVM guests don't require any changes for Remus
Installation varies slightly depending upon the host platform, so please see the guides below for examples.
- Install Xen 4.2.1 with Remus and DRBD on Ubuntu 12.10
- Install Xen 4.1.4 with Remus and DRBD on Ubuntu 12.10
Using DRBD instead of blktap2 for storage replication allows for quick resynchronization of the disk backend after failed host is back online. Since storage (re)synchronization is done online - while the VM is operational, there is no need to shutdown the VM. Once storage is synchronized, one can start, stop and restart Remus on a running VM anytime.
However, DRBD must be custom built with support for protocol D (see the above install guides), so the normal packaged versions of DRBD are not suitable.
Note that DRBD will be operated in dual primary mode, which carries a number of risks and management issues. Please research this topic to be aware of the potential complications.
- For PV domUs, Remus requires "suspend event channel" kernel support. Otherwise Remus can run most any PV domU in a degraded performance mode. This kernel support is not widely available (not currently available in Ubuntu, for example), but is available with OpenSUSE. See Remus PV domU requirements for more information.
In Xen 4.0.0:
- Xen hypervisor and tools have Remus support.
- Only linux-2.6.18-xen is supported as Xen dom0 kernel with Remus.
- If using a PV domU you need to run linux-2.6.18-xen as domU kernel.
In Xen 4.0.1:
- Pvops dom0 kernel support for Remus has been added in Xen 4.0.1-rc4, so it's available in Xen 4.0.1 final release. You can use Linux 2.6.32 based pvops dom0 kernel with Remus.
- PV domU kernel still needs to be linux-2.6.18-xen.
In Xen 4.2:
- Many bugfixes to Remus.
- Remus support for pvops domU kernels: Linux 22.214.171.124 and later upstream kernel.org versions are now supported as PV domU kernels, in addition to Jeremy's xen.git xen/stable-2.6.32.x branch.
- For better Remus performance you should use a domU kernel with "suspend event channel" support, which means linux-2.6.18-xen, or any of the xenlinux forwardports (novell sles11sp1 2.6.32 kernel, for example). pvops domU kernels don't have suspend event channel support yet.
- Checkpoint compression for less data to transfer between hosts.
In Xen 4.4:
- Experimental support for xenlight (xl). Per the man page, there is no support for network or disk buffering at the moment.
Xen 4.5 plans: as per 4.5 Development Update
- Significant improvements to Remus in Xen are currently reviewed, such as better XL integration, better DRDB support, stability and performance improvements and enablement of COLO support
- COLO support in Xen (see COLO XPDS13 video, COLO XPDS13 presentation)
Note that if using linux-2.6.18-xen kernel it needs to be new enough to include Remus support/patches! It's recommended to download the latest version from linux-2.6.18-xen.hg mercurial repository for use with Remus.
(Note: the Remus links at nss.cs.ubc.ca are broken. If you have current links, please edit those below)
- The Remus project web site was at http://nss.cs.ubc.ca/remus/ but is no longer present as of June 2014. You can find the site via this Wayback Machine link.
- Remus documentation was at http://nss.cs.ubc.ca/remus/doc.html but is no longer present. You can use this Wayback Machine link.
Rcpavlicek 18:49, 19 June 2014 (UTC): It appears that the 4.4 Man Pages only have the "xl remus" command syntax, not the "remus" command used if you are still using xm. The "remus" command appears to have a similar, but not identical, syntax to "xl remus". See Remus_Toolkit_Differences. Also, "xl remus" still has limitations in the 4.4 release, per the man page.
- Research/design paper about Remus: https://www.usenix.org/legacy/event/nsdi08/tech/full_papers/cully/cully.pdf
- Configuring and installing Remus tutorial: http://remusha.wikidot.com/
- Video demonstration of Remus: http://joburg.eu/en/video/2jV4lOgFJMY/Screen-cast-Remus-High-Availability-based-on-Xen-Hypervisor
- Latest linux-2.6.18-xen kernel is available from Mercurial tree at http://xenbits.xen.org/linux-2.6.18-xen.hg .
- Remus version of DRBD git://aramis.nss.cs.ubc.ca/drbd-8.3-remus