Xen User Manual
Xen User Manual
DISCLAIMER: This documentation is currently under active development and as such there may be mistakes and omissions watch out for these and please report any you nd to the developers mailing list. Contributions of material, suggestions and corrections are welcome.
Contents
I Introduction and Tutorial
1 Introduction 1.1 Structure of a Xen-Based System . . . . . . . . . . . . . . . . . . . . 1.2 Hardware Support . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3 History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Installation 2.1 Prerequisites . . . . . . . . . . . . 2.2 Installing from Binary Tarball . . 2.3 Installing from Source . . . . . . 2.3.1 Obtaining the Source . . . 2.3.2 Building from Source . . . 2.3.3 Custom XenLinux Builds 2.3.4 Installing the Binaries . . 2.4 Conguration . . . . . . . . . . . 2.4.1 GRUB Conguration . . . 2.4.2 Serial Console (optional) . 2.4.3 TLS Libraries . . . . . . . 2.5 Booting Xen . . . . . . . . . . . .
1
3 4 4 5 7 7 8 8 8 9 10 10 10 11 11 12 12 13 13 14 14 15 17 17 17 17 18 19
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
3 Starting Additional Domains 3.1 Creating a Domain Conguration File . . . 3.2 Booting the Domain . . . . . . . . . . . . . 3.3 Example: ttylinux . . . . . . . . . . . . . . 3.4 Starting / Stopping Domains Automatically 4 Domain Management Tools 4.1 Command-line Management . . . . . . 4.1.1 Basic Management Commands . 4.1.2 xm list . . . . . . . . . . . . 4.2 Domain Save and Restore . . . . . . . . 4.3 Live Migration . . . . . . . . . . . . . i
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
4.4
Managing Domain Memory . . . . . . . . . . . . . . . 4.4.1 Setting memory footprints from dom0 . . . . . . 4.4.2 Setting memory footprints from within a domain 4.4.3 Setting memory limits . . . . . . . . . . . . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
20 20 20 20 21 21 22 23 24
5 Domain Filesystem Storage 5.1 Exporting Physical Devices as VBDs . 5.2 Using File-backed VBDs . . . . . . . 5.3 Using LVM-backed VBDs . . . . . . 5.4 Using NFS Root . . . . . . . . . . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
25
27 27 28 29 31 31 32 32 33 33 34 34 35 35 37 37 37 39 40 41 41 41 41 43
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
9 Further Support 9.1 Other Documentation . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2 Online References . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.3 Mailing Lists . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A Installing Xen / XenLinux on Debian ii
47 49
iii
iv
Part I
Chapter 1
Introduction
Xen is a paravirtualising virtual machine monitor (VMM), or hypervisor, for the x86 processor architecture. Xen can securely execute multiple virtual machines on a single physical system with close-to-native performance. The virtual machine technology facilitates enterprise-grade functionality, including: Virtual machines with performance close to native hardware. Live migration of running virtual machines between physical hosts.
Paravirtualisation permits very high performance virtualisation, even on architectures like x86 that are traditionally very hard to virtualise. The drawback of this approach is that it requires operating systems to be ported to run on Xen. Porting an OS to run on Xen is similar to supporting a new hardware platform, however the process is simplied because the paravirtual machine architecture is very similar to the underlying native hardware. Even though operating system kernels must explicitly support Xen, a key feature is that user space applications and libraries do not require modication. Xen support is available for increasingly many operating systems: right now, Linux 2.4, Linux 2.6 and NetBSD are available for Xen 2.0. A FreeBSD port is undergoing testing and will be incorporated into the release soon. Other OS ports, including Plan 9, are in progress. We hope that that arch-xen patches will be incorporated into the mainstream releases of these operating systems in due course (as has already happened for NetBSD). Possible usage scenarios for Xen include: Kernel development. Test and debug kernel modications in a sandboxed virtual machine no need for a separate test machine. Multiple OS congurations. Run multiple operating systems simultaneously, for instance for compatibility or QA purposes. 3
Server consolidation. Move multiple servers onto a single physical host with performance and fault isolation provided at virtual machine boundaries. Cluster computing. Management at VM granularity provides more exibility than separately managing each physical host, but better control and isolation than single-system image solutions, particularly by using live migration for load balancing. Hardware support for custom OSes. Allow development of new OSes while beneting from the wide-ranging hardware support of existing OSes such as Linux.
Xen itself contains only the code required to detect and start secondary processors, set up interrupt routing, and perform PCI bus enumeration. Device drivers run within a privileged guest OS rather than within Xen itself. This approach provides compatibility with the majority of device hardware supported by Linux. The default XenLinux build contains support for relatively modern server-class network and disk hardware, but you can add support for other hardware by conguring your XenLinux kernel in the normal way.
1.3 History
Xen was originally developed by the Systems Research Group at the University of Cambridge Computer Laboratory as part of the XenoServers project, funded by the UK-EPSRC. XenoServers aim to provide a public infrastructure for global distributed computing, and Xen plays a key part in that, allowing us to efciently partition a single machine to enable multiple independent clients to run their operating systems and applications in an environment providing protection, resource isolation and accounting. The project web page contains further information along with pointers to papers and technical reports: http://www.cl.cam.ac.uk/xeno Xen has since grown into a fully-edged project in its own right, enabling us to investigate interesting research issues regarding the best techniques for virtualising resources such as the CPU, memory, disk and network. The project has been bolstered by support from Intel Research Cambridge, and HP Labs, who are now working closely with us. Xen was rst described in a paper presented at SOSP in 2003 1 , and the rst public release (1.0) was made that October. Since then, Xen has signicantly matured and is now used in production scenarios on many sites. Xen 2.0 features greatly enhanced hardware support, conguration exibility, usability and a larger complement of supported operating systems. This latest release takes Xen a step closer to becoming the denitive open source solution for virtualisation.
http://www.cl.cam.ac.uk/netos/papers/2003-xensosp.pdf
Chapter 2
Installation
The Xen distribution includes three main components: Xen itself, ports of Linux 2.4 and 2.6 and NetBSD to run on Xen, and the user-space tools required to manage a Xen-based system. This chapter describes how to install the Xen 2.0 distribution from source. Alternatively, there may be pre-built packages available as part of your operating system distribution.
2.1 Prerequisites
The following is a full list of prerequisites. Items marked are required by the xend control tools, and hence required if you want to run more than one virtual machine; items marked are only required if you wish to build from source.
A working Linux distribution using the GRUB bootloader and running on a P6class (or newer) CPU. The Linux bridge-utils1 (e.g., /sbin/brctl) The iproute2 package.
An installation of Twisted v1.3 or above2 . There may be a binary package available for your distribution; alternatively it can be installed by running make install-twisted in the root of the Xen source tree. Development installation of libcurl (e.g., libcurl-devel) Build tools (gcc v3.2.x or v3.3.x, binutils, GNU make).
1 2
Once you have satised the relevant prerequisites, you can now install either a binary or source distribution of Xen.
2.3.1
The Xen source tree is available as either a compressed source tar ball or as a clone of our master BitKeeper repository. Obtaining the Source Tarball Stable versions (and daily snapshots) of the Xen source tree are available as compressed tarballs from the Xen download page http://xen.sf.net Using BitKeeper If you wish to install Xen from a clone of our latest BitKeeper repository then you will need to install the BitKeeper tools. Download instructions for BitKeeper can be obtained by lling out the form at: http://www.bitmover.com/cgi-bin/download.cgi The public master BK repository for the 2.0 release lives at: bk://xen.bkbits.net/xen-2.0.bk You can use BitKeeper to download it and keep it updated with the latest features and xes. Change to the directory in which you want to put the source code, then run: 8
# bk clone bk://xen.bkbits.net/xen-2.0.bk Under your current directory, a new directory named xen-2.0.bk has been created, which contains all the source code for Xen, the OS ports, and the control tools. You can update your repository with the latest changes at any time by running: # cd xen-2.0.bk # to change into the local repository # bk pull # to update the repository
2.3.2
The top-level Xen Makele includes a target world that will do the following: Build Xen Build the control tools, including xend
Download (if necessary) and unpack the Linux 2.6 source code, and patch it for use with Xen Build a Linux kernel to use in domain 0 and a smaller unprivileged kernel, which can optionally be used for unprivileged virtual machines. After the build has completed you should have a top-level directory called dist/ in which all resulting targets will be placed; of particular interest are the two kernels XenLinux kernel images, one with a -xen0 extension which contains hardware device drivers and drivers for Xens virtual devices, and one with a -xenU extension that just contains the virtual ones. These are found in dist/install/boot/ along with the image for Xen itself and the conguration les used during the build. The NetBSD port can be built using: # make netbsd20 NetBSD port is built using a snapshot of the netbsd-2-0 cvs branch. The snapshot is downloaded as part of the build process, if it is not yet present in the NETBSD SRC PATH search path. The build process also downloads a toolchain which includes all the tools necessary to build the NetBSD kernel under Linux. To customize further the set of kernels built you need to edit the top-level Makele. Look for the line: KERNELS ?= mk.linux-2.6-xen0 mk.linux-2.6-xenU You can edit this line to include any set of operating system kernels which have congurations in the top-level buildconfigs/ directory, for example mk.linux-2.4xenU to build a Linux 2.4 kernel containing only virtual device drivers.
2.3.3
If you wish to build a customized XenLinux kernel (e.g. to support additional devices or enable distribution-required features), you can use the standard Linux conguration mechanisms, specifying that the architecture being built for is xen, e.g: # # # # cd linux-2.6.11-xen0 make ARCH=xen xconfig cd .. make
You can also copy an existing Linux conguration (.config) into linux-2.6.11xen0 and execute: # make ARCH=xen oldconfig You may be prompted with some Xen-specic options; we advise accepting the defaults for these options. Note that the only difference between the two types of Linux kernel that are built is the conguration le used for each. The U sufxed (unprivileged) versions dont contain any of the physical hardware device drivers, leading to a 30% reduction in size; hence you may prefer these for your non-privileged domains. The 0 sufxed privileged versions can be used to boot the system, as well as in driver domains and unprivileged domains.
2.3.4
The les produced by the build process are stored under the dist/install/ directory. To install them in their default locations, do: # make install Alternatively, users with special installation requirements may wish to install them manually by copying the les to their appropriate destinations. The dist/install/boot directory will also contain the cong les used for building the XenLinux kernels, and also versions of Xen and XenLinux kernels that contain debug symbols (xen-syms-2.0.6 and vmlinux-syms-2.6.11.11-xen0) which are essential for interpreting crash dumps. Retain these les as the developers may wish to see them if you post on the mailing list.
2.4 Conguration
Once you have built and installed the Xen distribution, it is simple to prepare the machine for booting and running Xen.
10
2.4.1
GRUB Conguration
An entry should be added to grub.conf (often found under /boot/ or /boot/grub/) to allow Xen / XenLinux to boot. This le is sometimes called menu.lst, depending on your distribution. The entry should look something like the following:
title Xen 2.0 / XenLinux 2.6 kernel /boot/xen-2.0.gz dom0_mem=131072 module /boot/vmlinuz-2.6-xen0 root=/dev/sda4 ro console=tty0
The kernel line tells GRUB where to nd Xen itself and what boot parameters should be passed to it (in this case, setting domain 0s memory allocation in kilobytes and the settings for the serial port). For more details on the various Xen boot parameters see Section 8.2. The module line of the conguration describes the location of the XenLinux kernel that Xen should start and the parameters that should be passed to it (these are standard Linux parameters, identifying the root device and specifying it be initially mounted read only and instructing that console output be sent to the screen). Some distributions such as SuSE do not require the ro parameter. If you want to use an initrd, just add another module line to the conguration, as usual:
module /boot/my_initrd.gz
As always when installing a new kernel, it is recommended that you do not delete existing menu options from menu.lst you may want to boot your old Linux kernel in future, particularly if you have problems.
2.4.2
In order to congure Xen serial console output, it is necessary to add an boot option to your GRUB cong; e.g. replace the above kernel line with:
kernel /boot/xen.gz dom0_mem=131072 com1=115200,8n1
This congures Xen to output on COM1 at 115,200 baud, 8 data bits, 1 stop bit and no parity. Modify these parameters for your set up. One can also congure XenLinux to share the serial console; to achieve this append console=ttyS0 to your module line. If you wish to be able to log in over the XenLinux serial console it is necessary to add a line into /etc/inittab, just as per regular Linux. Simply add the line:
c:2345:respawn:/sbin/mingetty ttyS0
and you should be able to log in. Note that to successfully log in as root over the serial line will require adding ttyS0 to /etc/securetty in most modern distributions.
11
2.4.3
TLS Libraries
Users of the XenLinux 2.6 kernel should disable Thread Local Storage (e.g. by doing a mv /lib/tls /lib/tls.disabled) before attempting to run with a XenLinux kernel3 . You can always reenable it by restoring the directory to its original location (i.e. mv /lib/tls.disabled /lib/tls). The reason for this is that the current TLS implementation uses segmentation in a way that is not permissible under Xen. If TLS is not disabled, an emulation mode is used within Xen which reduces performance substantially. We hope that this issue can be resolved by working with Linux distribution vendors to implement a minor backward-compatible change to the TLS library.
If you boot without rst disabling TLS, you will get a warning message during the boot process. In this case, simply perform the rename after the machine is up and then run /sbin/ldconfig to make it take effect.
12
Chapter 3
13
memory Set this to the size of the domains memory in megabytes (e.g. memory = 64) disk Set the rst entry in this list to calculate the offset of the domains root partition, based on the domain ID. Set the second to the location of /usr if you are sharing it between domains (e.g. disk =
[phy:your hard drive%d,sda1,w % (base partition number + vmid), phy:your usr partition,sda6,r ]
dhcp Uncomment the dhcp variable, so that the domain will receive its IP address from a DHCP server (e.g. dhcp=dhcp) You may also want to edit the vif variable in order to choose the MAC address of the virtual ethernet interface yourself. For example: vif = [mac=00:06:AA:F6:BB:B3] If you do not set this variable, xend will automatically generate a random MAC address from an unused range.
14
kernel = "/boot/vmlinuz-2.6-xenU" memory = 64 name = "ttylinux" nics = 1 ip = "1.2.3.4" disk = [file:/path/to/ttylinux/rootfs,sda1,w] root = "/dev/sda1 ro" 3. Now start the domain and connect to its console: xm create configfile -c 4. Login as root, password root.
15
16
Chapter 4
4.1.1
The most important xm commands are: # xm list: Lists all domains running. # xm consoles : Gives information about the domain consoles. # xm console: Opens a console to a domain (e.g. # xm console myVM
4.1.2
xm list
The output of xm list is in rows of the following format: name domid memory cpu state cputime console name The descriptive name of the virtual machine. domid The number of the domain ID this virtual machine is running in. memory Memory size in megabytes. cpu The CPU this domain is running on. 17
state Domain state consists of 5 elds: r running b blocked p paused s shutdown c crashed cputime How much CPU time (in seconds) the domain has used so far. console TCP port accepting connections to the domains console. The xm list command also supports a long output format when the -l switch is used. This outputs the fulls details of the running domains in xends SXP conguration format. For example, suppose the system is running the ttylinux domain as described earlier. The list command should produce output somewhat like the following: # xm list Name Domain-0 ttylinux Id 0 5 Mem(MB) 251 63 CPU 0 0 State r----b--Time(s) 172.2 3.0 Console 9605
Here we can see the details for the ttylinux domain, as well as for domain 0 (which, of course, is always running). Note that the console port for the ttylinux domain is 9605. This can be connected to by TCP using a terminal program (e.g. telnet or, better, xencons). The simplest way to connect is to use the xm console command, specifying the domain name or ID. To connect to the console of the ttylinux domain, we could use any of the following: # xm console ttylinux # xm console 5 # xencons localhost 9605
# xm restore ttylinux.xen This will restore the state of the domain and restart it. The domain will carry on as before and the console may be reconnected using the xm console command, as above.
19
4.4.1
The machine administrator can request that a domain alter its memory footprint using the xm set-mem command. For instance, we can request that our example ttylinux domain reduce its memory footprint to 32 megabytes. # xm set-mem ttylinux 32 We can now see the result of this in the output of xm list: # xm list Name Domain-0 ttylinux Id 0 5 Mem(MB) 251 31 CPU 0 0 State r----b--Time(s) 172.2 4.3 Console 9605
The domain has responded to the request by returning memory to Xen. We can restore the domain to its original size using the command line: # xm set-mem ttylinux 64
4.4.2
The virtual le /proc/xen/balloon allows the owner of a domain to adjust their own memory footprint. Reading the le (e.g. cat /proc/xen/balloon) prints out the current memory footprint of the domain. Writing the le (e.g. echo new target > /proc/xen/balloon) requests that the kernel adjust the domains memory footprint to a new value.
4.4.3
Xen associates a memory size limit with each domain. By default, this is the amount of memory the domain is originally started with, preventing the domain from ever growing beyond this size. To permit a domain to grow beyond its original allocation or to prevent a domain youve shrunk from reclaiming the memory it relinquished, use the xm maxmem command.
20
Chapter 5
read-write sharing, export the directory to other domains via NFS from domain0 (or use a cluster le system such as GFS or ocfs2).
22
This limit can be statically increased by using the max loop module parameter if CONFIG BLK DEV LOOP is compiled as a module in the dom0 kernel, or by using the max loop=n boot option if CONFIG BLK DEV LOOP is compiled directly into the dom0 kernel.
23
Each of these can grow to have 1GB of differences from the master volume. You can grow the amount of space for storing the differences using the lvextend command, e.g.: # lvextend +100M /dev/vg/myclonedisk1 Dont let the differences volume ever ll up otherwise LVM gets rather confused. It may be possible to automate the growing process by using dmsetup wait to spot the volume getting full and then issue an lvextend. In principle, it is possible to continue writing to the volume that has been cloned (the changes will not be visible to the clones), but we wouldnt recommend this: have the cloned volume as a pristine le system install that isnt mounted directly by any of the virtual machines.
Finally, congure the domain to use NFS root. In addition to the normal variables, you should make sure to set the following values in the domains conguration le:
root = /dev/nfs nfs_server = 2.3.4.5 # substitute IP address of server nfs_root = /path/to/root # path to root FS on the server
The domain will need network access at boot time, so either statically congure an IP address (Using the cong variables ip, netmask, gateway, hostname) or enable DHCP ( dhcp=dhcp). Note that the Linux NFS root implementation is known to have stability problems under high load (this is not a Xen-specic problem), so this conguration may not be appropriate for critical servers.
24
Part II
25
Chapter 6
Control Software
The Xen control software includes the xend node control daemon (which must be running), the xm command line tools, and the prototype xensv web interface.
A SysV init script called xend is provided to start xend at boot time. make install installs this script in /etc/init.d. To enable it, you have to make symbolic links in the appropriate runlevel directories or use the chkconfig tool, where available. Once xend is running, more sophisticated administration can be done using the xm tool (see Section 6.2) and the experimental Xensv web interface (see Section 6.3). As xend runs, events will be logged to /var/log/xend.log and, if the migration assistant daemon (xfrd) has been started, /var/log/xfrd.log. These may be of use for troubleshooting problems.
27
28
29
30
Chapter 7
Domain Conguration
The following contains the syntax of the domain conguration les and description of how to further specify networking, driver domain and general scheduling behaviour.
dhcp Set to dhcp if you want to use DHCP to congure networking. netmask Manually congured IP netmask. gateway Manually congured IP gateway. hostname Set the hostname for the virtual machine. root Specify the root device parameter on the kernel command line. nfs server IP address for the NFS server (if any). nfs root Path of the root lesystem on the NFS server (if any). extra Extra string to append to the kernel command line (if any) restart Three possible options: always Always restart the domain, no matter what its exit code is. never Never restart the domain. onreboot Restart the domain iff it requests reboot. For additional exibility, it is also possible to include Python scripting commands in conguration les. An example of this is the xmexample2 le, which uses Python code to handle the vmid variable.
7.2.1
Each domain network interface is connected to a virtual network interface in dom0 by a point to point link (effectively a virtual crossover cable). These devices are named vif<domid>.<vifid> (e.g. vif1.0 for the rst interface in domain 1, vif3.1 for the second interface in domain 3). Trafc on these virtual interfaces is handled in domain 0 using standard Linux mechanisms for bridging, routing, rate limiting, etc. Xend calls on two shell scripts to perform initial conguration of the network and conguration of new virtual interfaces. By default, these scripts congure a single bridge for all the virtual interfaces. Arbitrary routing / bridging congurations can be congured by customising the scripts, as described in the following section.
32
7.2.2
Xens virtual networking is congured by two shell scripts (by default network and vif-bridge). These are called automatically by xend when certain events occur, with arguments to the scripts providing further contextual information. These scripts are found by default in /etc/xen/scripts. The names and locations of the scripts can be congured in /etc/xen/xend-config.sxp. network: This script is called whenever xend is started or stopped to respectively initialise or tear down the Xen virtual network. In the default conguration initialisation creates the bridge xen-br0 and moves eth0 onto that bridge, modifying the routing accordingly. When xend exits, it deletes the Xen bridge and removes eth0, restoring the normal IP and routing conguration. vif-bridge: This script is called for every domain virtual interface and can congure rewalling rules and add the vif to the appropriate bridge. By default, this adds and removes VIFs on the default Xen bridge. For more complex network setups (e.g. where routing is required or integrate with existing bridges) these scripts may be replaced with customised variants for your sites preferred conguration.
domain is allowed to access the numbers x,y and z may be in either decimal or hexadecimal format. Flat Format: Include a list of PCI device addresses of the format: pci = [x,y,z, ...] where each element in the list is a string specifying the components of the PCI device address, separated by commas. The components (x, y and z) of the list may be formatted as either decimal or hexadecimal.
7.4.1
sched=bvt (the default) BVT provides proportional fair shares of the CPU time. It has been observed to penalise domains that block frequently (e.g. I/O intensive domains), but this can be compensated for by using warping. Global Parameters ctx allow the context switch allowance is similar to the quantum in traditional schedulers. It is the minimum time that a scheduled domain will be allowed to run before being pre-empted. Per-domain parameters mcuadv the MCU (Minimum Charging Unit) advance determines the proportional share of the CPU that a domain receives. It is set inversely proportionally to a domains sharing weight. warp the amount of virtual time the domain is allowed to warp backwards warpl the warp limit is the maximum time a domain can run warped for 34
warpu the unwarp requirement is the minimum time a domain must run unwarped for before it can warp again
7.4.2
Atropos
sched=atropos Atropos is a soft real time scheduler. It provides guarantees about absolute shares of the CPU, with a facility for sharing slack CPU time on a best-effort basis. It can provide timeliness guarantees for latency-sensitive domains. Every domain has an associated period and slice. The domain should receive slice nanoseconds every period nanoseconds. This allows the administrator to congure both the absolute share of the CPU a domain receives and the frequency with which it is scheduled. Note: dont overcommit the CPU when using Atropos (i.e. dont reserve more CPU than is available the utilisation should be kept to slightly less than 100% in order to ensure predictable behaviour). Per-domain parameters period The regular time interval during which a domain is guaranteed to receive its allocation of CPU time. slice The length of time per period that a domain is guaranteed to run for (in the absence of voluntary yielding of the CPU). latency The latency hint is used to control how soon after waking up a domain it should be scheduled. xtratime This is a boolean ag that species whether a domain should be allowed a share of the system slack time.
7.4.3
Round Robin
sched=rrobin The round robin scheduler is included as a simple demonstration of Xens internal scheduler API. It is not intended for production use.
35
Global Parameters rr slice The maximum time each domain runs before the next scheduling decision is made.
36
Chapter 8
watchdog Enable NMI watchdog which can report certain failures. noirqbalance Disable software IRQ balancing and afnity. This can be used on systems such as Dell 1850/2850 that have workarounds in hardware for IRQ-routing issues. badpage=<page number>,<page number>, . . . Specify a list of pages not to be allocated for use because they contain bad bytes. For example, if your memory tester says that byte 0x12345678 is bad, you would place badpage=0x12345 on Xens command line. com1=<baud>,DPS,<io base>,<irq> com2=<baud>,DPS,<io base>,<irq> Xen supports up to two 16550-compatible serial ports. For example: com1=9600, 8n1, 0x408, 5 maps COM1 to a 9600-baud port, 8 data bits, no parity, 1 stop bit, I/O port base 0x408, IRQ 5. If some conguration options are standard (e.g., I/O base and IRQ), then only a prex of the full conguration string need be specied. If the baud rate is pre-congured (e.g., by the bootloader) then you can specify auto in place of a numeric baud rate. console=<specier list> Specify the destination for Xen console I/O. This is a commaseparated list of, for example: vga use VGA console and allow keyboard input com1 use serial port com1 com2H use serial port com2. Transmitted chars will have the MSB set. Received chars must have MSB set. com2L use serial port com2. Transmitted chars will have the MSB cleared. Received chars must have MSB cleared. The latter two examples allow a single port to be shared by two subsystems (e.g. console and debugger). Sharing is controlled by MSB of each transmitted/received character. [NB. Default for this option is com1,vga] sync console Force synchronous console output. This is useful if you system fails unexpectedly before it has sent all available output to the console. In most cases Xen will automatically enter synchronous mode when an exceptional event occurs, but this option provides a manual fallback. conswitch=<switch-char><auto-switch-char> Specify how to switch serial-console input between Xen and DOM0. The required sequence is CTRL-<switch-char> pressed three times. Specifying the backtick character disables switching. The <auto-switch-char> species whether Xen should auto-switch input to DOM0 when it boots if it is x then auto-switching is disabled. Any other value, or omitting the character, enables auto-switching. [NB. default switch-char is a] nmi=xxx Specify what to do with an NMI parity or I/O error. nmi=fatal: Xen prints a diagnostic and then hangs. 38
nmi=dom0: Inform DOM0 of the NMI. nmi=ignore: Ignore the NMI. mem=xxx Set the physical RAM address limit. Any RAM appearing beyond this physical address in the memory map will be ignored. This parameter may be specied with a B, K, M or G sufx, representing bytes, kilobytes, megabytes and gigabytes respectively. The default unit, if no sufx is specied, is bytes. dom0 mem=xxx Set the amount of memory to be allocated to domain0. In Xen 3.x the parameter may be specied with a B, K, M or G sufx, representing bytes, kilobytes, megabytes and gigabytes respectively; if no sufx is specied, the parameter defaults to kilobytes. In previous versions of Xen, sufxes were not supported and the value is always interpreted as kilobytes. tbuf size=xxx Set the size of the per-cpu trace buffers, in pages (default 1). Note that the trace buffers are only enabled in debug builds. Most users can ignore this feature completely. sched=xxx Select the CPU scheduler Xen should use. The current possibilities are bvt (default), atropos and rrobin. For more information see Section 7.4. apic verbosity=debug,verbose Print more detailed information about local APIC and IOAPIC conguration. lapic Force use of local APIC even when left disabled by uniprocessor BIOS. nolapic Ignore local APIC in a uniprocessor system, even if enabled by the BIOS. apic=bigsmp,default,es7000,summit Specify NUMA platform. This can usually be probed automatically. In addition, the following options may be specied on the Xen command line. Since domain 0 shares responsibility for booting the platform, Xen will automatically propagate these options to its command line. These options are taken from Linuxs commandline syntax with unchanged semantics. acpi=off,force,strict,ht,noirq,. . . Modify how Xen (and domain 0) parses the BIOS ACPI tables. acpi skip timer override Instruct Xen (and domain 0) to ignore timer-interrupt override instructions specied by the BIOS ACPI tables. noapic Instruct Xen (and domain 0) to ignore any IOAPICs that are present in the system, and instead continue to use the legacy PIC.
39
xencons=xxx Specify the device node to which the Xen virtual console driver is attached. The following options are supported: xencons=off: disable virtual console xencons=tty: attach console to /dev/tty1 (tty0 at boot-time) xencons=ttyS: attach console to /dev/ttyS0 The default is ttyS for dom0 and tty for all other domains.
8.4 Debugging
Xen has a set of debugging features that can be useful to try and gure out whats going on. Hit h on the serial line (if you specied a baud rate on the Xen command line) or ScrollLock-h on the keyboard to get a list of supported commands. If you have a crash youll likely get a crash dump containing an EIP (PC) which, along with an objdump -d image, can be useful in guring out whats happened. Debug a Xenlinux image just as you would any other Linux kernel.
40
Chapter 9
Further Support
If you have questions that are not answered by this manual, the sources of information listed below may be of interest to you. Note that bug reports, suggestions and contributions related to the software (or the documentation) should be sent to the Xen developers mailing list (address below).
41
[email protected] Changelog feed from the unstable and 2.0 trees - developer oriented. Subscribe at:
http://lists.xensource.com/xen-changelog
42
Appendix A
If youre going to use this lesystem / disk image only as a template for other vm disk images, something like 300 MB should be enough.. (of course it depends what kind of packages you are planning to install to the template) 3. Create the lesystem and initialise the swap image
mkfs.ext3 /path/diskimage mkswap /path/swapimage
5. Install debootstrap Make sure you have debootstrap installed on the host. If you are running Debian sarge (3.1 / testing) or unstable you can install it by running apt-get install debootstrap. Otherwise, it can be downloaded from the Debian project website. 6. Install Debian base to the disk image:
debootstrap --arch i386 sarge /mnt/disk \
43
http://ftp.<countrycode>.debian.org/debian
You can use any other Debian http/ftp mirror you want. 7. When debootstrap completes successfully, modify settings:
chroot /mnt/disk /bin/bash
Edit the following les using vi or nano and make needed changes:
/etc/hostname /etc/hosts /etc/resolv.conf /etc/network/interfaces /etc/networks
9. Create Xen 2.0 conguration le for the new domain. You can use the examplecongurations coming with Xen as a template. Make sure you have the following set up:
disk = [ file:/path/diskimage,sda1,w, file:/path/swapimage,sda2,w ] root = "/dev/sda1 ro"
11. Attach to the console of the new domain. You should see something like this when starting the new domain:
Started domain testdomain2, console on port 9626
There you can see the ID of the console: 26. You can also list the consoles with xm consoles (ID is the last two digits of the port number.)
44
or by telnetting to the port 9626 of localhost (the xm console program works better). 12. Log in and run base-cong As a default theres no password for the root. Check that everything looks OK, and the system started without errors. Check that the swap is active, and the network settings are correct. Run /usr/sbin/base-config to set up the Debian settings. Set up the password for root using passwd. 13. Done. You can exit the console by pressing Ctrl + ] If you need to create new domains, you can just copy the contents of the templateimage to the new disk images, either by mounting the template and the new image, and using cp -a or tar or by simply copying the image le. Once this is done, modify the image-specic settings (hostname, network settings, etc).
45
46
Appendix B
47
The one slight complication with the above is that /sbin/portmap is dynamically linked against /usr/lib/libwrap.so.0 Since this is in /usr, it wont work. This can be solved by copying the le (and link) below the /usr mount point, and just let the le be covered when the mount happens. In some installations, where a shared read-only /usr is being used, it may be desirable to move other large directories over into the read-only /usr. For example, you might replace /bin, /lib and /sbin with links into /usr/root/bin, /usr/root/lib and /usr/root/sbin respectively. This creates other problems for running the /linuxrc script, requiring bash, portmap, mount, ifcong, and a handful of other shared libraries to be copied below the mount point a simple statically-linked C program would solve this problem.
48
Appendix C
Glossary of Terms
Atropos One of the CPU schedulers provided by Xen. Atropos provides domains with absolute shares of the CPU, with timeliness guarantees and a mechanism for sharing out slack time. BVT The BVT scheduler is used to give proportional fair shares of the CPU to domains. Exokernel A minimal piece of privileged code, similar to a microkernel but providing a more hardware-like interface to the tasks it manages. This is similar to a paravirtualising VMM like Xen but was designed as a new operating system structure, rather than specically to run multiple conventional OSs. Domain A domain is the execution context that contains a running virtual machine. The relationship between virtual machines and domains on Xen is similar to that between programs and processes in an operating system: a virtual machine is a persistent entity that resides on disk (somewhat like a program). When it is loaded for execution, it runs in a domain. Each domain has a domain ID. Domain 0 The rst domain to be started on a Xen machine. Domain 0 is responsible for managing the system. Domain ID A unique identier for a domain, analogous to a process ID in an operating system. Full virtualisation An approach to virtualisation which requires no modications to the hosted operating system, providing the illusion of a complete system of real hardware devices. Hypervisor An alternative term for VMM, used because it means beyond supervisor, since it is responsible for managing multiple supervisor kernels. Live migration A technique for moving a running virtual machine to another physical host, without stopping it or the services running on it. Microkernel A small base of code running at the highest hardware privilege level. A 49
microkernel is responsible for sharing CPU and memory (and sometimes other devices) between less privileged tasks running on the system. This is similar to a VMM, particularly a paravirtualising VMM but typically addressing a different problem space and providing different kind of interface. NetBSD/Xen A port of NetBSD to the Xen architecture. Paravirtualisation An approach to virtualisation which requires modications to the operating system in order to run in a virtual machine. Xen uses paravirtualisation but preserves binary compatibility for user space applications. Shadow pagetables A technique for hiding the layout of machine memory from a virtual machines operating system. Used in some VMMs to provide the illusion of contiguous physical memory, in Xen this is used during live migration. Virtual Machine The environment in which a hosted operating system runs, providing the abstraction of a dedicated machine. A virtual machine may be identical to the underlying hardware (as in full virtualisation, or it may differ, as in paravirtualisation. VMM Virtual Machine Monitor - the software that allows multiple virtual machines to be multiplexed on a single physical machine. Xen Xen is a paravirtualising virtual machine monitor, developed primarily by the Systems Research Group at the University of Cambridge Computer Laboratory. XenLinux Ofcial name for the port of the Linux kernel that runs on Xen.
50