Features/Migration: Difference between revisions

From QEMU
No edit summary
Line 17: Line 17:
This is the roadmap, features are integrated upstream as they are done.
This is the roadmap, features are integrated upstream as they are done.


== TODO Short Term ==
== ToDo list ==


* use TLS for communication (Volunteer?)
* use TLS for communication (Volunteer?)
Line 34: Line 34:
** We could change kernel log code to set the bitmap for "used" pages when we start logging, this would allow us to not migration zero pages at all, right now we have to "allocate" the pages, to check that they are zero, and then sent them as zero pages
** We could change kernel log code to set the bitmap for "used" pages when we start logging, this would allow us to not migration zero pages at all, right now we have to "allocate" the pages, to check that they are zero, and then sent them as zero pages


* Abstract QEMUFile use
* Abstract QEMUFile use (Google Summer of Code Project)


We can change QEMUFile use to something like:
We can change QEMUFile use to something like:


<source lang=c>
<pre>
struct MigrationChannel {
struct MigrationChannel {
void *opaque;
    void *opaque;
uint32 get_uint32(struct MigrationChannel *)
    uint32 get_uint32(struct MigrationChannel *)
int put_uint32(structMigrationCHannel *)
    int put_uint32(structMigrationCHannel *)
/* the same for all the basic types */
/* the same for all the basic types */
}
}
</source>
</pre>


And then change all  
And then change all ocurrences of:


<pre>
qemu_get_sbe32(f, &foo);
</pre>


into
<pre>
foo = MC->get_uint32(MC);
</pre>
Where migration channel has been initialized properly for QEMUFile.  This would make trivial to change the protocol format to anything else.
** Continuous VMState testing (GSOC project)
Add a new flag that during normal operation at random intervals:
*** stops the VM
*** saves all device state to a buffer
*** reset all devices
*** load all device state from that buffer
This way, we could test that we can migrate at any moment.


== TODO Long Term ==  
== TODO Long Term ==  

Revision as of 11:18, 23 April 2014

Summary

Migration roadmap.

Owner

  • Name: Juan Quintela
  • Email: quintela@redhat.com

Detailed Summary

This page describes what are the changes planned for migration and who is supposed to do each of the changes. If you want to collaborate on any of the items don't doubt to contact me directly or asking on the qemu mailing list.

Status

This is the roadmap, features are integrated upstream as they are done.

ToDo list

  • use TLS for communication (Volunteer?)

Right now all migration communication are done through clear channels. If you need to encrypt the channel, you need to use an external program. The problem with this is the performance loss. We need to transfer all data to another program, and then to the network.

  • Improve migration bitmap handling (Volunteer?)
    • Split bitmap use. We always use all bitmaps, VGA, CODE & Migration, independently of what we are doing. We could improve it with:
      • VGA: only add it to VGA framebuffers
      • MIGRATION: We only need to allocate/handle it during migration.
      • CODE: Only needed with TCG, no need at all for KVM
  • KVM migration bitmap (Volunteer?)
    • We could use the native bitmap format, and change/improve kernel to only set bits for dirty pages, not cleaning clean ones.
    • We could change kernel log code to set the bitmap for "used" pages when we start logging, this would allow us to not migration zero pages at all, right now we have to "allocate" the pages, to check that they are zero, and then sent them as zero pages
  • Abstract QEMUFile use (Google Summer of Code Project)

We can change QEMUFile use to something like:

struct MigrationChannel {
    void *opaque;
    uint32 get_uint32(struct MigrationChannel *)
    int put_uint32(structMigrationCHannel *)
/* the same for all the basic types */
}

And then change all ocurrences of:

qemu_get_sbe32(f, &foo);

into

foo = MC->get_uint32(MC);

Where migration channel has been initialized properly for QEMUFile. This would make trivial to change the protocol format to anything else.

    • Continuous VMState testing (GSOC project)

Add a new flag that during normal operation at random intervals:

      • stops the VM
      • saves all device state to a buffer
      • reset all devices
      • load all device state from that buffer

This way, we could test that we can migrate at any moment.

TODO Long Term

  • Finish conversion to VMState. Pending things are:
    • send generated fields
    • rebase cpu ports to latest (need previous one)
    • virtio: exist very old version (very old means as of more than 1 year ago). Problem is how to describe lists easily in VMState
    • slirp: some patches exist, same previous problem, how to handle easily lists. Slirp is basically list of lists of lists.
    • misc devices: almost all of them don't work on a migrated platform, so we could change them.
  • Protocol changes
    • Add size + checksum to sections. This is one incompatible change and needs further thought.
    • Make embedded sections real sections, with headers. This will allow us to version internal state.
    • Unit testing. In colaboration with qdev, allow devices to be tested alone with old/new migration versions/subsections.
  • Improve testing
    • How to be sure that ideas we are compatible (or not) with previous versions

An automated way of detecting this is needed.


  • Define target machine in the monitor

This would allow us to sent the configuration through the migration channel. This needs very big changes in qemu, but we are heading on that direction.

Code

The code still not merged is currently kept in several branches of this git repository: