[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Post-copy live migration of virtual machines

Published: 31 July 2009 Publication History

Abstract

We present the design, implementation, and evaluation of post-copy based live migration for virtual machines (VMs) across a Gigabit LAN. Post-copy migration defers the transfer of a VM's memory contents until after its processor state has been sent to the target host. This deferral is in contrast to the traditional pre-copy approach, which first copies the memory state over multiple iterations followed by a final transfer of the processor state. The post-copy strategy can provide a "win-win" by reducing total migration time while maintaining the liveness of the VM during migration. We compare post-copy extensively against the traditional pre-copy approach on the Xen Hypervisor. Using a range of VM workloads we show that post-copy improves several metrics including pages transferred, total migration time, and network overhead. We facilitate the use of post-copy with adaptive prepaging techniques to minimize the number of page faults across the network. We propose different prepaging strategies and quantitatively compare their effectiveness in reducing network-bound page faults. Finally, we eliminate the transfer of free memory pages in both pre-copy and post-copy through a dynamic self-ballooning (DSB) mechanism. DSB periodically reclaims free pages from a VM and significantly speeds up migration with negligible performance impact on VM workload.

References

[1]
Barham, P., Dragovic, B., Fraser, K., Hand, S., Harris, T., Ho, A., Neugebauer, R., Pratt, I., and Warfield, A. Xen and the art of virtualization. In Proc. of ACM SOSP 2003 (Oct. 2003).
[2]
Bradford, R., Kotsovinos, E., Feldmann, A., and Schiöberg, H. Live wide-area migration of virtual machines including local persistent state. In Proc. of the International Conference on Virtual Execution Environments (2007), pp. 169--179.
[3]
Clark, C., Fraser, K., Hand, S., Hansen, J., Jul, E., Limpach, C., Pratt, I., and Warfield, A. Live migration of virtual machines. In Network System Design and Implementation (2005).
[4]
Cully, B., Lefebvre, G., and Meyer, D. Remus: High availability via asynchronous virtual machine replication. In NSDI '07: Networked Systems Design and Implementation (2008).
[5]
Denning, P.J. The working set model for program behavior. Communications of the ACM 11, 5 (1968), 323--333.
[6]
Douglis, F. Transparent process migration in the Sprite operating system. Tech. rep., University of California at Berkeley, Berkeley, CA, USA, 1990.
[7]
Hand, S.M. Self-paging in the nemesis operating system. In OSDI'99, New Orleans, Louisiana, USA (1999), pp. 73--86.
[8]
Hansen, J., and Henriksen, A. Nomadic operating systems. In Master's thesis, Dept. of Computer Science, University of Copenhagen, Denmark (2002).
[9]
Hansen, J., and Jul, E. Self-migration of operating systems. In Proc. of ACM SIGOPS Europen Workshop, Leuven, Belgium (2004).
[10]
Hines, M., and Gopalan, K. MemX: Supporting large memory applications in Xen virtual machines. In Second International Workshop on Virtualization Technology in Distributed Computing (VTDC07), Reno, Nevada (2007).
[11]
Hines, M., and Gopalan, K. Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning. In Proceedings of ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE), Washington, DC (March 2009).
[12]
Ho, R.S., Wang, C.-L., and Lau, F.C. Lightweight process migration and memory prefetching in OpenMosix. In Proc. of IPDPS (2008).
[13]
Kerrighed. http://www.kerrighed.org.
[14]
Kivity, A., Kamay, Y., and Laor, D. KVM: the linux virtual machine monitor. In Proc. of Ottawa Linux Symposium (2007).
[15]
Lagar-Cavilla, H.A., Whitney, J., Scannel, A., Rumble, S., Brudno, M., de Lara, E., and Satyanarayanan, M. Impromptu clusters for near-interactive cloud-based services. Tech. rep., CSRG-578, University of Toronto, June 2008.
[16]
Magenheimer, D. Add self-ballooning to balloon driver. Discussion on Xen Development mailing list and personal communication, April 2008.
[17]
Milojicic, D., Douglis, F., Paindaveine, Y., Wheeler, R., and Zhou, S. Process migration survey. ACM Computing Surveys 32(3) (Sep. 2000), 241--299.
[18]
MOSIX. http://www.mosix.org.
[19]
Nelson, M., Lim, B.-H., and Hutchins, G. Fast transparent migration for virtual machines. In Usenix, Anaheim, CA (2005), pp. 25--25.
[20]
Noack, M. Comparative evaluation of process migration algorithms. Master's thesis, Dresden University of Technology - Operating Systems Group, 2003.
[21]
OpenVZ. Container-based Virtualization for Linux, http://www.openvz.com/.
[22]
Oppenheimer, G., and Weizer, N. Resource management for a medium scale time-sharing operating system. Commun. ACM 11, 5 (1968), 313--322.
[23]
Osman, S., Subhraveti, D., Su, G., and Nieh, J. The design and implementation of Zap: A system for migrating computing environments. In Proc. of OSDI (2002), pp. 361--376.
[24]
Plank, J., Beck, M., Kingsley, G., and Li, K. Libckpt: Transparent checkpointing under UNIX. In Proc. of Usenix Annual Technical Conference, New Orleans, Louisiana (1998).
[25]
Richmond, M., and Hitchens, M. A new process migration algorithm. SIGOPS Oper. Syst. Rev. 31, 1 (1997), 31--42.
[26]
Roush, E.T. Fast dynamic process migration. In Intl. Conference on Distributed Computing Systems (ICDCS) (1996), p. 637.
[27]
Sapuntzakis, C., Chandra, R., and Pfaff, B. Optimizing the migration of virtual computers. In Proc. of OSDI (2002).
[28]
Satyanarayanan, M., and Gilbert, B. Pervasive personal computing in an internet suspend/resume system. IEEE Internet Computing 11, 2 (2007), 16--25.
[29]
Schmidt, B.K. Supporting Ubiquitous Computing with Stateless Consoles and Computation Caches. PhD thesis, Computer Science Dept., Stanford University, 2000.
[30]
Stellner, G. Cocheck: Checkpointing and process migration for MPI. In IPPS '1996 (Washington, DC, USA), pp. 526--531.
[31]
Thain, D., Tannenbaum, T., and Livny, M. Distributed computing in practice: the Condor experience. Concurr. Comput. : Pract. Exper. 17 (2005), 323--356.
[32]
Trivedi, K. An analysis of prepaging. Journal of Computing 22 (1979), 191--210.
[33]
Trivedi, K. On the paging performance of array algorithms. IEEE Transactions on Computers C-26, 10 (Oct. 1977), 938--947.
[34]
Trivedi, K. Prepaging and applications to array algorithms. IEEE Transactions on Computers C-25, 9 (Sept. 1976), 915--921.
[35]
Waldspurger, C. Memory resource management in VMWare ESX server. In Operating System Design and Implementation (OSDI 02), Boston, MA (Dec 2002).
[36]
Whitaker, A., Cox, R., and Shaw, M. Constructing services with interposable virtual hardware. In NSDI 2004 (2004), pp. 13--13.
[37]
Whitaker, A., Shaw, M., and Gribble, S. Scale and performance in the denali isolation kernel. In OSDI 2002, New York, NY, USA (2002), pp. 195--209.

Cited By

View all
  • (2025)VM consolidation steps in cloud computing: A perspective reviewSimulation Modelling Practice and Theory10.1016/j.simpat.2024.103034138(103034)Online publication date: Jan-2025
  • (2024)Live Data Replication to A Disaster Recovery Site Using XEN HypervisorComputer Science, Engineering and Technology10.46632/cset/1/1/71:1(54-61)Online publication date: 13-Aug-2024
  • (2024)Performance Analysis of Different Hypervisors Using Memory and Workloads in OS VirtualizationIntelligent Technologies for Automated Electronic Systems10.2174/9789815179514124010008(74-86)Online publication date: 4-Mar-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGOPS Operating Systems Review
ACM SIGOPS Operating Systems Review  Volume 43, Issue 3
July 2009
109 pages
ISSN:0163-5980
DOI:10.1145/1618525
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 July 2009
Published in SIGOPS Volume 43, Issue 3

Check for updates

Author Tags

  1. operating systems
  2. post-copy
  3. process migration
  4. virtual machines
  5. xen

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)78
  • Downloads (Last 6 weeks)6
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)VM consolidation steps in cloud computing: A perspective reviewSimulation Modelling Practice and Theory10.1016/j.simpat.2024.103034138(103034)Online publication date: Jan-2025
  • (2024)Live Data Replication to A Disaster Recovery Site Using XEN HypervisorComputer Science, Engineering and Technology10.46632/cset/1/1/71:1(54-61)Online publication date: 13-Aug-2024
  • (2024)Performance Analysis of Different Hypervisors Using Memory and Workloads in OS VirtualizationIntelligent Technologies for Automated Electronic Systems10.2174/9789815179514124010008(74-86)Online publication date: 4-Mar-2024
  • (2024)Democratizing Container Live Migration for Enhanced Future Networks - A SurveyACM Computing Surveys10.1145/370443657:4(1-37)Online publication date: 14-Nov-2024
  • (2024)Custom Page Fault Handling With eBPFProceedings of the ACM SIGCOMM 2024 Workshop on eBPF and Kernel Extensions10.1145/3672197.3673432(71-73)Online publication date: 4-Aug-2024
  • (2024)UTwinVM: Reliable hints on the effects of hypervisor updates on VMs in the CloudProceedings of the 25th International Middleware Conference10.1145/3652892.3700752(103-116)Online publication date: 2-Dec-2024
  • (2024)TimeCloth: Fast Point-in-Time Database Recovery in The CloudCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653382(214-226)Online publication date: 9-Jun-2024
  • (2024)Cost-Efficient Cluster Migration of VNFs for Service Function Chain EmbeddingIEEE Transactions on Network and Service Management10.1109/TNSM.2023.328775721:1(979-993)Online publication date: 1-Feb-2024
  • (2024)Live Migration of Video Analytics Applications in Edge ComputingIEEE Transactions on Mobile Computing10.1109/TMC.2023.324653923:3(2078-2092)Online publication date: 1-Mar-2024
  • (2024)HEXO: Offloading Long-Running Compute- and Memory-Intensive Workloads on Low-Cost, Low-Power Embedded SystemsIEEE Transactions on Cloud Computing10.1109/TCC.2024.348217812:4(1415-1432)Online publication date: Oct-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media