HAVmS: Highly Available Virtual Machine Computer System Fault Tolerant with Automatic Failback and Close to Zero Downtime

Authors

  • Memmo Federici Istituto di Astrosica e Planetologia Spaziali, INAF IAPS Via fosso del Cavaliere 100, 00133 Roma, Italy
  • Carlo Gaibisso Istituto di Analisi dei Sistemi ed Informatica "Antonio Ruberti", IASI-CNR Viale Manzoni 30, 00185 Roma , Italy
  • Bruno L. Martino Associated INAF IAPS

DOI:

https://doi.org/10.14311/APP.2014.01.0278

Abstract

In scientic computing, systems often manage computations that require continuous acquisition of of satellite data and the management of large databases, as well as the execution of analysis software and simulation models (e.g. Monte Carlo or molecular dynamics cell simulations) which may require several weeks of continuous run. These systems, consequently, should ensure the continuity of operation even in case of serious faults. HAVmS (High Availability Virtual machine System) is a highly available, "fault tolerant" system with zero downtime in case of fault. It is based on the use of Virtual Machines and implemented by two servers with similar characteristics. HAVmS, thanks to the developed software solutions, is unique in its kind since it automatically failbacks once faults have been fixed. The system has been designed to be used both with professional or inexpensive hardware and supports the simultaneous execution of multiple services such as: web, mail, computing and administrative services, uninterrupted computing, data base management. Finally the system is cost effective adopting exclusively open source solutions, is easily manageable and for general use.

Downloads

Download data is not yet available.

Downloads

Published

2014-12-04

How to Cite

HAVmS: Highly Available Virtual Machine Computer System Fault Tolerant with Automatic Failback and Close to Zero Downtime. (2014). Acta Polytechnica CTU Proceedings, 1(1), 278-282. https://doi.org/10.14311/APP.2014.01.0278