Predictive auto-scaling with OpenStack Monasca

Lanciano, Giacomo; Galli, Filippo; Cucinotta, Tommaso; Bacciu, Davide; Passarella, Andrea

doi:10.1145/3468737.3494104

Cloud auto-scaling mechanisms are typically based on reactive automation rules that scale a cluster whenever some metric, e.g., the average CPU usage among instances, exceeds a predefined threshold. Tuning these rules becomes particularly cumbersome when scaling-up a cluster involves non-negligible times to bootstrap new instances, as it happens frequently in production cloud services. To deal with this problem, we propose an architecture for auto-scaling cloud services based on the status in which the system is expected to evolve in the near future. Our approach leverages on time-series forecasting techniques, like those based on machine learning and artificial neural networks, to predict the future dynamics of key metrics, e.g., resource consumption metrics, and apply a threshold-based scaling policy on them. The result is a predictive automation policy that is able, for instance, to automatically anticipate peaks in the load of a cloud application and trigger ahead of time appropriate scaling actions to accommodate the expected increase in traffic. We prototyped our approach as an open-source OpenStack component, which relies on, and extends, the monitoring capabilities offered by Monasca, resulting in the addition of predictive metrics that can be leveraged by orchestra- tion components like Heat or Senlin. We show experimental results using a recurrent neural network and a multi-layer perceptron as predictor, which are compared with a simple linear regression and a traditional non-predictive auto-scaling policy. However, the proposed framework allows for the easy customization of the prediction policy as needed

Predictive auto-scaling with OpenStack Monasca

Lanciano, Giacomo;Galli, Filippo;Cucinotta, Tommaso;Bacciu, Davide;Passarella, Andrea

2021

Abstract

Cloud auto-scaling mechanisms are typically based on reactive automation rules that scale a cluster whenever some metric, e.g., the average CPU usage among instances, exceeds a predefined threshold. Tuning these rules becomes particularly cumbersome when scaling-up a cluster involves non-negligible times to bootstrap new instances, as it happens frequently in production cloud services. To deal with this problem, we propose an architecture for auto-scaling cloud services based on the status in which the system is expected to evolve in the near future. Our approach leverages on time-series forecasting techniques, like those based on machine learning and artificial neural networks, to predict the future dynamics of key metrics, e.g., resource consumption metrics, and apply a threshold-based scaling policy on them. The result is a predictive automation policy that is able, for instance, to automatically anticipate peaks in the load of a cloud application and trigger ahead of time appropriate scaling actions to accommodate the expected increase in traffic. We prototyped our approach as an open-source OpenStack component, which relies on, and extends, the monitoring capabilities offered by Monasca, resulting in the addition of predictive metrics that can be leveraged by orchestra- tion components like Heat or Senlin. We show experimental results using a recurrent neural network and a multi-layer perceptron as predictor, which are compared with a simple linear regression and a traditional non-predictive auto-scaling policy. However, the proposed framework allows for the easy customization of the prediction policy as needed

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2021
			
	Settore Scientifico Disciplinare (validi fino a 24/06/2024)
	
				Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
			
	Titolo del Convegno
	
				UCC '21: 2021 IEEE/ACM 14th International Conference on Utility and Cloud Computing
			
	Luogo del Convegno
	
				Leicester
			
	Periodo del Convegno
	
				6-9 dicembre 2021
			
	Titolo del Volume
	
				Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing
			
	Editore
	
				Association for Computing Machinery
			
	ISBN
	
				9781450385640
			
	DOI
	
				https://dx.doi.org/10.1145/3468737.3494104
			
	Parole chiave
	
				Elasticity auto-scaling; Time-series forecasting; Predictive operations; OpenStack
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Lanciano et al. - 2021 - Predictive auto-scaling with OpenStack Monasca.pdf Accesso chiuso Tipologia: Published version Licenza: Non pubblico Dimensione 2.06 MB Formato Adobe PDF Richiedi una copia	2.06 MB	Adobe PDF	Richiedi una copia