TY - GEN
T1 - Hierarchical and frequency-aware model predictive control for bare-metal cloud applications
AU - Ogawa, Yukio
AU - Hasegawa, Go
AU - Murata, Masayuki
N1 - Publisher Copyright:
©2018 IEEE
PY - 2018/7/2
Y1 - 2018/7/2
N2 - Bare-metal cloud provides a dedicated set of physical machines (PMs) and enables both PMs and virtual machines (VMs) on the PMs to be scaled in/out dynamically. However, to increase efficiency of the resources and reduce violations of service level agreements (SLAs), resources need to be scaled quickly to adapt to workload changes, which results in high reconfiguration overhead, especially for the PMs. This paper proposes a hierarchical and frequency-aware auto-scaling based on Model Predictive Control, which enable us to achieve an optimal balance between resource efficiency and overhead. Moreover, when performing high-frequency resource control, the proposed technique improves the timing of reconfigurations for the PMs without increasing the number of them, while it increases the reallocations for the VMs to adjust the redundant capacity among the applications; this process improves the resource efficiency. Through trace-based numerical simulations, we demonstrate that when the control frequency is increased to 16 times per hour, the VM insufficiency causing SLA violations is reduced to a minimum of 0.1% per application without increasing the VM pool capacity.
AB - Bare-metal cloud provides a dedicated set of physical machines (PMs) and enables both PMs and virtual machines (VMs) on the PMs to be scaled in/out dynamically. However, to increase efficiency of the resources and reduce violations of service level agreements (SLAs), resources need to be scaled quickly to adapt to workload changes, which results in high reconfiguration overhead, especially for the PMs. This paper proposes a hierarchical and frequency-aware auto-scaling based on Model Predictive Control, which enable us to achieve an optimal balance between resource efficiency and overhead. Moreover, when performing high-frequency resource control, the proposed technique improves the timing of reconfigurations for the PMs without increasing the number of them, while it increases the reallocations for the VMs to adjust the redundant capacity among the applications; this process improves the resource efficiency. Through trace-based numerical simulations, we demonstrate that when the control frequency is increased to 16 times per hour, the VM insufficiency causing SLA violations is reduced to a minimum of 0.1% per application without increasing the VM pool capacity.
KW - Autoscaling
KW - Bare-metal cloud
KW - Frequency-aware
KW - Model Predictive Control
KW - Resource reconfiguration
UR - http://www.scopus.com/inward/record.url?scp=85061707055&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061707055&partnerID=8YFLogxK
U2 - 10.1109/UCC.2018.00010
DO - 10.1109/UCC.2018.00010
M3 - Conference contribution
AN - SCOPUS:85061707055
T3 - Proceedings - 11th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2018
SP - 11
EP - 20
BT - Proceedings - 11th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2018
A2 - Sill, Alan
A2 - Spillner, Josef
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2018
Y2 - 17 December 2018 through 20 December 2018
ER -