The field of cloud computing is moving towards more efficient and dynamic resource management, with a focus on optimizing performance, reducing energy consumption, and ensuring quality of service. Recent developments have explored the use of machine learning, deep reinforcement learning, and genetic algorithms to achieve these goals. Notably, research has concentrated on improving load balancing, autoscaling, and virtual machine scheduling in cloud environments. Additionally, there is a growing interest in addressing the challenges of transactional cloud applications and ensuring state consistency, durability, and lifecycle management. Overall, the field is advancing towards more intelligent and adaptive resource management systems. Noteworthy papers include: Symmetry-Preserving Architecture for Multi-NUMA Environments, which proposes a novel deep reinforcement learning approach for dynamic VM scheduling. Scalability Optimization in Cloud-Based AI Inference Services, which presents a comprehensive framework for real-time load balancing and autoscaling. Optimized Cloud Resource Allocation Using Genetic Algorithms, which introduces a GA-based approach for VM placement and consolidation to minimize power usage while maintaining QoS constraints.
Advances in Cloud Resource Management and Optimization
Sources
Joint Optimization of Controller Placement and Switch Assignment in SDN-based LEO Satellite Networks
Symmetry-Preserving Architecture for Multi-NUMA Environments (SPANE): A Deep Reinforcement Learning Approach for Dynamic VM Scheduling
Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
Scalability Optimization in Cloud-Based AI Inference Services: Strategies for Real-Time Load Balancing and Automated Scaling