Performance and Portability in Computational Models and Frameworks

Report on Current Developments in the Research Area

General Direction of the Field

The recent advancements in the research area are primarily focused on enhancing the performance, scalability, and portability of computational models and frameworks across diverse hardware and software environments. There is a strong emphasis on leveraging modern programming paradigms, such as Domain-Specific Languages (DSLs) and heterogeneous computing, to achieve better performance and scalability in complex simulations and workloads. Additionally, there is a growing interest in integrating high-performance computing (HPC) with cloud-native technologies, enabling more flexible and efficient execution of workloads across different platforms.

One of the key trends is the development of performance-portable mini-applications (mini-apps) that abstract complex simulations, such as Lattice-Boltzmann Method (LBM) simulations, to evaluate and optimize performance across various hardware architectures. These mini-apps are designed to be portable across different computing systems, including GPUs and CPUs, and can provide valuable insights for optimizing large-scale frameworks.

Another significant development is the exploration of container orchestration frameworks that enable the deployment of parallel computing workloads in a portable, scalable, and secure manner. These frameworks aim to address the challenges of portability, scalability, and security in parallel computing by leveraging containerization technologies and integrating them with existing HPC job management systems.

Overall, the field is moving towards more integrated and flexible solutions that can adapt to different hardware and software environments, while also ensuring high performance, scalability, and security.

Noteworthy Papers

  • miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations: Introduces the first SYCL-based LBM mini-app, demonstrating performance portability across diverse hardware.
  • Syndeo: Portable Ray Clusters with Secure Containerization: Presents a framework for container orchestration of Ray on Slurm, addressing portability, scalability, and security in parallel computing.

Sources

Performance and scaling of the LFRic weather and climate model on different generations of HPE Cray EX supercomputers

miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations

Running Cloud-native Workloads on HPC with High-Performance Kubernetes

Syndeo: Portable Ray Clusters with Secure Containerization

Built with on top of