Page 2 – Center for Computing Research (CCR)

For the FY15 ASC L2 Trilab Codesign milestone Sandia National Laboratories performed two main studies. The first study investigated three topics (performance, cross-platform portability and programmer productivity) when using OpenMP directives and the RAJA and Kokkos programming models available from LLNL and SNL respectively. The focus of this first study was the LULESH mini-application developed and maintained by LLNL. In the coming sections of the report the reader will find performance comparisons (and a demonstration of portability) for a variety of mini-application implementations produced during this study with varying levels of optimization. Of note is that the implementations utilized including optimizations across a number of programming models to help ensure claims that Kokkos can provide native-class application performance are valid. The second study performed during FY15 is a performance assessment of the MiniAero mini-application developed by Sandia. This mini-application was developed by the SIERRA Thermal-Fluid team at Sandia for the purposes of learning the Kokkos programming model and so is available in only a single implementation. For this report we studied its performance and scaling on a number of machines with the intent of providing insight into potential performance issues that may be experienced when similar algorithms are deployed on the forthcoming Trinity ASC ATS platform.

More Details

TYPE SAND Report YEAR 2015

OSTI DOI

ASCR Computer Architecture Lab

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

ASCR Computer Architecture Laboratory

Hammond, Simon D.; Ang, James A.; Rodrigues, Arun; Hemmert, Karl S.; Voskuilen, Gwendolyn R.; Cook, Jeanine C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Assessing the predictive capabilities of mini-applications

Barrett, Richard F.; Crozier, Paul C.; Doerfler, Douglas W.; Hammond, Simon D.; Heroux, Michael A.; Lin, Paul L.; Trucano, Timothy G.; Vaughan, Courtenay T.; Williams, Alan B.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Astra

Laros, James H.; Pedretti, Kevin P.; Hammond, Simon D.; Alvin, Kenneth F.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Astra: The World's First Petascale Arm Supercomputer

Pedretti, Kevin P.; Laros, James H.; Hammond, Simon D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Asynchronous Many-Task Programming Models for Next Generation Platforms

Wilke, Jeremiah J.; Bettencourt, Matthew T.; Bova, S.W.; franko, ken f.; Gamell, Marc G.; Grant, Ryan E.; Hammond, Simon D.; Hollman, David S.; Knight, Samuel K.; Kolla, Hemanth K.; Lin, Paul L.; Olivier, Stephen L.; Sjaardema, Gregory D.; Slattengren, Nicole S.; Teranishi, Keita T.; Bennett, Janine C.; Clay, Robert L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Balancing Productivity Portability and Performance - The Challenge for Programming Models at Exascale?

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Balar: A SST GPU Component for Performance Modeling and Profiling

Hughes, Clayton H.; Hammond, Simon D.; Khairy, Mahmoud K.; Zhang, Mengchi Z.; Green, Roland G.; Rogers, Timothy R.; Hoekstra, Robert J.

Programmable accelerators have become commonplace in modern computing systems. Advances in programming models and the availability of massive amounts of data have created a space for massively parallel accelerators capable of maintaining context for thousands of concurrent threads resident on-chip. These threads are grouped and interleaved on a cycle-by-cycle basis among several massively parallel computing cores. One path for the design of future supercomputers relies on an ability to model the performance of these massively parallel cores at scale. The SST framework has been proven to scale up to run simulations containing tens of thousands of nodes. A previous report described the initial integration of the open-source, execution-driven GPU simulator, GPGPU-Sim, into the SST framework. This report discusses the results of the integration and how to use the new GPU component in SST. It also provides examples of what it can be used to analyze and a correlation study showing how closely the execution matches that of a Nvidia V100 GPU when running kernels and mini-apps.

More Details

TYPE SAND Report YEAR 2019

OSTI DOI

Bowman and a Path to Trinity

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Building 725 Astra and Vanguard

Lacy, Susan L.; Noe, John P.; Ogden, Jeffry B.; Hammond, Simon D.

Abstract not provided.

More Details

TYPE Other Report YEAR 2018

OSTI DOI

Challenges of Codesign

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Characterizing Mini-App Workloads

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Chronicles of astra: Challenges and lessons from the first petascale arm supercomputer

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Pedretti, Kevin P.; Younge, Andrew J.; Hammond, Simon D.; Laros, James H.; Curry, Matthew J.; Aguilar, Michael J.; Hoekstra, Robert J.; Brightwell, Ronald B.

Arm processors have been explored in HPC for several years, however there has not yet been a demonstration of viability for supporting large-scale production workloads. In this paper, we offer a retrospective on the process of bringing up Astra, the first Petascale supercomputer based on 64-bit Arm processors, and validating its ability to run production HPC applications. Through this process several immature technology gaps were addressed, including software stack enablement, Linux bugs at scale, thermal management issues, power management capabilities, and advanced container support. From this experience, several lessons learned are formulated that contributed to the successful deployment of Astra. These insights can be helpful to accelerate deploying and maturing other first-seen HPC technologies. With Astra now supporting many users running a diverse set of production applications at multi-thousand node scales, we believe this constitutes strong supporting evidence that Arm is a viable technology for even the largest-scale supercomputer deployments.

More Details

TYPE Conference Poster YEAR 2020

Scopus OSTI

Coarse-Grain Simulation of Networks-on-Chip using SST/Macro

Hendry, Gilbert H.; Hammond, Simon D.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Codesign at Sandia: LULESH and MiniAero

Trott, Christian R.; Hammond, Simon D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Codesign for Production Applications

Hammond, Simon D.; Trott, Christian R.; Vaughan, Courtenay T.; Dinge, Dennis D.; Lin, Paul L.; Pase, Douglas M.; Benner, R.E.; Cook, Jeanine C.; Hoekstra, Robert J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Codesign for the Masses

Lewis, Cannada L.; Hammond, Simon D.; Wilke, Jeremiah J.

In this position paper we will address challenges and opportunities relating to the design and codesign of application specific circuits. Given our background as computational scientists, our perspective is from the viewpoint of a highly motivated application developer as opposed to career computer architects

More Details

TYPE Other Report YEAR 2021

OSTI DOI