Publications Search

Assessing the role of mini-applications in predicting key performance characteristics of scientific and engineering applications

Journal of Parallel and Distributed Computing

Barrett, R.F.; Crozier, Paul; Doerfler, Douglas W.; Heroux, Michael A.; Lin, Paul T.; Thornquist, Heidi K.; Trucano, Timothy G.; Vaughan, Courtenay T.

Computational science and engineering application programs are typically large, complex, and dynamic, and are often constrained by distribution limitations. As a means of making tractable rapid explorations of scientific and engineering application programs in the context of new, emerging, and future computing architectures, a suite of "miniapps" has been created to serve as proxies for full scale applications. Each miniapp is designed to represent a key performance characteristic that does or is expected to significantly impact the runtime performance of an application program. In this paper we introduce a methodology for assessing the ability of these miniapps to effectively represent these performance issues. We applied this methodology to three miniapps, examining the linkage between them and an application they are intended to represent. Herein we evaluate the fidelity of that linkage. This work represents the initial steps required to begin to answer the question, "Under what conditions does a miniapp represent a key performance characteristic in a full app?"

More Details

TYPE Journal Article YEAR 2015

Scopus OSTI DOI

Trinity Benchmarks on the Intel Xeon Phi (Knights Corner)

Rajan, Mahesh; Doerfler, Douglas W.; Hammond, Simon

This report documents the early experiences with porting and performance analysis of the Tri-Lab Trinity benchmark applications on Intel Xeon Phi (Knights Corner) (KNC) processor. KNC, the second generation of the Intel Many Integrated Core (MIC) architectures, uses a large number of small P54C-x86 cores with wide vector units and is deployed as PCI bus attached process accelerators. Sandia has experimental test beds of small InifiniBand clusters and workstations to investigate the performance of the MIC architecture. On these experimental test beds the programming models that may be investigated are "offload", "symmetric" and "native". Among these program usage models our primary interest is in the so called "native" mode, because the planned Trinity system to be deployed in 2016 using the next generation MIC processor architecture called Knights Landing would be self-hosted. Trinity / NERSC-8 benchmark programs cover a variety of scientific disciplines and they were used to guide the procurement of these systems. Architectures such as the Intel MIC are well suited to study evolving processor architectures and a usage model commonly referred to as MPI + X that facilitates migration of our applications to use both coarse grain and fine grain parallelism. Our focus with the applications selected is on the efficacy of algorithms in these applications to take advantage of features like: large number of cores, wide vector units, higher-bandwidth and deeper memory sub-system. This is a first step towards understanding applications, algorithms and programming environments for Trinity and future exascale computing systems.

More Details

TYPE SAND Report YEAR 2014

DOI OSTI

Trinity Advanced Technology System Overview

Bays, Nathan R.; Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

First Experiences with 64-bit ARM Moonshot

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

Trinity Benchmarks on Xeon Phi (Knights Corner)

Rajan, Mahesh; Doerfler, Douglas W.; Hammond, Simon; Trott, Christian R.; Barrett, Richard F.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

AN EVALUATION OF 64-BIT ARM FOR USE IN HIGH-PERFORMANCE MODELING AND SIMULATION ARCHITECTURE

Doerfler, Douglas W.; Bradicich, Tom; Chatterjee, Mitrajit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

Trinity Advanced Technology System Overview

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI OSTI

Experiences with Sandia National Laboratories HPC applications and MPI performance

Rajan, Mahesh; Doerfler, Douglas W.; Barrett, Richard F.; Stevenson, Joel O.; Agelastos, Anthony M.; Shaw, Ryan; Meyer, Harold E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

The Role of Advanced Technology Systems in the ASC Platform Strategy

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Trinity: Next-Generation Supercomputer for the ASC Program

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Division 1000 News Notes Submission Form

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

Trinity and NERSC8 Technical Requirements Review

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Experiences with Xeon Phi

Hammond, Simon; Rajamanickam, Sivasankaran; Ang, James A.; Barrett, Richard F.; Doerfler, Douglas W.; Heroux, Michael A.; Laros, James H.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Assessing the predictive capabilities of mini-applications

Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012

Barrett, Richard F.; Crozier, Paul; Doerfler, Douglas W.; Hammond, Simon; Heroux, Michael A.; Lin, Paul T.; Trucano, Timothy G.; Vaughan, Courtenay T.; Williams, Alan B.

The push to exascale computing is informed by the assumption that the architecture, regardless of the specific design, will be fundamentally different from petascale computers. The Mantevo project has been established to produce a set of proxies, or 'miniapps,' which enable rapid exploration of key performance issues that impact a broad set of scientific applications programs of interest to ASC and the broader HPC community. Understanding the conditions under which a miniapp can be confidently used as predictive of an applications' behavior must be clearly elucidated. Toward this end, we have developed a methodology for assessing the predictive capabilities of application proxies. Adhering to the spirit of experimental validation, our approach provides a framework for examining data from the application with that provided by their proxies. In this poster we present this methodology, and apply it to three miniapps developed by the Mantevo project. © 2012 IEEE.

More Details

TYPE Conference YEAR 2012

OSTI Scopus

Navigating an evolutionary fast path to exascale

Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012

Barrett, Richard F.; Hammond, Simon; Vaughan, Courtenay T.; Doerfler, Douglas W.; Heroux, Michael A.

The computing community is in the midst of a disruptive architectural change. The advent of manycore and heterogeneous computing nodes forces us to reconsider every aspect of the system software and application stack. To address this challenge there is a broad spectrum of approaches, which we roughly classify as either revolutionary or evolutionary. With the former, the entire code base is re-written, perhaps using a new programming language or execution model. The latter, which is the focus of this work, seeks a piecewise path of effective incremental change. The end effect of our approach will be revolutionary in that the control structure of the application will be markedly different in order to utilize single-instruction multiple-data/thread (SIMD/SIMT), manycore and heterogeneous nodes, but the physics code fragments will be remarkably similar. Our approach is guided by a set of mission driven applications and their proxies, focused on balancing performance potential with the realities of existing application code bases. Although the specifics of this process have not yet converged, we find that there are several important steps that developers of scientific and engineering application programs can take to prepare for making effective use of these challenging platforms. Aiding an evolutionary approach is the recognition that the performance potential of the architectures is, in a meaningful sense, an extension of existing capabilities: vectorization, threading, and a re-visiting of node interconnect capabilities. Therefore, as architectures, programming models, and programming mechanisms continue to evolve, the preparations described herein will provide significant performance benefits on existing and emerging architectures. © 2012 IEEE.

More Details

TYPE Conference YEAR 2012

OSTI Scopus