Publications Search

Two Years of Co-Design

Abstract not provided.

More Details

TYPE Report YEAR 2014

OSTI

SNAP: Strong scaling high fidelity molecular dynamics simulations on leadership-class computing platforms

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Trott, Christian R.; Hammond, Simon; Thompson, Aidan P.

The rapidly improving compute capability of contemporary processors and accelerators is providing the opportunity for significant increases in the accuracy and fidelity of scientific calculations. In this paper we present performance studies of a new molecular dynamics (MD) potential called SNAP. The SNAP potential has shown great promise in accurately reproducing physics and chemistry not described by simpler potentials. We have developed new algorithms to exploit high single-node concurrency provided by three different classes of machine: the Titan GPU-based system operated by Oak Ridge National Laboratory, the combined Sequoia and Vulcan BlueGene/Q machines located at Lawrence Livermore National Laboratory, and the large-scale Intel Sandy Bridge system, Chama, located at Sandia. Our analysis focuses on strong scaling experiments with approximately 246,000 atoms over the range 1-122,880 nodes on Sequoia/Vulcan and 40-18,630 nodes on Titan. We compare these machine in terms of both simulation rate and power efficiency. We find that node performance correlates with power consumption across the range of machines, except for the case of extreme strong scaling, where more powerful compute nodes show greater efficiency. This study is a unique assessment of a challenging, scientifically relevant calculation running on several of the world's leading contemporary production supercomputing platforms. © 2014 Springer International Publishing.

More Details

TYPE Conference YEAR 2014

DOI DOI OSTI OSTI Scopus Scopus

Reducing the bulk of the bulk synchronous parallel model

Parallel Processing Letters

Barrett, Richard F.; Vaughan, Courtenay T.; Hammond, Simon

For over two decades the dominant means for enabling portable performance of computational science and engineering applications on parallel processing architectures has been the bulk-synchronous parallel programming (BSP) model. Code developers, motivated by performance considerations to minimize the number of messages transmitted, have typically pursued a strategy of aggregating message data into fewer, larger messages. Emerging and future high-performance architectures, especially those seen as targeting Exascale capabilities, provide motivation and capabilities for revisiting this approach. In this paper we explore alternative configurations within the context of a large-scale complex multi-physics application and a proxy that represents its behavior, presenting results that demonstrate some important advantages as the number of processors increases in scale.

More Details

TYPE Journal Article YEAR 2013

OSTI DOI

Application Memory Analysis

Hammond, Simon

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

The Path to Exascale Experiences porEng and debugging for Intel Xeon Phi

Hammond, Simon

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

Performance on Advanced Systems Test Beds

Trott, Christian R.; Hammond, Simon; Kelly, Suzanne M.; Laros, James H.; Ang, James A.

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

I don't wanna grow up... stuck at predictive capability maturity model level zero!

Rider, William J.; Kelly, Suzanne M.; Barrett, Richard F.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

NNSA/ASC Test Bed Update

Hammond, Simon; Barrett, Richard F.; Vaughan, Courtenay T.; Trott, Christian R.; Laros, James H.; Kelly, Suzanne M.; Ang, James A.

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

SST and Test-Bed Hack-a-thon

Hammond, Simon; Rodrigues, Arun; Kelly, Suzanne M.; Ang, James A.

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

A Glimpse into the the Next Decade of Supercomputing: An Overview of Sandia's Advanced Test Bed Project

Hammond, Simon

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

I Don't Wanna Grow Up...Stuck at Predictive Capability Maturity Model Level Zero

Kelly, Suzanne M.; Rider, William J.; Barrett, Richard F.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Application Explorations for Future Interconnects

Barrett, Richard F.; Vaughan, Courtenay T.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

GPU Acceleration of Data Assembly in Finite Element Methods and Its Energy Implications

Barrett, Richard F.; Hammond, Simon; Hsieh, Mingyu N.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

The Impact of Hybrid-Core Processors on MPI Message Rate

Barrett, Brian; Brightwell, Ronald B.; Hemmert, Karl S.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

SST (micro) Introduction - Presentation to SST Hack-a-thon Attendees

Rodrigues, Arun; Hammond, Simon; Kelly, Suzanne M.

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

SST Hack-a-thon Component Overview

Hammond, Simon

Abstract not provided.

More Details

TYPE Report YEAR 2013

OSTI

SST and ExMatEx Update

Hammond, Simon; Rodrigues, Arun; Kelly, Suzanne M.; Vandyke, John P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Mantevo Suite 1.0

Barrett, Richard F.; Willenbring, James M.; Hammond, Simon

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Experiences with Xeon Phi

Hammond, Simon; Rajamanickam, Sivasankaran; Ang, James A.; Barrett, Richard F.; Doerfler, Douglas W.; Heroux, Michael A.; Laros, James H.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Application Explorations for Future Interconnects

Barrett, Richard F.; Vaughan, Courtenay T.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Using Miniapplications in a Mantevo Framework for Optimizing Sandia's SPARC CFD Code on Multi-Core Many-Core and GPU-Accelerated Platforms

Barrett, Richard F.; Laros, James H.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

The impact of hybrid-core processors on MPI message rate

ACM International Conference Proceeding Series

Barrett, Brian; Brightwell, Ronald B.; Hammond, Simon; Hemmert, Karl S.

Power and energy concerns are motivating chip manufacturers to consider future hybrid-core processor designs that combine a small number of traditional cores optimized for single-thread performance with a large number of simpler cores optimized for throughput performance. This trend is likely to impact the way compute resources for network protocol processing functions are allocated and managed. In particular, the performance of MPI match processing is critical to achieving high message throughput. In this paper, we analyze the ability of simple and more complex cores to perform MPI matching operations for various scenarios in order to gain insight into how MPI implementations for future hybrid-core processors should be designed.

More Details

TYPE Conference YEAR 2013

OSTI Scopus

Finding an On--Ramp to the Exascale Highway

Hammond, Simon

Abstract not provided.

More Details

TYPE Report YEAR 2012

OSTI

Using Miniapplications in a Mantevo Framework for Optimizing Sandia's SPARC CFD Code on Multi-Core Many-Core and GPU-Accelerated Compute Platforms

Hammond, Simon; Laros, James H.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Navigating an evolutionary fast path to exascale

Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012

Barrett, Richard F.; Hammond, Simon; Vaughan, Courtenay T.; Doerfler, Douglas W.; Heroux, Michael A.

The computing community is in the midst of a disruptive architectural change. The advent of manycore and heterogeneous computing nodes forces us to reconsider every aspect of the system software and application stack. To address this challenge there is a broad spectrum of approaches, which we roughly classify as either revolutionary or evolutionary. With the former, the entire code base is re-written, perhaps using a new programming language or execution model. The latter, which is the focus of this work, seeks a piecewise path of effective incremental change. The end effect of our approach will be revolutionary in that the control structure of the application will be markedly different in order to utilize single-instruction multiple-data/thread (SIMD/SIMT), manycore and heterogeneous nodes, but the physics code fragments will be remarkably similar. Our approach is guided by a set of mission driven applications and their proxies, focused on balancing performance potential with the realities of existing application code bases. Although the specifics of this process have not yet converged, we find that there are several important steps that developers of scientific and engineering application programs can take to prepare for making effective use of these challenging platforms. Aiding an evolutionary approach is the recognition that the performance potential of the architectures is, in a meaningful sense, an extension of existing capabilities: vectorization, threading, and a re-visiting of node interconnect capabilities. Therefore, as architectures, programming models, and programming mechanisms continue to evolve, the preparations described herein will provide significant performance benefits on existing and emerging architectures. © 2012 IEEE.

More Details

TYPE Conference YEAR 2012

OSTI Scopus

Publications

Search results