Publications

Results 176–200 of 202

Kokkos: Enabling manycore performance portability through polymorphic memory access patterns

Journal of Parallel and Distributed Computing

Trott, Christian R.

The manycore revolution can be characterized by increasing thread counts, decreasing memory per thread, and diversity of continually evolving manycore architectures. High performance computing (HPC) applications and libraries must exploit increasingly finer levels of parallelism within their codes to sustain scalability on these devices. We found that a major obstacle to performance portability is the diverse and conflicting set of constraints on memory access patterns across devices. Contemporary portable programming models address manycore parallelism (e.g., OpenMP, OpenACC, OpenCL) but fail to address memory access patterns. The Kokkos C++ library enables applications and domain libraries to achieve performance portability on diverse manycore architectures by unifying abstractions for both fine-grain data parallelism and memory access patterns. In this paper we describe Kokkos’ abstractions, summarize its application programmer interface (API), present performance results for unit-test kernels and mini-applications, and outline an incremental strategy for migrating legacy C++ codes to Kokkos. Furthermore, the Kokkos library is under active research and development to incorporate capabilities from new generations of manycore architectures, and to address a growing list of applications and domain libraries.

More Details

TYPE Journal Article YEAR 2014

OSTI DOI

SNAP: Strong Scaling High Fidelity Molecular

Trott, Christian R.; Hammond, Simon D.; Thompson, Aidan P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

LAMMPS-Kokkos: The Tutorial alpha

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Kokkos: The Tutorial alpha

Trott, Christian R.; Edwards, Harold C.; Sunderland, Daniel S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Migrating to Kokkos

Trott, Christian R.; Edwards, Harold C.; Hoemmen, Mark F.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Migrating to Kokkos

Trott, Christian R.; Hoemmen, Mark F.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Kokkos a Manycore Device Performance Portability Library for C++ HPC Applications

Trott, Christian R.; Sunderland, Daniel S.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

A New Approach for Interatomic Potentials: Application to Tantalum

Foiles, Stephen M.; Thompson, Aidan P.; Swiler, Laura P.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

SNAP: Strong scaling high fidelity molecular dynamics simulations on leadership-class computing platforms

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Trott, Christian R.; Hammond, Simon D.; Thompson, Aidan P.

The rapidly improving compute capability of contemporary processors and accelerators is providing the opportunity for significant increases in the accuracy and fidelity of scientific calculations. In this paper we present performance studies of a new molecular dynamics (MD) potential called SNAP. The SNAP potential has shown great promise in accurately reproducing physics and chemistry not described by simpler potentials. We have developed new algorithms to exploit high single-node concurrency provided by three different classes of machine: the Titan GPU-based system operated by Oak Ridge National Laboratory, the combined Sequoia and Vulcan BlueGene/Q machines located at Lawrence Livermore National Laboratory, and the large-scale Intel Sandy Bridge system, Chama, located at Sandia. Our analysis focuses on strong scaling experiments with approximately 246,000 atoms over the range 1-122,880 nodes on Sequoia/Vulcan and 40-18,630 nodes on Titan. We compare these machine in terms of both simulation rate and power efficiency. We find that node performance correlates with power consumption across the range of machines, except for the case of extreme strong scaling, where more powerful compute nodes show greater efficiency. This study is a unique assessment of a challenging, scientifically relevant calculation running on several of the world's leading contemporary production supercomputing platforms. © 2014 Springer International Publishing.

More Details

TYPE Conference YEAR 2014

Scopus OSTI DOI

Kokkos Tutorial:A Trilinos package for manycore performance portability

Edwards, Harold C.; Trott, Christian R.; Sunderland, Daniel S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Performance on Advanced Systems Test Beds

Trott, Christian R.; Hammond, Simon D.; Kelly, Suzanne M.; Laros, James H.; Ang, James A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Kokkos: Enabling Manycore Performance Portable Applications and Libraries

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Extended abstract for "Kokkos a Manycore Device Performance Portability Library for C++ HPC Applications"

Edwards, Harold C.; Trott, Christian R.; Sunderland, Daniel S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

NNSA/ASC Test Bed Update

Hammond, Simon D.; Barrett, Richard F.; Vaughan, Courtenay T.; Trott, Christian R.; Laros, James H.; Kelly, Suzanne M.; Ang, James A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Kokkos: Enabling performance portability across manycore architectures

Edwards, Harold C.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Kokkos LibraryEnabling performance portability across manycore architectures

Trott, Christian R.; Sunderland, Daniel S.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

KokkosArray:Multidimensional Arrays forManycore Performance Portability

Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Kokkos: Enabling performance portability across manycore architectures

Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Quantum-Accurae LAMMPS SNAP Simulations on Petascale Platforms

Thompson, Aidan P.; Trott, Christian R.; Swiler, Laura P.; Tucker, Garritt T.; Foiles, Stephen M.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Automated generation of quantum-accurate classical interatomic potentials for metals and semiconductors

Thompson, Aidan P.; Schultz, Peter A.; Swiler, Laura P.; Trott, Christian R.; Tucker, Garritt T.; Foiles, Stephen M.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Implementing Many-Body Potentials for Molecular Dynamics Simulations

Trott, Christian R.; Thompson, Aidan P.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

An examination of content similarity within the memory of HPC applications

Ferreira, Kurt; Thompson, Aidan P.; Trott, Christian R.; Levy, Scott L.

Abstract not provided.

More Details

TYPE SAND Report YEAR 2013

OSTI DOI

Evaluating the Feasibility of Using Memory Content Similarity to Improve System Resilience

Ferreira, Kurt; Thompson, Aidan P.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

miniMD_postdoc_poster

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Exploring the Future of High Performance Computing with Proxy-Apps

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Results 176–200 of 202

Results 176–200 of 202