Publications

Results 126–150 of 219

Solving the performance portability issue with Kokkos

Trott, Christian R.; Plimpton, Steven J.; Thompson, A.P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

On the Importance of Faster Atomics

Hammond, Simon; Trott, Christian R.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures

Garcia De Gonzalo, Simon; Hammond, Simon; Trott, Christian R.; Huw, Wen-Mei

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Prototyping the Next Generation of Aria

Clausen, Jonathan; Brunini, Victor; Forster, Christopher J.; Noble, David R.; Trott, Christian R.; Hammond, Simon; Hoemmen, Mark F.; Lin, Paul T.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

A Classical MD Primer

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Performance-portable sparse matrix-matrix multiplication for many-core architectures

Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017

Deveci, Mehmet; Trott, Christian R.; Rajamanickam, Sivasankaran

We consider the problem of writing performance portablesparse matrix-sparse matrix multiplication (SPGEMM) kernelfor many-core architectures. We approach the SPGEMMkernel from the perspectives of algorithm design and implementation, and its practical usage. First, we design ahierarchical, memory-efficient SPGEMM algorithm. We thendesign and implement thread scalable data structures thatenable us to develop a portable SPGEMM implementation. We show that the method achieves performance portabilityon massively threaded architectures, namely Intel's KnightsLanding processors (KNLs) and NVIDIA's Graphic ProcessingUnits (GPUs), by comparing its performance to specializedimplementations. Second, we study an important aspectof SPGEMM's usage in practice by reusing the structure ofinput matrices, and show speedups up to 3× compared to thebest specialized implementation on KNLs. We demonstratethat the portable method outperforms 4 native methods on2 different GPU architectures (up to 17× speedup), and it ishighly thread scalable on KNLs, in which it obtains 101× speedup on 256 threads.

More Details

TYPE Conference Poster YEAR 2017

DOI OSTI Scopus

OpenACC for Programmers: Concepts and Strategies

Trott, Christian R.

Abstract not provided.

More Details

TYPE Book YEAR 2017

OSTI

Optimizing the Performance of Sparse-Matrix Vector Products on Next-Generation Processors

Hammond, Simon; Trott, Christian R.

Matrix-vector products are ubiquitous in high-performance scientific applications and have a growing set of occurrences in advanced data analysis activities. Achieving high performance for these kernels is therefore paramount, in part, because these operations can consume vast amounts of application execution time. In this report we document the development of several sparse-matrix vector product kernel implementations using a variety of programming models and approaches. Each kernel is run on a broad set of matrices selected to demonstrate the wide variety of matrix structure and sparsity that is possible with a single, generic kernel. For benchmarking and performance analysis, we utilize leading computing architectures for the NNSA/ASC program including Intel's Knights Landing processor and IBM's POWER8.

More Details

TYPE SAND Report YEAR 2017

DOI OSTI

Kokkos Tutorial

Edwards, Harold C.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Profiling Kokkos Application

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Kokkos: The C++ Performance Portability Programming Model

Trott, Christian R.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Kokkos: The C++ Performance Portability Programming Model

Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Next Generation Science Applications for the Next Generation of Supercomputing

Vaughan, Courtenay T.; Hammond, Simon; Dinge, Dennis; Lin, Paul T.; Pase, Douglas M.; Cook, Jeanine; Trott, Christian R.; Hughes, Clayton; Hoekstra, Robert J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Next Generation Science Applications for the Next Generation of Supercomputing

Vaughan, Courtenay T.; Hammond, Simon; Dinge, Dennis; Lin, Paul T.; Pase, Douglas M.; Trott, Christian R.; Cook, Jeanine; Hughes, Clayton; Hoekstra, Robert J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Enabling Low Mach Fluid Simulations Using Trilinos

Hu, Jonathan J.; Devine, Karen; Hoemmen, Mark F.; Lin, Paul T.; Rajamanickam, Sivasankaran; Roberts, Nathan V.; Siefert, Christopher; Trott, Christian R.; Prokopenko, Andrey

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Kokkos: Performance Portability Status

Trott, Christian R.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on High-Performance Accelerators

Garcia De Gonzalo, Simon; Huw, Wen-Mei; Hammond, Simon; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Extending Kokkos with Task Parallelism

Sunderland, Daniel; Edwards, Harold C.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

KokkosKernels: Compact Layouts for Batched Blas and Sparse Matrix-Matrix multiply

Rajamanickam, Sivasankaran; Bradley, Andrew M.; Kim, Kyungjoo; Deveci, Mehmet; Trott, Christian R.; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Performance Issues for Modeling Materials via MD on Current and Future Hardware

Plimpton, Steven J.; Moore, Stan G.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Preparing Sandia's Application Portfolio for the Future Using Kokkos

Trott, Christian R.; Edwards, Harold C.; Hammond, Simon; Sunderland, Daniel

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Codesign for Production Applications

Hammond, Simon; Trott, Christian R.; Vaughan, Courtenay T.; Dinge, Dennis; Lin, Paul T.; Pase, Douglas M.; Benner, Robert E.; Cook, Jeanine; Hoekstra, Robert J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Kokkos: Performance Portability for C++ Codes

Trott, Christian R.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Prototyping the Next-Generation of Aria

Brunini, Victor; Clausen, Jonathan; Noble, David R.; Forster, Christopher J.; Trott, Christian R.; Hammond, Simon; Hoemmen, Mark F.; Lin, Paul T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Kokkos: Performance Portability and Productivity for C++ Applications

Edwards, Harold C.; Trott, Christian R.; Sunderland, Daniel

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Results 126–150 of 219

Results 126–150 of 219