Publications Search

ExaLearn – GenTen Tensor Software ECP Milestone

Kolla, Hemanth; Phipps, Eric T.; Wolf, Michael

The objective of this milestone was to finish integrating GenTen tensor software with combustion application Pele using the Ascent in situ analysis software, partnering with the ALPINE and Pele teams. Also, to demonstrate the usage of the tensor analysis as part of a combustion simulation.

More Details

TYPE Other Report YEAR 2022

DOI OSTI

ExaLearn: Co-Design Center for Exascale Machine Learning Technologies

Alexander, Frank; Foster, Ian; Sweeney, Christine; Nugent, Peter; Van Essen, Brian; Seal, Sudip; Ang, James; Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2021

DOI OSTI

ExaLearn: Co-Design Center for Exascale Machine Learning Technologies

Alexander, Frank; Foster, Ian; Hagberg, Aric; Nugent, Peter; Van Essen, Brian; Womble, David; Ang, James; Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2021

DOI OSTI

ECP Report: Update on Proxy Applications and Vendor Interactions

Ang, Jim; Sweeney, Christine; Wolf, Michael; Ellis, John A.; Ghosh, Sayan; Kagawa, Ai; Huang, Yunzhi; Rajamanickam, Sivasankaran; Ramakrishnaiah, Vinay; Schram, Malachi; Yoo, Shinjae

The ExaLearn miniGAN team (Ellis and Rajamanickam) have released miniGAN, a generative adversarial network(GAN) proxy application, through the ECP proxy application suite. miniGAN is the first machine learning proxy application in the suite (note: the ECP CANDLE project did previously release some benchmarks) and models the performance for training generator and discriminator networks. The GAN's generator and discriminator generate plausible 2D/3D maps and identify fake maps, respectively. miniGAN aims to be a proxy application for related applications in cosmology (CosmoFlow, ExaGAN) and wind energy (ExaWind). miniGAN has been developed so that optimized mathematical kernels (e.g., kernels provided by Kokkos Kernels) can be plugged into to the proxy application to explore potential performance improvements. miniGAN has been released as open source software and is available through the ECP proxy application website (https://proxyapps.exascaleproject.ordecp-proxy-appssuite/) and on GitHub (https://github.com/SandiaMLMiniApps/miniGAN). As part of this release, a generator is provided to generate a data set (series of images) that are inputs to the proxy application.

More Details

TYPE Other Report YEAR 2020

DOI OSTI

Evolution of the Kokkos Ecosystem

Miles, Jeffery S.; Trott, Christian R.; Brunini, Victor; Bettencourt, Matthew T.; Poliakoff, David; Rajamanickam, Sivasankaran; Hollman, David S.; Wolf, Michael; Glass, Micheal W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Scalable Triangle Counting on Distributed-Memory Systems

Acer, Seher; Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Berry, Jonathan; Wolf, Michael; Catalyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI

The Kokkos C++ Performance Portability EcoSystem

Wolf, Michael; Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Scalable triangle counting on distributed-memory systems

2019 IEEE High Performance Extreme Computing Conference, HPEC 2019

Acer, Seher; Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Wolf, Michael; Catalyurek, Umit V.

Triangle counting is a foundational graph-analysis kernel in network science. It has also been one of the challenge problems for the 'Static Graph Challenge'. In this work, we propose a novel, hybrid, parallel triangle counting algorithm based on its linear algebra formulation. Our framework uses MPI and Cilk to exploit the benefits of distributed-memory and shared-memory parallelism, respectively. The problem is partitioned among MPI processes using a two-dimensional (2D) Cartesian block partitioning. One-dimensional (1D) rowwise partitioning is used within the Cartesian blocks for shared-memory parallelism using the Cilk programming model. Besides exhibiting very good strong scaling behavior in almost all tested graphs, our algorithm achieves the fastest time on the 1.4B edge real-world twitter graph, which is 3.217 seconds, on 1,092 cores. In comparison to past distributed-memory parallel winners of the graph challenge, we demonstrate a speed up of 2.7× on this twitter graph. This is also the fastest time reported for parallel triangle counting on the twitter graph when the graph is not replicated.

More Details

TYPE Conference Poster YEAR 2019

DOI OSTI Scopus

Linear algebra-based triangle counting via fine-grained tasking on heterogeneous environments : ((Update on Static Graph Challenge)

2019 IEEE High Performance Extreme Computing Conference, HPEC 2019

Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Berry, Jonathan; Acer, Seher; Wolf, Michael; Young, Jeffrey S.; Catalyurek, Umit V.

Triangle counting is a representative graph problem that shows the challenges of improving graph algorithm performance using algorithmic techniques and adopting graph algorithms to new architectures. In this paper, we describe an update to the linear-algebraic formulation of the triangle counting problem. Our new approach relies on fine-grained tasking based on a tile layout. We adopt this task based algorithm to heterogeneous architectures (CPUs and GPUs) for up to 10.8x speed up over past year's graph challenge submission. This implementation also results in the fastest kernel time known at time of publication for real-world graphs like twitter (3.7 second) and friendster (1.8 seconds) on GPU accelerators when the graph is GPU resident. This is a 1.7 and 1.2 time improvement over previous state-of-the-art triangle counting on GPUs. We also improved end-to-end execution time by overlapping computation and communication of the graph to the GPUs. In terms of end-to-end execution time, our implementation also achieves the fastest end-to-end times due to very low overhead costs.

More Details

TYPE Conference Poster YEAR 2019

OSTI Scopus

Scalable Triangle Counting on Distributed-Memory Systems

Acer, Seher; Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Wolf, Michael; Catalyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

DOI OSTI

Scalable Triangle Counting on Distributed-Memory Systems

Acer, Seher; Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Wolf, Michael; Catalyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

DOI OSTI

Exploiting Scientific Software to Solve Problems in Data Analytics

Devine, Karen; Boman, Erik G.; Dunlavy, Daniel M.; Kolda, Tamara G.; Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Fast Triangle Counting Using Cilk

Rajamanickam, Sivasankaran; Yasar, Abdurrahman; Wolf, Michael; Berry, Jonathan; Catlyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Trilinos Framework and Solvers

Rajamanickam, Sivasankaran; Hu, Jonathan J.; Devine, Karen; Wolf, Michael

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Fast Triangle Counting Using Cilk

Rajamanickam, Sivasankaran; Yasar, Abdurrahman; Wolf, Michael; Berry, Jonathan; Catlyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

ExaLearn Application Interview

Rajamanickam, Sivasankaran; Wolf, Michael; Phipps, Eric T.; Ebeida, Mohamed; Debusschere, Bert

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

High-Performance Portable Data Analytics Software Using the Kokkos Ecosystem

Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Fast Linear Algebra-Based Triangle Analytics with Kokkos Kernels

Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Wolf, Michael; Berry, Jonathan; Catalyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Exalearn: Plans for Proxy applications and Performance Portabili

Rajamanickam, Sivasankaran; Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Fast Triangle Counting Using Cilk

Yasar, Abdurrahman; Rajamanickam, Sivasankaran; Wolf, Michael; Berry, Jonathan; Catalyurek, Umit

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

DOI OSTI

Sparse Matrix-Matrix Multiplication on Multilevel Memory Architectures: Algorithms and Experiments

Deveci, Mehmet; Hammond, Simon; Wolf, Michael; Rajamanickam, Sivasankaran

Architectures with multiple classes of memory media are becoming a common part of mainstream supercomputer deployments. So called multi-level memories offer differing characteristics for each memory component including variation in bandwidth, latency and capacity. This paper investigates the performance of sparse matrix multiplication kernels on two leading highperformance computing architectures — Intel's Knights Landing processor and NVIDIA's Pascal GPU. We describe a data placement method and a chunking-based algorithm for our kernels that exploits the existence of the multiple memory spaces in each hardware platform. We evaluate the performance of these methods w.r.t. standard algorithms using the auto-caching mechanisms Our results show that standard algorithms that exploit cache reuse performed as well as multi-memory-aware algorithms for architectures such as Ki\iLs where the memory subsystems have similar latencies. However, for architectures such as GPUS where memory subsystems differ significantly in both bandwidth and latency, multi-memory-aware methods are crucial for good performance. In addition, our new approaches permit the user to run problems that require larger capacities than the fastest memory of each compute node without depending on the software-managed cache mechanisms.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

Multi-threaded Sparse Matrix Matrix Multiplication with Applications in Scientific Computing and Graph Analytics

Deveci, Mehmet; Wolf, Michael; Berry, Jonathan; Rajamanickam, Sivasankaran; Boman, Erik G.; Trott, Christian R.; Hammond, Simon; Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

ExaGraph at Sandia: Graph Coloring Clustering and Partitioning for Exascale Computing

Boman, Erik G.; Deveci, Mehmet; Devine, Karen; Rajamanickam, Sivasankaran; Wolf, Michael

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

TriData: High Performance Linear Algebra-Based Data Analytics

Wolf, Michael; Dunlavy, Daniel M.; Lehoucq, Rich; Berry, Jonathan; Bourgeois, Daniel

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

ExaGraph: Combinatorial Methods for Enabling Exascale Applications

Author, No; Halappanavar, Mahantesh; Buluc, Aydin; Boman, Erik G.; Pothen, Alex; Tumeo, Antonino; Azad, Ariful; Khan, Arif; Ferdous, Sm; Rajamanickam, Sivasankaran; Wolf, Michael; Deveci, Mehmet; Devine, Karen

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Fast linear algebra-based triangle counting with KokkosKernels

2017 IEEE High Performance Extreme Computing Conference, HPEC 2017

Wolf, Michael; Deveci, Mehmet; Berry, Jonathan; Hammond, Simon; Rajamanickam, Sivasankaran

Triangle counting serves as a key building block for a set of important graph algorithms in network science. In this paper, we address the IEEE HPEC Static Graph Challenge problem of triangle counting, focusing on obtaining the best parallel performance on a single multicore node. Our implementation uses a linear algebra-based approach to triangle counting that has grown out of work related to our miniTri data analytics miniapplication [1] and our efforts to pose graph algorithms in the language of linear algebra. We leverage KokkosKernels to implement this approach efficiently on multicore architectures. Our performance results are competitive with the fastest known graph traversal-based approaches and are significantly faster than the Graph Challenge reference implementations, up to 670,000 times faster than the C++ reference and 10,000 times faster than the Python reference on a single Intel Haswell node.

More Details

TYPE Conference Poster YEAR 2017

DOI OSTI Scopus

KKTri: Fast Linear Algebra-Based Triangle Counting with KokkosKernels

Wolf, Michael; Deveci, Mehmet; Berry, Jonathan; Hammond, Simon; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

High Fidelity Simulations of Large-scale Wireless Networks (Part II - FY2017)

Onunkwo, Uzoma; Ganti, Anand; Mitchell, John A.; Scoggin, Michael; Schroeppel, Richard C.; Van Leeuwen, Brian P.; Wolf, Michael

The ability to simulate wireless networks at large-scale for meaningful amount of time is considerably lacking in today's network simulators. For this reason, many published work in this area often limit their simulation studies to less than a 1,000 nodes and either over-simplify channel characteristics or perform studies over time scales much less than a day. In this report, we show that one can overcome these limitations and study problems of high practical consequence. This work presents two key contributions to high fidelity simulation of large-scale wireless networks: (a) wireless simulations can be sped up by more than 100X in runtime using ideas from spatial indexing algorithms and clipping of negligible signals and (b) clustering and task-oriented programming paradigm can be used to reduce inter- process communication in a parallel discrete event simulation resulting in a better scaling efficiency.

More Details

TYPE SAND Report YEAR 2017

DOI OSTI

Fast Linear Algebra-Based Triangle Counting with KokkosKernels

Wolf, Michael; Deveci, Mehmet; Berry, Jonathan; Hammond, Simon; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

DOI OSTI

Performance Portable Sparse Matrix Matrix Multiplication with Applications in Scientific Computing and Graph Analytics

Deveci, Mehmet; Trott, Christian R.; Hammond, Simon; Wolf, Michael; Berry, Jonathan; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Fast Linear Algebra-Based Triangle Counting with KokkosKernels

Wolf, Michael; Deveci, Mehmet; Berry, Jonathan; Hammond, Simon; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

DOI OSTI

miniTri ? Triangle Based Data Analytics Miniapp

Wolf, Michael

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

A computational spectral graph theory tutorial

Lehoucq, Rich; Boman, Erik G.; Devine, Karen; Berry, Jonathan; Dunlavy, Daniel M.; Wolf, Michael; Van HensonVan; Sanders, Geoff

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

TriData: Applying Trilinos to Data Analysis

Klinvex, Alicia M.; Wolf, Michael; Dunlavy, Daniel M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Kokkos/Qthreads Task Parallel Approach to Linear Algebra Based Graph Analytics

Wolf, Michael; Edwards, Harold C.; Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Asynchrony in Task-Parallel Linear Algebra Based Graph Analytics

Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Hierarchical Task-Data Parallelism using Kokkos and Qthreads

Edwards, Harold C.; Olivier, Stephen L.; Berry, Jonathan; Mackey, Greg E.; Rajamanickam, Sivasankaran; Wolf, Michael; Kim, Kyungjoo; Stelle, George W.

This report describes a new capability for hierarchical task-data parallelism using Sandia's Kokkos and Qthreads, and evaluation of this capability with sparse matrix Cholesky factorization and social network triangle enumeration mini-applications. Hierarchical task-data parallelism consists of a collection of tasks with executes-after dependences where each task contains data parallel operations performed on a team of hardware threads. The collection of tasks and dependences form a directed acyclic graph of tasks - a task DAG. Major challenges of this research and development effort include: portability and performance across multicore CPU; manycore Intel Xeon Phi, and NVIDIA GPU architectures; scalability with respect to hardware concurrency and size of the task DAG; and usability of the application programmer interface (API).

More Details

TYPE SAND Report YEAR 2016

DOI OSTI

Advantages to Modeling Relational Data using Hypergraphs versus Graphs

Wolf, Michael; Klinvex, Alicia M.; Dunlavy, Daniel M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Task Parallel Approach to the Linear Algebra-Based Implementation of miniTri

Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Advantages to Modeling Relational Data using Hypergraphs versus Graphs

Wolf, Michael; Klinvex, Alicia M.; Dunlavy, Daniel M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Kokkos/Qthreads Task Parallel Approach to Linear Algebra Based Graph Analytics

Wolf, Michael; Edwards, Harold C.; Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Hypergraph Exploitation for Data Sciences

Wolf, Michael; Dunlavy, Daniel M.; Klinvex, Alicia M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Clustering network data through effective use of eigensolvers and hypergraph models

Klinvex, Alicia M.; Wolf, Michael; Dunlavy, Daniel M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

A task-based linear algebra Building Blocks approach for scalable graph analytics

2015 IEEE High Performance Extreme Computing Conference, HPEC 2015

Wolf, Michael; Berry, Jonathan; Stark, Dylan T.

It is challenging to obtain scalable HPC performance on real applications, especially for data science applications with irregular memory access and computation patterns. To drive co-design efforts in architecture, system, and application design, we are developing miniapps representative of data science workloads. These in turn stress the state of the art in Graph BLAS-like Graph Algorithm Building Blocks (GABB). In this work, we outline a Graph BLAS-like, linear algebra based approach to miniTri, one such miniapp. We describe a task-based prototype implementation and give initial scalability results.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

A task-based linear algebra Building Blocks approach for scalable graph analytics

2015 IEEE High Performance Extreme Computing Conference Hpec 2015

Wolf, Michael; Berry, Jonathan; Stark, Dylan T.

It is challenging to obtain scalable HPC performance on real applications, especially for data science applications with irregular memory access and computation patterns. To drive co-design efforts in architecture, system, and application design, we are developing miniapps representative of data science workloads. These in turn stress the state of the art in Graph BLAS-like Graph Algorithm Building Blocks (GABB). In this work, we outline a Graph BLAS-like, linear algebra based approach to miniTri, one such miniapp. We describe a task-based prototype implementation and give initial scalability results.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

HPX Applications and Performance Adaptation

Koniges, Alice; Candadai, Jayashree A.; Kaiser, Hartmut; Huck, Kevin; Kemp, Jeremy; Heller, Thomas; Anderson, Matthew; Lumsdaine, Andrew; Serio, Adrian; Wolf, Michael; Lelbach, Bryce; Brightwell, Ronald B.; Sterling, Thomas

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI