Publications Search

Sparse solvers provide essential functionality for a wide variety of scientific applications. Highly parallel sparse solvers are essential for continuing advances in high-fidelity, multi-physics and multi-scale simulations, especially as we target exascale platforms. This paper describes the challenges, strategies and progress of the US Department of Energy Exascale Computing project towards providing sparse solvers for exascale computing platforms. We address the demands of systems with thousands of high-performance node devices where exposing concurrency, hiding latency and creating alternative algorithms become essential. The efforts described here are works in progress, highlighting current success and upcoming challenges. This article is part of a discussion meeting issue 'Numerical algorithms for high-performance computational science'.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

Exascale Node-Level Parallel Programming Environments: Overview and Deciding What's Right for You

Rajamanickam, Sivasankaran; Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Lightweight Software Process Improvement Using Productivity and Sustainability Improvement Planning (PSIP)

Communications in Computer and Information Science

Milewicz, Reed M.; Heroux, Michael A.; Gonsiorowski, Elsa; Gupta, Rinku; Moulton, J.D.; Watson, Gregory R.; Willenbring, James M.; Zamora, Richard J.; Raybourn, Elaine M.

Productivity and Sustainability Improvement Planning (PSIP) is a lightweight, iterative workflow that allows software development teams to identify development bottlenecks and track progress to overcome them. In this paper, we present an overview of PSIP and how it compares to other software process improvement (SPI) methodologies, and provide two case studies that describe how the use of PSIP led to successful improvements in team effectiveness and efficiency.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI Scopus

SOFTWARE ENGINEERING BEST PRACTICES: Why What and How

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Trust me. QED

SIAM News

Heroux, Michael A.

Consider a standard SIAM journal article containing theoretical results. Each theorem has a proof that typically builds on previous developments. Since every theorem stems from a firm foundation, the research community can trust a result without further evidence. One could thus argue that a theorem does not require a proof because surely an author would not publish it if no proof existed to back it up. Furthermore, respectable reviewers and editors expect proofs without exception, and papers containing proof-less theorems will likely go unpublished.

More Details

TYPE Journal Article YEAR 2019

OSTI

Trilinos Overview

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

The Extreme-Scale Scientific Software Stack (E4S)

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Toward performance portability of the Albany finite element analysis code using the Kokkos library

International Journal of High Performance Computing Applications

Demeshko, Irina; Watkins, Jerry E.; Tezaur, Irina K.; Guba, Oksana; Spotz, William S.; Salinger, Andrew G.; Pawlowski, Roger; Heroux, Michael A.

Performance portability on heterogeneous high-performance computing (HPC) systems is a major challenge faced today by code developers: parallel code needs to be executed correctly as well as with high performance on machines with different architectures, operating systems, and software libraries. The finite element method (FEM) is a popular and flexible method for discretizing partial differential equations arising in a wide variety of scientific, engineering, and industrial applications that require HPC. This article presents some preliminary results pertaining to our development of a performance portable implementation of the FEM-based Albany code. Performance portability is achieved using the Kokkos library. We present performance results for the Aeras global atmosphere dynamical core module in Albany. Numerical experiments show that our single code implementation gives reasonable performance across three multicore/many-core architectures: NVIDIA General Processing Units (GPU’s), Intel Xeon Phis, and multicore CPUs.

More Details

TYPE Journal Article YEAR 2019

DOI OSTI Scopus DOI OSTI Scopus

Community organizations: Changing the culture in which research software is developed and sustained

Computing in Science and Engineering

Katz, Daniel S.; Mcinnes, Lois C.; Bernholdt, David E.; Mayes, Abigail C.; Hong, Neil P.C.; Duckles, Jonah; Gesing, Sandra; Heroux, Michael A.; Hettrick, Simon; Jimenez, Rafael C.; Pierce, Marlon; Weaver, Belinda; Wilkins-Diehr, Nancy

Software is the key crosscutting technology that enables advances in mathematics, computer science, and domain-specific science and engineering to achieve robust simulations and analysis for science, engineering, and other research fields. However, software itself has not traditionally received focused attention from research communities; rather, software has evolved organically and inconsistently, with its development largely as by-products of other initiatives. Moreover, challenges in scientific software are expanding due to disruptive changes in computer hardware, increasing scale and complexity of data, and demands for more complex simulations involving multiphysics, multiscale modeling and outer-loop analysis. In recent years, community members have established a range of grass-roots organizations and projects to address these growing technical and social challenges in software productivity, quality, reproducibility, and sustainability. This article provides an overview of such groups and discusses opportunities to leverage their synergistic activities while nurturing work toward emerging software ecosystems.

More Details

TYPE Journal Article YEAR 2019

DOI OSTI Scopus

Recent Trends and Challenges for High Performance Sparse Linear Algebra

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Making Reproducibility Indispensable

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

ECP Software Technologies Software Development Kits (SDKs)

Willenbring, James M.; Heroux, Michael A.; Mcinnes, Lois

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Toward a Compatible Reproducibility Taxonomy for Computational and Computing Sciences

Heroux, Michael A.; Barba, Lorena A.; Parashar, Manish; Stodden, Victoria; Taufer, Michela

Reproducibility is an essential ingredient of the scientific enterprise. The ability to reproduce results builds trust that we can rely on the results as foundations for future scientific exploration. Presently, the fields of computational and computing sciences provide two opposing definitions of reproducible and replicable. In computational sciences, reproducible research means authors provide all necessary data and computer codes to run analyses again, so others can re-obtain the results (J. Claerbout et al., 1992). The concept was adopted and extended by several communities, where it was distinguished from replication: collecting new data to address the same question, and arriving at consistent findings (Peng et al. 2006). The Association of Computing Machinery (ACM), representing computer science and industry professionals, recently established a reproducibility initiative, adopting essentially opposite definitions. The purpose of this report is to raise awareness of the opposite definitions and propose a path to a compatible taxonomy.

More Details

TYPE SAND Report YEAR 2018

DOI OSTI

Local Failure Local Recovery (LFLR) Resiliency for Asynchronous Many Task (AMT) Programming and Execution Models

Heroux, Michael A.

Certification of successful completion of ASC CSSE Level 2 Milestone 6362: Local Failure Local Recovery (LFLR) Resiliency for Asynchronous Many Task (AMT) Programming and Execution Models

More Details

TYPE Other Report YEAR 2018

DOI OSTI