Publications Search

Demonstrate multi-turbine simulation with hybrid-structured / unstructured-moving-grid software stack running primarily on GPUs and propose improvements for successful KPP-2

Bidadi, Shreyas; Brazell, Michael; Brunhart-Lupo, Nicholas; Henry De Frahan, Marc T.; Lee, Dong H.; Hu, Jonathan J.; Melvin, Jeremy; Mullowney, Paul; Vijayakumar, Ganesh; Moser, Robert D.; Rood, Jon; Sakievich, Philip; Sharma, Ashesh; Williams, Alan B.; Sprague, Michael A.

The goal of the ExaWind project is to enable predictive simulations of wind farms comprised of many megawatt-scale turbines situated in complex terrain. Predictive simulations will require computational fluid dynamics (CFD) simulations for which the mesh resolves the geometry of the turbines, capturing the thin boundary layers, and captures the rotation and large deflections of blades. Whereas such simulations for a single turbine are arguably petascale class, multi-turbine wind farm simulations will require exascale-class resources.

More Details

TYPE Other Report YEAR 2022

DOI OSTI

Harnessing exascale for whole wind farm high-fidelity simulations to improve wind farm efficiency

Crozier, Paul; Adcock, Christiane; Ananthan, Shreyas; Berger-Vergiat, Luc; Brazell, Michael; Brunhart-Lupo, Nicholas; Henry De Frahan, Marc T.; Hu, Jonathan J.; Knaus, Robert C.; Melvin, Jeremy; Moser, Bob; Mullowney, Paul; Rood, Jon; Sharma, Ashesh; Thomas, Stephen; Vijayakumar, Ganesh; Williams, Alan B.; Wilson, Robert; Yamazaki, Ichitaro; Sprague, Michael A.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

FY2021 Q4: Demonstrate moving-grid multi-turbine simulations primarily run on GPUs and propose improvements for successful KPP-2 [Slides]

Adcock, Christiane; Ananthan, Shreyas; Berger-Vergiat, Luc; Brazell, Michael; Brunhart-Lupo, Nicholas; Hu, Jonathan J.; Knaus, Robert C.; Melvin, Jeremy; Moser, Bob; Mullowney, Paul; Rood, Jon; Sharma, Ashesh; Thomas, Stephen; Vijayakumar, Ganesh; Williams, Alan B.; Wilson, Robert; Yamazaki, Ichitaro; Sprague, Michael

Isocontours of Q-criterion with velocity visualized in the wake for two NREL 5-MW turbines operating under uniform-inflow wind speed of 8 m/s. Simulation performed with the hybrid-Nalu-Wind/AMR-Wind solver.

More Details

TYPE Other Report YEAR 2021

DOI OSTI

Demonstrate moving-grid multi-turbine simulations primarily run on GPUs and propose improvements for successful KPP-2

Adcock, Christiane; Ananthan, Shreyas; Berget-Vergiat, Luc; Brazell, Michael; Brunhart-Lupo, Nicholas; Hu, Jonathan J.; Knaus, Robert C.; Melvin, Jeremy; Moser, Bob; Mullowney, Paul; Rood, Jon; Sharma, Ashesh; Thomas, Stephen; Vijayakumar, Ganesh; Williams, Alan B.; Wilson, Robert; Yamazaki, Ichitaro; Sprague, Michael

The goal of the ExaWind project is to enable predictive simulations of wind farms comprised of many megawatt-scale turbines situated in complex terrain. Predictive simulations will require computational fluid dynamics (CFD) simulations for which the mesh resolves the geometry of the turbines, capturing the thin boundary layers, and captures the rotation and large deflections of blades. Whereas such simulations for a single turbine are arguably petascale class, multi-turbine wind farm simulations will require exascale-class resources.

More Details

TYPE Other Report YEAR 2021

DOI OSTI

Recent Advances in Maxwell Solvers for PIC Simulations Across Architectures

Hu, Jonathan J.; Glusa, Christian

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

An Algebraic Monolithic Method for Volume-coupled Multiphysics Problems

Ohm, Peter; Tuminaro, Raymond S.; Hu, Jonathan J.; Shadid, John N.; Cyr, Eric C.; Wiesner, Tobias

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

ExaWind: Exascale Predictive Wind Plant Flow Physics Modeling

Sprague, Michael; Ananthan, Shreyas; Binyahib, Roba; Brazell, Michael; De Frahan, Marc H.; King, Ryan A.; Mullowney, Paul; Rood, Jon; Sharma, Ashesh; Thomas, Stephen A.; Vijayakumar, Ganesh; Crozier, Paul; Berger-Vergiat, Luc; Cheung, Lawrence; Dement, David C.; Develder, Nathaniel; Glaze, David J.; Hu, Jonathan J.; Knaus, Robert C.; Lee, Dong H.; Matula, Neil; Okusanya, Tolulope O.; Overfelt, James R.; Rajamanickam, Sivasankaran; Sakievich, Philip; Smith, Timothy A.; Vo, Johnathan; Williams, Alan B.; Yamazaki, Ichitaro; Turner, William J.; Prokopenko, Andrey; Wilson, Robert V.; Moser, Robert; Melvin, Jeremy; Sitaraman, Jay

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2021

DOI OSTI

An Algebraic Monolithic Method for Volume-coupled Multiphysics Problems

Ohm, Peter; Tuminaro, Raymond S.; Shadid, John N.; Hu, Jonathan J.; Wiesner, Tobias

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

Portability of Nalu-Wind using Trilinos and Kokkos Libraries

Berger-Vergiat, Luc; Hu, Jonathan J.; Glusa, Christian; Siefert, Christopher

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

Removal of the UVM Requirement from Tpetra: MultiVector and BlockMultiVector

Devine, Karen; Danielson, Geoffrey C.; Fuller, Timothy J.; Hu, Jonathan J.; Kelley, Brian M.; Kim, Kyungjoo; Siefert, Christopher; Smith, Timothy A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

A Monolithic Algebraic Multigrid Approach for Coupled Multiphysics Problems using the MueLu Framework

Ohm, Peter; Tuminaro, Raymond S.; Shadid, John N.; Hu, Jonathan J.; Wiesner, Tobias; Cyr, Eric C.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

Performance Portability in Sandia Mission Applications and Kokkos Ecosystem Updates

Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

One slide overviews of the Belos and MueLu Libraries in Trilinos

Hu, Jonathan J.; Glusa, Christian

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

State of the Tpetra Linear Solver Stack

Siefert, Christopher; Devine, Karen; Hoemmen, Mark F.; Hu, Jonathan J.; Kelley, Brian M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Linear Solvers for Plasma Simulations on Advanced Architectures

Hu, Jonathan J.; Glusa, Christian; Moore, Stan G.; Lin, Paul; Phillips, Edward; Bettencourt, Matthew T.; Foulk, James W.; Siefert, Christopher; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

MueLu's Algorithmic Performance on GPU

Berger-Vergiat, Luc; Hu, Jonathan J.; Tuminaro, Raymond S.; Siefert, Christopher; Glusa, Christian

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

ExaWind: Exascale Predictive Wind Plant Flow Physics Modeling

Sprague, M.; Ananthan, S.; Brazell, M.; Glaws, A.; De Frahan, M.; King, R.; Natarajan, M.; Rood, J.; Sharma, A.; Sirydowicz, K.; Thomas, S.; Vijaykumar, G.; Yellapantula, S.; Crozier, Paul; Berger-Vergiat, Luc; Cheung, Lawrence; Glaze, David J.; Hu, Jonathan J.; Knaus, Robert C.; Lee, Dong H.; Okusanya, Tolulope O.; Overfelt, James R.; Rajamanickam, Sivasankaran; Sakievich, Philip; Smith, Timothy A.; Vo, Johnathan; Williams, Alan B.; Yamazaki, Ichitaro; Turner, J.; Prokopenko, A.; Wilson, R.; Moser, R.; Melvin, J.; Sitaraman, J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Trilinos linear solvers group update

Berger-Vergiat, Luc; Hu, Jonathan J.; Rajamanickam, Sivasankaran; Yamazaki, Ichitaro

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Krylov Solvers and Algebraic Multigrid

Glusa, Christian; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Scalable Preconditioners to Improve Time to Solution for Magnetohydrodynamics Applications

Tuminaro, Raymond S.; Shadid, John N.; Cyr, Eric C.; Lin, Paul T.; Pawlowski, Roger; Phillips, Edward; Wiesner, Tobias; Adler, James; Benson, Tom; Chacon, Luis; Farrell, Patrick; Rappaport, Ari; Maclachlan, Scott; Hu, Jonathan J.; Berger-Vergiat, Luc; Glusa, Christian; Siefert, Christopher

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

MueLu: A Portable High Performance C++ Multigrid Preconditioning Library

Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Kokkos Kernels and Trilinos Solvers in FASTMath

Rajamanickam, Sivasankaran; Hu, Jonathan J.; Yang, Ulrike

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

FASTMath: Kokkos Kernels and Linear Solvers

Rajamanickam, Sivasankaran; Bogle, Ian; Hu, Jonathan J.; Devine, Karen; Slota, George M.; Perego, Mauro; Kim, Kyungjoo

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Overview of the Trilinos packages Belos and MueLu

Hu, Jonathan J.; Glusa, Christian

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

Trilinos Multigrid Solvers

Berger-Vergiat, Luc; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

ExaWind

Berger-Vergiat, Luc; Rajamanickam, Sivasankaran; Hu, Jonathan J.; Luchini, Christopher B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2019

OSTI

A multigrid approach to solve hypersonic flow problems

Berger-Vergiat, Luc; Tuminaro, Raymond S.; Hu, Jonathan J.; Siefert, Christopher

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Large Scale Parallel Solution Methods for Electromagnetic Simulations

Hu, Jonathan J.; Glusa, Christian; Lin, Paul T.; Phillips, Edward; Foulk, James W.; Siefert, Christopher; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Non-invasive Semi-Structured Multigrid on Advanced Architectures

Tuminaro, Raymond S.; Hu, Jonathan J.; Siefert, Christopher; Berger-Vergiat, Luc; Mayr, Mattias

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

ExaWind: Exascale Predictive Wind Plant Flow Physics Modeling

Sprague, Michael; Ananthan, Shreyas; Gruchalla, Kenny; Lawson, Michael; Rood, Jon; Swirydowicz, K.; Thomas, Steve; Vijayakumar, Ganesh; Crozier, Paul; Dohrmann, Clark R.; Hu, Jonathan J.; Williams, Alan B.; Turner, John; Prokopenko, A.; Moser, Robert; Melvin, J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Trilinos Scalable Solvers and Kokkos Kernels

Hu, Jonathan J.; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

MueLu User's Guide

Berger-Vergiat, Luc; Glusa, Christian; Hu, Jonathan J.; Siefert, Christopher; Tuminaro, Raymond S.; Mayr, Matthias; Prokopenko, Andrey; Wiesner, Tobias

This is the official user guide for MUELU multigrid library in Trilinos version 12.13 (Dev). This guide provides an overview of MUELU, its capabilities, and instructions for new users who want to start using MUELU with a minimum of effort. Detailed information is given on how to drive MUELU through its XML interface. Links to more advanced use cases are given. This guide gives information on how to achieve good parallel performance, as well as how to introduce new algorithms Finally, readers will find a comprehensive listing of available MUELU options. Any options not documented in this manual should be considered strictly experimental.

More Details

TYPE SAND Report YEAR 2018

DOI OSTI

Non-invasive Semi-Structured Multigrid on Advanced Architectures

Tuminaro, Raymond S.; Berger-Vergiat, Luc; Hu, Jonathan J.; Siefert, Christopher; Mayr, Matthias

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Performance of fully-coupled algebraic multigrid preconditioners for large-scale VMS resistive MHD

Journal of Computational and Applied Mathematics

Lin, Paul T.; Shadid, John N.; Hu, Jonathan J.; Pawlowski, Roger; Cyr, Eric C.

This work explores the current performance and scaling of a fully-implicit stabilized unstructured finite element (FE) variational multiscale (VMS) capability for large-scale simulations of 3D incompressible resistive magnetohydrodynamics (MHD). The large-scale linear systems that are generated by a Newton nonlinear solver approach are iteratively solved by preconditioned Krylov subspace methods. The efficiency of this approach is critically dependent on the scalability and performance of the algebraic multigrid preconditioner. This study considers the performance of the numerical methods as recently implemented in the second-generation Trilinos implementation that is 64-bit compliant and is not limited by the 32-bit global identifiers of the original Epetra-based Trilinos. The study presents representative results for a Poisson problem on 1.6 million cores of an IBM Blue Gene/Q platform to demonstrate very large-scale parallel execution. Additionally, results for a more challenging steady-state MHD generator and a transient solution of a benchmark MHD turbulence calculation for the full resistive MHD system are also presented. These results are obtained on up to 131,000 cores of a Cray XC40 and one million cores of a BG/Q system.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI Scopus

Trilinos Framework and Solvers

Rajamanickam, Sivasankaran; Hu, Jonathan J.; Devine, Karen; Wolf, Michael

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Deploy threading in Nalu solver stack

Prokopenko, Andrey; Thomas, Stephen; Swirydowicz, Kasia; Ananthan, Shreyas; Hu, Jonathan J.; Williams, Alan B.; Sprague, Michael

The goal of the ExaWind project is to enable predictive simulations of wind farms composed of many MW-scale turbines situated in complex terrain. Predictive simulations will require computational fluid dynamics (CFD) simulations for which the mesh resolves the geometry of the turbines, and captures the rotation and large deflections of blades. Whereas such simulations for a single turbine are arguably petascale class, multi-turbine wind farm simulations will require exascale-class resources. The primary code in the ExaWind project is Nalu, which is an unstructured-grid solver for the acousticallyincompressible Navier-Stokes equations, and mass continuity is maintained through pressure projection. The model consists of the mass-continuity Poisson-type equation for pressure and a momentum equation for the velocity. For such modeling approaches, simulation times are dominated by linear-system setup and solution for the continuity and momentum systems. For the ExaWind challenge problem, the moving meshes greatly affect overall solver costs as re-initialization of matrices and re-computation of preconditioners is required at every time step In this Milestone, we examine the effect of threading on the solver stack performance against flat-MPI results obtained from previous milestones using Haswell performance data full-turbine simulations. Whereas the momentum equations are solved only with the Trilinos solvers, we investigate two algebraic-multigrid preconditioners for the continuity equations: Trilinos/Muelu and HYPRE/BoomerAMG. These two packages embody smoothed-aggregation and classical Ruge-Stiiben AMG methods, respectively. In our FY18 Q2 report, we described our efforts to improve setup and solve of the continuity equations under flat-MPI parallelism. While significant improvement was demonstrated in the solve phase, setup times remained larger than expected. Starting with the optimized settings described in the Q2 report, we explore here simulation performance where OpenMP threading is employed in the solver stack. For Trilinos, threading is acheived through the Kokkos abstraction where, whereas HYPRE/BoomerAMG employs straight OpenMP. We examined results for our mid-resolution baseline turbine simulation configuration (229M DOF). Simulations on 2048 Haswell cores explored the effect of decreasing the number of MPI ranks while increasing the number of threads. Both HYPRE and Trilinos exhibited similar overal solution times, and both showed dramatic increases in simulation time in the shift from MPI ranks to OpenMP threads. This increase is attributed to the large amount of work per MPI rank starting at the single-thread configuration. Decreasing MPI ranks, while increasing threads, may be increasing simulation time due to thread synchronization and start-up overhead contributing to the latency and serial time in the model. These result showed that an MPI+OpenMP parallel decomposition will be more effective as the amount per MPI rank computation per MPI rank decreases and the communication latency increases. This idea was demonstrated in a strong scaling study of our low-resolution baseline model (29M DOF) with the Trilinos-HYPRE configuration. While MPI-only results showed scaling improvement out to about 1536 cores, engaging threading carried scaling improvements out to 4128 cores — roughly 7000 DOF per core. This is an important result as improved strong scaling is needed for simulations to be executed over sufficiently long simulated durations (i.e., for many timesteps). In addition to threading work described above, the team examined solver-performance improvements by exploring communication-overhead in the HYPRE-GMRES implementation through a communicationoptimal- GMRE algorithm (CO-GMRES), and offloading compute-intensive solver actions to GPUs. To those ends, a HYPRE mini-app was allow us to easily test different solver approaches and HYPRE parameter settings without running the entire Nalu code. With GPU acceleration on the Summitdev supercomputer, a 20x speedup was achieved for the overall preconditioner and solver execution time for the mini-app. A study on Haswell processors showed that CO-GMRES provides benefits as one increases MPI ranks.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

ASC ATDM Level 2 Milestone #6358: Assess Status of Next Generation Components and Physics Models in EMPIRE

Bettencourt, Matthew T.; Kramer, Richard M.J.; Cartwright, Keith; Phillips, Edward; Ober, Curtis C.; Pawlowski, Roger; Swan, Matthew S.; Tezaur, Irina K.; Phipps, Eric T.; Conde, Sidafa; Cyr, Eric C.; Ulmer, Craig; Kordenbrock, Todd; Levy, Scott L.N.; Templet, Gary J.; Hu, Jonathan J.; Lin, Paul T.; Glusa, Christian; Siefert, Christopher; Glass, Micheal W.

This report documents the outcome from the ASC ATDM Level 2 Milestone 6358: Assess Status of Next Generation Components and Physics Models in EMPIRE. This Milestone is an assessment of the EMPIRE (ElectroMagnetic Plasma In Realistic Environments) application and three software components. The assessment focuses on the electromagnetic and electrostatic particle-in-cell solutions for EMPIRE and its associated solver, time integration, and checkpoint-restart components. This information provides a clear understanding of the current status of the EMPIRE application and will help to guide future work in FY19 in order to ready the application for the ASC ATDM L1 Milestone in FY20. It is clear from this assessment that performance of the linear solver will have to be a focus in FY19.

More Details

TYPE SAND Report YEAR 2018

DOI OSTI

Deploy threading in Nalu solver stack

Prokopenko, Andrey; Thomas, Stephen; Swirydowicz, Kasia; Ananthan, Shreyas; Hu, Jonathan J.; Williams, Alan B.; Sprague, Michael

The goal of the ExaWind project is to enable predictive simulations of wind farms composed of many MW-scale turbines situated in complex terrain. Predictive simulations will require computational fluid dynamics (CFD) simulations for which the mesh resolves the geometry of the turbines, and captures the rotation and large deflections of blades. Whereas such simulations for a single turbine are arguably petascale class, multi-turbine wind farm simulations will require exascale-class resources. The primary code in the ExaWind project is Nalu, which is an unstructured-grid solver for the acoustically-incompressible Navier-Stokes equations, and mass continuity is maintained through pressure projection. The model consists of the mass-continuity Poisson-type equation for pressure and a momentum equation for the velocity. For such modeling approaches, simulation times are dominated by linear-system setup and solution for the continuity and momentum systems. For the ExaWind challenge problem, the moving meshes greatly affect overall solver costs as re-initialization of matrices and re-computation of preconditioners is required at every time step.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

On node parallel aggregation for AMG

Berger-Vergiat, Luc; Hu, Jonathan J.; Siefert, Christopher

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

KokkosKernels Overview

Rajamanickam, Sivasankaran; Deveci, Mehmet; Kim, Kyungjoo; Ellingwood, Nathan D.; Trott, Christian R.; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Decrease time-to-solution through improved linear-system setup and solve (Milestone Report)

Hu, Jonathan J.; Thomas, Stephen; Dohrmann, Clark R.; Ananthan, Shreyas; Domino, Stefan P.; Williams, Alan B.; Sprague, Michael

The goal of the ExaWind project is to enable predictive simulations of wind farms composed of many MW-scale turbines situated in complex terrain. Predictive simulations will require computational fluid dynamics (CFD) simulations for which the mesh resolves the geometry of the turbines, and captures the rotation and large deflections of blades. Whereas such simulations for a single turbine are arguably petascale class, multi-turbine wind farm simulations will require exascale-class resources. We describe in this report our efforts to decrease the setup and solution time for the mass-continuity Poisson system with respect to the benchmark timing results reported in FY18 Q1. In particular, we investigate improving and evaluating two types of algebraic multigrid (AMG) preconditioners: Classical Ruge-Stfiben AMG (C-AMG) and smoothed-aggregation AMG (SA-AMG), which are implemented in the Hypre and Trilinos/MueLu software stacks, respectively.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

Multigrid Solvers in Trilinos

Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Solvers for Hypersonic Flow Applications

Hu, Jonathan J.; Berger-Vergiat, Luc; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

High-fidelity simulation of wind-turbine incompressible flows with classical and aggregation AMG preconditioners

Thomas, Stephen; Ananthan, Shreyas; Yellapantula, Shashank; Hu, Jonathan J.; Sprague, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Decrease time-to-solution through improved linear-system setup and solve

Hu, Jonathan J.; Thomas, Stephen; Dohrmann, Clark R.; Ananthan, Shreyas; Domino, Stefan P.; Williams, Alan B.; Sprague, Michael

The goal of the ExaWind project is to enable predictive simulations of wind farms composed of many MW-scale turbines situated in complex terrain. Predictive simulations will require computational fluid dynamics (CFD) simulations for which the mesh resolves the geometry of the turbines, and captures the rotation and large deflections of blades. Whereas such simulations for a single turbine are arguably petascale class, multi-turbine wind farm simulations will require exascale-class resources.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

Trilinos Performance on Knights Landing

Foulk, James W.; Siefert, Christopher; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Leveraging Problem Structure within Multilevel Solvers to Improve Robustness & Performance on Advanced Architectures

Tuminaro, Raymond S.; Hu, Jonathan J.; Siefert, Christopher; Berger-Vergiat, Luc; Mayr, M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Ensemble Grouping Strategies for Embedded Stochastic Collocation Methods Applied to Anisotropic Diffusion Problems

SIAM/ASA Journal on Uncertainty Quantification

D'Elia, Marta; Phipps, Eric T.; Edwards, Harold C.; Hu, Jonathan J.; Rajamanickam, Sivasankaran

Previous work has demonstrated that propagating groups of samples, called ensembles, together through forward simulations can dramatically reduce the aggregate cost of sampling-based uncertainty propagation methods [E. Phipps, M. D'Elia, H. C. Edwards, M. Hoemmen, J. Hu, and S. Rajamanickam, SIAM J. Sci. Comput., 39 (2017), pp. C162--C193]. However, critical to the success of this approach when applied to challenging problems of scientific interest is the grouping of samples into ensembles to minimize the total computational work. For example, the total number of linear solver iterations for ensemble systems may be strongly influenced by which samples form the ensemble when applying iterative linear solvers to parameterized and stochastic linear systems. In this paper we explore sample grouping strategies for local adaptive stochastic collocation methods applied to PDEs with uncertain input data, in particular canonical anisotropic diffusion problems where the diffusion coefficient is modeled by truncated Karhunen--Loève expansions. Finally, we demonstrate that a measure of the total anisotropy of the diffusion coefficient is a good surrogate for the number of linear solver iterations for each sample and therefore provides a simple and effective metric for grouping samples.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI

ECP ALCC Quarterly Report (Oct-Dec 2017)

Hu, Jonathan J.

The scientific goal of ExaWind Exascale Computing Project (ECP) is to advance our fundamental understanding of the flow physics governing whole wind plant performance, including wake formation, complex terrain impacts, and turbine-turbine-interaction effects. Current methods for modeling wind plant performance fall short due to insufficient model fidelity and inadequate treatment of key phenomena, combined with a lack of computational power necessary to address the wide range of relevant length scales associated with wind plants. Thus, our ten-year exascale challenge is the predictive simulation of a wind plant composed of O(100) multi-MW wind turbines sited within a 100 km2 area with complex terrain, involving simulations with O(100) billion grid points. The project plan builds progressively from predictive petascale simulations of a single turbine, where the detailed blade geometry is resolved, meshes rotate and deform with blade motions, and atmospheric turbulence is realistically modeled, to a multi turbine array in complex terrain. The ALCC allocation will be used continually throughout the allocation period. In the first half of the allocation period, small (e.g., for testing Kokkos algorithms) and medium (e.g., 10K cores for highly resolved ABL simulations) sized jobs will be typical. In the second half of the allocation period, we will also have a number of large submittals for our resolved-turbine simulations. A challenge in the latter period is that small time step sizes will require long wall-clock times for statistically meaningful solutions. As such, we expect our allocation-hour burn rate to increase as we move through the allocation period.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

Towards Scalable AMG-based Preconditioners for MHD and Multifluid Plasma Simulations

Lin, Paul T.; Shadid, John N.; Phillips, Edward; Hu, Jonathan J.; Cyr, Eric C.; Pawlowski, Roger

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Algorithmic aspects and performance of AMG-based preconditioning for an implicit FE VMS resistive MHD model

Lin, Paul T.; Shadid, John N.; Phillips, Edward; Hu, Jonathan J.; Cyr, Eric C.; Pawlowski, Roger; Tsuji, Paul; Sondak, David

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Multigrid for multiphysics using the MueLu framework

Wiesner, Tobias A.; Hu, Jonathan J.; Shadid, John N.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Embedded ensemble propagation for improving performance, portability, and scalability of uncertainty quantification on emerging computational architectures

SIAM Journal on Scientific Computing

Phipps, Eric T.; Edwards, Harold C.; Hoemmen, Mark F.; Hu, Jonathan J.; Rajamanickam, Sivasankaran

In this study, quantifying simulation uncertainties is a critical component of rigorous predictive simulation. A key component of this is forward propagation of uncertainties in simulation input data to output quantities of interest. Typical approaches involve repeated sampling of the simulation over the uncertain input data, and can require numerous samples when accurately propagating uncertainties from large numbers of sources. Often simulation processes from sample to sample are similar and much of the data generated from each sample evaluation could be reused. We explore a new method for implementing sampling methods that simultaneously propagates groups of samples together in an embedded fashion, which we call embedded ensemble propagation. We show how this approach takes advantage of properties of modern computer architectures to improve performance by enabling reuse between samples, reducing memory bandwidth requirements, improving memory access patterns, improving opportunities for fine-grained parallelization, and reducing communication costs. We describe a software technique for implementing embedded ensemble propagation based on the use of C++ templates and describe its integration with various scientific computing libraries within Trilinos. We demonstrate improved performance, portability and scalability for the approach applied to the simulation of partial differential equations on a variety of CPU, GPU, and accelerator architectures, including up to 131,072 cores on a Cray XK7 (Titan).

More Details

TYPE Journal Article YEAR 2017

DOI OSTI

Towards a More Algebraic hp-Multigrid

Siefert, Christopher; Hu, Jonathan J.; Roberts, Nathan V.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Enabling Low Mach Fluid Simulations Using Trilinos

Hu, Jonathan J.; Devine, Karen; Hoemmen, Mark F.; Lin, Paul T.; Rajamanickam, Sivasankaran; Roberts, Nathan V.; Siefert, Christopher; Trott, Christian R.; Prokopenko, Andrey

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

MueLu - a multigrid framework for multiphysics preconditioners

Wiesner, Tobias A.; Shadid, John N.; Cyr, Eric C.; Hu, Jonathan J.; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Modernization and optimization of a legacy open-source CFD code for high-performance computing architectures

International Journal of Computational Fluid Dynamics

Gel, Aytekin; Hu, Jonathan J.; El Ould-Ahmed-Vall, Moustapha; Kalinkin, Alexander A.

Legacy codes remain a crucial element of today's simulation-based engineering ecosystem due to the extensive validation process and investment in such software. The rapid evolution of high-performance computing architectures necessitates the modernization of these codes. One approach to modernization is a complete overhaul of the code. However, this could require extensive investments, such as rewriting in modern languages, new data constructs, etc., which will necessitate systematic verification and validation to re-establish the credibility of the computational models. The current study advocates using a more incremental approach and is a culmination of several modernization efforts of the legacy code MFIX, which is an open-source computational fluid dynamics code that has evolved over several decades, widely used in multiphase flows and still being developed by the National Energy Technology Laboratory. Two different modernization approaches,‘bottom-up’ and ‘top-down’, are illustrated. Preliminary results show up to 8.5x improvement at the selected kernel level with the first approach, and up to 50% improvement in total simulated time with the latter were achieved for the demonstration cases and target HPC systems employed.

More Details

TYPE Journal Article YEAR 2017

DOI OSTI Scopus

Multiphysics Preconditioning with the MueLu Multigrid Library

Wiesner, Tobias A.; Tuminaro, Raymond S.; Cyr, Eric C.; Shadid, John N.; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Performance of Scalable AMG-based Preconditioners for MHD and Multifluid Plasma Simulations

Lin, Paul T.; Shadid, John N.; Phillips, Edward; Hu, Jonathan J.; Cyr, Eric C.; Pawlowski, Roger; Prokopenko, Andrey; Tsuji, Paul

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Trilinos NGP Planning

Rajamanickam, Sivasankaran; Devine, Karen; Hu, Jonathan J.; Hoemmen, Mark F.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Toward Robust Scalable Algebraic Multigrid Solvers

Tuminaro, Raymond S.; Cyr, Eric C.; Shadid, John N.; Noble, David R.; Berger-Vergiat, Luc; Hu, Jonathan J.; Prokopenko, Andrey; Siefert, Christopher; Wiesner, Tobias A.; Perego, Mauro; Salinger, Andrew G.; Tezaur, Irina K.; Price, Stephen

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Performance and Capability Improvements Towards Industrial Grade Open-Source DEM Simulation Framework with Integrated and Easy-To-Use Uncertainty Quantification

Hu, Jonathan J.; Ellingwood, Nathan D.; Chen, S.; Adepu, M.; Mor, O.; Simmons, R.; Gel, A.; Emady, H.; Jiao, Y.; Tong, C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Aspects of Large-scale Fully-coupled AMG Preconditioned FE Magnetohydrodynamic Simulations Enabled Through Trilinos

Lin, Paul T.; Shadid, John N.; Hu, Jonathan J.; Pawlowski, Roger; Cyr, Eric C.; Prokopenko, Andrey V.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Multigrid Methods for Systems Arising from High Order Discretizations

Hu, Jonathan J.; Siefert, Christopher

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Multigrid framework for multiphysics problems

Wiesner, Tobias A.; Shadid, John N.; Cyr, Eric C.; Tuminaro, Raymond S.; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Ifpack2 User's Guide 1.0

Prokopenko, Andrey V.; Siefert, Christopher; Hu, Jonathan J.; Hoemmen, Mark F.; Klinvex, Alicia M.

This is the definitive user manual for the I FPACK 2 package in the Trilinos project. I FPACK 2 pro- vides implementations of iterative algorithms (e.g., Jacobi, SOR, additive Schwarz) and processor- based incomplete factorizations. I FPACK 2 is part of the Trilinos T PETRA solver stack, is templated on index, scalar, and node types, and leverages node-level parallelism indirectly through its use of T PETRA kernels. I FPACK 2 can be used to solve to matrix systems with greater than 2 billion rows (using 64-bit indices). Any options not documented in this manual should be considered strictly experimental .

More Details

TYPE SAND Report YEAR 2016

DOI OSTI

Performance of smoothers for algebraic multigrid preconditioners for finite element variational multiscale incompressible magnetohydrodynamics

Lin, Paul T.; Shadid, John N.; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Ensemble Grouping strategies for embedded Stochastic Collocation methods applied to anisotropic diffusion problems

Edwards, Harold C.; Hu, Jonathan J.; Phipps, Eric T.; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Speeding up multi-physics simulations through reuse of multigrid components

Prokopenko, Andrey V.; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Embedded Ensemble Propagation for Improving Performance Portability and Scalability of Uncertainty Quantification on Emerging Computational Architectures

Phipps, Eric T.; Edwards, Harold C.; Hoemmen, Mark F.; Hu, Jonathan J.; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Speeding up multi-physics simulations through reuse of multigrid components

Prokopenko, Andrey V.; Lin, Paul T.; Shadid, John N.; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Recent Progress in Trilinos Linear Solvers

Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

A new MATLAB interface to MueLu

Wiesner, Tobias A.; Hu, Jonathan J.; Kelley, Brian; Siefert, Christopher

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Linear and Eigen-Solver Capability Update

Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

MueLu: Approaches to Next Generation Platforms

Hu, Jonathan J.; Prokopenko, Andrey V.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI