Publications

Results 1–50 of 144

ExaLearn – GenTen Tensor Software ECP Milestone

Kolla, Hemanth K.; Phipps, Eric T.; Wolf, Michael W.

The objective of this milestone was to finish integrating GenTen tensor software with combustion application Pele using the Ascent in situ analysis software, partnering with the ALPINE and Pele teams. Also, to demonstrate the usage of the tensor analysis as part of a combustion simulation.

More Details

TYPE Other Report YEAR 2022

OSTI DOI

Streaming Generalized Canonical Polyadic Tensor Decompositions

Phipps, Eric T.; Johnson, Nick J.; Kolda, Tamara K.

In this paper, we develop a method which we call OnlineGCP for computing the Generalized Canonical Polyadic (GCP) tensor decomposition of streaming data. GCP differs from traditional canonical polyadic (CP) tensor decompositions as it allows for arbitrary objective functions which the CP model attempts to minimize. This approach can provide better fits and more interpretable models when the observed tensor data is strongly non-Gaussian. In the streaming case, tensor data is gradually observed over time and the algorithm must incrementally update a GCP factorization with limited access to prior data. In this work, we extend the GCP formalism to the streaming context by deriving a GCP optimization problem to be solved as new tensor data is observed, formulate a tunable history term to balance reconstruction of recently observed data with data observed in the past, develop a scalable solution strategy based on segregated solves using stochastic gradient descent methods, describe a software implementation that provides performance and portability to contemporary CPU and GPU architectures and integrates with Matlab for enhanced usability, and demonstrate the utility and performance of the approach and software on several synthetic and real tensor data sets.

More Details

TYPE Other Report YEAR 2021

OSTI DOI

Majorize-Minimize Algorithms for Streaming Generalized Canonical Polyadic Tensor Decompositions

Gilman, Kyle G.; Phipps, Eric T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

Low-Communication Asynchronous Distributed Generalized Canonical Polyadic Tensor Decomposition

Lewis, Cannada L.; Phipps, Eric T.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2021

OSTI DOI

Split Bregman optimizer for online generalized CP tensor decomposition

Gilman, Kyle G.; Phipps, Eric T.; Kolla, Hemanth K.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

Distributed Generalized Canonical Polyadic Decomposition

Lewis, Cannada L.; Phipps, Eric T.; Kolda, Tamara G.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2021

OSTI DOI

Generalized Canonical Polyadic Tensor Decompositions for Streaming Data

Phipps, Eric T.; Johnson, Nicholas T.; Kolda, Tamara G.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2021

OSTI DOI

Low-Communication Asynchronous Distributed Generalized Canonical Polyadic Tensor Decomposition

2021 IEEE High Performance Extreme Computing Conference, HPEC 2021

Lewis, Cannada L.; Phipps, Eric T.

In this work, we show that reduced communication algorithms for distributed stochastic gradient descent improve the time per epoch and strong scaling for the Generalized Canonical Polyadic (GCP) tensor decomposition, but with a cost, achieving convergence becomes more difficult. The implementation, based on MPI, shows that while one-sided algorithms offer a path to asynchronous execution, the performance benefits of optimized allreduce are difficult to best.

More Details

TYPE Conference Paper YEAR 2021

Scopus OSTI DOI

Finite Element Tools for Performance Portability of Implicit and IMEX Simulations on Next Generation Architectures

Pawlowski, Roger P.; Phipps, Eric T.; Trott, Christian R.; Cyr, Eric C.; Shadid, John N.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2020

OSTI DOI

Automatic Differentiation of C++ Codes with Sacado

Phipps, Eric T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Automatic Differentiation of C++ Codes on Emerging Manycore Architectures with Sacado

Phipps, Eric T.; Pawlowski, Roger P.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Streaming Tensors

Johnson, Nicholas T.; Kolda, Tamara G.; Phipps, Eric T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Computing Generalized CP Decompositions on Emerging Parallel Architectures

Phipps, Eric T.; Kolda, Tamara G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

A Portable SIMD Primitive using Kokkos for Heterogeneous Architectures

Sahasrabudhe, Damodar S.; Phipps, Eric T.; Rajamanickam, Sivasankaran R.; Berzins, Martin B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Hierarchic Parallelism Memory Management and Helpful Debugging Tools: Updates on Trilinos Discretization Packages

Pawlowski, Roger P.; Phipps, Eric T.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

A Portable SIMD Primitive Using Kokkos for Heterogeneous Architectures

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Sahasrabudhe, Damodar; Phipps, Eric T.; Rajamanickam, Sivasankaran R.; Berzins, Martin

As computer architectures are rapidly evolving (e.g. those designed for exascale), multiple portability frameworks have been developed to avoid new architecture-specific development and tuning. However, portability frameworks depend on compilers for auto-vectorization and may lack support for explicit vectorization on heterogeneous platforms. Alternatively, programmers can use intrinsics-based primitives to achieve more efficient vectorization, but the lack of a gpu back-end for these primitives makes such code non-portable. A unified, portable, Single Instruction Multiple Data (simd) primitive proposed in this work, allows intrinsics-based vectorization on cpus and many-core architectures such as Intel Knights Landing (knl), and also facilitates Single Instruction Multiple Threads (simt) based execution on gpus. This unified primitive, coupled with the Kokkos portability ecosystem, makes it possible to develop explicitly vectorized code, which is portable across heterogeneous platforms. The new simd primitive is used on different architectures to test the performance boost against hard-to-auto-vectorize baseline, to measure the overhead against efficiently vectroized baseline, and to evaluate the new feature called the “logical vector length” (lvl). The simd primitive provides portability across cpus and gpus without any performance degradation being observed experimentally.

More Details

TYPE Conference Poster YEAR 2020

Scopus OSTI

Surrogate-based ensemble grouping strategies for embedded sampling-based uncertainty quantification

Lecture Notes in Computational Science and Engineering

D'Elia, Marta D.; Phipps, Eric T.; Rushdi, A.; Ebeida, Mohamed S.

The embedded ensemble propagation approach introduced in Phipps et al. (SIAM J. Sci. Comput. 39(2):C162, 2017) has been demonstrated to be a powerful means of reducing the computational cost of sampling-based uncertainty quantification methods, particularly on emerging computational architectures. A substantial challenge with this method however is ensemble-divergence, whereby different samples within an ensemble choose different code paths. This can reduce the effectiveness of the method and increase computational cost. Therefore grouping samples together to minimize this divergence is paramount in making the method effective for challenging computational simulations. In this work, a new grouping approach based on a surrogate for computational cost built up during the uncertainty propagation is developed and applied to model advection-diffusion problems where computational cost is driven by the number of (preconditioned) linear solver iterations. The approach is developed within the context of locally adaptive stochastic collocation methods, where a surrogate for the number of linear solver iterations, generated from previous levels of the adaptive grid generation, is used to predict iterations for subsequent samples, and group them based on similar numbers of iterations. The effectiveness of the method is demonstrated by applying it to highly anisotropic advection-dominated diffusion problems with a wide variation in solver iterations from sample to sample. It extends the parameter-based grouping approach developed in D’Elia et al. (SIAM/ASA J. Uncertain. Quantif. 6:87, 2017) to more general problems without requiring detailed knowledge of how the uncertain parameters affect the simulation’s cost, and is also less intrusive to the simulation code.

More Details

TYPE Journal Article YEAR 2020

Scopus OSTI

High performance tensor decomposition on emerging manycore architectures

Phipps, Eric T.; Kolda, Tamara G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Rigorous Data Fusion for Computationally Expensive Simulations

Winovich, Nickolas W.; Rushdi, Ahmad R.; Phipps, Eric T.; Ray, Jaideep R.; Lin, Guang L.; Ebeida, Mohamed S.

This manuscript comprises the final report for the 1-year, FY19 LDRD project "Rigorous Data Fusion for Computationally Expensive Simulations," wherein an alternative approach to Bayesian calibration was developed based a new sampling technique called VoroSpokes. Vorospokes is a novel quadrature and sampling framework defined with respect to Voronoi tessellations of bounded domains in R d developed within this project. In this work, we first establish local quadrature and sampling results on convex polytopes using randomly directed rays, or spokes, to approximate the quantities of interest for a specified target function. A theoretical justification for both procedures is provided along with empirical results demonstrating the unbiased convergence in the resulting estimates/samples. The local quadrature and sampling procedures are then extended to global procedures defined on more general domains by applying the local results to the cells of a Voronoi tessellation covering the domain in consideration. We then demonstrate how the proposed global sampling procedure can be used to define a natural framework for adaptively constructing Voronoi Piecewise Surrogate (VPS) approximations based on local error estimates. Finally, we show that the adaptive VPS procedure can be used to form a surrogate model approximation to a specified, potentially unnormalized, density function, and that the global sampling procedure can be used to efficiently draw independent samples from the surrogate density in parallel. The performance of the resulting VoroSpokes sampling framework is assessed on a collection of Bayesian inference problems and is shown to provide highly accurate posterior predictions which align with the results obtained using traditional methods such as Gibbs sampling and random-walk Markov Chain Monte Carlo (MCMC). Importantly, the proposed framework provides a foundation for performing Bayesian inference tasks which is entirely independent from the theory of Markov chains.

More Details

TYPE SAND Report YEAR 2019

OSTI DOI

Embedded UQ on Emerging Computing Architectures

Phipps, Eric T.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Parallel Sparse Tensor Decomposition with the Trilinos Parallel Linear Algebra Framework

Devine, Karen D.; Kolda, Tamara G.; Phipps, Eric T.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Software for sparse tensor decomposition on emerging computing architectures

SIAM Journal on Scientific Computing

Phipps, Eric T.; Kolda, Tamara G.

In this paper, we develop software for decomposing sparse tensors that is portable to and performant on a variety of multicore, manycore, and GPU computing architectures. The result is a single code whose performance matches optimized architecture-specific implementations. The key to a portable approach is to determine multiple levels of parallelism that can be mapped in different ways to different architectures, and we explain how to do this for the matricized tensor times Khatri-Rao product (MTTKRP), which is the key kernel in canonical polyadic tensor decomposition. Our implementation leverages the Kokkos framework, which enables a single code to achieve high performance across multiple architectures that differ in how they approach fine-grained parallelism. We also introduce a new construct for portable thread-local arrays, which we call compile-time polymorphic arrays. Not only are the specifics of our approaches and implementation interesting for tuning tensor computations, but they also provide a roadmap for developing other portable high-performance codes. As a last step in optimizing performance, we modify the MTTKRP algorithm itself to do a permuted traversal of tensor nonzeros to reduce atomic-write contention. We test the performance of our implementation on 16- and 68-core Intel CPUs and the K80 and P100 NVIDIA GPUs, showing that we are competitive with state-of-the-art architecture-specific codes while having the advantage of being able to run on a variety of architectures.

More Details

TYPE Journal Article YEAR 2019

Scopus OSTI DOI

Hierarchical Parallelism for Performant Embedded Automatic Differentiation on GPUs

Phipps, Eric T.; Pawlowski, Roger P.; Bettencourt, Matthew T.; Cyr, Eric C.; Roberts, Nathan V.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Validation Assessment of Hypersonic Double-Cone Flow Simulations using UQ Sensitivity Analysis and Validation Metrics

Kieweg, Sarah K.; Ray, Jaideep R.; Weirs, V.G.; Carnes, Brian C.; Dinzl, Derek J.; Freno, Brian A.; Howard, Micah A.; Phipps, Eric T.; Rider, William J.; Smith, Thomas M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Validation Assessment of Hypersonic Double-Cone Flow Simulations using Uncertainty Quantification Sensitivity Analysis and Validation Metrics

Kieweg, Sarah K.; Ray, Jaideep R.; Weirs, Vincent G.; Carnes, Brian C.; Dinzl, Derek J.; Freno, Brian A.; Howard, Micah A.; Phipps, Eric T.; Rider, William J.; Smith, Thomas M.; Nompelis, Ioannis N.; Candler, Graham C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI DOI

A Performance Portable SIMD Scalar Type for Effective Vectorization Across Heterogeneous Architectures

Sahasrabudhe, Damodar S.; Phipps, Eric T.; Rajamanickam, Sivasankaran R.; Berzins, Martin B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

ExaLearn Application Interview

Rajamanickam, Sivasankaran R.; Wolf, Michael W.; Phipps, Eric T.; Ebeida, Mohamed S.; Debusschere, Bert J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Tempus Time-Integration Package & Applications

Ober, Curtis C.; Pawlowski, Roger P.; Conde, Sidafa C.; Kalashnikova, Irina; Phipps, Eric T.; Phillips, Edward G.; Mota, Alejandro M.; Phlipot, Greg [.; Ridzal, Denis R.; Hansen, Michael A.; Fisher, Travis C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

ASC ATDM Level 2 Milestone #6358: Assess Status of Next Generation Components and Physics Models in EMPIRE

Bettencourt, Matthew T.; Kramer, Richard M.; Cartwright, Keith C.; Phillips, Edward G.; Ober, Curtis C.; Pawlowski, Roger P.; Swan, Matthew S.; Kalashnikova, Irina; Phipps, Eric T.; Conde, Sidafa C.; Cyr, Eric C.; Ulmer, Craig D.; Kordenbrock, Todd H.; Levy, Scott L.; Templet, Gary J.; Hu, Jonathan J.; Lin, Paul L.; Glusa, Christian A.; Siefert, Christopher S.; Glass, Micheal W.

This report documents the outcome from the ASC ATDM Level 2 Milestone 6358: Assess Status of Next Generation Components and Physics Models in EMPIRE. This Milestone is an assessment of the EMPIRE (ElectroMagnetic Plasma In Realistic Environments) application and three software components. The assessment focuses on the electromagnetic and electrostatic particle-in-cell solu- tions for EMPIRE and its associated solver, time integration, and checkpoint-restart components. This information provides a clear understanding of the current status of the EMPIRE application and will help to guide future work in FY19 in order to ready the application for the ASC ATDM L 1 Milestone in FY20. It is clear from this assessment that performance of the linear solver will have to be a focus in FY19.

More Details

TYPE SAND Report YEAR 2018

OSTI DOI

Developing a Hybrid Multi-fluid/PIC Plasma Capability Using Components

Pawlowski, Roger P.; Phillips, Edward G.; Shadid, John N.; Lin, Paul L.; Cyr, Eric C.; Conde, Sidafa C.; Bettencourt, Matthew T.; Phipps, Eric T.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Towards Towards a Hybrid Multi-fluid/PIC Plasma Capability

Pawlowski, Roger P.; Phillips, Edward G.; Shadid, John N.; Cyr, Eric C.; Conde, Sidafa C.; Bettencourt, Matthew T.; Phipps, Eric T.; Bond, Stephen D.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Solving classification problems using implicit Voronoi cells and local hyperplane sampling

Ebeida, Mohamed S.; Swiler, Laura P.; Phipps, Eric T.; Ray, Jaideep R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Tensor Decompositions on Emerging Manycore Hardware with Genten and Kokkos

Phipps, Eric T.; Kolda, Tamara G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Towards Performance Portable Assembly Tools for Multi-Fluid Plasma Simulation

Pawlowski, Roger P.; Bettencourt, Matthew T.; Cyr, Eric C.; Miller, Sean M.; Phillips, Edward G.; Phipps, Eric T.; Shadid, John N.; Trott, Christian R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Ensemble Grouping Strategies for Embedded Stochastic Collocation Methods Applied to Anisotropic Diffusion Problems

SIAM/ASA Journal on Uncertainty Quantification

D'Elia, Marta D.; Phipps, Eric T.; Edwards, Harold C.; Hu, Jonathan J.; Rajamanickam, Sivasankaran R.

Previous work has demonstrated that propagating groups of samples, called ensembles, together through forward simulations can dramatically reduce the aggregate cost of sampling-based uncertainty propagation methods [E. Phipps, M. D'Elia, H. C. Edwards, M. Hoemmen, J. Hu, and S. Rajamanickam, SIAM J. Sci. Comput., 39 (2017), pp. C162--C193]. However, critical to the success of this approach when applied to challenging problems of scientific interest is the grouping of samples into ensembles to minimize the total computational work. For example, the total number of linear solver iterations for ensemble systems may be strongly influenced by which samples form the ensemble when applying iterative linear solvers to parameterized and stochastic linear systems. In this paper we explore sample grouping strategies for local adaptive stochastic collocation methods applied to PDEs with uncertain input data, in particular canonical anisotropic diffusion problems where the diffusion coefficient is modeled by truncated Karhunen--Loève expansions. Finally, we demonstrate that a measure of the total anisotropy of the diffusion coefficient is a good surrogate for the number of linear solver iterations for each sample and therefore provides a simple and effective metric for grouping samples.

More Details

TYPE Journal Article YEAR 2018

OSTI DOI

Ensemble grouping strategies for embedded stochastic collocation methods applied to anisotropic diffusion problems

SIAM-ASA Journal on Uncertainty Quantification

D'Elia, Marta D.; Edwards, Harold C.; Hu, J.; Phipps, Eric T.; Rajamanickam, Sivasankaran R.

Previous work has demonstrated that propagating groups of samples, called ensembles, together through forward simulations can dramatically reduce the aggregate cost of sampling-based uncertainty propagation methods [E. Phipps, M. D'Elia, H. C. Edwards, M. Hoemmen, J. Hu, and S. Rajamanickam, SIAM J. Sci. Comput., 39 (2017), pp. C162-C193]. However, critical to the success of this approach when applied to challenging problems of scientific interest is the grouping of samples into ensembles to minimize the total computational work. For example, the total number of linear solver iterations for ensemble systems may be strongly influenced by which samples form the ensemble when applying iterative linear solvers to parameterized and stochastic linear systems. In this work we explore sample grouping strategies for local adaptive stochastic collocation methods applied to PDEs with uncertain input data, in particular canonical anisotropic diffusion problems where the diffusion coefficient is modeled by truncated Karhunen-Loève expansions. We demonstrate that a measure of the total anisotropy of the diffusion coefficient is a good surrogate for the number of linear solver iterations for each sample and therefore provides a simple and effective metric for grouping samples.

More Details

TYPE Conference Poster YEAR 2018

Scopus OSTI

SNL ATDM Math Libraries ? Data Propagation Components

Phipps, Eric T.; Turner, Daniel Z.; Maupin, Kathryn A.; Ray, Jaideep R.; Ridzal, Denis R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

SIMD Scalar Types for Outer-loop Vectorization

Phipps, Eric T.; Kim, Kyungjoo K.; Rajamanickam, Sivasankaran R.; Tupek, Michael R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Parallel Tensor Decompositions for Massive Heterogeneous Incomplete Data

Phipps, Eric T.; Kolda, Tamara G.; Anderson-Bergman, Clifford I.; Devine, Karen D.; Dunlavy, Daniel D.; Hong, David H.; Vuduc, Richard V.; Li, Jaijai L.; Young, Jeff Y.; Ballard, Grey B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Improving Sampling-based Uncertainty Quantification Performance Through Embedded Ensemble Propagation

Phipps, Eric T.; D'Elia, Marta D.; Ebeida, Mohamed S.; Rushdi, Ahmad A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Tackling UQ in DARMA a Programming Model for Task-Based Execution at Extreme-Scale

Rizzi, Francesco N.; Phipps, Eric T.; Hollman, David S.; Lifflander, Jonathan; Wilke, Jeremiah J.; Markosyan, Aram H.; Kolla, Hemanth K.; Slattengren, Nicole S.; Teranishi, Keita T.; Stewart, James R.; Clay, Robert L.; Bennett, Janine C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Embedded uncertainty quantification methods via stokhos

Handbook of Uncertainty Quantification

Phipps, Eric T.; Salinger, Andrew G.

Stokhos (Phipps, Stokhos embedded uncertainty quantification methods. http://trilinos.org/packages/stokhos/, 2015) is a package within Trilinos (Heroux et al., ACM Trans Math Softw 31(3), 2005; Michael et al., Sci Program 20(2):83-88, 2012) that enables embedded or intrusive uncertainty quantification capabilities to C++ codes. It provides tools for implementing stochastic Galerkin methods and embedded sample propagation through the use of template-based generic programming (Pawlowski et al., Sci Program 20:197-219, 2012; Roger et al., Sci Program 20:327-345, 2012) which allows deterministic simulation codes to be easily modified for embedded uncertainty quantification. It provides tools for forming and solving the resulting linear and nonlinear equations these methods generate, leveraging the large-scale linear and nonlinear solver capabilities provided by Trilinos. Furthermore, Stokhos is integrated with the emerging many-core architecture capabilities provided by the Kokkos (Edwards et al., Sci Program 20(2):89-114, 2012; Edwards et al., J Parallel Distrib Comput 74(12):3202-3216, 2014) and Tpetra packages (Baker and Heroux, Sci Program 20(2):115-128, 2012; Hoemmen et al., Tpetra: next-generation distributed linear algebra. http://trilinos.org/packages/tpetra, 2015) within Trilinos, allowing these embedded uncertainty quantification capabilities to be applied in both shared and distributed memory parallel computational environments. Finally, the Stokhos tools have been incorporated into the Albany simulation code (Pawlowski et al., Sci Program 20:327-345, 2012; Salinger et al., Albany multiphysics simulation code. https://github.com/gahansen/Albany, 2015) enabling embedded uncertainty quantification of a wide variety of large-scale PDE-based simulations.

More Details

TYPE Journal Article YEAR 2017

Scopus OSTI

A Novel Voronoi Piecewise Approach to Solve Stochastic PDEs

Ebeida, Mohamed S.; Phipps, Eric T.; D'Elia, Marta D.; Rushdi, Ahmad A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Intrusive UQ Algorithms for Emerging Computing Platforms

Safta, Cosmin S.; Phipps, Eric T.; Najm, H.N.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Ensemble Grouping strategies for embedded Stochastic Collocation

D'Elia, Marta D.; Ebeida, Mohamed S.; Phipps, Eric T.; Rushdi, Ahmad A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Computing Derivatives for UQ on Emerging Manycore Architectures

Phipps, Eric T.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

Automatic Differentiation on Emerging Manycore Architectures with Sacado and Kokkos

Phipps, Eric T.; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

SNL ATDM Analysis Components

Phipps, Eric T.; Hanks, Byron H.; Rizzi, Francesco N.; Ridzal, Denis R.; Robbins, Joshua R.; Aguilo Valentin, Miguel A.; Copps, Kevin D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

Embedded ensemble propagation for improving performance, portability, and scalability of uncertainty quantification on emerging computational architectures

SIAM Journal on Scientific Computing

Phipps, Eric T.; D'Elia, Marta D.; Edwards, Harold C.; Hoemmen, M.; Hu, J.; Rajamanickam, Sivasankaran R.

Quantifying simulation uncertainties is a critical component of rigorous predictive simulation. A key component of this is forward propagation of uncertainties in simulation input data to output quantities of interest. Typical approaches involve repeated sampling of the simulation over the uncertain input data and can require numerous samples when accurately propagating uncertainties from large numbers of sources. Often simulation processes from sample to sample are similar, and much of the data generated from each sample evaluation could be reused. We explore a new method for implementing sampling methods that simultaneously propagates groups of samples together in an embedded fashion, which we call embedded ensemble propagation. We show how this approach takes advantage of properties of modern computer architectures to improve performance by enabling reuse between samples, reducing memory bandwidth requirements, improving memory access patterns, improving opportunities for fine-grained parallelization, and reducing communication costs. We describe a software technique for implementing embedded ensemble propagation based on the use of C++ templates and describe its integration with various scientific computing libraries within Trilinos. We demonstrate improved performance, portability, and scalability for the approach applied to the simulation of partial differential equations on a variety of multicore and manycore architectures, including up to 16,384 cores on a Cray XK7 (Titan).

More Details

TYPE Journal Article YEAR 2017

Scopus OSTI DOI

Embedded Analysis For Next-Generation Applications and Architectures

Phipps, Eric T.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Results 1–50 of 144

Results 1–50 of 144