Publications Search

We present a parallel hierarchical solver for general sparse linear systems on distributed-memory machines. For large-scale problems, this fully algebraic algorithm is faster and more memory-efficient than sparse direct solvers because it exploits the low-rank structure of fill-in blocks. Depending on the accuracy of low-rank approximations, the hierarchical solver can be used either as a direct solver or as a preconditioner. The parallel algorithm is based on data decomposition and requires only local communication for updating boundary data on every processor. Moreover, the computation-to-communication ratio of the parallel algorithm is approximately the volume-to-surface-area ratio of the subdomain owned by every processor. We present various numerical results to demonstrate the versatility and scalability of the parallel algorithm.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI Scopus DOI OSTI Scopus

Simple effective conservative treatment of uncertainty from sparse samples of random functions

ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems. Part B. Mechanical Engineering

Romero, Vicente J.; Schroeder, Benjamin B.; Dempsey, James F.; Lewis, John R.; Breivik, Nicole L.; Orient, George E.; Antoun, Bonnie R.; Winokur, Justin W.; Glickman, Matthew R.; Red-Horse, John R.

This paper examines the variability of predicted responses when multiple stress-strain curves (reflecting variability from replicate material tests) are propagated through a finite element model of a ductile steel can being slowly crushed. Over 140 response quantities of interest (including displacements, stresses, strains, and calculated measures of material damage) are tracked in the simulations. Each response quantity’s behavior varies according to the particular stress-strain curves used for the materials in the model. We desire to estimate response variability when only a few stress-strain curve samples are available from material testing. Propagation of just a few samples will usually result in significantly underestimated response uncertainty relative to propagation of a much larger population that adequately samples the presiding random-function source. A simple classical statistical method, Tolerance Intervals, is tested for effectively treating sparse stress-strain curve data. The method is found to perform well on the highly nonlinear input-to-output response mappings and non-standard response distributions in the can-crush problem. The results and discussion in this paper support a proposition that the method will apply similarly well for other sparsely sampled random variable or function data, whether from experiments or models. Finally, the simple Tolerance Interval method is also demonstrated to be very economical.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI

Formulation and computation of dynamic, interface-compatible Whitney complexes in three dimensions

Journal of Computational Physics

Siefert, Christopher S.; Kramer, Richard M.; Voth, Thomas E.; Bochev, Pavel B.

A discrete De Rham complex enables compatible, structure-preserving discretizations for a broad range of partial differential equations problems. Such discretizations can correctly reproduce the physics of interface problems, provided the grid conforms to the interface. However, large deformations, complex geometries, and evolving interfaces makes generation of such grids difficult. We develop and demonstrate two formally equivalent approaches that, for a given background mesh, dynamically construct an interface-conforming discrete De Rham complex. Both approaches start by dividing cut elements into interface-conforming subelements but differ in how they build the finite element basis on these subelements. The first approach discards the existing non-conforming basis of the parent element and replaces it by a dynamic set of degrees of freedom of the same kind. The second approach defines the interface-conforming degrees of freedom on the subelements as superpositions of the basis functions of the parent element. These approaches generalize the Conformal Decomposition Finite Element Method (CDFEM) and the extended finite element method with algebraic constraints (XFEM-AC), respectively, across the De Rham complex.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI Scopus

Sparse Matrix-Matrix Multiplication on Multilevel Memory Architectures: Algorithms and Experiments

Deveci, Mehmet D.; Hammond, Simon D.; Wolf, Michael W.; Rajamanickam, Sivasankaran R.

Architectures with multiple classes of memory media are becoming a common part of mainstream supercomputer deployments. So called multi-level memories offer differing characteristics for each memory component including variation in bandwidth, latency and capacity. This paper investigates the performance of sparse matrix multiplication kernels on two leading highperformance computing architectures — Intel's Knights Landing processor and NVIDIA's Pascal GPU. We describe a data placement method and a chunking-based algorithm for our kernels that exploits the existence of the multiple memory spaces in each hardware platform. We evaluate the performance of these methods w.r.t. standard algorithms using the auto-caching mechanisms Our results show that standard algorithms that exploit cache reuse performed as well as multi-memory-aware algorithms for architectures such as Ki\iLs where the memory subsystems have similar latencies. However, for architectures such as GPUS where memory subsystems differ significantly in both bandwidth and latency, multi-memory-aware methods are crucial for good performance. In addition, our new approaches permit the user to run problems that require larger capacities than the fastest memory of each compute node without depending on the software-managed cache mechanisms.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

Cask Transportation Project - Prelimianary Results

Kalinina, Elena A.; Wright, Catherine W.; Gordon, Natalie G.; Saltzstein, Sylvia J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

COMPUTATIONAL TOOLS FOR FREE-FORM OPTIMIZATION OF LARGE-SCALE EnMat STRUCTURES

El-Kady, I.; Reinke, Charles M.; van Bloemen Waanders, Bart G.; Ridzal, Denis R.; Su, Mehmet; Fitzpatrick, David; Chilton, Ryan

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Structured grid algorithms in MueLu

Berger-Vergiat, Luc B.; Tuminaro, Raymond S.; Ohm, Peter B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Data-Driven Uncertainty Quantification for Multi-Sensor Analytics

Stracuzzi, David J.; Darling, Michael C.; Chen, Maximillian G.; Peterson, Matthew G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

COMPUTATIONAL TOOLS FOR FREE-FORM OPTIMIZATION OF LARGE-SCALE EnMat STRUCTURES

Ridzal, Denis R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Making Social-Network Data Sets More Human to Aid National-Security Graph Analytics

Berry, Jonathan W.; Phillips, Cynthia A.; Saia, Jared

More Details

TYPE Presentation YEAR 2018

OSTI

Rebooting Computers to Avoid Meltdown and Spectre

Computer

Conte, Thomas M.; DeBenedictis, Erik; Mendelson, Avi; Milojicic, Dejan

Security vulnerabilities such as Meltdown and Spectre demonstrate how chip complexity grew faster than our ability to manage unintended consequences. Attention to security from the outset should be part of the rememdy, yet complexity must be controlled at a more fundamental level.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI Scopus

Multi-level Uncertainty Quantification of a Wind Turbine Large Eddy Simulation Model

Maniaci, David C.; Frankel, Ari L.; Geraci, Gianluca G.; Blaylock, Myra L.; Eldred, Michael S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Parallel Programming Futures: What We Have and Will Have Will Not Be Enough

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Calibration and Propagation of Model Discrepancy Across Experiments

Maupin, Kathryn A.; Swiler, Laura P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Optimal Approximation of Spectral Risk Measures with Application to PDE-Constrained Optimization

Kouri, Drew P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Optimization and Control Under Uncertainty

Kouri, Drew P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Merlin Element Library Deep Dive

Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Fully-coupled and block/Schur complement based algebraic multigrid-based preconditioners for implicit continuum plasma simulations

Lin, Paul L.; Shadid, John N.; Phillips, Edward G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

MIRaGE Multiscale Inverse Rapid Group-theory for Engineered-metamaterials

El-Kady, I.; Reinke, Charles M.; van Bloemen Waanders, Bart G.; Ridzal, Denis R.; Su, Mehmet; Fitzpatrick, David; Chilton, Ryan

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

FEMLITE: Rapid Electromagnetic Modeling of Periodic Metamaterial Structures

Reinke, Charles M.; El-Kady, I.; van Bloemen Waanders, Bart G.; Ridzal, Denis R.; Su, Mehmet; Fitzpatrick, David; Chilton, Ryan

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

A group theory approach to tailored optical properties of metamaterials

El-Kady, I.; Reinke, Charles M.; van Bloemen Waanders, Bart G.; Ridzal, Denis R.; Su, Mehmet; Fitzpatrick, David; Chilton, Ryan

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Investigating Melt Pool Dynamics in Laser-Bed Powder Fusion Using a Two-Color Pyrometer

Koepke, Joshua R.; Jared, Bradley H.; Saiz, David J.; Schwaller, Erich J.; Tung, Daniel J.; Swiler, Laura P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Solvers for Hypersonic Flow Applications

Hu, Jonathan J.; Berger-Vergiat, Luc B.; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Supporting High Performance Analytics with System Software for Virtualized Supercomputing

Younge, Andrew J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Developing Numerical Methods for both Near Feed and Target Physics on Z-Machine

McGregor, Duncan A.; Robinson, Allen C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Modeling field emission from real surfaces

Moore, Christopher H.; Berg, Morgann B.; Brumbach, Michael T.; Bussmann, Ezra B.; Clem, Paul G.; Hjalmarson, Harold P.; Hopkins, Matthew M.; Schultz, Peter A.; Scrymgeour, David S.; Smith, Sean S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

An Adaptive Model with Joint Chance Constraints for a Hybrid Wind-Conventional Generator System

Singh, Bismark S.; Morton, David; Santoso, Surya

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

DOI OSTI

VTK-m Update (DOECGF 2018)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Some directions on algorithms for chance-constrained optimization models

Singh, Bismark S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Collaborative Augmented Reality Virtual Reality (CARVR)

Carvey, Brad

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Trinity: Opportunities and Challenges of a Heterogeneous System

Hemmert, Karl S.; Moore, Stan G.; Gallis, Michail A.; Davis, Mike E.; Levesque, John; Hjelm, Nathan; Lujan, James; Morton, David; Nam, Hai A.; Parga, Alex; Peltz Jr., Paul; Shipman, Galen; Torrez, Alfred

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

The Crazy Future of Vis

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

MULTILEVEL /MULTIFIDELITY MONTE CARLO FOR WAVE PROPAGATION IN HETEROGENEOUS MEDIA

Geraci, Gianluca G.; Eldred, Michael S.; Iaccarino, Gianluca

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Methodologies for Enabling Bayesian Calibration in Landice Modeling Towards Probabilistic Projections of Sealevel Change

Perego, Mauro P.; Jakeman, John D.; Kalashnikova, Irina; Price, Stephen; Stadler, Georg

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI OSTI

What?s New in ParaView - DOECGF 2018

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2018

OSTI

Sparse multifidelity approximations for forward UQ

Safta, Cosmin S.; Huan, Xun H.; Sargsyan, Khachik S.; Najm, H.N.; Eldred, Michael S.; Geraci, Gianluca G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI OSTI

Exploiting Geometric Partitioning in Task Mapping for Parallel Computes

Deveci, Mehmet D.; Devine, Karen D.; Laros, James H.; Taylor, Mark A.; Rajamanickam, Sivasankaran R.; Catalyurek, Umit V.

We present a new method for mapping applications' MPI tasks to cores of a parallel computer such that applications' communication time is reduced. We address the case of sparse node allocation, where the nodes assigned to a job are not necessarily located in a contiguous block nor within close proximity to each other in the network, although our methods generalize to contiguous allocations as well. The goal is to assign tasks to cores so that interdependent tasks are performed by "nearby' cores, thus lowering the distance messages must travel, the amount of congestion in the network, and the overall cost of communication. Our new method applies a geometric partitioning algorithm to both the tasks and the processors, and assigns task parts to the corresponding processor parts. We also present a number of algorithmic optimizations that exploit specific features of the network or application. We show that, for the structured finite difference mini-application MiniGhost, our mapping methods reduced communication time up to 75% relative to MiniGhost's default mapping on 128K cores of a Cray XK7 with sparse allocation. For the atmospheric modeling code E3SM/HOMME, our methods reduced communication time up to 31% on 32K cores of an IBM BlueGene/Q with contiguous allocation.

More Details

TYPE Other Report YEAR 2018

DOI OSTI