Publications Search

A Comparative Critical Analysis of Modern Task-Parallel Runtimes

The rise in node-level parallelism has increased interest in task-based parallel runtimes for a wide array of application areas. Applications have a wide variety of task spawning patterns which frequently change during the course of application execution, based on the algorithm or solver kernel in use. Task scheduling and load balance regimes, however, are often highly optimized for specific patterns. This paper uses four basic task spawning patterns to quantify the impact of specific scheduling policy decisions on execution time. We compare the behavior of six publicly available tasking runtimes: Intel Cilk, Intel Threading Building Blocks (TBB), Intel OpenMP, GCC OpenMP, Qthreads, and High Performance ParalleX (HPX). With the exception of Qthreads, the runtimes prove to have schedulers that are highly sensitive to application structure. No runtime is able to provide the best performance in all cases, and those that do provide the best performance in some cases, unfortunately, provide extremely poor performance when application structure does not match the scheduler's assumptions.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

Simulating neural systems with Xyce

Schiek, Richard; Thornquist, Heidi K.; Warrender, Christina E.; Mei, Ting; Teeter, Corinne M.; Aimone, James B.

Sandias parallel circuit simulator, Xyce, can address large scale neuron simulations in a new way extending the range within which one can perform high-fidelity, multi-compartment neuron simulations. This report documents the implementation of neuron devices in Xyce, their use in simulation and analysis of neuron systems.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

Exploring the Future of High Performance Computing with Proxy-Apps

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Next Generation Solver Stack for the Simulation of the Abnormal Thermal Fire Environment for Stockpile Stewardship

Lin, Paul T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Simulations of the Effect of Neutrons on Stockpile Transistors

Lin, Paul T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Measurements of Spin Life Time of an Antimony-Bound Electron in Silicon

Lu, Tzu M.; Bishop, Nathaniel B.; Tracy, Lisa A.; Blume-Kohout, Robin; Pluym, Tammy; Wendt, Joel R.; Dominguez, Jason; Lilly, Michael; Carroll, M.S.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Emerging HPC Systems and Next Generation Engineering Analysis Applications

Ang, James A.; Barrett, Richard F.; Hammond, Simon; Rodrigues, Arun

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Component-Based Scientific Application Development

Salinger, Andrew G.

Over the past few years, we have defined and gone a long ways towards implementing a component-based strategy for building scientific application codes. We have asserted that this approach offers significant advantages over a model of writing project-based application codes. There are now several technical and programmatic successes that validate these claims. Not only are there net benefits to code projects that follow this strategy, but also the most striking gains are for the long-term impact and productivity of our computational science organizations.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

Sensitivity analysis techniques applied to a system of hyperbolic conservation laws

Reliability Engineering and System Safety

Weirs, V.G.; Kamm, James R.; Swiler, Laura P.; Tarantola, Stefano; Ratto, Marco; Adams, Brian M.; Rider, William J.; Eldred, Michael

Sensitivity analysis is comprised of techniques to quantify the effects of the input variables on a set of outputs. In particular, sensitivity indices can be used to infer which input parameters most significantly affect the results of a computational model. With continually increasing computing power, sensitivity analysis has become an important technique by which to understand the behavior of large-scale computer simulations. Many sensitivity analysis methods rely on sampling from distributions of the inputs. Such sampling-based methods can be computationally expensive, requiring many evaluations of the simulation; in this case, the Sobol method provides an easy and accurate way to compute variance-based measures, provided a sufficient number of model evaluations are available. As an alternative, meta-modeling approaches have been devised to approximate the response surface and estimate various measures of sensitivity. In this work, we consider a variety of sensitivity analysis methods, including different sampling strategies, different meta-models, and different ways of evaluating variance-based sensitivity indices. The problem we consider is the 1-D Riemann problem. By a careful choice of inputs, discontinuous solutions are obtained, leading to discontinuous response surfaces; such surfaces can be particularly problematic for meta-modeling approaches. The goal of this study is to compare the estimated sensitivity indices with exact values and to evaluate the convergence of these estimates with increasing samples sizes and under an increasing number of meta-model evaluations. © 2011 Elsevier Ltd. All rights reserved.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

Superconducting Electronics and Future Computing

Debenedictis, Erik

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Defect and Dopant Properties in CsI and NaI

Schultz, Peter A.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Poisson-Schrodinger Solvers in QCAD

Gao, Xujiao; Nielsen, Erik N.; Muller, Richard P.; Young, Ralph W.; Salinger, Andrew G.; Tezaur, Irina K.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

A Python HPC framework: PyTrilinos ODIN and Seamless

Spotz, William S.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Streaming Connectivity with the X-Stream Model

Phillips, Cynthia A.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

First-principles defect chemistry in C-doped GaAs

Schultz, Peter A.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

The Portals 4.0 Network Programming Interface

Brightwell, Ronald B.; Pedretti, Kevin; Wheeler, Kyle B.; Hemmert, Karl S.; Barrett, Brian

This report presents a specification for the Portals 4.0 network programming interface. Portals 4.0 is intended to allow scalable, high-performance network communication between nodes of a parallel computing system. Portals 4.0 is well suited to massively parallel processing and embedded systems. Portals 4.0 represents an adaption of the data movement layer developed for massively parallel processing platforms, such as the 4500-node Intel TeraFLOPS machine. Sandia’s Cplant cluster project motivated the development of Version 3.0, which was later extended to Version 3.3 as part of the Cray Red Storm machine and XT line. Version 4.0 is targeted to the next generation of machines employing advanced network interface architectures that support enhanced offload capabilities.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

A Survey of Advanced Probabilistic Uncertainty Propagation and Sensitivity Analysis Methods

Swiler, Laura P.; Romero, Vicente J.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

A Case of System-Wide Power Management for Scientific Applications

Lofstead, Gerald F.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Experiences Applying Data Staging Technology in Unconventional Ways

Lofstead, Gerald F.; Oldfield, Ron

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Dax Design/Breaking Up the Cell Class

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

SNL and VNIITF VTC discussion of Be computational modeling

Desjarlais, Michael P.; Knudson, Marcus D.; Thompson, A.P.; Lane, James M.D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

A data driven approach to assess team performance through team communication

Forsythe, James C.; Glickman, Matthew R.; Haass, Michael J.; Whetzel, Jonathan H.

Abstract not provided.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

Research Agenda for Visualization at Exascale

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Project Grandmaster

Fabian, Nathan; Mcclain, Jonathan T.; Davis, Warren L.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Verifying fossil fuel CO2 emissions with CMAQ

Liu, Zhen; Safta, Cosmin; Sargsyan, Khachik; Van Bloemen Waanders, Bart; Bambha, Ray; Michelsen, Hope A.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Surface engineering of electrospun fibers to optimize ion and electron transport in Li%2B battery cathodes

Missert, Nancy; Garcia, Robert M.; Nagasubramanian, Ganesan; Leung, Kevin; Rempe, Susan; Rogers, David

Abstract not provided.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

RAID: Motivation and Implementation

Curry, Matthew L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Peridigm: A New Paradigm in Computational Peridynamics

Parks, Michael L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Generalized Langevin Dynamics in LAMMPS

Proposed for publication in Engineering with Computers.

Baczewski, Andrew D.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2012

OSTI

Discretization Capability Area

Ridzal, Denis

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Transactional Parallel Metadata Services for Integrated Application Workflows

Lofstead, Gerald F.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Peridynamic Modeling of Void Collapse in Representative Plutonium Oxide Microstructures

Littlewood, David J.; Tikare, Veena

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Incorporating the min-max mesh optimization method within the Target-Matrix Paradigm

Knupp, Patrick K.; Day, David M.

Abstract not provided.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

Sandia's Appro %22Compton%22Cluster

Hammond, Simon

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Multiscale Modeling of Fracture with Peridynamics

Silling, Stewart

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Characterizing and Mitigating Work Time Inflation in Task Parallel Programs

Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Going to Graduate School and Working on HPC Was Awesome and You Should Think About Doing It Too

Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Trinity the ASC Program?s Next-GenerationAdvanced Technology System

Doerfler, Douglas W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Quasi-statics Modal Analysis and Structural Health Monitoring within the Peridynamic Framework

Littlewood, David J.; Mish, Kyran D.; Pierson, Kendall H.

More Details

TYPE Conference YEAR 2012

OSTI

Unprecedented Scalability and Performance of the new NNSA Tri-Lab Capacity Cluster 2 (TLCC2)

Rajan, Mahesh; Doerfler, Douglas W.; Lin, Paul T.; Hammond, Simon; Barrett, Richard F.; Vaughan, Courtenay T.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

INDIVIDUAL DIFFERENCES IN MULTITASKING ABILITY AND ADAPTABILITY

Proposed for publication in Human Factors: The Journal of Human Factors and Ergonomics Society.

Abbott, Robert G.; Haass, Michael J.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2012

OSTI

Copy of The ParaView Tutorial (Version 3.98)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Dax Design/Device Adapter Header Order

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Informing Macroscale Constitutive Laws through Peridynamic Modeling of Grain-Scale Mechanisms in Plutonium Oxide

Littlewood, David J.; Bignell, John; Tikare, Veena

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Portals 4 Network Programming Interface

Barrett, Brian; Brightwell, Ronald B.; Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Leveraging MPI's one-sided communication interface for shared-memory programming

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Hoefler, Torsten; Dinan, James; Buntinas, Darius; Balaji, Pavan; Barrett, Brian W.; Brightwell, Ronald B.; Gropp, William; Kale, Vivek; Thakur, Rajeev

Hybrid parallel programming with MPI for internode communication in conjunction with a shared-memory programming model to manage intranode parallelism has become a dominant approach to scalable parallel programming. While this model provides a great deal of flexibility and performance potential, it saddles programmers with the complexity of utilizing two parallel programming systems in the same application. We introduce an MPI-integrated shared-memory programming model that is incorporated into MPI through a small extension to the one-sided communication interface. We discuss the integration of this interface with the upcoming MPI 3.0 one-sided semantics and describe solutions for providing portable and efficient data sharing, atomic operations, and memory consistency. We describe an implementation of the new interface in the MPICH2 and Open MPI implementations and demonstrate an average performance improvement of 40% to the communication component of a five-point stencil solver. © 2012 Springer-Verlag.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

A low impact flow control implementation for offload communication interfaces

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Barrett, Brian W.; Brightwell, Ronald B.; Underwood, Keith D.

Message passing paradigms provide for many to one messaging patterns that result in receive side resource exhaustion. Traditionally, MPI implementations layered over the Portals network programming interface provided a large default unexpected receive buffer space, the user was expected to configure the buffer size to the application demand, and the application was aborted when the buffer space was overrun. The Portals 4 design provides a set of primitives for implementing scalable resource exhaustion recovery without negatively impacting normal operation. A resource exhaustion recovery protocol for MPI implementations is presented, as well as performance results for an Open MPI implementation of the protocol. © 2012 Springer-Verlag.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

ShyLU: A hybrid-hybrid solver for multicore platforms

Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012

Rajamanickam, Sivasankaran; Boman, Erik G.; Heroux, Michael A.

With the ubiquity of multicore processors, it is crucial that solvers adapt to the hierarchical structure of modern architectures. We present ShyLU, a "hybrid-hybrid" solver for general sparse linear systems that is hybrid in two ways: First, it combines direct and iterative methods. The iterative part is based on approximate Schur complements where we compute the approximate Schur complement using a value-based dropping strategy or structure-based probing strategy. Second, the solver uses two levels of parallelism via hybrid programming (MPI+threads). ShyLU is useful both in shared-memory environments and on large parallel computers with distributed memory. In the latter case, it should be used as a sub domain solver. We argue that with the increasing complexity of compute nodes, it is important to exploit multiple levels of parallelism even within a single compute node. We show the robustness of ShyLU against other algebraic preconditioners. ShyLU scales well up to 384 cores for a given problem size. We also study the MPI-only performance of ShyLU against a hybrid implementation and conclude that on present multicore nodes MPI-only implementation is better. However, for future multicore machines (96 or more cores) hybrid/ hierarchical algorithms and implementations are important for sustained performance. © 2012 IEEE.

More Details

TYPE Conference YEAR 2012

OSTI Scopus

Multithreaded algorithms for maxmum matching in bipartite graphs

Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS 2012

Azad, Ariful; Halappanavar, Mahantesh; Rajamanickam, Sivasankaran; Boman, Erik G.; Khan, Arif; Pothen, Alex

We design, implement, and evaluate algorithms for computing a matching of maximum cardinality in a bipartite graph on multicore and massively multithreaded computers. As computers with larger numbers of slower cores dominate the commodity processor market, the design of multithreaded algorithms to solve large matching problems becomes a necessity. Recent work on serial algorithms for the matching problem has shown that their performance is sensitive to the order in which the vertices are processed for matching. In a multithreaded environment, imposing a serial order in which vertices are considered for matching would lead to loss of concurrency and performance. But this raises the question: Would parallel matching algorithms on multithreaded machines improve performance over a serial algorithm? We answer this question in the affirmative. We report efficient multithreaded implementations of three classes of algorithms based on their manner of searching for augmenting paths: breadth-first-search, depth-first-search, and a combination of both. The Karp-Sipser initialization algorithm is used to make the parallel algorithms practical. We report extensive results and insights using three shared-memory platforms (a 48-core AMD Opteron, a 32-coreIntel Nehalem, and a 128-processor Cray XMT) on a representative set of real-world and synthetic graphs. To the best of our knowledge, this is the first study of augmentation-based parallel algorithms for bipartite cardinality matching that demonstrates good speedups on multithreaded shared memory multiprocessors. © 2012 IEEE.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

Lagrangian shock hydrodynamics on tetrahedral meshes: A stable and accurate variational multiscale approach

Journal of Computational Physics

Scovazzi, Guglielmo S.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2012

OSTI DOI

Publications

Search results