Publications Search

Parallel phase model : a programming model for high-end parallel machines with manycores

Brightwell, Ronald B.; Heroux, Michael A.; Wen, Zhaofang W.

This paper presents a parallel programming model, Parallel Phase Model (PPM), for next-generation high-end parallel machines based on a distributed memory architecture consisting of a networked cluster of nodes with a large number of cores on each node. PPM has a unified high-level programming abstraction that facilitates the design and implementation of parallel algorithms to exploit both the parallelism of the many cores and the parallelism at the cluster level. The programming abstraction will be suitable for expressing both fine-grained and coarse-grained parallelism. It includes a few high-level parallel programming language constructs that can be added as an extension to an existing (sequential or parallel) programming language such as C; and the implementation of PPM also includes a light-weight runtime library that runs on top of an existing network communication software layer (e.g. MPI). Design philosophy of PPM and details of the programming abstraction are also presented. Several unstructured applications that inherently require high-volume random fine-grained data accesses have been implemented in PPM with very promising results.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

An Algebraic Multigrid Method for Compatible Least-Squares Formulations of Div-Curl Equations

Siefert, Christopher S.; Bochev, Pavel B.; Peterson, Kara J.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

What's New in ParaView (DOECGF 2009)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

UltraVis Overview for DOECGF 2009

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

A semantic disambiguation algorithm to reason about cars from the shapes in over-segmentation of high-resolution orthophotos

Diegert, Carl F.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

A semantic disambiguation algorithm to reason about cars from the shapes in over-segmentation of high-resolution orthophotos

Diegert, Carl F.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

A scalable and adaptable solution framework within components of the Community Climate System Model

Sprinter Lecture Notes

Rouson, Damian R.; Salinger, Andrew G.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2009

OSTI

Memory in Silico: Building a Neuromimetic Episodic Cognitive Model

Taylor, Shawn E.; Bernard, Michael L.; Vineyard, Craig M.; Verzi, Stephen J.; Morrow, James D.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

An extensible operating system design for large-scale parallel machines

Riesen, Rolf; Ferreira, Kurt

Running untrusted user-level code inside an operating system kernel has been studied in the 1990's but has not really caught on. We believe the time has come to resurrect kernel extensions for operating systems that run on highly-parallel clusters and supercomputers. The reason is that the usage model for these machines differs significantly from a desktop machine or a server. In addition, vendors are starting to add features, such as floating-point accelerators, multicore processors, and reconfigurable compute elements. An operating system for such machines must be adaptable to the requirements of specific applications and provide abstractions to access next-generation hardware features, without sacrificing performance or scalability.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

Red Storm/XT4: A Superior Architecture for Scalability

Doerfler, Douglas W.; Vaughan, Courtenay T.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Algorithmic properties of the midpoint predictor-corrector time integrator

Love, Edward L.; Scovazzi, Guglielmo S.; Rider, William J.

Algorithmic properties of the midpoint predictor-corrector time integration algorithm are examined. In the case of a finite number of iterations, the errors in angular momentum conservation and incremental objectivity are controlled by the number of iterations performed. Exact angular momentum conservation and exact incremental objectivity are achieved in the limit of an infinite number of iterations. A complete stability and dispersion analysis of the linearized algorithm is detailed. The main observation is that stability depends critically on the number of iterations performed.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

Link Prediction on Evolving Data using Tensor Factorizations

Acar Ataman, Evrim N.; Kolda, Tamara G.; Dunlavy, Daniel D.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Adjoint based optimization and adaptivity for flow and transport problems

Carnes, Brian C.; Bartlett, Roscoe B.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Timed-Run Scheduling

Leung, Vitus J.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Simulation & Modeling

Rodrigues, Arun

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Can we continue to build supercomputers out of processors optimized for laptops?

Murphy, Richard C.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Energy Minimizing Algebraic Multigrid for Systems of Partial Differential Equations

Tuminaro, Raymond S.; Hu, Jonathan J.; Cyr, Eric C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Verification of complex codes

Ober, Curtis C.

Over the past several years, verifying and validating complex codes at Sandia National Laboratories has become a major part of code development. These aspects tackle two important parts of simulation modeling: determining if the models have been correctly implemented - verification, and determining if the correct models have been selected - validation. In this talk, we will focus on verification and discuss the basics of code verification and its application to a few codes and problems at Sandia.

More Details

TYPE Conference YEAR 2009

OSTI

Notes on a gap in advancing geospatial image processing methods for NA-22

Diegert, Carl F.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Model-free Learning and Control in a Mobile Robot

Rohrer, Brandon R.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

A Flexible Approach for the Statistical Visualization of Ensemble Data

Potter, Kristin C.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

LSAView: A Tool for Visual Exploration of Latent Semantic Modeling

Dunlavy, Daniel D.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Xyce Parallel Electronic Simulator : reference guide, version 4.1

Keiter, Eric R.; Mei, Ting M.; Russo, Thomas V.; Pawlowski, Roger P.; Schiek, Richard S.; Santarelli, Keith R.; Coffey, Todd S.; Thornquist, Heidi K.

This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users Guide. The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users Guide.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

Xyce Parallel Electronic Simulator : users' guide, version 4.1

Keiter, Eric R.; Mei, Ting M.; Russo, Thomas V.; Pawlowski, Roger P.; Schiek, Richard S.; Santarelli, Keith R.; Coffey, Todd S.; Thornquist, Heidi K.

This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: (1) Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). Note that this includes support for most popular parallel and serial computers. (2) Improved performance for all numerical kernels (e.g., time integrator, nonlinear and linear solvers) through state-of-the-art algorithms and novel techniques. (3) Device models which are specifically tailored to meet Sandia's needs, including some radiation-aware devices (for Sandia users only). (4) Object-oriented code design and implementation using modern coding practices that ensure that the Xyce Parallel Electronic Simulator will be maintainable and extensible far into the future. Xyce is a parallel code in the most general sense of the phrase - a message passing parallel implementation - which allows it to run efficiently on the widest possible number of computing platforms. These include serial, shared-memory and distributed-memory parallel as well as heterogeneous platforms. Careful attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. The development of Xyce provides a platform for computational research and development aimed specifically at the needs of the Laboratory. With Xyce, Sandia has an 'in-house' capability with which both new electrical (e.g., device model development) and algorithmic (e.g., faster time-integration methods, parallel solver algorithms) research and development can be performed. As a result, Xyce is a unique electrical simulation capability, designed to meet the unique needs of the laboratory.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

CSSE Simulation Tools Quarterly Update FY2009 Q2

Rodrigues, Arun; Adalsteinsson, Helgi A.; Cranford, Scott C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Enabling immersive simulation

Abbott, Robert G.; Basilico, Justin D.; Glickman, Matthew R.; Hart, Derek H.; Whetzel, Jonathan H.

The object of the 'Enabling Immersive Simulation for Complex Systems Analysis and Training' LDRD has been to research, design, and engineer a capability to develop simulations which (1) provide a rich, immersive interface for participation by real humans (exploiting existing high-performance game-engine technology wherever possible), and (2) can leverage Sandia's substantial investment in high-fidelity physical and cognitive models implemented in the Umbra simulation framework. We report here on these efforts. First, we describe the integration of Sandia's Umbra modular simulation framework with the open-source Delta3D game engine. Next, we report on Umbra's integration with Sandia's Cognitive Foundry, specifically to provide for learning behaviors for 'virtual teammates' directly from observed human behavior. Finally, we describe the integration of Delta3D with the ABL behavior engine, and report on research into establishing the theoretical framework that will be required to make use of tools like ABL to scale up to increasingly rich and realistic virtual characters.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

EEG analyses with SOBI

Glickman, Matthew R.

The motivating vision behind Sandia's MENTOR/PAL LDRD project has been that of systems which use real-time psychophysiological data to support and enhance human performance, both individually and of groups. Relevant and significant psychophysiological data being a necessary prerequisite to such systems, this LDRD has focused on identifying and refining such signals. The project has focused in particular on EEG (electroencephalogram) data as a promising candidate signal because it (potentially) provides a broad window on brain activity with relatively low cost and logistical constraints. We report here on two analyses performed on EEG data collected in this project using the SOBI (Second Order Blind Identification) algorithm to identify two independent sources of brain activity: one in the frontal lobe and one in the occipital. The first study looks at directional influences between the two components, while the second study looks at inferring gender based upon the frontal component.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

An optimization approach for fitting canonical tensor decompositions

Acar Ataman, Evrim N.; Dunlavy, Daniel D.

Tensor decompositions are higher-order analogues of matrix decompositions and have proven to be powerful tools for data analysis. In particular, we are interested in the canonical tensor decomposition, otherwise known as the CANDECOMP/PARAFAC decomposition (CPD), which expresses a tensor as the sum of component rank-one tensors and is used in a multitude of applications such as chemometrics, signal processing, neuroscience, and web analysis. The task of computing the CPD, however, can be difficult. The typical approach is based on alternating least squares (ALS) optimization, which can be remarkably fast but is not very accurate. Previously, nonlinear least squares (NLS) methods have also been recommended; existing NLS methods are accurate but slow. In this paper, we propose the use of gradient-based optimization methods. We discuss the mathematical calculation of the derivatives and further show that they can be computed efficiently, at the same cost as one iteration of ALS. Computational experiments demonstrate that the gradient-based optimization methods are much more accurate than ALS and orders of magnitude faster than NLS.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

The Dual-Use Dilemma

Gaudioso, Jennifer M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Biosecurity Policy Drivers

Gaudioso, Jennifer M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

The Design for Tractable Analysis (DTA) Framework: A Methodology for the Analysis and Simulation of Complex Systems

International Journal of Decision Support System Technology (IJDSST)

Linebarger, John M.; De Spain, Mark J.; McDonald, Michael J.; Spencer, Floyd W.; Cloutier, Robert J.

The Design for Tractable Analysis (DTA) framework was developed to address the analysis of complex systems and so-called “wicked problems.” DTA is distinctive because it treats analytic processes as key artifacts that can be created and improved through formal design processes. Systems (or enterprises) are analyzed as a whole, in conjunction with decomposing them into constituent elements for domain-specific analyses that are informed by the whole. After using the Systems Modeling Language (SysML) to frame the problem in the context of stakeholder needs, DTA harnesses the Design Structure Matrix (DSM) to structure the analysis of the system and address questions about the emergent properties of the system. The novel use of DSM to “design the analysis” makes DTA particularly suitable for addressing the interdependent nature of complex systems. The use of DTA is demonstrated by a case study of sensor grid placement decisions to secure assets at a fixed site. © 2009, IGI Global. All rights reserved.

More Details

TYPE Journal Article YEAR 2009

Scopus OSTI

Low-dimensional modeling for spatial developing free shear layers

Barone, Matthew F.; van Bloemen Waanders, Bart G.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Networks Grand Challenge LDRD External Advisory Board Meeting

Rountree, Suzanne L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

SPECIAL FINITE ELEMENT METHODS BASED ON COMPONENT MODE SYNTHESIS TECHNIQUES

ESAIM: Mathematical Modelling and Numerical Analysis

Lehoucq, Richard B.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2009

OSTI

Magnetic-pulse-driven Rayleigh-Taylor instability in plastically deforming metals

Niederhaus, John H.; Alexander, Charles S.; Haill, Thomas A.; Vogler, Tracy V.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Peridynamic modeling of the dynamic response of heterogeneous media

Silling, Stewart A.; Lehoucq, Richard B.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Multilevel Project

Tuminaro, Raymond S.; Hu, Jonathan J.; Siefert, Christopher S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

A New Parallel Strategy for Transistor-Level Circuit Simulation

Keiter, Eric R.; Thornquist, Heidi K.; Day, David M.; Boman, Erik G.; Hoekstra, Robert J.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Climate Changes in the Arctic and The Challenge to USCG Operations

Mitchiner, John L.; Strickland, James H.; Heermann, Philip D.; Sanzero, George V.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Performance of an MPI-only semiconductor device simulator on a quad socket/quad core InfiniBand platform

Shadid, John N.

This preliminary study considers the scaling and performance of a finite element (FE) semiconductor device simulator on a capacity cluster with 272 compute nodes based on a homogeneous multicore node architecture utilizing 16 cores. The inter-node communication backbone for this Tri-Lab Linux Capacity Cluster (TLCC) machine is comprised of an InfiniBand interconnect. The nonuniform memory access (NUMA) nodes consist of 2.2 GHz quad socket/quad core AMD Opteron processors. The performance results for this study are obtained with a FE semiconductor device simulation code (Charon) that is based on a fully-coupled Newton-Krylov solver with domain decomposition and multilevel preconditioners. Scaling and multicore performance results are presented for large-scale problems of 100+ million unknowns on up to 4096 cores. A parallel scaling comparison is also presented with the Cray XT3/4 Red Storm capability platform. The results indicate that an MPI-only programming model for utilizing the multicore nodes is reasonably efficient on all 16 cores per compute node. However, the results also indicated that the multilevel preconditioner, which is critical for large-scale capability type simulations, scales better on the Red Storm machine than the TLCC machine.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

An Extensible Operating System Design for Large-Scale Parallel Machines

Riesen, Rolf; Ferreira, Kurt

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

I/O trace data from homme_cam_3_2_59 code runs

Ward, Harry L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Modeling Populations of Interest in Order to Simulate Cultural Response to Influence Activities

Bernard, Michael L.; Backus, George A.; Glickman, Matthew R.

Abstract not provided.

More Details

TYPE Conference YEAR 2009

OSTI

Calculation of chemical reaction energies using the AM05 density functional

Journal of Computational Chemistry

Wills, Ann E.; Janssen, Curtis L.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2009

OSTI

DAKOTA Training 2008: Optimization and Calibration

Adams, Brian M.; Swiler, Laura P.; Eldred, Michael S.; Gay, David M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

Improving The Semidefinite Programming Bound To Max Cut

Operations Research Letters

Carr, Robert D.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2009

OSTI

Verification Validation Uncertainty Quantification Predictive Modeling and Simulation: Integration of NW Capabilities into NEAMS

Stewart, James R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2009

OSTI

On the two-domain equations for gas chromatography

Romero, L.A.; Parks, Michael L.

We present an analysis of gas chromatographic columns where the stationary phase is not assumed to be a thin uniform coating along the walls of the cross section. We also give an asymptotic analysis assuming that the parameter {beta} = KD{sup II}{rho}{sup II}/D{sup I}{rho}{sup I} is small. Here K is the partition coefficient, and D{sup i} and {rho}{sup i}, i = I, II are the diffusivity and density in the mobile (i = I) and stationary (i = II) regions.

More Details

TYPE SAND Report YEAR 2009

DOI OSTI

Interoperable mesh components for large-scale, distributed-memory simulations

Journal of Physics: Conference Series

Devine, Karen D.; Diachin, L.; Kraftcheck, J.; Jansen, K.E.; Leung, Vitus J.; Luo, X.; Miller, M.; Ollivier-Gooch, C.; Ovcharenko, A.; Sahni, O.; Shephard, M.S.; Tautges, T.; Xie, T.; Zhou, M.

SciDAC applications have a demonstrated need for advanced software tools to manage the complexities associated with sophisticated geometry, mesh, and field manipulation tasks, particularly as computer architectures move toward the petascale. In this paper, we describe a software component - an abstract data model and programming interface - designed to provide support for parallel unstructured mesh operations. We describe key issues that must be addressed to successfully provide high-performance, distributed-memory unstructured mesh services and highlight some recent research accomplishments in developing new load balancing and MPI-based communication libraries appropriate for leadership class computing. Finally, we give examples of the use of parallel adaptive mesh modification in two SciDAC applications. © 2009 IOP Publishing Ltd.

More Details

TYPE Conference YEAR 2009

Scopus OSTI

DOE's Institute for Advanced Architecture and Algorithms: An application-driven approach

Journal of Physics: Conference Series

Murphy, Richard C.

This paper describes an application driven methodology for understanding the impact of future architecture decisions on the end of the MPP era. Fundamental transistor device limitations combined with application performance characteristics have created the switch to multicore/multithreaded architectures. Designing large-scale supercomputers to match application demands is particularly challenging since performance characteristics are highly counter-intuitive. In fact, data movement more than FLOPS dominates. This work discusses some basic performance analysis for a set of DOE applications, the limits of CMOS technology, and the impact of both on future architectures. © 2009 IOP Publishing Ltd.

More Details

TYPE Conference YEAR 2009

OSTI Scopus

Publications

Search results