Publications Search

X-trees: An unsupervised learning method for use in developmental agents

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Poblano v1.0 : a Matlab toolbox for gradient-based optimization

We present Poblano v1.0, a Matlab toolbox for solving gradient-based unconstrained optimization problems. Poblano implements three optimization methods (nonlinear conjugate gradients, limited-memory BFGS, and truncated Newton) that require only first order derivative information. In this paper, we describe the Poblano methods, provide numerous examples on how to use Poblano, and present results of Poblano used in solving problems from a standard test collection of unconstrained optimization problems.

More Details

TYPE SAND Report YEAR 2010

DOI OSTI

On calculating the equilibrium structure of molecular crystals

Wills, Ann E.; Wixom, Ryan R.; Mattsson, Thomas M.

The difficulty of calculating the ambient properties of molecular crystals, such as the explosive PETN, has long hampered much needed computational investigations of these materials. One reason for the shortcomings is that the exchange-correlation functionals available for Density Functional Theory (DFT) based calculations do not correctly describe the weak intermolecular van der Waals' forces present in molecular crystals. However, this weak interaction also poses other challenges for the computational schemes used. We will discuss these issues in the context of calculations of lattice constants and structure of PETN with a number of different functionals, and also discuss if these limitations can be circumvented for studies at non-ambient conditions.

More Details

TYPE Conference YEAR 2010

OSTI

The missing ingredient of confinement physics in density functionals

Hao, Feng H.; Wills, Ann E.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

HPC top 10 InfiniBand Machine : a 3D Torus IB interconnect on Red Sky

Naegle, John H.; Monk, Stephen T.; Schutt, James A.; Doerfler, Douglas W.; Rajan, Mahesh R.

This presentation discusses the following topics: (1) Red Sky Background; (2) 3D Torus Interconnect Concepts; (3) Difficulties of Torus in IB; (4) New Routing Code for IB a 3D Torus; (5) Red Sky 3D Torus Implementation; and (6) Managing a Large IB Machine. Computing at Sandia: (1) Capability Computing - Designed for scaling of single large runs, Usually proprietary for maximum performance, and Red Storm is Sandia's current capability machine; (2) Capacity Computing - Computing for the masses, 100s of jobs and 100s of users, Extreme reliability required, Flexibility for changing workload, Thunderbird will be decommissioned this quarter, Red Sky is our future capacity computing platform, and Red Mesa machine for National Renewable Energy Lab. Red Sky main themes are: (1) Cheaper - 5X capacity of Tbird at 2/3 the cost, Substantially cheaper per flop than our last large capacity machine purchase; (2) Leaner - Lower operational costs, Three security environments via modular fabric, Expandable, upgradeable, extensible, and Designed for 6yr. life cycle; and (3) Greener - 15% less power-1/6th power per flop, 40% less water-5M gallons saved annually, 10X better cooling efficiency, and 4x denser footprint.

More Details

TYPE Conference YEAR 2010

OSTI

A lightweight, GPU-based software RAID system

Brightwell, Ronald B.; Ward, Harry L.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Combining dynamical decoupling with optimal control for improved QIP

Carroll, Malcolm; Witzel, Wayne W.

Constructing high-fidelity control pulses that are robust to control and system/environment fluctuations is a crucial objective for quantum information processing (QIP). We combine dynamical decoupling (DD) with optimal control (OC) to identify control pulses that achieve this objective numerically. Previous DD work has shown that general errors up to (but not including) third order can be removed from {pi}- and {pi}/2-pulses without concatenation. By systematically integrating DD and OC, we are able to increase pulse fidelity beyond this limit. Our hybrid method of quantum control incorporates a newly-developed algorithm for robust OC, providing a nested DD-OC approach to generate robust controls. Motivated by solid-state QIP, we also incorporate relevant experimental constraints into this DD-OC formalism. To demonstrate the advantage of our approach, the resulting quantum controls are compared to previous DD results in open and uncertain model systems.

More Details

TYPE Conference YEAR 2010

OSTI

Solid-liquid phase coexistence of alkali nitrates from molecular dynamics simulations

Jayaraman, Saivenkataraman J.

Alkali nitrate eutectic mixtures are finding application as industrial heat transfer fluids in concentrated solar power generation systems. An important property for such applications is the melting point, or phase coexistence temperature. We have computed melting points for lithium, sodium and potassium nitrate from molecular dynamics simulations using a recently developed method, which uses thermodynamic integration to compute the free energy difference between the solid and liquid phases. The computed melting point for NaNO3 was within 15K of its experimental value, while for LiNO3 and KNO3, the computed melting points were within 100K of the experimental values [4]. We are currently extending the approach to calculate melting temperatures for binary mixtures of lithium and sodium nitrate.

More Details

TYPE Conference YEAR 2010

OSTI

Characteristic curves and scaling for shockless compression from atomistic molecular dynamics

Lane, James M.; Thompson, Aidan P.; Zimmerman, Jonathan A.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Urban Hopper : SPIE Conference briefing

Salton, Jonathan R.; Buerger, Stephen B.; Marron, Lisa C.; Feddema, John T.; Fischer, Gary J.; Little, Charles; Spletzer, Barry L.; Xavier, Patrick G.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Modeling Job Interrupts on Redundant Computing HPC Systems

Ferreira, Kurt; Riesen, Rolf; Robinson, David G.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

System Software Research for Extreme-Scale Computing

Oldfield, Ron A.; Brightwell, Ronald B.; Pedretti, Kevin P.; Riesen, Rolf; Ferreira, Kurt; Kelly, Suzanne M.; Laros, James H.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

Uncertainty Quantification: Lecture for TEDS CLASS ESP700

Swiler, Laura P.; Eldred, Michael S.; Adams, Brian M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

New Results in the Design of Switched Feedback Controllers for Linear Systems

Santarelli, Keith R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI OSTI

Performance concerns when iterating hybrid-parallel kernels

Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Classical Nonlocal and Fractional Diffusion Equations

International Journal for Multiscale Computational Engineering

Lehoucq, Richard B.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2010

OSTI

Optimization-based constrained modeling : a new transport paradigm

Ridzal, Denis R.; Bochev, Pavel B.; Scovazzi, Guglielmo S.; Peterson, Kara J.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Orbital Optimized MBPT(2) as a Non-local Optimized Effective Potential

Taube, Andrew G.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Mixed Aleatory-Epistemic Uncertainty Quantification with Stochastic Expansions and Optimization-Based Interval Estimation

Reliability Engineering and System Safety

Eldred, Michael S.; Swiler, Laura P.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2010

OSTI

How to build a fault-tolerant logical qubit with quantum dots

Landahl, Andrew J.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Progress on the development of a scalable fully-implicit stabilized unstructured FE capability for resistive MHD

Shadid, John N.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

Software Life-cycle and Integration Issues for CS&E R&D Software and Experiences from Trilinos

Bartlett, Roscoe B.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Recent Advances in Two-Dimensional Sparse Matrix Partitioning

Wolf, Michael W.; Boman, Erik G.; Chevalier, Cedric

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Block Preconditioners for Fluid Flow and MHD

Cyr, Eric C.; Shadid, John N.; Pawlowski, Roger P.; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Parallel Replica Dynamics

Thompson, Aidan P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

Copy of Predicting AMD Magny-Cours Performance for a Suite of NNSA/ASC Applications

Doerfler, Douglas W.; Rajan, Mahesh R.; Vaughan, Courtenay T.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

A coarsening method for linear peridynamics

International Journal of Multiscale Computational Engineering

Silling, Stewart A.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2010

OSTI

Quantum Information Sciences Research at Sandia

Tarman, Thomas D.; Carroll, Malcolm; Blain, Matthew G.; Landahl, Andrew J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI OSTI

A brief parallel I/O tutorial

Ward, Harry L.

This document provides common best practices for the efficient utilization of parallel file systems for analysts and application developers. A multi-program, parallel supercomputer is able to provide effective compute power by aggregating a host of lower-power processors using a network. The idea, in general, is that one either constructs the application to distribute parts to the different nodes and processors available and then collects the result (a parallel application), or one launches a large number of small jobs, each doing similar work on different subsets (a campaign). The I/O system on these machines is usually implemented as a tightly-coupled, parallel application itself. It is providing the concept of a 'file' to the host applications. The 'file' is an addressable store of bytes and that address space is global in nature. In essence, it is providing a global address space. Beyond the simple reality that the I/O system is normally composed of a small, less capable, collection of hardware, that concept of a global address space will cause problems if not very carefully utilized. How much of a problem and the ways in which those problems manifest will be different, but that it is problem prone has been well established. Worse, the file system is a shared resource on the machine - a system service. What an application does when it uses the file system impacts all users. It is not the case that some portion of the available resource is reserved. Instead, the I/O system responds to requests by scheduling and queuing based on instantaneous demand. Using the system well contributes to the overall throughput on the machine. From a solely self-centered perspective, using it well reduces the time that the application or campaign is subject to impact by others. The developer's goal should be to accomplish I/O in a way that minimizes interaction with the I/O system, maximizes the amount of data moved per call, and provides the I/O system the most information about the I/O transfer per request.

More Details

TYPE SAND Report YEAR 2010

DOI OSTI

Porting LAMMPS to GPUs

Brown, William M.; Crozier, Paul C.; Plimpton, Steven J.

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. LAMMPS has potentials for soft materials (biomolecules, polymers) and solid-state materials (metals, semiconductors) and coarse-grained or mesoscopic systems. It can be used to model atoms or, more generically, as a parallel particle simulator at the atomic, meso, or continuum scale. LAMMPS runs on single processors or in parallel using message-passing techniques and a spatial-decomposition of the simulation domain. The code is designed to be easy to modify or extend with new functionality.

More Details

TYPE Conference YEAR 2010

OSTI

Comparing Programming Paradigms for Graph Algorithms

Devine, Karen D.; Plimpton, Steven J.; Bayer, Gregory B.; Barrett, Brian B.; Berry, Jonathan W.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Parallel Partitioning Coloring and Ordering for Scientific Computing

Boman, Erik G.; Devine, Karen D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI DOI

Block Preconditioning for Implicit Ocean Models

Siefert, Christopher S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI OSTI

NPT Dynamics Minimization and Elastic Constants for Triclinic Cells

Thompson, Aidan P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

Coherence of Donor Electron Spins in Isotopically Enriched Silicon

Witzel, Wayne W.; Muller, Richard P.; Carroll, Malcolm

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Xenon equation of state table number 5195

Carpenter, John H.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

WMD: The Bio Dimension - Challenges Obligations Opportunities

Gaudioso, Jennifer M.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI OSTI

Performance Concerns for Coupling Hybrid-Parallel Kernels

Edwards, Harold C.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI OSTI

Coherence of Donor Electron Spins in Isotopically Enriched Silicon

Witzel, Wayne W.; Muller, Richard P.; Carroll, Malcolm

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Foundational development of an advanced nuclear reactor integrated safety code

Schmidt, Rodney C.; Hooper, Russell H.; Humphries, Larry; Lorber, Alfred L.; Spotz, William S.

This report describes the activities and results of a Sandia LDRD project whose objective was to develop and demonstrate foundational aspects of a next-generation nuclear reactor safety code that leverages advanced computational technology. The project scope was directed towards the systems-level modeling and simulation of an advanced, sodium cooled fast reactor, but the approach developed has a more general applicability. The major accomplishments of the LDRD are centered around the following two activities. (1) The development and testing of LIME, a Lightweight Integrating Multi-physics Environment for coupling codes that is designed to enable both 'legacy' and 'new' physics codes to be combined and strongly coupled using advanced nonlinear solution methods. (2) The development and initial demonstration of BRISC, a prototype next-generation nuclear reactor integrated safety code. BRISC leverages LIME to tightly couple the physics models in several different codes (written in a variety of languages) into one integrated package for simulating accident scenarios in a liquid sodium cooled 'burner' nuclear reactor. Other activities and accomplishments of the LDRD include (a) further development, application and demonstration of the 'non-linear elimination' strategy to enable physics codes that do not provide residuals to be incorporated into LIME, (b) significant extensions of the RIO CFD code capabilities, (c) complex 3D solid modeling and meshing of major fast reactor components and regions, and (d) an approach for multi-physics coupling across non-conformal mesh interfaces.

More Details

TYPE SAND Report YEAR 2010

DOI OSTI

Enabling Architectures for Large-Scale Applications (Presentation)

Heroux, Michael A.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

ParaText: Scalable Text Analysis and Visualization

Dunlavy, Daniel D.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

Peridynamic states

Silling, Stewart A.

Abstract not provided.

More Details

TYPE Conference YEAR 2010

OSTI

A framework for reduced order modeling with mixed moment matching and peak error objectives

SIAM Journal on Scientific Computing

Santarelli, Keith R.

We examine a new method of producing reduced order models for LTI systems which attempts to minimize a bound on the peak error between t he original and reduced order models subject to a bound on the peak value of the input. The method, which can be implemented by solving a set of linear programming problems that are parameterized v ia a single scalar quantity, is able to minimize an error bound subject to a number of moment matc hing constraints. Moreover, because all optimization is performed in the time domain, the method can also be used to perform model reduction for infinite dimensional systems, rather than being restricted to finite order state space descriptions. We begin by contrasting the method we present her e with two classes of standard model reduction algorithms, namely, moment matching algorithms and singular value-based methods. After motivating the class of reduction tools we propose, we describe the algorithm (which minimizes the Ll norm of the difference between the original and reduced order impulse responses) and formulate the corresponding linear programming problem that is solved during each iteration of the algorithm. We then prove that, for a certain class of LTI systems, the metho d we propose can be used to produce reduced order models of arbitrary accuracy even when the original system is infinite dimensional. We then show how to incorporate moment matching constraints into the basic error bound minimization algorithm, and present three examples which utilize the techni ques described herein. We conclude with some comments on extensions to multi-input, multi-output systems, as well as some general comments for future work. © 2010 Society for Industrial and Applied Mathematics.

More Details

TYPE Conference YEAR 2010

Scopus OSTI

A framework for reduced order modeling with mixed moment matching and peak error objectives

SIAM Journal on Scientific Computing

Santarelli, Keith R.

We examine a new method of producing reduced order models for LTI systems which attempts to minimize a bound on the peak error between t he original and reduced order models subject to a bound on the peak value of the input. The method, which can be implemented by solving a set of linear programming problems that are parameterized v ia a single scalar quantity, is able to minimize an error bound subject to a number of moment matc hing constraints. Moreover, because all optimization is performed in the time domain, the method can also be used to perform model reduction for infinite dimensional systems, rather than being restricted to finite order state space descriptions. We begin by contrasting the method we present her e with two classes of standard model reduction algorithms, namely, moment matching algorithms and singular value-based methods. After motivating the class of reduction tools we propose, we describe the algorithm (which minimizes the Ll norm of the difference between the original and reduced order impulse responses) and formulate the corresponding linear programming problem that is solved during each iteration of the algorithm. We then prove that, for a certain class of LTI systems, the metho d we propose can be used to produce reduced order models of arbitrary accuracy even when the original system is infinite dimensional. We then show how to incorporate moment matching constraints into the basic error bound minimization algorithm, and present three examples which utilize the techni ques described herein. We conclude with some comments on extensions to multi-input, multi-output systems, as well as some general comments for future work. © 2010 Society for Industrial and Applied Mathematics.

More Details

TYPE Conference YEAR 2010

OSTI Scopus

A switched state feedback law for the stabilization of LTI systems

Proceedings of the 2010 American Control Conference, ACC 2010

Santarelli, Keith R.

Inspired by prior work in the design of switched feedback controllers for second order systems, we develop a switched state feedback control law for the stabilization of LTI systems of arbitrary dimension. The control law operates by switching between two static gain vectors in such a way that the state trajectory is driven onto a stable n - 1 dimensional hyperplane (where n represents the system dimension). We begin by briefly examining relevant geometric properties of the phase portraits in the case of two-dimensional systems and show how these geometric properties can be expressed as algebraic constraints on the switched vector fields that are applicable to LTI systems of arbitrary dimension. We then describe an explicit procedure for designing stabilizing controllers and illustrate the closed-loop transient performance via two examples. © 2010 AACC.

More Details

TYPE SAND Report YEAR 2010

DOI OSTI Scopus

Advanced I/O for large-scale scientific applications

Oldfield, Ron A.

As scientific simulations scale to use petascale machines and beyond, the data volumes generated pose a dual problem. First, with increasing machine sizes, the careful tuning of IO routines becomes more and more important to keep the time spent in IO acceptable. It is not uncommon, for instance, to have 20% of an application's runtime spent performing IO in a 'tuned' system. Careful management of the IO routines can move that to 5% or even less in some cases. Second, the data volumes are so large, on the order of 10s to 100s of TB, that trying to discover the scientifically valid contributions requires assistance at runtime to both organize and annotate the data. Waiting for offline processing is not feasible due both to the impact on the IO system and the time required. To reduce this load and improve the ability of scientists to use the large amounts of data being produced, new techniques for data management are required. First, there is a need for techniques for efficient movement of data from the compute space to storage. These techniques should understand the underlying system infrastructure and adapt to changing system conditions. Technologies include aggregation networks, data staging nodes for a closer parity to the IO subsystem, and autonomic IO routines that can detect system bottlenecks and choose different approaches, such as splitting the output into multiple targets, staggering output processes. Such methods must be end-to-end, meaning that even with properly managed asynchronous techniques, it is still essential to properly manage the later synchronous interaction with the storage system to maintain acceptable performance. Second, for the data being generated, annotations and other metadata must be incorporated to help the scientist understand output data for the simulation run as a whole, to select data and data features without concern for what files or other storage technologies were employed. All of these features should be attained while maintaining a simple deployment for the science code and eliminating the need for allocation of additional computational resources.

More Details

TYPE SAND Report YEAR 2010

DOI OSTI

Lightweight storage and overlay networks for fault tolerance

Oldfield, Ron A.

The next generation of capability-class, massively parallel processing (MPP) systems is expected to have hundreds of thousands to millions of processors, In such environments, it is critical to have fault-tolerance mechanisms, including checkpoint/restart, that scale with the size of applications and the percentage of the system on which the applications execute. For application-driven, periodic checkpoint operations, the state-of-the-art does not provide a scalable solution. For example, on today's massive-scale systems that execute applications which consume most of the memory of the employed compute nodes, checkpoint operations generate I/O that consumes nearly 80% of the total I/O usage. Motivated by this observation, this project aims to improve I/O performance for application-directed checkpoints through the use of lightweight storage architectures and overlay networks. Lightweight storage provide direct access to underlying storage devices. Overlay networks provide caching and processing capabilities in the compute-node fabric. The combination has potential to signifcantly reduce I/O overhead for large-scale applications. This report describes our combined efforts to model and understand overheads for application-directed checkpoints, as well as implementation and performance analysis of a checkpoint service that uses available compute nodes as a network cache for checkpoint operations.

More Details

TYPE SAND Report YEAR 2010

DOI OSTI