Performance of fully-coupled algebraic multilevel domain decomposition preconditioners for incompressible flow and transport

Proposed for publication in International Journal for Numerical Methods in Engineering.

Sala, Marzio S.; Shadid, John N.; Tuminaro, Raymond S.

This study investigates algebraic multilevel domain decomposition preconditioners of the Schwarz type for solving linear systems associated with Newton-Krylov methods. The key component of the preconditioner is a coarse approximation based on algebraic multigrid ideas to approximate the global behavior of the linear system. The algebraic multilevel preconditioner is based on an aggressive coarsening graph partitioning of the non-zero block structure of the Jacobian matrix. The scalability of the preconditioner is presented as well as comparisons with a two-level Schwarz preconditioner using a geometric coarse grid operator. These comparisons are obtained on large-scale distributed-memory parallel machines for systems arising from incompressible flow and transport using a stabilized finite element formulation. The results demonstrate the influence of the smoothers and coarse level solvers for a set of 3D example problems. For preconditioners with more than one level, careful attention needs to be given to the balance of robustness and convergence rate for the smoothers and the cost of applying these methods. For properly chosen parameters, the two- and three-level preconditioners are demonstrated to be scalable to 1024 processors.

More Details

TYPE Journal Article YEAR 2004

OSTI

Locally conservative least-squares finite element methods for Darcy flows

Proposed for publication in Computer Methods in Applied Mechanics and Engineering.

Bochev, Pavel B.

Least-squares finite-element methods for Darcy flow offer several advantages relative to the mixed-Galerkin method: the avoidance of stability conditions between finite-element spaces, the efficiency of solving symmetric and positive definite systems, and the convenience of using standard, continuous nodal elements for all variables. However, conventional C{sup o} implementations conserve mass only approximately and for this reason they have found limited acceptance in applications where locally conservative velocity fields are of primary interest. In this paper, we show that a properly formulated compatible least-squares method offers the same level of local conservation as a mixed method. The price paid for gaining favourable conservation properties is that one has to give up what is arguably the least important advantage attributed to least-squares finite-element methods: one can no longer use continuous nodal elements for all variables. As an added benefit, compatible least-squares methods inherit the best computational properties of both Galerkin and mixed-Galerkin methods and, in some cases, yield identical results, while offering the advantages of not having to deal with stability conditions and yielding positive definite discrete problems. Numerical results that illustrate our findings are provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Uncertainty quantification and multiscale mathematics

Trucano, Timothy G.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

Full employment and competition in the Aspen economic model: implications for modeling acts of terrorism

Sprigg, James A.; Ehlen, Mark E.

Acts of terrorism could have a range of broad impacts on an economy, including changes in consumer (or demand) confidence and the ability of productive sectors to respond to changes. As a first step toward a model of terrorism-based impacts, we develop here a model of production and employment that characterizes dynamics in ways useful toward understanding how terrorism-based shocks could propagate through the economy; subsequent models will introduce the role of savings and investment into the economy. We use Aspen, a powerful economic modeling tool developed at Sandia, to demonstrate for validation purposes that a single-firm economy converges to the known monopoly equilibrium price, output, and employment levels, while multiple-firm economies converge toward the competitive equilibria typified by lower prices and higher output and employment. However, we find that competition also leads to churn by consumers seeking lower prices, making it difficult for firms to optimize with respect to wages, prices, and employment levels. Thus, competitive firms generate market ''noise'' in the steady state as they search for prices and employment levels that will maximize profits. In the context of this model, not only could terrorism depress overall consumer confidence and economic activity but terrorist acts could also cause normal short-run dynamics to be misinterpreted by consumers as a faltering economy.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

Seeded perturbations in wire array z-pinches

Jones, Brent M.; Deeney, Christopher D.; Mckenney, John M.; Garasi, Christopher J.; Mehlhorn, Thomas A.; Robinson, Allen C.; Wunsch, Scott E.

The impact of 3D structure on wire array z-pinch dynamics is a topic of current interest, and has been studied by the controlled seeding of wire perturbations. First, Al wires were etched at Sandia, creating 20% radial perturbations with variable axial wavelength. Observations of magnetic bubble formation in the etched regions during experiments on the MAGPIE accelerator are discussed and compared to 3D MHD modeling. Second, thin NaF coatings of 1 mm axial extent were deposited on Al wires and fielded on the Zebra accelerator. Little or no axial transport of the NaF spectroscopic dopant was observed in spatially resolved K-shell spectra, which places constraints on particle diffusivity in dense z-pinch plasmas. Finally, technology development for seeding perturbations is discussed.

More Details

TYPE Conference YEAR 2004

OSTI

TEM characterization of morphological evolution in silver nanoparticles during sintering

Proposed for publication in Journal of Materials Research.

Bell, Nelson S.; Tikare, Veena T.; Headley, Thomas J.; Provencio, P.N.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Pierson, Kendall H.; Gee, Michael W.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

A primal based penalty preconditioner for elliptic saddle point systems

SIAM Journal of Numerical Analysis

Proposed for publication in Computer Methods in Applied Mechanics and Engineering Journal.

Scalable fault tolerant algorithms for linear-scaling coupled-cluster electronic structure methods

Janssen, Curtis L.; Leininger, Matthew L.

By means of coupled-cluster theory, molecular properties can be computed with an accuracy often exceeding that of experiment. The high-degree polynomial scaling of the coupled-cluster method, however, remains a major obstacle in the accurate theoretical treatment of mainstream chemical problems, despite tremendous progress in computer architectures. Although it has long been recognized that this super-linear scaling is non-physical, the development of efficient reduced-scaling algorithms for massively parallel computers has not been realized. We here present a locally correlated, reduced-scaling, massively parallel coupled-cluster algorithm. A sparse data representation for handling distributed, sparse multidimensional arrays has been implemented along with a set of generalized contraction routines capable of handling such arrays. The parallel implementation entails a coarse-grained parallelization, reducing interprocessor communication and distributing the largest data arrays but replicating as many arrays as possible without introducing memory bottlenecks. The performance of the algorithm is illustrated by several series of runs for glycine chains using a Linux cluster with an InfiniBand interconnect.

More Details

TYPE SAND Report YEAR 2004

OSTI DOI

Numerical treatment of mechanism-dependent characteristic times in a vicoplastic model for Salem limestone based on experimental observations

Fossum, Arlo F.; Brannon, Rebecca M.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

Study of three-dimensional structure in wire array Z-Pinches by controlled seeding of axial modulations in wire radius

Physical Review Letters

Jones, Brent M.; Deeney, Christopher D.; Mckenney, John M.; Garasi, Christopher J.; Mehlhorn, Thomas A.; Robinson, Allen C.; Wunsch, Scott E.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Peridynamic fracture and damage modeling of membranes and nanofiber networks

Silling, Stewart A.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

Calibration contra validation:characterization and consequences

Trucano, Timothy G.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

Hopf bifurcations in a model for dynamics of the flow in a thermosyphon

Journal of Fluid Mechanics

Dynamic context discrimination : psychological evidence for the Sandia Cognitive Framework

Speed, Ann S.

Human behavior is a function of an iterative interaction between the stimulus environment and past experience. It is not simply a matter of the current stimulus environment activating the appropriate experience or rule from memory (e.g., if it is dark and I hear a strange noise outside, then I turn on the outside lights and investigate). Rather, it is a dynamic process that takes into account not only things one would generally do in a given situation, but things that have recently become known (e.g., there have recently been coyotes seen in the area and one is known to be rabid), as well as other immediate environmental characteristics (e.g., it is snowing outside, I know my dog is outside, I know the police are already outside, etc.). All of these factors combine to inform me of the most appropriate behavior for the situation. If it were the case that humans had a rule for every possible contingency, the amount of storage that would be required to enable us to fluidly deal with most situations we encounter would rapidly become biologically untenable. We can all deal with contingencies like the one above with fairly little effort, but if it isn't based on rules, what is it based on? The assertion of the Cognitive Systems program at Sandia for the past 5 years is that at the heart of this ability to effectively navigate the world is an ability to discriminate between different contexts (i.e., Dynamic Context Discrimination, or DCD). While this assertion in and of itself might not seem earthshaking, it is compelling that this ability and its components show up in a wide variety of paradigms across different subdisciplines in psychology. We begin by outlining, at a high functional level, the basic ideas of DCD. We then provide evidence from several different literatures and paradigms that support our assertion that DCD is a core aspect of cognitive functioning. Finally, we discuss DCD and the computational model that we have developed as an instantiation of DCD in more detail. Before commencing with our overview of DCD, we should note that DCD is not necessarily a theory in the classic sense. Rather, it is a description of cognitive functioning that seeks to unify highly similar findings across a wide variety of literatures. Further, we believe that such convergence warrants a central place in efforts to computationally emulate human cognition. That is, DCD is a general principle of cognition. It is also important to note that while we are drawing parallels across many literatures, these are functional parallels and are not necessarily structural ones. That is, we are not saying that the same neural pathways are involved in these phenomena. We are only saying that the different neural pathways that are responsible for the appearance of these various phenomena follow the same functional rules - the mechanisms are the same even if the physical parts are distinct. Furthermore, DCD is not a causal mechanism - it is an emergent property of the way the brain is constructed. DCD is the result of neurophysiology (cf. John, 2002, 2003). Finally, it is important to note that we are not proposing a generic learning mechanism such that one biological algorithm can account for all situation interpretation. Rather, we are pointing out that there are strikingly similar empirical results across a wide variety of disciplines that can be understood, in part, by similar cognitive processes. It is entirely possible, even assumed in some cases (i.e., primary language acquisition) that these more generic cognitive processes are complemented and constrained by various limits which may or may not be biological in nature (cf. Bates & Elman, 1996; Elman, in press).

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

A comparative study of stabilized, equal-order finite element approximation of Darcy equations

Proposed for publication in Applied Numerical Mathematics.

Bochev, Pavel B.; Dohrmann, Clark R.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Simulated mobile self-location using 3D range sensing and an A-Priori map

Little, Charles; Peters, Ralph R.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

Estimating image manifold dimension by inversion

Martin, Shawn; Backer, Alejandro B.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

A conceptual framework for biosecurity levels

Proposed for publication in Bioscience.

Salerno, Reynolds M.; Gaudioso, Jennifer M.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Analysis and control of distributed cooperative systems

Feddema, John T.; Schoenwald, David A.; Parker, Eric P.; Wagner, John S.

As part of DARPA Information Processing Technology Office (IPTO) Software for Distributed Robotics (SDR) Program, Sandia National Laboratories has developed analysis and control software for coordinating tens to thousands of autonomous cooperative robotic agents (primarily unmanned ground vehicles) performing military operations such as reconnaissance, surveillance and target acquisition; countermine and explosive ordnance disposal; force protection and physical security; and logistics support. Due to the nature of these applications, the control techniques must be distributed, and they must not rely on high bandwidth communication between agents. At the same time, a single soldier must easily direct these large-scale systems. Finally, the control techniques must be provably convergent so as not to cause undo harm to civilians. In this project, provably convergent, moderate communication bandwidth, distributed control algorithms have been developed that can be regulated by a single soldier. We have simulated in great detail the control of low numbers of vehicles (up to 20) navigating throughout a building, and we have simulated in lesser detail the control of larger numbers of vehicles (up to 1000) trying to locate several targets in a large outdoor facility. Finally, we have experimentally validated the resulting control algorithms on smaller numbers of autonomous vehicles.

More Details

TYPE SAND Report YEAR 2004

OSTI DOI

Fossum, A.F.; Brannon, Rebecca M.

The mathematical and physical foundations and domain of applicability of Sandia's GeoModel are presented along with descriptions of the source code and user instructions. The model is designed to be used in conventional finite element architectures, and (to date) it has been installed in five host codes without requiring customizing the model subroutines for any of these different installations. Although developed for application to geological materials, the GeoModel actually applies to a much broader class of materials, including rock-like engineered materials (such as concretes and ceramics) and even to metals when simplified parameters are used. Nonlinear elasticity is supported through an empirically fitted function that has been found to be well-suited to a wide variety of materials. Fundamentally, the GeoModel is a generalized plasticity model. As such, it includes a yield surface, but the term 'yield' is generalized to include any form of inelastic material response including microcrack growth and pore collapse. The geomodel supports deformation-induced anisotropy in a limited capacity through kinematic hardening (in which the initially isotropic yield surface is permitted to translate in deviatoric stress space to model Bauschinger effects). Aside from kinematic hardening, however, the governing equations are otherwise isotropic. The GeoModel is a genuine unification and generalization of simpler models. The GeoModel can employ up to 40 material input and control parameters in the rare case when all features are used. Simpler idealizations (such as linear elasticity, or Von Mises yield, or Mohr-Coulomb failure) can be replicated by simply using fewer parameters. For high-strain-rate applications, the GeoModel supports rate dependence through an overstress model.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

Computational Fluid Dynamic simulations of pipe elbow flow

Homicz, Gregory F.

One problem facing today's nuclear power industry is flow-accelerated corrosion and erosion in pipe elbows. The Korean Atomic Energy Research Institute (KAERI) is performing experiments in their Flow-Accelerated Corrosion (FAC) test loop to better characterize these phenomena, and develop advanced sensor technologies for the condition monitoring of critical elbows on a continuous basis. In parallel with these experiments, Sandia National Laboratories is performing Computational Fluid Dynamic (CFD) simulations of the flow in one elbow of the FAC test loop. The simulations are being performed using the FLUENT commercial software developed and marketed by Fluent, Inc. The model geometry and mesh were created using the GAMBIT software, also from Fluent, Inc. This report documents the results of the simulations that have been made to date; baseline results employing the RNG k-e turbulence model are presented. The predicted value for the diametrical pressure coefficient is in reasonably good agreement with published correlations. Plots of the velocities, pressure field, wall shear stress, and turbulent kinetic energy adjacent to the wall are shown within the elbow section. Somewhat to our surprise, these indicate that the maximum values of both wall shear stress and turbulent kinetic energy occur near the elbow entrance, on the inner radius of the bend. Additional simulations were performed for the same conditions, but with the RNG k-e model replaced by either the standard k-{var_epsilon}, or the realizable k-{var_epsilon} turbulence model. The predictions using the standard k-{var_epsilon} model are quite similar to those obtained in the baseline simulation. However, with the realizable k-{var_epsilon} model, more significant differences are evident. The maximums in both wall shear stress and turbulent kinetic energy now appear on the outer radius, near the elbow exit, and are {approx}11% and 14% greater, respectively, than those predicted in the baseline calculation; secondary maxima in both quantities still occur near the elbow entrance on the inner radius. Which set of results better reflects reality must await experimental corroboration. Additional calculations demonstrate that whether or not FLUENT's radial equilibrium pressure distribution option is used in the PRESSURE OUTLET boundary condition has no significant impact on the flowfield near the elbow. Simulations performed with and without the chemical sensor and associated support bracket that were present in the experiments demonstrate that the latter have a negligible influence on the flow in the vicinity of the elbow. The fact that the maxima in wall shear stress and turbulent kinetic energy occur on the inner radius is therefore not an artifact of having introduced the sensor into the flow.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

Peridynamic modeling of membranes and fibers

Proposed for publication in Peridynamic Modeling of Membranes and Fibers.

Silling, Stewart A.

The peridynamic theory of continuum mechanics allows damage, fracture, and long-range forces to be treated as natural components of the deformation of a material. In this paper, the peridynamic approach is applied to small thickness two- and one-dimensional structures. For membranes, a constitutive model is described appropriate for rubbery sheets that can form cracks. This model is used to perform numerical simulations of the stretching and dynamic tearing of membranes. A similar approach is applied to one-dimensional string like structures that undergrow stretching, bending, and failure. Long-range forces similar to van der Waals interactions at the nanoscale influence the equilibrium configurations of these structures, how they deform, and possibly self-assembly.

More Details

TYPE Journal Article YEAR 2004

OSTI

Swiler, Laura P.; Wyss, Gregory D.

This document is a reference guide for the UNIX Library/Standalone version of the Latin Hypercube Sampling Software. This software has been developed to generate Latin hypercube multivariate samples. This version runs on Linux or UNIX platforms. This manual covers the use of the LHS code in a UNIX environment, run either as a standalone program or as a callable library. The underlying code in the UNIX Library/Standalone version of LHS is almost identical to the updated Windows version of LHS released in 1998 (SAND98-0210). However, some modifications were made to customize it for a UNIX environment and as a library that is called from the DAKOTA environment. This manual covers the use of the LHS code as a library and in the standalone mode under UNIX.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

Appendix : supplementary information : gene expression profiling reveals novel genes for improved risk classification and outcome prediction in pediatric acute lymphoblastic leukemia : identification, validation, and cloning of OPAL1

Proposed for publication in New England Journal of Medicine.

Davidson, George S.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Romero, L.A.

Analytic solutions are useful for code verification. Structural vibration codes approximate solutions to the eigenvalue problem for the linear elasticity equations (Navier's equations). Unfortunately the verification method of 'manufactured solutions' does not apply to vibration problems. Verification books (for example [2]) tabulate a few of the lowest modes, but are not useful for computations of large numbers of modes. A closed form solution is presented here for all the eigenvalues and eigenfunctions for a cuboid solid with isotropic material properties. The boundary conditions correspond physically to a greased wall.

More Details

TYPE SAND Report YEAR 2004

OSTI DOI

Coarse-Grained Lipids II : a comparison of density functional theory and simulation

Proposed for publication in Biophysical Journal.

Proposed for publication in the Journal of Fluid Mechanics.

Salinger, Andrew G.; Shadid, John N.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2004

OSTI

Feature length-scale modeling of LPCVD & PECVD MEMS fabrication processes

Proposed for publication in the Journal of Microsystems Technologies.

Plimpton, Steven J.; Schmidt, Rodney C.

The surface micromachining processes used to manufacture MEMS devices and integrated circuits transpire at such small length scales and are sufficiently complex that a theoretical analysis of them is particularly inviting. Under development at Sandia National Laboratories (SNL) is Chemically Induced Surface Evolution with Level Sets (ChISELS), a level-set based feature-scale modeler of such processes. The theoretical models used, a description of the software and some example results are presented here. The focus to date has been of low-pressure and plasma enhanced chemical vapor deposition (low-pressure chemical vapor deposition, LPCVD and PECVD) processes. Both are employed in SNLs SUMMiT V technology. Examples of step coverage of SiO{sub 2} into a trench by each of the LPCVD and PECVD process are presented.

More Details

TYPE Journal Article YEAR 2004

OSTI

A new family of operator-splitting methods

Proposed for publication in Applied Numerical Mathematics.

OSTI

A mathematical framework for multiscale science and engineering : the variational multiscale method and interscale transfer operators

Wagner, Gregory J.; Bochev, Pavel B.; Christon, Mark A.; Collis, Samuel S.; Lehoucq, Richard B.; Shadid, John N.; Slepoy, Alexander S.

Existing approaches in multiscale science and engineering have evolved from a range of ideas and solutions that are reflective of their original problem domains. As a result, research in multiscale science has followed widely diverse and disjoint paths, which presents a barrier to cross pollination of ideas and application of methods outside their application domains. The status of the research environment calls for an abstract mathematical framework that can provide a common language to formulate and analyze multiscale problems across a range of scientific and engineering disciplines. In such a framework, critical common issues arising in multiscale problems can be identified, explored and characterized in an abstract setting. This type of overarching approach would allow categorization and clarification of existing models and approximations in a landscape of seemingly disjoint, mutually exclusive and ad hoc methods. More importantly, such an approach can provide context for both the development of new techniques and their critical examination. As with any new mathematical framework, it is necessary to demonstrate its viability on problems of practical importance. At Sandia, lab-centric, prototype application problems in fluid mechanics, reacting flows, magnetohydrodynamics (MHD), shock hydrodynamics and materials science span an important subset of DOE Office of Science applications and form an ideal proving ground for new approaches in multiscale science.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

ML 3.1 developer's guide

Sala, Marzio S.; Hu, Jonathan J.; Tuminaro, Raymond S.

ML development was started in 1997 by Ray Tuminaro and Charles Tong. Currently, there are several full- and part-time developers. The kernel of ML is written in ANSI C, and there is a rich C++ interface for Trilinos users and developers. ML can be customized to run geometric and algebraic multigrid; it can solve a scalar or a vector equation (with constant number of equations per grid node), and it can solve a form of Maxwell's equations. For a general introduction to ML and its applications, we refer to the Users Guide [SHT04], and to the ML web site, http://software.sandia.gov/ml.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

DeBenedictis, Erik

The ASCI supercomputing program is broadly defined as running physics simulations on progressively more powerful digital computers. What happens if we extrapolate the computer technology to its end? We have developed a model for key ASCI computations running on a hypothetical computer whose technology is parameterized in ways that account for advancing technology. This model includes technology information such as Moore's Law for transistor scaling and developments in cooling technology. The model also includes limits imposed by laws of physics, such as thermodynamic limits on power dissipation, limits on cooling, and the limitation of signal propagation velocity to the speed of light. We apply this model and show that ASCI computations will advance smoothly for another 10-20 years to an 'end game' defined by thermodynamic limits and the speed of light. Performance levels at the end game will vary greatly by specific problem, but will be in the Exaflops to Zetaflops range for currently anticipated problems. We have also found an architecture that would be within a constant factor of giving optimal performance at the end game. This architecture is an evolutionary derivative of the mesh-connected microprocessor (such as ASCI Red Storm or IBM Blue Gene/L). We provide designs for the necessary enhancement to microprocessor functionality and the power-efficiency of both the processor and memory system. The technology we develop in the foregoing provides a 'perfect' computer model with which we can rate the quality of realizable computer designs, both in this writing and as a way of designing future computers. This report focuses on classical computers based on irreversible digital logic, and more specifically on algorithms that simulate space computing, irreversible logic, analog computers, and other ways to address stockpile stewardship that are outside the scope of this report.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

Molecular simulations of MEMS and membrane coatings (PECASE)

Thompson, Aidan P.

The goal of this Laboratory Directed Research & Development (LDRD) effort was to design, synthesize, and evaluate organic-inorganic nanocomposite membranes for solubility-based separations, such as the removal of higher hydrocarbons from air streams, using experiment and theory. We synthesized membranes by depositing alkylchlorosilanes on the nanoporous surfaces of alumina substrates, using techniques from the self-assembled monolayer literature to control the microstructure. We measured the permeability of these membranes to different gas species, in order to evaluate their performance in solubility-based separations. Membrane design goals were met by manipulating the pore size, alkyl group size, and alkyl surface density. We employed molecular dynamics simulation to gain further understanding of the relationship between membrane microstructure and separation performance.

More Details

TYPE SAND Report YEAR 2004

DOI OSTI

Verification of Euler/Navier-Stokes codes using the method of manufactured solutions

International Journal for Numerical Methods in Fluids

Roy, C.J.; Nelson, C.C.; Smith, T.M.; Ober, Curtis C.

The method of manufactured solutions is used to verify the order of accuracy of two finite-volume Euler and Navier-Stokes codes. The Premo code employs a node-centred approach using unstructured meshes, while the Wind code employs a similar scheme on structured meshes. Both codes use Roe's upwind method with MUSCL extrapolation for the convective terms and central differences for the diffusion terms, thus yielding a numerical scheme that is formally second-order accurate. The method of manufactured solutions is employed to generate exact solutions to the governing Euler and Navier-Stokes equations in two dimensions along with additional source terms. These exact solutions are then used to accurately evaluate the discretization error in the numerical solutions. Through global discretization error analyses, the spatial order of accuracy is observed to be second order for both codes, thus giving a high degree of confidence that the two codes are free from coding mistakes in the options exercised. Examples of coding mistakes discovered using the method are also given. © 2004 John Wiley and Sons, Ltd.

More Details

TYPE Journal Article YEAR 2004

Scopus OSTI

Leung, Vitus J.

Motivated by observations about job runtimes on the CPlant system, we use a trace-driven microsimulator to begin characterizing the performance of different classes of allocation algorithms on jobs with different communication patterns in space-shared parallel systems with mesh topology. We show that relative performance varies considerably with communication pattern. The Paging strategy using the Hilbert space-filling curve and the Best Fit heuristic performed best across several communication patterns.

More Details

TYPE Conference YEAR 2004

DOI OSTI

Simulating economic effects of disruptions in the telecommunications infrastructure

Barton, Dianne C.; Eidson, Eric D.; Schoenwald, David A.; Cox, Roger G.; Reinert, Rhonda K.

CommAspen is a new agent-based model for simulating the interdependent effects of market decisions and disruptions in the telecommunications infrastructure on other critical infrastructures in the U.S. economy such as banking and finance, and electric power. CommAspen extends and modifies the capabilities of Aspen-EE, an agent-based model previously developed by Sandia National Laboratories to analyze the interdependencies between the electric power system and other critical infrastructures. CommAspen has been tested on a series of scenarios in which the communications network has been disrupted, due to congestion and outages. Analysis of the scenario results indicates that communications networks simulated by the model behave as their counterparts do in the real world. Results also show that the model could be used to analyze the economic impact of communications congestion and outages.

More Details

TYPE Report YEAR 2004

DOI OSTI

Trilinos 3.1 tutorial

Heroux, Michael A.; Sala, Marzio S.

This document introduces the use of Trilinos, version 3.1. Trilinos has been written to support, in a rigorous manner, the solver needs of the engineering and scientific applications at Sandia National Laboratories. Aim of this manuscript is to present the basic features of some of the Trilinos packages. The presented material includes the definition of distributed matrices and vectors with Epetra, the iterative solution of linear system with AztecOO, incomplete factorizations with IFPACK, multilevel methods with ML, direct solution of linear system with Amesos, and iterative solution of nonlinear systems with NOX. With the help of several examples, some of the most important classes and methods are detailed to the inexperienced user. For the most majority, each example is largely commented throughout the text. Other comments can be found in the source of each example. This document is a companion to the Trilinos User's Guide and Trilinos Development Guides. Also, the documentation included in each of the Trilinos' packages is of fundamental importance.

More Details

TYPE Report YEAR 2004

DOI OSTI

Avoiding spurious submovement decompositions: A globally optimal algorithm

Biological Cybernetics

Rohrer, Brandon R.

Evidence for the existence of discrete sub-movements underlying continuous human movement has motivated many attempts to "extract" them. Although they produce visually convincing results, all of the methodologies that have been employed are prone to produce spurious decompositions. Examples of potential failures are given. A branch-and-bound algorithm for submovement extraction, capable of global nonlinear minimization (and hence capable of avoiding spurious decompositions), is developed and demonstrated.

More Details

TYPE Journal Article YEAR 2003

DOI OSTI Scopus

A network architecture for Petaflops supercomputers

DeBenedictis, Erik

If we are to build a supercomputer with a speed of 10{sup 15} floating operations per second (1 PetaFLOPS), interconnect technology will need to be improved considerably over what it is today. In this report, we explore one possible interconnect design for such a network. The guiding principle in this design is the optimization of all components for the finiteness of the speed of light. To achieve a linear speedup in time over well-tested supercomputers of todays' designs will require scaling up of processor power and bandwidth and scaling down of latency. Latency scaling is the most challenging: it requires a 100 ns user-to-user latency for messages traveling the full diameter of the machine. To meet this constraint requires simultaneously minimizing wire length through 3D packaging, new low-latency electrical signaling mechanisms, extremely fast routers, and new network interfaces. In this report, we outline approaches and implementations that will meet the requirements when implemented as a system. No technology breakthroughs are required.

More Details

TYPE Report YEAR 2003

DOI OSTI

An assessment of semi-discrete central schemes for hyperbolic conservation laws

Christon, Mark A.; Ketcheson, David I.; Robinson, Allen C.

High-resolution finite volume methods for solving systems of conservation laws have been widely embraced in research areas ranging from astrophysics to geophysics and aero-thermodynamics. These methods are typically at least second-order accurate in space and time, deliver non-oscillatory solutions in the presence of near discontinuities, e.g., shocks, and introduce minimal dispersive and diffusive effects. High-resolution methods promise to provide greatly enhanced solution methods for Sandia's mainstream shock hydrodynamics and compressible flow applications, and they admit the possibility of a generalized framework for treating multi-physics problems such as the coupled hydrodynamics, electro-magnetics and radiative transport found in Z pinch physics. In this work, we describe initial efforts to develop a generalized 'black-box' conservation law framework based on modern high-resolution methods and implemented in an object-oriented software framework. The framework is based on the solution of systems of general non-linear hyperbolic conservation laws using Godunov-type central schemes. In our initial efforts, we have focused on central or central-upwind schemes that can be implemented with only a knowledge of the physical flux function and the minimal/maximal eigenvalues of the Jacobian of the flux functions, i.e., they do not rely on extensive Riemann decompositions. Initial experimentation with high-resolution central schemes suggests that contact discontinuities with the concomitant linearly degenerate eigenvalues of the flux Jacobian do not pose algorithmic difficulties. However, central schemes can produce significant smearing of contact discontinuities and excessive dissipation for rotational flows. Comparisons between 'black-box' central schemes and the piecewise parabolic method (PPM), which relies heavily on a Riemann decomposition, shows that roughly equivalent accuracy can be achieved for the same computational cost with both methods. However, PPM clearly outperforms the central schemes in terms of accuracy at a given grid resolution and the cost of additional complexity in the numerical flux functions. Overall we have observed that the finite volume schemes, implemented within a well-designed framework, are extremely efficient with (potentially) very low memory storage. Finally, we have found by computational experiment that second and third-order strong-stability preserving (SSP) time integration methods with the number of stages greater than the order provide a useful enhanced stability region. However, we observe that non-SSP and non-optimal SSP schemes with SSP factors less than one can still be very useful if used with time-steps below the standard CFL limit. The 'well-designed' integration schemes that we have examined appear to perform well in all instances where the time step is maintained below the standard physical CFL limit.

More Details

TYPE Report YEAR 2003

DOI OSTI

Three-dimensional z-pinch wire array modeling with ALEGRA-HEDP

Proposed for publication in the Computer Physics Communications.

Robinson, Allen C.; Garasi, Christopher J.

An understanding of the dynamics of z-pinch wire array explosion and collapse is of critical interest to the development and future of pulsed power inertial confinement fusion experiments. Experimental results clearly show the extreme three-dimensional nature of the wire explosion and collapse process. The physics of this process can be approximated by the resistive magnetohydrodynamic (MHD) equations augmented by thermal and radiative transport modeling. Z-pinch MHD physics is dominated by material regions whose conductivity properties vary drastically as material passes from solid through melt into plasma regimes. At the same time void regions between the wires are modeled as regions of very low conductivity. This challenging physical situation requires a sophisticated three-dimensional modeling approach matched by sufficient computational resources to make progress in predictive modeling and improved physical understanding.

More Details

TYPE Journal Article YEAR 2003

OSTI

Implementing scalable disk-less clusters using the Network File System (NFS)

Laros, James H.; Ward, Harry L.

This paper describes a methodology for implementing disk-less cluster systems using the Network File System (NFS) that scales to thousands of nodes. This method has been successfully deployed and is currently in use on several production systems at Sandia National Labs. This paper will outline our methodology and implementation, discuss hardware and software considerations in detail and present cluster configurations with performance numbers for various management operations like booting.

More Details

TYPE Conference YEAR 2003

OSTI

Modeling air blast on thin-shell structures with Zapotec

Bessette, Gregory B.; Vaughan, Courtenay T.; Bell, Raymond L.; Attaway, Stephen W.

A new capability for modeling thin-shell structures within the coupled Euler-Lagrange code, Zapotec, is under development. The new algorithm creates an artificial material interface for the Eulerian portion of the problem by expanding a Lagrangian shell element such that it has an effective thickness that spans one or more Eulerian cells. The algorithm implementation is discussed along with several examples involving blast loading on plates.

More Details

TYPE Conference YEAR 2003

OSTI

Supercomputing and discrete algorithms : a symbiotic relationship

Hendrickson, Bruce A.; Hart, William E.; Phillips, Cynthia A.

Abstract not provided.

More Details

TYPE Conference YEAR 2003

OSTI

Stability of Streamline Upwind Petrov-Galerkin (SUPG) finite elements for transient advection-diffusion problems

Proposed for publication in Journal of Computer Methods in Application and Mechanical Engineering.

Underwood, Keith; Brightwell, Ronald B.

Memory may be the only system component that is more commoditized than a microprocessor. To simultaneously exploit this and address the impending memory wall, processing in memory (PIM) research efforts are considering ways to move processing into memory without significantly increasing the cost of the memory. As such, PIM devices may become the basis for future commodity clusters. Although these PIM devices may leverage new computational paradigms such as hardware support for multi-threading and traveling threads, they must provide support for legacy programming models if they are to supplant commodity clusters. This paper presents a prototype implementation of MPI over a traveling thread mechanism called parcels. A performance analysis indicates that the direct hardware support of a traveling thread model can lead to an efficient, lightweight MPI implementation.

More Details

TYPE Conference YEAR 2003

OSTI

Mechanisms for radiation dose-rate sensitivity of bipolar transistors

Hjalmarson, Harold P.; Shaneyfelt, Marty R.; Schwank, James R.; Edwards, Arthur H.; Hembree, Charles E.; Mattsson, Thomas M.

Mechanisms for enhanced low-dose-rate sensitivity are described. In these mechanisms, bimolecular reactions dominate the kinetics at high dose rates thereby causing a sub-linear dependence on total dose, and this leads to a dose-rate dependence. These bimolecular mechanisms include electron-hole recombination, hydrogen recapture at hydrogen source sites, and hydrogen dimerization to form hydrogen molecules. The essence of each of these mechanisms is the dominance of the bimolecular reactions over the radiolysis reaction at high dose rates. However, at low dose rates, the radiolysis reaction dominates leading to a maximum effect of the radiation.

More Details

TYPE Conference YEAR 2003

OSTI

Equation of state for a high-density glass

Wills, Ann E.

Properties of relevance for the equation of state for a high-density glass are discussed. We review the effects of failure waves, comminuted phase, and compaction on the validity of the Mie-Grueneisen EOS. The specific heat and the Grueneisen parameter at standard conditions for a {rho}{sub 0} = 5.085 g/cm{sup 3} glass ('Glass A') is then estimated to be 522 mJ/g/K and 0.1-0.3, respectively. The latter value is substantially smaller than the value of 2.1751 given in the SESAME tables for a high-density glass with {rho}{sub 0} = 5.46 g/cm{sup 3}. The present unusual value of the Grueneisen parameter is confirmed from the volume dependence determined from fitting the Mie-Grueneisen EOS to shock data in Ref. [2].

More Details

TYPE Conference YEAR 2003

OSTI

DeBenedictis, Erik; Christopher, Thomas W.

PIM (Processor in Memory) architectures are being proposed for future supercomputers, because they reduce the problems that SMP MMPs have with latency. However, they do not meet the SMP MPP balance factors. Being relatively processor rich and memory starved, it is unclear whether an ASCI application could run on them, either as-is or with recoding. The KBA (Koch-Baker-Alcouffe) algorithm (Koch, 1992) for particle transport (radiation transport) is shown not to fit on PIMs as written. When redesigned with a 3-D allocation of cells to PIMs, the resulting algorithm is projected to execute an order of magnitude faster and more cost-effectively than the KBA algorithm, albeit with high initial hardware costs.

More Details

TYPE Report YEAR 2003

DOI OSTI

A practical application of compact optimization

Proposed for publication in the Journal of 40R.

Carr, Robert D.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2003

OSTI

Generative visualization system for subject matter experts

Brannon, Nathan B.; Lippitt, Carl E.

Abstract not provided.

More Details

TYPE Conference YEAR 2003

OSTI

Windowing functions for SAR data with spectral gaps

Doerry, Armin; Dickey, Fred M.; Romero, L.A.

Abstract not provided.

More Details

TYPE Conference YEAR 2003

OSTI

Spontaneous ionization of hydrogen atoms at the Si-SiO2 interface

Proposed for publication in Physical Review B.

Hjalmarson, Harold P.; Edwards, Arthur H.; Schultz, Peter A.; Hjalmarson, Harold P.

We present a series of electronic structure calculations that demonstrate a mechanism for spontaneous ionization of hydrogen at the Si-SiO{sub 2} interface. Specifically, we show that an isolated neutral hydrogen atom will spontaneously give up its charge and bond to a threefold coordinated oxygen atom. We refer to this entity as a proton. We have calculated the potential surface and found it to be entirely attractive. In contrast, hydrogen molecules will not undergo an analogous reaction. We relate these calculations both to proton generation experiments and to hydrogen plasma experiments.

More Details

TYPE Journal Article YEAR 2003

OSTI

Publications

Search results