Publications Search

A method for providing non-diffuse transport of material quantities in arbitrary Lagrangian-Eulerian (ALE) dynamic solid mechanics computations is presented. ALE computations are highly desirable for simulating dynamic problems that incorporate multiple materials and large deformations. Despite the advantages of using ALE for such problems, the method is associated with diffusion of material quantities due to the advection transport step of the computational cycle. This drawback poses great difficulty for applications of material failure for which discrete features are important, but are smeared out as a result of the diffusive advection operation. The focus of this work is an ALE method that incorporates transport of variables on discrete, massless points that move with the velocity field, referred to as Lagrangian material tracers (LMT), and consequently prevents diffusion of certain material quantities of interest. A detailed description of the algorithm is provided along with discussion of its computational aspects. Simulation results include a simple proof of concept, verification using a manufactured solution, and fragmentation of a uniformly loaded thin ring that clearly demonstrates the improvement offered by the ALE LMT method.

More Details

TYPE SAND Report YEAR 2016

DOI OSTI

Parallel Graph Coloring for Manycore Architectures

Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016

Deveci, Mehmet; Boman, Erik G.; Devine, Karen; Rajamanickam, Sivasankaran

Graph algorithms are challenging to parallelize on manycore architectures due to complex data dependencies and irregular memory access. We consider the well studied problem of coloring the vertices of a graph. In many applications it is important to compute a coloring with few colors in near-lineartime. In parallel, the optimistic (speculative) coloring method by Gebremedhin and Manne is the preferred approach but it needs to be modified for manycore architectures. We discuss a range of implementation issues for this vertex-based optimistic approach. We also propose a novel edge-based optimistic approach that has more parallelism and is better suited to GPUs. We study the performance empirically on two architectures(Xeon Phi and GPU) and across many data sets (from finite element problems to social networks). Our implementation uses the Kokkos library, so it is portable across platforms. We show that on GPUs, we significantly reduce the number of colors (geometric mean 4X, but up to 48X) as compared to the widely used cuSPARSE library. In addition, our edge-based algorithm is 1.5 times faster on average than cuSPARSE, where it hasspeedups up to 139X on a circuit problem. We also show the effect of the coloring on a conjugate gradient solver using multi-colored Symmetric Gauss-Seidel method as preconditioner, the higher coloring quality found by the proposed methods reduces the overall solve time up to 33% compared to cuSPARSE.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus

Basker: A threaded sparse LU factorization utilizing hierarchical parallelism and data layouts

Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016

Booth, Joshua D.; Rajamanickam, Sivasankaran; Thornquist, Heidi K.

Scalable sparse LU factorization is critical for efficient numerical simulation of circuits and electrical power grids. In this work, we present a new scalable sparse direct solver called Basker. Basker introduces a new algorithm to parallelize the Gilbert-Peierls algorithm for sparse LU factorization. As architectures evolve, there exists a need for algorithms that are hierarchical in nature to match the hierarchy in thread teams, individual threads, and vector level parallelism. Basker is designed to map well to this hierarchy in architectures. There is also a need for data layouts to match multiple levels of hierarchy in memory. Basker uses a two-dimensional hierarchical structure of sparse matrices that maps to the hierarchy in the memory architectures and to the hierarchy in parallelism. We present performance evaluations of Basker on the Intel SandyBridge and Xeon Phi platforms using circuit and power grid matrices taken from the University of Florida sparse matrix collection and from Xyce circuit simulations. Basker achieves a geometric mean speedup of 5.91× on CPU (16 cores) and 7.4× on Xeon Phi (32 cores) relative to KLU. Basker outperforms Intel MKL Pardiso (PMKL) by as much as 30× on CPU (16 cores) and 7.5× on Xeon Phi (32 cores) for low fill-in circuit matrices. Furthermore, Basker provides 5.4× speedup on a challenging matrix sequence taken from an actual Xyce simulation.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus

NiMC: Characterizing and Eliminating Network-Induced Memory Contention

Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016

Groves, Taylor L.; Grant, Ryan; Arnold, Dorian

Remote Direct Memory Access (RDMA) is expected to be an integral communication mechanism for future exascale systems - enabling asynchronous data transfers, so that applications may fully utilize all CPU resources while simultaneously sharing data amongst remote nodes. We examined this network-induced memory contention (NiMC), the interactions between RDMA and the memory subsystem when applications and out-of-band services compete for memory resources, and NiMC's resulting impact on application-level performance. For a range of hardware technologies and HPC workloads, we quantified NiMC and show that NiMC's impact grows with scale resulting in up to 3X performance degradation at scales as small as 8K processes even in applications that previously have been shown to be performance resilient in the presence of noise. We also evaluated three potential techniques to reduce NiMC's performance impact, namely hardware offloading, core reservation and software-based network throttling. While all three of these solutions show promise, we provide guidelines that help select the best solution for a given environment.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus

A comparison of high-level programming choices for incomplete sparse factorization across different architectures

Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016

Booth, Joshua D.; Kim, Kyungjoo; Rajamanickam, Sivasankaran

All many-core systems require fine-grained shared memory parallelism, however the most efficient way to extract such parallelism is far from trivial. Fine-grained parallel algorithms face various performance trade-offs related to tasking, accesses to global data-structures, and use of shared cache. While programming models provide high level abstractions, such as data and task parallelism, algorithmic choices still remain open on how to best implement irregular algorithms, such as sparse factorizations, while taking into account the trade-offs mentioned above. In this paper, we compare these performance trade-offs for task and data parallelism on different hardware architectures such as Intel Sandy Bridge, Intel Xeon Phi, and IBM Power8. We do this by comparing the scaling of a new task-parallel incomplete sparse Cholesky factorization called Tacho and a new data-parallel incomplete sparse LU factorization called Basker. Both solvers utilize Kokkos programming model and were developed within the ShyLU package of Trilinos. Using these two codes we demonstrate how high-level programming changes affect performance and overhead costs on multiple multi/many-core systems. We find that Kokkos is able to provide comparable performance with both parallel-for and task/futures on traditional x86 multicores. However, the choice of which high-level abstraction to use on many-core systems depends on both the architectures and input matrices.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus

Improving Application Resilience to Memory Errors with Lightweight Compression

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Levy, Scott L.N.; Ferreira, Kurt; Bridges, Patrick G.

In next-generation extreme-scale systems, application performance will be limited by memory performance characteristics. The first exascale system is projected to contain many petabytes of memory. In addition to the sheer volume of the memory required, device trends, such as shrinking feature sizes and reduced supply voltages, have the potential to increase the frequency of memory errors. As a result, resilience to memory errors is a key challenge. In this paper, we evaluate the viability of using memory compression to repair detectable uncorrectable errors (DUEs) in memory. We develop a software library, evaluate its performance and demonstrate that it is able to significantly compress memory of HPC applications. Further, we show that exploiting compressed memory pages to correct memory errors can significantly improve application performance on next-generation systems.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus

Increasing Molecular Dynamics Simulation Rates with an 8-Fold Increase in Electrical Power Efficiency

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Brown, W.M.; Semin, Andrey; Hebenstreit, Michael; Khvostov, Sergey; Raman, Karthik; Plimpton, Steven J.

Electrical power efficiency is a primary concern in designing modern HPC systems. Common strategies to improve CPU power efficiency rely on increased parallelism within a processor that is enabled both by an increase in the vector capabilities within the core and also the number of cores within a processor. Although many-core processors have been available for some time, achieving power-efficient performance has been challenging due to the offload model. Here, we evaluate performance of the molecular dynamics code LAMMPS on two new Intel® processors including the second generation many-core Intel® Xeon Phi™ processor that is available as a bootable CPU. We describe our approach to measure power consumption out-of-band and software optimizations necessary to achieve energy efficiency. We analyze benefits from Intel® Advanced Vector Extensions 512 instructions and demonstrate increased simulations rates with over 9X the CPU+DRAM power efficiency when compared to the unoptimized code on previous generation processors.

More Details

TYPE Conference Poster YEAR 2016

OSTI Scopus

Advantages to Modeling Relational Data using Hypergraphs versus Graphs

Wolf, Michael; Klinvex, Alicia M.; Dunlavy, Daniel M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Enabling fast, stable and accurate peridynamic computations using multi-time-step integration

Computer Methods in Applied Mechanics and Engineering

Lindsay, Payton; Parks, Michael L.; Prakash, A.

Peridynamics is a nonlocal extension of classical continuum mechanics that is well-suited for solving problems with discontinuities such as cracks. This paper extends the peridynamic formulation to decompose a problem domain into a number of smaller overlapping subdomains and to enable the use of different time steps in different subdomains. This approach allows regions of interest to be isolated and solved at a small time step for increased accuracy while the rest of the problem domain can be solved at a larger time step for greater computational efficiency. Performance of the proposed method in terms of stability, accuracy, and computational cost is examined and several numerical examples are presented to corroborate the findings.

More Details

TYPE Journal Article YEAR 2016

DOI OSTI Scopus

Krylov-Subspace Recycling via the POD-Augmented Conjugate Gradient Method

Carlberg, Kevin T.; Forstall, Virginia; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Exploratory Trajectory Classification

Wilson, Andrew T.; Foulk, James W.; Valicka, Christopher G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Neuromorphic Algorithm Acceleration with Resistive Memory NanoCrossbars

Marinella, Matthew; Agarwal, Sapan; Fuller, Elliot J.; Talin, Albert A.; El Gabaly, Farid; Jacobs-Gedrim, Robin B.; Hughart, David R.; Goeke, Ronald S.; Hsia, Alexander W.; Schiek, Richard; Plimpton, Steven J.; James, Conrad D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Physical Limits of Computing

Frank, Michael P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

DOI OSTI

Energy Efficiency Limits of Logic and Memory

Agarwal, Sapan; Cook, Jeanine; Debenedictis, Erik; Frank, Michael P.; Cauwenburghs, Gert; Srikanth, Sriseshan; Deng, Bobin; Hein, Eric; Rabbat, Paul; Conte, Thomas

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Social Network Analysis Research at Sandia (a non-comprehensive survey)

Berry, Jonathan

More Details

TYPE Presentation YEAR 2016

OSTI

Cognitive systems and sciences at SNL

Divis, Kristin M.; Abbott, Robert G.; Avina, Glory E.; Matzen, Laura E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Semantic Graphs for Pattern Detection in Sensor Data from Monitoring Regimes

Gastelum, Zoe N.; Thomas, Maikael A.; Shead, Timothy M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Direct Simulation Monte Carlo Investigation of Hydrodynamic Instabilities in Gases

Gallis, Michael A.; Koehler, Timothy P.; Torczynski, John R.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Using Neuromorphic Computing Methods for General Computer Performance Growth

Debenedictis, Erik

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

A Multiscale Schwarz Coupling for Capturing Boundary Effects in Dilute Charged Particle Systems

Cheung, James

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Data from Density Functional Theory

Wills, Ann E.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Task Parallel Approach to the Linear Algebra-Based Implementation of miniTri

Wolf, Michael

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Cooperative Computing for Autonomous Data Centers Storing Social Network Data

Berry, Jonathan

More Details

TYPE Conference Poster YEAR 2016

OSTI

Discussion Points for Table Behavior in the van der Waals Regime

Carpenter, John H.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

EOS Table Interpolation A Brief Overview and My Path

Carpenter, John H.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Accounting for Data Uncertainty in Making and Using an EOS

Carpenter, John H.; Robinson, Allen C.; Debusschere, Bert; Wills, Ann E.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

SNL EOS Program Overview

Carpenter, John H.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Data Structures for Parallel High Speed Streaming

Porter, Alexandra

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Implementing and testing a Kinetic Phase Transition Model (continued)

Wills, Ann E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

XOM ? Sandia/DOE Programs/Projects Overview

Foulk, James W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Improving Power and Performance in HPC Networks

Groves, Taylor L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

A Summary Of Modeling and Simulation Assessments Techniques 2016

Weirs, Gregory

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Peridynamic Multiscale Finite Element Methods

Littlewood, David J.; Bond, Stephen D.; Costa, Timothy

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Slycat Analysis and Visualization

Crossno, Patricia J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Validation Simulations of the DSMC Code SPARTA

Klothakis, Angelos; Nikolos, Ioannis; Koehler, Timothy P.; Gallis, Michael A.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Aspects of Large-scale Fully-coupled AMG Preconditioned FE Magnetohydrodynamic Simulations Enabled Through Trilinos

Lin, Paul T.; Shadid, John N.; Hu, Jonathan J.; Pawlowski, Roger; Cyr, Eric C.; Prokopenko, Andrey V.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Global Sensitivity Analysis for Large Eddy Simulation Models

Huan, Xun H.; Safta, Cosmin; Eldred, Michael; Vane, Zachary P.; Lacaze, Guilhem; Oefelein, Joseph; Sargsyan, Khachik; Najm, Habib N.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Recent Developments in TDDFT of Warm Dense Matter

Baczewski, Andrew D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Block Preconditioners for Mixed Discretization MHD and Continuum Plasma Simulations Enabled by Teko

Phillips, Edward; Cyr, Eric C.; Shadid, John N.; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Understanding Performance Interference in Next-Generation HPC Systems

Levy, Scott L.N.; Ferreira, Kurt; Widener, Patrick; Bridges, Patrick G.; Mondragon, Oscar H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Stability of Peridynamic Correspondence Material Models and Their Particle Discretizations

Silling, Stewart

Peridynamic correspondence material models provide a way to combine a material model from the local theory with the inherent capabilities of peridynamics to model long-range forces and fracture. However, correspondence models in a typical particle discretization suffer from zero-energy mode instability. These instabilities are shown here to be an aspect of material stability. A stability condition is derived for state-based materials starting from the requirement of potential energy minimization. It is shown that all correspondence materials fail this stability condition due to zero-energy deformation modes of the family. To eliminate these modes, a term is added to the correspondence strain energy density that resists deviations from a uniform deformation. The resulting material model satisfies the stability condition while effectively leaving the stress tensor unchanged. Computational examples demonstrate the effectiveness of the modified material model in avoiding zero-energy mode instability in a peridynamic particle code.

More Details

TYPE SAND Report YEAR 2016

DOI OSTI

Device to System Modeling Framework to Enable a 10 fJ per Instruction Neuromorphic Computer

Marinella, Matthew; Agarwal, Sapan; Talin, Albert A.; Mccormick, Frederick B.; Plimpton, Steven J.; El Gabaly, Farid; Fuller, Elliot J.; Jacobs-Gedrim, Robin B.; Hughart, David R.; Goeke, Ronald S.; Hsia, Alexander W.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Emerging Technologies for the Acceleration of Neuromorphic Algorithms

Marinella, Matthew; Agarwal, Sapan; Plimpton, Steven J.; Talin, Albert A.; El Gabaly, Farid; Fuller, Elliot J.; Hughart, David R.; Parekh, Ojas D.; Debenedictis, Erik; Goeke, Ronald S.; Hsia, Alexander W.; Aimone, James B.; James, Conrad D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Validation Simulations of the DSMC Code SPARTA

Klothakis, Angelos; Nikolos, Iaonnis; Koehler, Timothy P.; Gallis, Michael A.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Fabrication of Quantum Dots in Undoped SiGe with a single metal layer

Luhman, Dwight R.; Lu, Tzu M.; Foulk, James W.; Muller, Richard P.; Nielsen, Erik N.; Bethke, Donald; Ten Eyck, Gregory A.; Pluym, Tammy; Wendt, Joel R.; Dominguez, Jason; Lilly, Michael; Carroll, M.S.; Wanke, Michael C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Analyzing allocation behavior for multi-level memory

Voskuilen, Gwendolyn R.; Rodrigues, Arun; Hammond, Simon

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

Fabrictaion of Counted Donor Devices using Top-Down Ion Implantation

Bielejec, Edward S.; Lilly, Michael; Pacheco, Jose L.; Abraham, John B.S.; Baczewski, Andrew D.; Jacobson, Noah T.; Muller, Richard P.; Luhman, Dwight R.; Carroll, M.S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Multigrid Methods for Systems Arising from High Order Discretizations

Hu, Jonathan J.; Siefert, Christopher

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Accelerating and automatic tuning for Progressive Hedging

Siirola, John D.; Watson, Jean-Paul; Woodruff, David L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

The Advanced Architecture Test Bed Project Past Future and Impacts

Foulk, James W.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Toward optimizing multi-qubit device design and operation

Frees, Adam; Foulk, James W.; Ward, Daniel R.; Blume-Kohout, Robin; Eriksson, M.A.; Friesen, Mark; Coppersmith, S.N.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

A COUPLING STRATEGY FOR LOCAL AND NONLOCAL MODELS with applications to static peridynamics

D'Elia, Marta; Littlewood, David J.; Perego, Mauro; Bochev, Pavel B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Spiking Network Algorithms for Scientific Computing

Severa, William M.; Parekh, Ojas D.; Carlson, Kristofor D.; James, Conrad D.; Aimone, James B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI

SDAV Progress Report (2016-06)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Anti-persistence on persistent storage: History-independent sparse tables and dictionaries

Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems

Bender, Michael A.; Berry, Jonathan; Johnson, Rob; Kroeger, Thomas; Mccauley, Samuel; Phillips, Cynthia A.; Simon, Bertrand; Singh, Shikha; Zage, David J.

We present history-independent alternatives to a B-tree, the primary indexing data structure used in databases. A data structure is history independent (HI) if it is impossible to deduce any information by examining the bit representation of the data structure that is not already available through the API. We show how to build a history-independent cache-oblivious B-tree and a history-independent external-memory skip list. One of the main contributions is a data structure we build on the way - a history-independent packed-memory array (PMA). The PMA supports efficient range queries, one of the most important operations for answering database queries. Our HI PMA matches the asymptotic bounds of prior non-HI packed-memory arrays and sparse tables. Specifically, a PMA maintains a dynamic set of elements in sorted order in a linearsized array. Inserts and deletes take an amortized O(log2 N) element moves with high probability. Simple experiments with our implementation of HI PMAs corroborate our theoretical analysis. Comparisons to regular PMAs give preliminary indications that the practical cost of adding history-independence is not too large. Our HI cache-oblivious B-tree bounds match those of prior non-HI cache-oblivious B-trees. Searches take O(logB N) I/Os; inserts and deletes take O(log2N/B + logB N) amortized I/Os with high probability; and range queries returning k elements take O(logB N + k/B) I/Os. Our HI external-memory skip list achieves optimal bounds with high probability, analogous to in-memory skip lists: O(logB N) I/Os for point queries and amortized O(logB N) I/Os for inserts/deletes. Range queries returning k elements run in O(logB N + k/B) I/Os. In contrast, the best possible high-probability bounds for inserting into the folklore B-skip list, which promotes elements with probability 1/B, is just Θ(log N) I/Os. This is no better than the bounds one gets from running an inmemory skip list in external memory.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus

The ill-posed problem in DIC

Turner, D.Z.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

VVUQ Best Practices in Computational Science/Engineering Problems with some thoughts about extensions/limits to Complex Systems Models

Swiler, Laura P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Sensible Machine Grand Challenge

Debenedictis, Erik

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Coupling of Multi-fidelity Models: Applications to PNP-cDFT and local-nonlocal Poisson equations

Bochev, Pavel B.; Cheung, James; D'Elia, Marta; Frischknecht, Amalie L.; Parks, Michael L.; Perego, Mauro

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

The Defect Band Gap

Schultz, Peter A.; Edwards, Arthur H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Cactus Environment Machine: Shared Environment Call-by-Need

Stelle, George W.; Stefanovic, Darko; Olivier, Stephen L.; Forrest, Stephanie

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Scalable Solution Methods for Multiple-time-scale Plasma Physics Models that Enable Beyond Forward Simulations

Shadid, John N.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Towards Adjoint-enabled Beyond Forward Simulation Capabilities for Multiple-time-scale Multiphysics CFD and MHD Systems

Shadid, John N.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

Exploiting Adiabaticity and Spin Refocusing for Circuit Model Multi-Qubit Gates

Witzel, Wayne M.; Montano, Ines; Muller, Richard P.; Carroll, M.S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2016

OSTI

Modeling Attention: Human Comprehension of Data Visualizations

Haass, Michael J.; Matzen, Laura E.; Wilson, Andrew T.; Divis, Kristin M.; Armenta, Mikaela L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2016

OSTI

A cross-enclave composition mechanism for exascale system software

Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers Ross 2016 in Conjunction with Hpdc 2016

Evans, Noah; Foulk, James W.; Kocoloski, Brian; Lange, John R.; Lang, Michael; Bridges, Patrick G.

As supercomputers move to exascale, the number of cores per node continues to increase, but the I/O bandwidth between nodes is increasing more slowly. This leads to computational power outstripping I/O bandwidth. This growth, in turn, encourages moving as much of an HPC workflow as possible onto the node in order to minimize data movement. One particular method of application composition, enclaves, co-locates different operating systems and runtimes on the same node where they communicate by in situ communication mechanisms. In this work, we describe a mechanism for communicating between composed applications. We implement a mechanism using Copy onWrite cooperating with XEMEM shared memory to provide consistent, implicitly unsynchronized communication across enclaves. We then evaluate this mechanism using a composed application and analytics between the Kitten Lightweight Kernel and Linux on top of the Hobbes Operating System and Runtime. These results show a 3% overhead compared to an application running in isolation, demonstrating the viability of this approach.

More Details

TYPE Conference Poster YEAR 2016

DOI OSTI Scopus