Publications Search

Final report for Cognitive Computing for Security LDRD 165613. It reports on the development of hybrid of general purpose/ne uromorphic computer architecture, with an emphasis on potential implementation with memristors.

More Details

TYPE SAND Report YEAR 2015

DOI OSTI

Practical Resilient Cases for FA-MPI a Transactional Fault-Tolerant MPI

Brightwell, Ronald B.; Hassani, Amin; Skjellum, Anthony; Bangalore, Purushotham

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

XVis: Visualization for the Extreme-Scale Scientific-Computation Ecosystem: Year-end report FY15 Q4

Moreland, Kenneth D.; Sewell, Christopher; Childs, Hank; Ma, Kwan-Liu; Geveci, Berk; Meredith, Jeremy

The XVis project brings together the key elements of research to enable scientific discovery at extreme scale. Scientific computing will no longer be purely about how fast computations can be performed. Energy constraints, processor changes, and I/O limitations necessitate significant changes in both the software applications used in scientific computation and the ways in which scientists use them. Components for modeling, simulation, analysis, and visualization must work together in a computational ecosystem, rather than working independently as they have in the past. This project provides the necessary research and infrastructure for scientific discovery in this new computational ecosystem by addressing four interlocking challenges: emerging processor technology, in situ integration, usability, and proxy analysis.

More Details

TYPE Other Report YEAR 2015

DOI OSTI

Adaptive Self-Tuning Networks

Draelos, Timothy J.; Peterson, Matthew G.; Lawry, Benjamin J.; Knox, Hunter A.; Faust, Aleksanrdra; Chael, Eric P.; Young, Christopher J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

SemiImplicit Coupling of Ice Sheet Momentum and Thickness Evolution Equations

Perego, Mauro P.; Price, Stephen; Hoffman, Matthew; Salinger, Andrew G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Multifidelity surrogates of groundwater flow

Asher, Michael; Jakeman, John D.; Jakeman, A.J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Full waveform inversion methods for source and media characterization before and after SPE5

Phillips-Alonge, Kristin E.; Knox, Hunter A.; Ober, Curtis C.; Abbott, Robert A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

A time-parallel method for the solution of PDE-constrained optimization problems

Ridzal, Denis R.; Cyr, Eric C.; Hajghassem, Mona

We study a time-parallel approach to solving quadratic optimization problems with linear time-dependent partial differential equation (PDE) constraints. These problems arise in formulations of optimal control, optimal design and inverse problems that are governed by parabolic PDE models. They may also arise as subproblems in algorithms for the solution of optimization problems with nonlinear time-dependent PDE constraints, e.g., in sequential quadratic programming methods. We apply a piecewise linear finite element discretization in space to the PDE constraint, followed by the Crank-Nicolson discretization in time. The objective function is discretized using finite elements in space and the trapezoidal rule in time. At this point in the discretization, auxiliary state variables are introduced at each discrete time interval, with the goal to enable: (i) a decoupling in time; and (ii) a fixed-point iteration to recover the solution of the discrete optimality system. The fixed-point iterative schemes can be used either as preconditioners for Krylov subspace methods or as smoothers for multigrid (in time) schemes. We present promising numerical results for both use cases.

More Details

TYPE Other Report YEAR 2015

DOI OSTI

DSMC Simulations of the Hydrodynamic Instabilities in Gases

Gallis, Michail A.; Torczynski, J.R.; Plimpton, Steven J.; Koehler, Timothy P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Sparta Validation and Verification

Gallis, Michail A.; Torczynski, J.R.; Plimpton, Steven J.; Koehler, Timothy P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

DSMC Simulations of Gas Flows

Gallis, Michail A.; Torczynski, J.R.; Plimpton, Steven J.; Koehler, Timothy P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Preconditioning Communication-Avoiding Krylov Methods

Rajamanickam, Sivasankaran R.; Yamazaki, I.; Boman, Erik G.; Prokopenko, Andrey V.; Heroux, Michael A.; Dongarra, J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

SDAV Progress Report (2015-12)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Why We Use Bad Color Maps and What You Can Do About It

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Inversion for Eigenvalues and Modes Using Sierra-SD and ROL

Walsh, Timothy W.; Aquino, Wilkins A.; Ridzal, Denis R.; Kouri, Drew P.

In this report we formulate eigenvalue-based methods for model calibration using a PDE-constrained optimization framework. We derive the abstract optimization operators from first principles and implement these methods using Sierra-SD and the Rapid Optimization Library (ROL). To demon- strate this approach, we use experimental measurements and an inverse solution to compute the joint and elastic foam properties of a low-fidelity unit (LFU) model.

More Details

TYPE SAND Report YEAR 2015

DOI OSTI

A Data-Driven Approach to PDE-Constrained Optimization under Uncertainty

Kouri, Drew P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Topology Optimization of Cellular Structure

Additive manufacturing

Robbins, Joshua R.; Owen, Steven J.; Clark, Brett W.; Voth, Thomas E.

This paper presents an end-to-end design process for compliance minimization based topological optimization of cellular structures through to the realization of a final printed product. Homogenization is used to derive properties representative of these structures through direct numerical simulation of unit cell models of the underlying periodic structure. The resulting homogenized properties are then used assuming uniform distribution of the cellular structure to compute the final macro-scale structure. A new method is then presented for generating an STL representation of the final optimized part that is suitable for printing on typical industrial machines. Quite fine cellular structures are shown to be possible using this method as compared to other approaches that use nurb based CAD representations of the geometry. Finally, results are presented that illustrate the fine-scale stresses developed in the final macro-scale optimized part and suggestions are made as to incorporate these features into the overall optimization process.

More Details

TYPE Journal Article YEAR 2015

OSTI

Density Functional Theory of Extreme Environments

Magyar, Rudolph J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

A Sparse Coding Model of the Hippocampal Dentate Gyrus

Aimone, James B.; Parekh, Ojas D.; Severa, William M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

A Massively Parallel Scalable Implicit SPH Solver

Trask, Nathaniel; Maxey, Martin; Kim, Kyungjoo K.; Perego, Mauro P.; Parks, Michael L.; Yang, Kai; Pan, Wenxiao; Tartakovsky, Alex

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Finding Non-Human Nodes in Social Networks Using Only Topology

Berry, Jonathan W.; Kearns, Aaron; Phillips, Cynthia A.; Saia, Jared

More Details

TYPE Conference Poster YEAR 2015

OSTI

The energy scaling advantages of RRAM crossbars

2015 4th Berkeley Symposium on Energy Efficient Electronic Systems, E3S 2015 - Proceedings

Agarwal, Sapan A.; Parekh, Ojas D.; Quach, Tu-Thach Q.; James, Conrad D.; Aimone, James B.; Marinella, Matthew J.

As transistors start to approach fundamental limits and Moore's law slows down, new devices and architectures are needed to enable continued performance gains. New approaches based on RRAM (resistive random access memory) or memristor crossbars can enable the processing of large amounts of data[1, 2]. One of the most promising applications for RRAM crossbars is brain inspired or neuromorphic computing[3, 4].

More Details

TYPE Conference Poster YEAR 2015

OSTI Scopus

Millivolt switches will support better energy-reliability tradeoffs

2015 4th Berkeley Symposium on Energy Efficient Electronic Systems, E3S 2015 - Proceedings

DeBenedictis, Erik; Zima, Hans

Millivolt switches will not only improve energy efficiency, but will enable a new capability to manage the energy-reliability tradeoff. By effectively utilizing this system-level capability, it may be possible to obtain one or two additional generations of scaling beyond current projections. Millivolt switches will enable further energy scaling, a process that is expected to continue until the technology encounters thermal noise errors [Theis 10]. If thermal noise errors can be accommodated at higher levels through a new form of error correction, it may be possible to scale about 3× lower in system energy than is currently projected. A general solution to errors would also address long standing problems with Cosmic Ray strikes, weak and aging parts, some cyber security vulnerabilities, etc.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Millivolt switches will support better energy-reliability tradeoffs

2015 4th Berkeley Symposium on Energy Efficient Electronic Systems, E3S 2015 - Proceedings

DeBenedictis, Erik; Zima, Hans

Millivolt switches will not only improve energy efficiency, but will enable a new capability to manage the energy-reliability tradeoff. By effectively utilizing this system-level capability, it may be possible to obtain one or two additional generations of scaling beyond current projections. Millivolt switches will enable further energy scaling, a process that is expected to continue until the technology encounters thermal noise errors [Theis 10]. If thermal noise errors can be accommodated at higher levels through a new form of error correction, it may be possible to scale about 3× lower in system energy than is currently projected. A general solution to errors would also address long standing problems with Cosmic Ray strikes, weak and aging parts, some cyber security vulnerabilities, etc.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Infrastructure for in situ system monitoring and application data analysis

Proceedings of ISAV 2015: 1st International Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis

Brandt, James M.; Devine, Karen D.; Gentile, Ann C.

We present an architecture for high-performance computers that integrates in situ analysis of hardware and system monitoring data with application-specific data to reduce application runtimes and improve overall platform utilization. Large-scale high-performance computing systems typically use monitoring as a tool unrelated to application execution. Monitoring data flows from sampling points to a centralized off-system machine for storage and post-processing when root-cause analysis is required. Along the way, it may also be used for instantaneous threshold-based error detection. Applications can know their application state and possibly allocated resource state, but typically, they have no insight into globally shared resource state that may affect their execution. By analyzing performance data in situ rather than off-line, we enable applications to make real-time decisions about their resource utilization. We address the particular case of in situ network congestion analysis and its potential to improve task placement and data partitioning. We present several design and analysis considerations.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Local recovery and failure masking for stencil-based applications at extreme scales

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Gamell, Marc; Teranishi, Keita T.; Heroux, Michael A.; Mayo, Jackson M.; Kolla, Hemanth K.; Chen, Jacqueline H.; Parashar, Manish

Application resilience is a key challenge that has to be addressed to realize the exascale vision. Online recovery, even when it involves all processes, can dramatically reduce the overhead of failures as compared to the more traditional approach where the job is terminated and restarted from the last checkpoint. In this paper we explore how local recovery can be used for certain classes of applications to further reduce overheads due to resilience. Specifically we develop programming support and scalable runtime mechanisms to enable online and transparent local recovery for stencil-based parallel applications on current leadership class systems. We also show how multiple independent failures can be masked to effectively reduce the impact on the total time to solution. We integrate these mechanisms with the S3D combustion simulation, and experimentally demonstrate (using the Titan Cray-XK7 system at ORNL) the ability to tolerate high failure rates (i.e., node failures every 5 seconds) with low overhead while sustaining performance, at scales up to 262144 cores.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

STS-k: A multilevel sparse triangular solution scheme for NUMA multicores

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Kabir, Humayun; Booth, Joshua D.; Aupy, Guillaume; Benoit, Anne; Robert, Yves; Raghavan, Padma

We consider techniques to improve the performance of parallel sparse triangular solution on non-uniform memory architecture multicores by extending earlier coloring and level set schemes for single-core multiprocessors. We develop STS-k, where k represents a small number of transformations for latency reduction from increased spatial and temporal locality of data accesses. We propose a graph model of data reuse to inform the development of STS-k and to prove that computing an optimal cost schedule is NP-complete. We observe significant speed-ups with STS-3 on 32-core Intel Westmere-Ex and 24-core AMD 'MagnyCours' processors. Incremental gains solely from the 3-level transformations in STS-3 for a fixed ordering, correspond to reductions in execution times by factors of 1.4(Intel) and 1.5(AMD) for level sets and 2(Intel) and 2.2(AMD) for coloring. On average, execution times are reduced by a factor of 6(Intel) and 4(AMD) for STS-3 with coloring compared to a reference implementation using level sets.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Early experiences with node-level power capping on the cray XC40 platform

Proceedings of E2SC 2015: 3rd International Workshop on Energy Efficient Supercomputing - Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis

Laros, James H.; Olivier, Stephen L.; Ferreira, Kurt B.; Shipman, Galen; Shu, Wei

Power consumption of extreme-scale supercomputers has become a key performance bottleneck. Yet current practices do not leverage power management opportunities, instead running at maximum power. This is not sustainable. Future systems will need to manage power as a critical resource, directing it to where it has greatest benefit. Power capping is one mechanism for managing power budgets, however its behavior is not well understood. This paper presents an empirical evaluation of several key HPC workloads running under a power cap on a Cray XC40 system, and provides a comparison of this technique with p-state control, demonstrating the performance differences of each. These results show: 1.) Maximum performance requires ensuring the cap is not reached; 2.) Performance slowdown under a cap can be attributed to cascading delays which result in unsynchronized performance variability across nodes; and, 3.) Due to lag in reaction time, considerable time is spent operating above the set cap. This work provides a timely and much needed comparison of HPC application performance under a power cap and attempts to enable users and system administrators to understand how to best optimize application performance on power-constrained HPC systems.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

A task-based linear algebra Building Blocks approach for scalable graph analytics

2015 IEEE High Performance Extreme Computing Conference, HPEC 2015

Wolf, Michael W.; Berry, Jonathan W.; Stark, Dylan T.

It is challenging to obtain scalable HPC performance on real applications, especially for data science applications with irregular memory access and computation patterns. To drive co-design efforts in architecture, system, and application design, we are developing miniapps representative of data science workloads. These in turn stress the state of the art in Graph BLAS-like Graph Algorithm Building Blocks (GABB). In this work, we outline a Graph BLAS-like, linear algebra based approach to miniTri, one such miniapp. We describe a task-based prototype implementation and give initial scalability results.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

A checkpoint compression study for high-performance computing systems

International Journal of High Performance Computing Applications

Ferreira, Kurt B.; Arnold, Dorian; Ibtesham, Dewan

As high-performance computing systems continue to increase in size and complexity, higher failure rates and increased overheads for checkpoint/restart (CR) protocols have raised concerns about the practical viability of CR protocols for future systems. Previously, compression has proven to be a viable approach for reducing checkpoint data volumes and, thereby, reducing CR protocol overhead leading to improved application performance. In this article, we further explore compression-based CR optimization by exploring its baseline performance and scaling properties, evaluating whether improved compression algorithms might lead to even better application performance and comparing checkpoint compression against and alongside other software- and hardware-based optimizations. Our results highlights are that: (1) compression is a very viable CR optimization; (2) generic, text-based compression algorithms appear to perform near optimally for checkpoint data compression and faster compression algorithms will not lead to better application performance; (3) compression-based optimizations fare well against and alongside other software-based optimizations; and (4) while hardware-based optimizations outperform software-based ones, they are not as cost effective.

More Details

TYPE Journal Article YEAR 2015

DOI OSTI Scopus DOI OSTI Scopus

VTK-m: Building a Visualization Toolkit for Massively Threaded Architectures (UltraVis 2015)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

PLATO Platinum Topology Optimization

Blacker, Ted D.; Robbins, Joshua R.; Owen, Steven J.; Aguilo Valentin, Miguel A.; Clark, Brett W.; Voth, Thomas E.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Overtime: A Tool for Analyzing Performance Variation due to Network Interference

Grant, Ryan E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI

Lightweight Threading with MPI Using Persistent Communcations Semantics

Grant, Ryan E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Early Experiences with Node-Level Power Capping on the Cray XC40 Platform [PowerPoint]

Laros, James H.; Olivier, Stephen L.; Ferreira, Kurt B.; Shipman, Galen; Shu, Wei

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI

NNSA/CEA Cooperation in Computer Science: Visualization SC15 Update

Bergeret, Nicolas; Brugger, Eric; De Verdiere, Guillaume C.; Crossno, Patricia J.; Guilbaud, Claire; Patchett, John

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Power API HPC Controls and Monitoring

Laros, James H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Interconnect So*ware/Hardware Co-Design for Extreme-Scale Systems

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Coupling Meshfree Peridynamics with Local Finite-Element Models

Littlewood, David J.; Silling, Stewart A.; Seleson, Pablo D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

So you might want to work at a national lab?

Phillips, Cynthia A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

NNSA Advanced Architecture Test Beds

Laros, James H.; Ang, James A.; Hammond, Simon D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Risk-Averse Optimization of Large-Scale Multiphysics Systems

Kouri, Drew P.; Aquino, Wilkins A.; Ridzal, Denis R.; Rockafellar, Ralph T.; Shapiro, Alexander; Uryasev, Stan

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Scheduling In-Situ Analytics in Next-generation Applications

Mondragon, Oscar H.; Bridges, Patrick G.; Ferreira, Kurt B.; Widener, Patrick W.; Levy, Scott L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Maintainability and Performance for LAMMPS

Trott, Christian R.; Shan, Tzu-Ray S.; Moore, Stan G.; Thompson, Aidan P.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Programming Models for Parallel Architectures and Requirements for Pre-Exascale

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

ARM HPC Users Group

Hammond, Simon D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Power API for HPC: Standardizing Power Measurement and Control

Laros, James H.; Laros, James H.; Kelly, Suzanne M.; Levenhagen, Michael J.; DeBonis, David D.; Olivier, Stephen L.; Grant, Ryan E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

DSMC Simulations of the Rayleigh-Taylor Instability in Gases

Gallis, Michail A.; Koehler, Timothy P.; Torczynski, J.R.; Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Albany: Integrating Algorithmic Components to Build Advanced Applications

Salinger, Andrew G.; Bradley, Andrew M.; Demeshko, Irina D.; Hansen, Glen H.; Perego, Mauro P.; Phipps, Eric T.; Kalashnikova, Irina; Tuminaro, Raymond S.; Granzow, Brian; Ibanez, Dan; Shephard, Mark

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Component-Based Application Code Development Part 2: Demonstration on a Land-Ice Model and Proposed Extension to Other Climate Components

Kalashnikova, Irina; Salinger, Andrew G.; Perego, Mauro P.; Tuminaro, Raymond S.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Finding Triangles and Tracking Particles: Using Spiking Neural Networks for Pattern Identification Algorithms

Severa, William M.; Parekh, Ojas D.; James, Conrad D.; Aimone, James B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

On the edge: State tomography boundaries and model selection

Scholten, Travis L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Encouraging Good Color Use for All

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Versatile Formal Methods Applied to Quantum Information

Witzel, Wayne W.; Rudinger, Kenneth M.; Sarovar, Mohan S.

Using a novel formal methods approach, we have generated computer-veri ed proofs of major theorems pertinent to the quantum phase estimation algorithm. This was accomplished using our Prove-It software package in Python. While many formal methods tools are available, their practical utility is limited. Translating a problem of interest into these systems and working through the steps of a proof is an art form that requires much expertise. One must surrender to the preferences and restrictions of the tool regarding how mathematical notions are expressed and what deductions are allowed. Automation is a major driver that forces restrictions. Our focus, on the other hand, is to produce a tool that allows users the ability to con rm proofs that are essentially known already. This goal is valuable in itself. We demonstrate the viability of our approach that allows the user great exibility in expressing state- ments and composing derivations. There were no major obstacles in following a textbook proof of the quantum phase estimation algorithm. There were tedious details of algebraic manipulations that we needed to implement (and a few that we did not have time to enter into our system) and some basic components that we needed to rethink, but there were no serious roadblocks. In the process, we made a number of convenient additions to our Prove-It package that will make certain algebraic manipulations easier to perform in the future. In fact, our intent is for our system to build upon itself in this manner.

More Details

TYPE Other Report YEAR 2015

DOI OSTI

Kokkos Tutorial

Edwards, Harold C.; Trott, Christian R.; Amelang, Jeff

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

k-Means Clustering for Two-Level Memory Systems

Berry, Jonathan W.; Bender, M.; Hammond, Simon D.; Moore, B.; Phillips, Cynthia; Moseley, B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Solving Graph Laplacians for Complex Networks

Boman, Erik G.; Deweese, Kevin; Gilbert, John R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Analysis of Critical Infrastructures with the Pyomo Modeling Software

Hart, William E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Aeras: Sandia's Next Generation Global Atmosphere Model

Spotz, William S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

LFLR for MPI+X

Teranishi, Keita T.; Heroux, Michael A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Grandmaster: Interactive text-based analytics of social media [PowerPoint]

Fabian, Nathan D.; Davis, Warren L.; Raybourn, Elaine M.; Lakkaraju, Kiran L.; Whetzel, Jonathan H.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI

Transport Physics in Thin-Film Oxides: From Capacitors to Memristors

Tierney, Brian D.; Hjalmarson, Harold P.; McLain, Michael L.; Hughart, David R.; Marinella, Matthew J.; Mamaluy, Denis M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Solution of Eulerian Frame MHD Simulations with Multi-stage RK Implicit / Explicit (IMEX) Time integration

Shadid, John N.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Challenges for Materials Modeling at the Atomic and Meso Scales

Plimpton, Steven J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Qthreads and Thoughts on ULT Standardization

Brightwell, Ronald B.; Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Logo for Gordon Conference on 2D Electronics Beyond Graphene

Baczewski, Andrew D.; Tomanek, David

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Grandmaster: Interactive text-based analytics of social media

Fabian, Nathan D.; Davis, Warren L.; Raybourn, Elaine M.; Lakkaraju, Kiran L.; Whetzel, Jonathan H.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI

Stereo-DIC vs 3D Scanning LDV for Modal Testing [PowerPoint]

Reu, Phillip L.; Rohe, Daniel P.; Jacobs-O'Malley, Laura D.; Turner, Daniel Z.

More Details

TYPE Conference Poster YEAR 2015

OSTI

OpenMP Tasks: New Features for 4.5 [PowerPoint]

Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Enabling tractable exploration of the performance of adaptive mesh refinement

Proceedings - IEEE International Conference on Cluster Computing, ICCC

Vaughan, Courtenay T.; Barrett, Richard F.

A broad range of physical phenomena in science and engineering can be explored using finite difference and volume based application codes. Incorporating Adaptive Mesh Refinement (AMR) into these codes focuses attention on the most critical parts of a simulation, enabling increased numerical accuracy of the solution while limiting memory consumption. However, adaptivity comes at the cost of increased runtime complexity, which is particularly challenging on emerging and expected future architectures. In order to explore the design space offered by new computing environments, we have developed a proxy application called miniAMR. MiniAMR exposes a range of the important issues that will significantly impact the performance potential of full application codes. In this paper, we describe miniAMR, demonstrate what is designed to represent in a full application code, and illustrate how it can be used to exploit future high performance computing architectures. To ensure an accurate understanding of what miniAMR is intended to represent, we compare it with CTH, a shock hydrodynamics code in heavy use throughout several computational science and engineering communities.

More Details

TYPE Conference Poster YEAR 2015

OSTI Scopus

Comparing global link arrangements for dragonfly networks

Proceedings - IEEE International Conference on Cluster Computing, ICCC

Hastings, Emily; Rincon-Cruz, David; Spehlmann, Marc; Meyers, Sofia; Xu, Anda; Bunde, David P.; Leung, Vitus J.

High-performance computing systems are shifting away from traditional interconnect topologies to exploit new technologies and to reduce interconnect power consumption. The Dragonfly topology is one promising candidate for new systems, with several variations already in production. It is hierarchical, with local links forming groups and global links joining the groups. At each level, the interconnect is a clique, with a link between each pair of switches in a group and a link between each pair of groups. This paper shows that the intergroup links can be made in meaningfully different ways. We evaluate three previously-proposed approaches for link organization (called global link arrangements) in two ways. First, we use bisection bandwidth, an important and commonly-used measure of the potential for communication bottlenecks. We show that the global link arrangements often give bisection bandwidths differing by 10s of percent, with the specific separation varying based on the relative bandwidths of local and global links. For the link bandwidths used in a current Dragonfly implementation, it is 33%. Second, we show that the choice of global link arrangement can greatly impact the regularity of task mappings for nearest neighbor stencil communication patterns, an important pattern in scientific applications.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

K-Means clustering on two-level memory systems

ACM International Conference Proceeding Series

Bender, Michael A.; Berry, Jonathan W.; Hammond, Simon D.; Moore, Branden J.; Moseley, Benjamin; Phillips, Cynthia A.

In recent work we quantified the anticipated performance boost when a sorting algorithm is modified to leverage user- Addressable "near-memory," which we call scratchpad. This architectural feature is expected in the Intel Knight's Land- ing processors that will be used in DOE's next large-scale supercomputer. This paper expands our analytical study of the scratch- pad to consider k-means clustering, a classical data-analysis technique that is ubiquitous in the literature and in prac- Tice. We present new theoretical results using the model introduced in [13], which measures memory transfers and assumes that computations are memory-bound. Our the- oretical results indicate that scratchpad-aware versions of k-means clustering can expect performance boosts for high- dimensional instances with relatively few cluster centers. These constraints may limit the practical impact of scratch- pad for k-means acceleration, so we discuss their origins and practical implications. We corroborate our theory with ex- perimental runs on a system instrumented to mimic one with scratchpad memory. We also contribute a semi-formalization of the computa- Tional properties that are necessary and sufficient to predict a performance boost from scratchpad-aware variants of al- gorithms. We have observed and studied these properties in the context of sorting, and now clustering. We conclude with some thoughts on the application of these properties to new areas. Specifically, we believe that dense linear algebra has similar properties to k-means, while sparse linear algebra and FFT computations are more sim-ilar to sorting. The sparse operations are more common in scientific computing, so we expect scratchpad to have signif- icant impact in that area.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Trajectory analysis via a geometric feature space approach

Statistical Analysis and Data Mining

Laros, James H.; Wilson, Andrew T.

This study aimed to organize a body of trajectories in order to identify, search for and classify both common and uncommon behaviors among objects such as aircraft and ships. Existing comparison functions such as the Fréchet distance are computationally expensive and yield counterintuitive results in some cases. We propose an approach using feature vectors whose components represent succinctly the salient information in trajectories. These features incorporate basic information such as the total distance traveled and the distance between start/stop points as well as geometric features related to the properties of the convex hull, trajectory curvature and general distance geometry. Additionally, these features can generally be mapped easily to behaviors of interest to humans who are searching large databases. Most of these geometric features are invariant under rigid transformation. Furthermore, we demonstrate the use of different subsets of these features to identify trajectories similar to an exemplar, cluster a database of several hundred thousand trajectories and identify outliers.

More Details

TYPE Journal Article YEAR 2015

DOI OSTI

Trajectory analysis via a geometric feature space approach

Statistical Analysis and Data Mining

Laros, James H.; Wilson, Andrew T.

This study aimed to organize a body of trajectories in order to identify, search for and classify both common and uncommon behaviors among objects such as aircraft and ships. Existing comparison functions such as the Fréchet distance are computationally expensive and yield counterintuitive results in some cases. We propose an approach using feature vectors whose components represent succinctly the salient information in trajectories. These features incorporate basic information such as the total distance traveled and the distance between start/stop points as well as geometric features related to the properties of the convex hull, trajectory curvature and general distance geometry. Additionally, these features can generally be mapped easily to behaviors of interest to humans who are searching large databases. Most of these geometric features are invariant under rigid transformation. We demonstrate the use of different subsets of these features to identify trajectories similar to an exemplar, cluster a database of several hundred thousand trajectories and identify outliers.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Computing quality scores and uncertainty for approximate pattern matching in geospatial semantic graphs

Statistical Analysis and Data Mining

Stracuzzi, David J.; Brost, Randolph B.; Phillips, Cynthia A.; Robinson, David G.; Wilson, Alyson G.; Woodbridge, Diane W.

Geospatial semantic graphs provide a robust foundation for representing and analyzing remote sensor data. In particular, they support a variety of pattern search operations that capture the spatial and temporal relationships among the objects and events in the data. However, in the presence of large data corpora, even a carefully constructed search query may return a large number of unintended matches. This work considers the problem of calculating a quality score for each match to the query, given that the underlying data are uncertain. We present a preliminary evaluation of three methods for determining both match quality scores and associated uncertainty bounds, illustrated in the context of an example based on overhead imagery data.

More Details

TYPE Journal Article YEAR 2015

DOI OSTI Scopus

Rapid development of an ice sheet climate application using the components-based approach

Salinger, Andrew G.; Kalashnikova, Irina

Abstract not provided.

More Details

TYPE Other Report YEAR 2015

DOI OSTI

Temperature Dependence of Grain Boundary Structure and Motion

Foiles, Stephen M.; O'Brien, Christopher J.; Abdeljawad, Fadi F.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Grain Boundary Segregation: Atomistic and Phase Field Treatments

Foiles, Stephen M.; Abdeljawad, Fadi F.; O'Brien, Christopher J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Enhancing Human Performance at the Airport Security Checkpoint:: Human Factors Research at the Transportation Security Administration

Speed, Ann S.; Kudrick, Bonnie

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

A Multi-Year Plan for Enhancing Turbulence Modeling in Hydra-TH Revised and Updated Version 2.0

Smith, Thomas M.; Berndt, Markus; Baglietto, Emilio; Magolan, Ben

The purpose of this report is to document a multi-year plan for enhancing turbulence modeling in Hydra-TH for the Consortium for Advanced Simulation of Light Water Reactors (CASL) program. Hydra-TH is being developed to the meet the high- fidelity, high-Reynolds number CFD based thermal hydraulic simulation needs of the program. This work is being conducted within the thermal hydraulics methods (THM) focus area. This report is an extension of THM CASL milestone L3:THM.CFD.P10.02 [33] (March, 2015) and picks up where it left off. It will also serve to meet the requirements of CASL THM level three milestone, L3:THM.CFD.P11.04, scheduled for completion September 30, 2015. The objectives of this plan will be met by: maturation of recently added turbulence models, strategic design/development of new models and systematic and rigorous testing of existing and new models and model extensions. While multi-phase turbulent flow simulations are important to the program, only single-phase modeling will be considered in this report. Large Eddy Simulation (LES) is also an important modeling methodology. However, at least in the first year, the focus is on steady-state Reynolds Averaged Navier-Stokes (RANS) turbulence modeling.

More Details

TYPE Other Report YEAR 2015

DOI OSTI

Building and Using Tabular Equations of State with Uncertainty Information

Carpenter, John H.; Robinson, Allen C.; Debusschere, Bert D.; Wills, Ann E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

The ACME project's dycore performance strategy for next generation architectures

Taylor, Mark A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Lightweight threading with MPI using Persistent Communications Semantics

Grant, Ryan E.; Skjellum, Anthony; Bangalore, Purushotham V.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Preparing for Exascale: Modeling MPI for Many-Core Systems using Fine-Grain Queues

Bridges, Patrick G.; Dosanjh, Matthew D.; Grant, Ryan E.; Farmer, Shane; Skjellum, Anthony; Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI

Measuring Parallel Software Scalability: You?re Doing it Wrong

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

VTK-m Overview (Intel Design Review)

Moreland, Kenneth D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

XFEM for Multi-Material Eulerian Solid/Hydrodyanmics in ALEGRA

Voth, Thomas E.; Sanchez, Jason J.; Mosso, Stewart J.; Kramer, Richard M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Power Aware Dynamic Provisioning of HPC Networks

Groves, Taylor G.; Grant, Ryan E.

Future exascale systems are under increased pressure to find power savings. The network, while it consumes a considerable amount of power is often left out of the picture when discussing total system power. Even when network power is being considered, the references are frequently a decade or older and rely on models that lack validation on modern inter- connects. In this work we explore how dynamic mechanisms of an Infiniband network save power and at what granularity we can engage these features. We explore this within the context of the host controller adapter (HCA) on the node and for the fabric, i.e. switches, using three different mechanisms of dynamic link width, frequency and disabling of links for QLogic and Mellanox systems. Our results show that while there is some potential for modest power savings, real world systems need to improved responsiveness to adjustments in order to fully leverage these savings. This page intentionally left blank.

More Details

TYPE SAND Report YEAR 2015

DOI OSTI

Parallel scaling analysis for ALEGRA on the Excalibur system

Niederhaus, John H.; Drake, Richard R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Quantifying neural information content: A case study of the impact of hippocampal adult neurogenesis through computational modeling

Vineyard, Craig M.; Verzi, Stephen J.; James, Conrad D.; Aimone, James B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Advances in Domain Mapping of Massively Parallel Scientific Computations

Leland, Robert; Hendrickson, Bruce A.

One of the most important concerns in parallel computing is the proper distribution of workload across processors. For most scientific applications on massively parallel machines, the best approach to this distribution is to employ data parallelism; that is, to break the datastructures supporting a computation into pieces and then to assign those pieces to different processors. Collectively, these partitioning and assignment tasks comprise the domain mapping problem.

More Details

TYPE Other Report YEAR 2015

DOI OSTI