Publications Search

LDMS-GPU: Lightweight Distributed Metric Service (LDMS) for NVIDIA GPGPUs

Elwazir, Ammar; Badawy, Abdel-Hameed A.; Aaziz, Omar R.; Cook, Jeanine C.

GPUs are now a fundamental accelerator for many high-performance computing applications. They are viewed by many as a technology facilitator for the surge in fields like machine learning and Convolutional Neural Networks. To deliver the best performance on a GPU, we need to create monitoring tools to ensure that we optimize the code to get the most performance and efficiency out of a GPU. Since NVIDIA GPUs are currently the most commonly implemented in HPC applications and systems, NVIDIA tools are the solution for performance monitoring. The Light-Weight Distributed Metric System (LDMS) at Sandia is an infrastructure widely adopted for large-scale systems and application monitoring. Sandia has developed CPU application monitoring capability within LDMS. Therefore, we chose to develop a GPU monitoring capability within the same framework. In this report, we discuss the current limitations in the NVIDIA monitoring tools, how we overcame such limitations, and present an overview of the tool we built to monitor GPU performance in LDMS and its capabilities. Also, we discuss our current validation results. Most of the performance counter results are the same in both vendor tools and our tool when using LDMS to collect these results. Furthermore, our tool provides these statistics during the entire runtime of the tool as a time series and not just aggregate statistics at the end of the application run. This allows the user to see the progress of the behavior of the applications during their lifetime.

More Details

TYPE SAND Report YEAR 2020

DOI OSTI

ECP Container Status 2020

Younge, Andrew J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Quantum transport in Si:P δ-layer wires

Mendez Granado, Juan P.; Mamaluy, Denis M.; Gao, Xujiao G.; Anderson, Evan M.; Campbell, DeAnna M.; Ivie, Jeffrey A.; Lu, Tzu-Ming L.; Schmucker, Scott W.; Misra, Shashank M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

High-order property-preserving semi-Lagrangian tracer transport and physics-dynamics-grid remap in EAMv2

Bradley, Andrew M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Elastoplastic Constitutive Model Calibration with Automatic Differentiation-based Sensitivities

Seidl, Daniel T.; Granzow, Brian N.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2020

DOI OSTI

Peridynamic Modeling of a Stress Pulse in a Heterogeneous Medium

Silling, Stewart A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Quantum Scientific Open User Testbed (QSCOUT) and Jaqal the Quantum Assembly Language

Landahl, Andrew J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Solidification Kinetics

Moore, Stan G.; Thompson, Aidan P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

FY20 CSSE L2 Milestone 7186

Templet Jr., Gary J.; Glickman, Matthew R.; Kordenbrock, Todd H.; Levy, Scott L.; Lofstead, Gerald F.; Mauldin, Jeff; Otahal, Thomas J.; Ulmer, Craig D.; Widener, Patrick W.; Oldfield, Ron A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Fault Tracking and Modeling in Advanced Node Processors of Single Event Effects

Cannon, Matthew J.; Rodrigues, Arun; Black, Dolores A.; Black, Jeffrey B.; Bustamante, Luis G.; Feinberg, Benjamin F.; Quinn, Heather; Clark, Lawrence T.; Brunhaver, John S.; Barnaby, Hugh; McLain, Michael L.; Agarwal, Sapan A.; Marinella, Matthew J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Evolving Spiking Circuit Motifs Using Weight Agnostic Neural Networks

Anwar, Abrar

Abstract not provided.

More Details

TYPE Conference Proceeding YEAR 2020

OSTI

Incorporating GPUs into Earth System Science

Taylor, Mark A.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2020

DOI OSTI

LANL/SANDIA/VOROCRUST EOFY 2020 meeting

LaForce, Tara; Jordan, Spencer H.; Ebeida, Mohamed S.; McLendon, William C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

MPI Partitioned Communication

Grant, Ryan E.; Dosanjh, Matthew D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Evaluating the Efficiency of OpenMP Tasking for Unbalanced Computationon Diverse CPU Architectures

Olivier, Stephen L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Lattice Structures for Dynamic Impact Environments

Jared, Bradley H.; Damm, David L.; Stacy, Shawn C.; Saiz, David J.; Moore, David G.; Epstein, Collin E.; Boyce, Brad B.; Jensen, Scott C.; McCarthy, Riley M.; Alberdi, Ryan A.; Heiden, Michael J.; Robbins, Joshua R.; Ruggles, Timothy R.; Branch, Brittany A.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Assessing Global Sensitivity Analysis for Credibility in Machine Learning Explainability

Laros, James H.; Smith, Michael R.; Field, Richard V.; Maxfield, Trevor; Rushdi, Ahmad R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Architectural Algorithmic and Systems Engineering Issues for Reversible Computing

Frank, Michael P.

Abstract not provided.

More Details

TYPE Conference Presenation YEAR 2020

DOI OSTI

Analog architectures for neural network acceleration based on non-volatile memory

Applied Physics Reviews

Xiao, Tianyao X.; Bennett, Christopher H.; Feinberg, Benjamin F.; Agarwal, Sapan A.; Marinella, Matthew J.

Analog hardware accelerators, which perform computation within a dense memory array, have the potential to overcome the major bottlenecks faced by digital hardware for data-heavy workloads such as deep learning. Exploiting the intrinsic computational advantages of memory arrays, however, has proven to be challenging principally due to the overhead imposed by the peripheral circuitry and due to the non-ideal properties of memory devices that play the role of the synapse. We review the existing implementations of these accelerators for deep supervised learning, organizing our discussion around the different levels of the accelerator design hierarchy, with an emphasis on circuits and architecture. We explore and consolidate the various approaches that have been proposed to address the critical challenges faced by analog accelerators, for both neural network inference and training, and highlight the key design trade-offs underlying these techniques.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

On mixed-integer programming formulations for the unit commitment problem

INFORMS Journal on Computing

Knueven, Ben; Ostrowski, James; Watson, Jean-Paul W.

We provide a comprehensive overview of mixed-integer programming formulations for the unit commitment (UC) problem. UC formulations have been an especially active area of research over the past 12 years due to their practical importance in power grid operations, and this paper serves as a capstone for this line of work. We additionally provide publicly available reference implementations of all formulations examined. We computationally test existing and novel UC formulations on a suite of instances drawn from both academic and real-world data sources. Driven by our computational experience from this and previous work, we contribute some additional formulations for both generator production upper bounds and piecewise linear production costs. By composing new UC formulations using existing components found in the literature and new components introduced in this paper, we demonstrate that performance can be significantly improved—and in the process, we identify a new state-of-the-art UC formulation.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus