Publications Search

Neural network (NN) inference is an essential part of modern systems and is found at the heart of numerous applications ranging from image recognition to natural language processing. In situ NN accelerators can efficiently perform NN inference using resistive crossbars, which makes them a promising solution to the data movement challenges faced by conventional architectures. Although such accelerators demonstrate significant potential for dense NNs, they often do not benefit from sparse NNs, which contain relatively few non-zero weights. Processing sparse NNs on in situ accelerators results in wasted energy to charge the entire crossbar where most elements are zeros. To address this limitation, this letter proposes Granular Matrix Reordering (GMR): a preprocessing technique that enables an energy-efficient computation of sparse NNs on in situ accelerators. GMR reorders the rows and columns of sparse weight matrices to maximize the crossbars' utilization and minimize the total number of crossbars needed to be charged. The reordering process does not rely on sparsity patterns and incurs no accuracy loss. Overall, GMR achieves an average of 28 percent and up to 34 percent reduction in energy consumption over seven pruned NNs across four different pruning methods and network architectures.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

Recent experiences withMachine Learning Perspectives fromAlgorithms Architectures and Applications

Rajamanickam, Sivasankaran R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Response of Wrought and Additively Manufactured 304L Stainless Steel at Mbar Stresses

Specht, Paul E.; Adams, David P.; Mitchell, John A.; Brown, Justin L.; Branch, Brittany A.; Silling, Stewart A.; Wise, Jack L.; Brown, Donald; Palmer, Todd

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

srMO-BO-3GP: A sequential regularized multi-objective constrained Bayesian optimization for design applications

Laros, James H.; Eldred, Michael S.; Wang, Yan; Mccann, Scott

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI

Presentation: Models and Analysis of Fuel Switching Generation Impacts on Power System Resilience

Wilches-Bernal, Felipe; Knueven, Bernard; Staid, Andrea S.; Watson, Jean-Paul

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Multilevel and multifidelity uncertainty quantification for cardiovascular hemodynamics

Computer Methods in Applied Mechanics and Engineering

Fleeter, Casey M.; Geraci, Gianluca G.; Schiavazzi, Daniele E.; Kahn, Andrew M.; Marsden, Alison L.

Standard approaches for uncertainty quantification in cardiovascular modeling pose challenges due to the large number of uncertain inputs and the significant computational cost of realistic three-dimensional simulations. We propose an efficient uncertainty quantification framework utilizing a multilevel multifidelity Monte Carlo (MLMF) estimator to improve the accuracy of hemodynamic quantities of interest while maintaining reasonable computational cost. This is achieved by leveraging three cardiovascular model fidelities, each with varying spatial resolution to rigorously quantify the variability in hemodynamic outputs. We employ two low-fidelity models (zero- and one-dimensional) to construct several different estimators. Our goal is to investigate and compare the efficiency of estimators built from combinations of these two low-fidelity model alternatives and our high-fidelity three-dimensional models. We demonstrate this framework on healthy and diseased models of aortic and coronary anatomy, including uncertainties in material property and boundary condition parameters. Our goal is to demonstrate that for this application it is possible to accelerate the convergence of the estimators by utilizing a MLMF paradigm. Therefore, we compare our approach to single fidelity Monte Carlo estimators and to a multilevel Monte Carlo approach based only on three-dimensional simulations, but leveraging multiple spatial resolutions. We demonstrate significant, on the order of 10 to 100 times, reduction in total computational cost with the MLMF estimators. We also examine the differing properties of the MLMF estimators in healthy versus diseased models, as well as global versus local quantities of interest. As expected, global quantities such as outlet pressure and flow show larger reductions than local quantities, such as those relating to wall shear stress, as the latter rely more heavily on the highest fidelity model evaluations. Similarly, healthy models show larger reductions than diseased models. In all cases, our workflow coupling Dakota's MLMF estimators with the SimVascular cardiovascular modeling framework makes uncertainty quantification feasible for constrained computational budgets.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

Timely Reporting of Heavy Hitters using External Memory

Proceedings of the ACM SIGMOD International Conference on Management of Data

Pandey, Prashant; Singh, Shikha; Bender, Michael A.; Berry, Jonathan W.; Farach-Colton, Martin; Johnson, Rob; Kroeger, Thomas M.; Phillips, Cynthia A.

Given an input stream of size N, a †-heavy hitter is an item that occurs at least † N times in S. The problem of finding heavy-hitters is extensively studied in the database literature. We study a real-time heavy-hitters variant in which an element must be reported shortly after we see its T = † N-th occurrence (and hence becomes a heavy hitter). We call this the Timely Event Detection (TED) Problem. The TED problem models the needs of many real-world monitoring systems, which demand accurate (i.e., no false negatives) and timely reporting of all events from large, high-speed streams, and with a low reporting threshold (high sensitivity). Like the classic heavy-hitters problem, solving the TED problem without false-positives requires large space (ω(N) words). Thus in-RAM heavy-hitters algorithms typically sacrifice accuracy (i.e., allow false positives), sensitivity, or timeliness (i.e., use multiple passes). We show how to adapt heavy-hitters algorithms to external memory to solve the TED problem on large high-speed streams while guaranteeing accuracy, sensitivity, and timeliness. Our data structures are limited only by I/O-bandwidth (not latency) and support a tunable trade-off between reporting delay and I/O overhead. With a small bounded reporting delay, our algorithms incur only a logarithmic I/O overhead. We implement and validate our data structures empirically using the Firehose streaming benchmark. Multi-threaded versions of our structures can scale to process 11M observations per second before becoming CPU bound. In comparison, a naive adaptation of the standard heavy-hitters algorithm to external memory would be limited by the storage device's random I/O throughput, i.e., ∼100K observations per second.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI Scopus

Synthetic Training Images for Real-World Object Detection

Gastelum, Zoe N.; Shead, Timothy M.; Laros, James H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

TOPIC MODELING WITH NATURAL LANGUAGE PROCESSING FOR IDENTIFICATION OF NUCLEAR PROLIFERATION-RELEVANT SCIENTIFIC AND TECHNICAL PUBLICATIONS

Bisila, Jonathan B.; Dunlavy, Daniel D.; Gastelum, Zoe N.; Ulmer, Craig D.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

FreeFunctionBlas-LEWG

Trott, Christian R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Dragonfly-Inspired Interception

Chance, Frances S.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Layer-Parallel Training: Nested Iteration

Cyr, Eric C.; Guenther, Stefanie; Ruthotto, Lars; Schroder, Jacob B.; Gauger, Nico R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Uncertainty Analysis of a COVID-19 Medical Resource Model

Swiler, Laura P.; Portone, Teresa P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Neuromorphic Hardware and Architectures

Cardwell, Suma G.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

PowerAPI: A Standardized Interface to Power/Energy Monitoring and Control

Grant, Ryan E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

A linearity preserving nodal variation limiting algorithm for continuous Galerkin discretization of ideal MHD equations

Journal of Computational Physics

Mabuza, Sibusiso M.; Shadid, John N.; Cyr, Eric C.; Pawlowski, Roger P.; Kuzmin, Dmitri

In this work, a stabilized continuous Galerkin (CG) method for magnetohydrodynamics (MHD) is presented. Ideal, compressible inviscid MHD equations are discretized in space on unstructured meshes using piecewise linear or bilinear finite element bases to get a semi-discrete scheme. Stabilization is then introduced to the semi-discrete method in a strategy that follows the algebraic flux correction paradigm. This involves adding some artificial diffusion to the high order, semi-discrete method and mass lumping in the time derivative term. The result is a low order method that provides local extremum diminishing properties for hyperbolic systems. The difference between the low order method and the high order method is scaled element-wise using a limiter and added to the low order scheme. The limiter is solution dependent and computed via an iterative linearity preserving nodal variation limiting strategy. The stabilization also involves an optional consistent background high order dissipation that reduces phase errors. The resulting stabilized scheme is a semi-discrete method that can be applied to inviscid shock MHD problems and may be even extended to resistive and viscous MHD problems. To satisfy the divergence free constraint of the MHD equations, we add parabolic divergence cleaning to the system. Various time integration methods can be used to discretize the scheme in time. We demonstrate the robustness of the scheme by solving several shock MHD problems.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

Case Study: Debugging Other People?s Libraries via PRELOAD

Siefert, Christopher S.; Laros, James H.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Inverting the design process with shape and topology optimization

Robbins, Joshua R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Truly heterogeneous HPC: Co-design to achieve what science needs from HPC

Cardwell, Suma G.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Machine Learning to Compare Arctic Simulations With Observed Data

Nichol, Jeffrey N.; Peterson, Matthew G.; Peterson, Kara J.; Stracuzzi, David J.; Fricke, Matthew

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Physics-informed graph neural nets (pigNNS) A unification of NN architectures with mimetic PDE discretization

Trask, Nathaniel A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2020

OSTI

Probing a Set of Trajectories to Maximize Captured Information

Leibniz International Proceedings in Informatics, LIPIcs

Fekete, Saoondor P.; Hill, Alexander; Krupke, Dominik; Mayer, Tyler; Mitchell, Joseph S.B.; Parekh, Ojas D.; Phillips, Cynthia A.

We study a trajectory analysis problem we call the Trajectory Capture Problem (TCP), in which, for a given input set T of trajectories in the plane, and an integer k-2, we seek to compute a set of k points ("portals") to maximize the total weight of all subtrajectories of T between pairs of portals. This problem naturally arises in trajectory analysis and summarization. We show that the TCP is NP-hard (even in very special cases) and give some first approximation results. Our main focus is on attacking the TCP with practical algorithm-engineering approaches, including integer linear programming (to solve instances to provable optimality) and local search methods. We study the integrality gap arising from such approaches. We analyze our methods on different classes of data, including benchmark instances that we generate. Our goal is to understand the best performing heuristics, based on both solution time and solution quality. We demonstrate that we are able to compute provably optimal solutions for real-world instances. 2012 ACM Subject Classification Theory of computation ! Design and analysis of algorithms.

More Details

TYPE Conference Poster YEAR 2020

OSTI Scopus

Publications

Search results