Publications

Results 1–25 of 88
Date Inputs. Currently set to enter a start and end date.
Current Filters Clear all
  • Remove author filter×
Publication Type Year

Understanding Memory Failures on a Petascale Arm System

The 31st International Symposium on High-Performance Parallel and Distributed Computing

Kurt Brian Ferreira, Scott Larson Nicoll Levy, Joshua David Hemmert, Kevin Pedretti

Conference Paper – 2022 Conference Paper 2022

SNL ATDM Software Ecosystem Operating Systems and On-Node Runtime

2022 Exascale Computing Project Annual Meeting (Virtual)

Stephen Lecler Olivier, Ronald B. Brightwell, Matthew Dosanjh, Kurt Brian Ferreira, Scott Larson Nicoll Levy, Kevin Pedretti, Andrew J Younge

Display or Poster (non-conference) – 2022 Display or Poster (non-conference) 2022

Characterizing Failures in HPC Using Benford?s Law

The SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP22)

Kurt Brian Ferreira, Scott Larson Nicoll Levy

Conference Presentation – 2022 Conference Presentation 2022

"Smarter" NICs for Faster Molecular Dynamics: A Case Study

36th IEEE International Parallel & Distributed Processing Symposium

Sara Karamati, Clayton Hughes, Karl Scott Hemmert, Ryan E. Grant, William Whitney Schonbein, Scott Larson Nicoll Levy, Thomas M. Conte, Jeffrey Young, Richard W. Buduc

Conference Proceeding – 2022 Conference Proceeding 2022

Characterizing Per-node Memory Failures Using Benford?s Law

FTXS 2021 Workshop on Fault Tolerance for HPC at eXtreme Scale held in conjuction with SC21

Kurt Brian Ferreira, Scott Larson Nicoll Levy

Abstract – 2021 Abstract 2021

A Benchmark to Understand Communication Performance in Hybrid MPI and GPU Applications

ExaMPI21Workshop on Exascale MPI

Keira Haskins, Bridges, Kurt Brian Ferreira, Scott Larson Nicoll Levy

Conference Paper – 2021 Conference Paper 2021

A Benchmark to Understand Communication Performance in Hybrid MPI and GPU Applications

ExaMPI21Workshop on Exascale MPI

Keira Haskins, Patrick Bridges, Kurt Brian Ferreira, Scott Larson Nicoll Levy

Conference Paper – 2021 Conference Paper 2021

Characterizing Memory Failures Using Benford?s Law

14th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids

Kurt Brian Ferreira, Scott Larson Nicoll Levy

Conference Paper – 2021 Conference Paper 2021

Characterizing Per-node Memory Failures Using Benford?s Law

Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS 2021)

Kurt Brian Ferreira, Scott Larson Nicoll Levy

Conference Paper – 2021 Conference Paper 2021

Evaluating MPI Resource Usage Summary Statistics

Journal of Parallel Computing

Kurt Brian Ferreira, Scott Larson Nicoll Levy

https://www.osti.gov/search/identifier:1822241

Journal Article – 2021 Journal Article 2021

Understanding the Effects of DRAM Correctable Error Logging at Scale

IEEE Cluster Conference

Kurt Brian Ferreira, Scott Larson Nicoll Levy, Victor G. Kuhns, Nathan DeBardelaben, Sean Blanchard

Conference Paper – 2021 Conference Paper 2021

MiniMod: A Modular Miniapplication Benchmarking Framework for HPC

IEEE Cluster 2021

William Pepper Marts, Matthew Dosanjh, William Whitney Schonbein, Scott Larson Nicoll Levy, Ryan Eric Grant, Patrick Bridges

Conference Paper – 2021 Conference Paper 2021

pMEMCPY: a simple, lightweight, and portable I/O library for storing data in persistent memory

REX-IO at IEEE Cluster 2021

Luke Logan, Gerald Fredrick Lofstead, Scott Larson Nicoll Levy, Patrick Widener, Xian-He Sun, Anthony Kougkas

Conference Paper – 2021 Conference Paper 2021

An Initial Examination of the Effect of Container Resource Constraints on Application Perturbation

Workshop on Resource Arbitration for Dynamic Runtimes (RADR)

Scott Larson Nicoll Levy, Kurt Brian Ferreira

https://www.osti.gov/search/identifier:1869756

Conference Presentation – 2021 Conference Presentation 2021

SNL ATDM Software Ecosystem Operating Systems and On-Node Runtime

2021 Exascale Computing Project Annual Meeting (Virtual)

Stephen Lecler Olivier, Ronald B. Brightwell, Kurt Brian Ferreira, Ryan Eric Grant, Scott Larson Nicoll Levy, Kevin Pedretti, Andrew J Younge

https://www.osti.gov/search/identifier:1861479

Display or Poster (non-conference) – 2021 Display or Poster (non-conference) 2021

Co-design of System Software for Compute Accelerators and SmartNICs

ASCR Workshop on Reimagining Codesign

Ryan Eric Grant, Scott Larson Nicoll Levy, William Whitney Schonbein

https://www.osti.gov/search/identifier:1847622

Conference Paper – 2021 Conference Paper 2021

Examining the Impact of Approximate Coordination on Checkpoint/Restart

https://ckpt-symposium.lbl.gov/home

Kurt Brian Ferreira, Scott Larson Nicoll Levy

Abstract – 2020 Abstract 2020

Low-cost MPI Multithreaded Message Matching Benchmarking

International Conference on High Performance Computing and Communication (HPCC)

William Whitney Schonbein, Ryan Eric Grant, Scott Larson Nicoll Levy, Matthew Dosanjh, William Pepper Marts

Conference Presentation – 2020 Conference Presentation 2020

Low-cost MPI Multithreaded Message Matching Benchmarking

International Conference on High Performance Computing and Communications (HPCC)

William Whitney Schonbein, Scott Larson Nicoll Levy, William Pepper Marts, Matthew Dosanjh, Ryan Eric Grant

https://www.osti.gov/search/identifier:1830913

Conference Paper – 2020 Conference Paper 2020

RaDD Runtimes: Radical and Different Distributed Runtimes with SmartNICs

International Conference for High Performance Computing, Networking, Storage and Analysis (SC)

Ryan Eric Grant, William Whitney Schonbein, Scott Larson Nicoll Levy

https://www.osti.gov/search/identifier:1825980

Conference Presentation – 2020 Conference Presentation 2020

RaDD Runtimes: Radical and Different Distributed Runtimes with SmartNICs

Fourth Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware

Ryan Eric Grant, Whit Schonbein, Scott Larson Nicoll Levy

https://www.osti.gov/search/identifier:1825981

Conference Paper – 2020 Conference Paper 2020

Evaluating MPI Message Size Summary Statistics

EuroMPI/USA '20

Scott Larson Nicoll Levy, Kurt Brian Ferreira

https://www.osti.gov/search/identifier:1825984

Conference Proceeding – 2020 Conference Proceeding 2020

FY20 CSSE L2 Milestone 7186

Completion of L2 Milestone 7186

Gary J. Templet Jr., Matthew R. Glickman, Todd Henry Kordenbrock, Scott Larson Nicoll Levy, Gerald Fredrick Lofstead, Jeff Mauldin, Thomas Jay Otahal, Craig D. Ulmer, Patrick Widener, Ron A. Oldfield

https://www.osti.gov/search/identifier:1820290

Presentation (non-conference) – 2020 Presentation (non-conference) 2020

Data Services for Visualization and Analysis ? ASC Level II Milestone (7186)

Gary J. Templet Jr., Matthew R. Glickman, Todd Henry Kordenbrock, Scott Larson Nicoll Levy, Gerald Fredrick Lofstead, Jeff Mauldin, Thomas Jay Otahal, Craig D. Ulmer, Patrick Widener, Ron A. Oldfield

https://www.osti.gov/search/identifier:1663267

SAND Report – 2020 SAND Report 2020

ALAMO: Autonomous Lightweight Allocation, Management and Optimization

Smoky Mountains Computational Sciences and Engineering Conference

Ronald B. Brightwell, Kurt Brian Ferreira, Ryan Eric Grant, Scott Larson Nicoll Levy, Gerald Fredrick Lofstead, Stephen Lecler Olivier, Kevin Pedretti, Andrew J Younge, Ann C. Gentile, Bradley Keith Brandt

https://www.osti.gov/search/identifier:1818044

Conference Paper – 2020 Conference Paper 2020
Document Title Type Year
Results 1–25 of 88