Ron Brightwell

Scalable System Software

Author profile picture

Scalable System Software

rbbrigh@sandia.gov

Google Scholar

LinkedIn

(505) 844-2099

Sandia National Laboratories, New Mexico
P.O. Box 5800
Albuquerque, NM 87185-1319

Current Activities

Bio

I received my BS in mathematics in 1991 and my MS in computer science in 1994 from I received my BS in mathematics in 1991 and my MS in computer science in 1994 from Mississippi State University. I joined Sandia National Laboratories in 1995 after serving as a graduate research assistant in the system software thrust at the MSU/NSF Engineering Research Center for Computational Field Simulation (now known as the High Performance Computing Collaboratory). While at Sandia, I’ve worked on several research and development projects associated with system software and high-performance networking for large-scale, massively parallel, distributed-memory, scientific computing systems. I’ve designed and developed high-performance implementations of the Message Passing Interface (MPI) standard on several platforms, including the Cray T3D and T3E, the Intel Paragon and TeraFLOPS (ASCI/Red), Sandia’s Computational Plant Linux clusters, and the Cray Red Storm (XT3). My research interests include high-performance, scalable communication interfaces and protocols for system area networks, operating systems for massively parallel processing machines, and parallel program performance analysis libraries and tools. I’m a Senior Member of the IEEE and the IEEE Computer Society and a Senior Member of the Association of Computing Machinery.

Recent Publications

William Whitney Schonbein, Brian Barrett, Ronald B. Brightwell, Ryan Grant, Karl Scott Hemmert, Kevin Pedretti, Keith Underwood, Rolf Riesen, Torsten Hoefler, Mathieu Barbe, Luiz H. Suraty Filho, Alexandre Ratchov, Arthur Maccabe, (2022). The Portals 4.3 Network Programming Interface https://www.osti.gov/search/identifier:1875218 Document ID: 1551092

Ronald B. Brightwell, (2022). Recent Sandia R&D Activities in HPC Networking CEA/NNSA Collaboration Meeting Document ID: 1540672

Stephen Lecler Olivier, Ronald B. Brightwell, Matthew Dosanjh, Kurt Brian Ferreira, Scott Larson Nicoll Levy, Kevin Pedretti, Andrew J Younge, (2022). SNL ATDM Software Ecosystem Operating Systems and On-Node Runtime 2022 Exascale Computing Project Annual Meeting (Virtual) Document ID: 1505231

Stephen Lecler Olivier, Ronald B. Brightwell, Kurt Brian Ferreira, Ryan Eric Grant, Scott Larson Nicoll Levy, Kevin Pedretti, Andrew J Younge, (2021). SNL ATDM Software Ecosystem Operating Systems and On-Node Runtime 2021 Exascale Computing Project Annual Meeting (Virtual) https://www.osti.gov/search/identifier:1861479 Document ID: 1293055

Kevin Pedretti, Ronald B. Brightwell, Andrew J Younge, Jack Lange, (2021). HPC Operating System Research Areas and Challenges ASCR OS/R Roundtable https://www.osti.gov/search/identifier:1843100 Document ID: 1266799

Ron A. Oldfield, Michael Wolf, Ronald B. Brightwell, (2020). ECP Capability Assessment Report for Software Technologies — Kokkos, Kokkos Kernels, VTK-m, Operating Systems and On-Node Runtimes https://www.osti.gov/search/identifier:1717885 Document ID: 1230336

Kevin Pedretti, Andrew J Younge, Simon David Hammond, James H. Laros, Matthew Jon Curry, Michael James Aguilar, Robert J. Hoekstra, Ronald B. Brightwell, (2020). Chronicles of Astra: Challenges and Lessons from the First Petascale Arm Supercomputer Sc20 https://www.osti.gov/search/identifier:1822114 Document ID: 1207577

Ronald B. Brightwell, Kurt Brian Ferreira, Ryan Eric Grant, Scott Larson Nicoll Levy, Gerald Fredrick Lofstead, Stephen Lecler Olivier, Kevin Pedretti, Andrew J Younge, Ann C. Gentile, Bradley Keith Brandt, (2020). ALAMO: Autonomous Lightweight Allocation, Management and Optimization Smoky Mountains Computational Sciences and Engineering Conference https://www.osti.gov/search/identifier:1818044 Document ID: 1195366

Gabrielle Trujillo, Daniel Z. Turner, Ronald B. Brightwell, Ron A. Oldfield, Robert L. Clay, (2019). September 2019 ECP ST Project Review ECP-ST Review https://www.osti.gov/search/identifier:1646043 Document ID: 1032128

Ronald B. Brightwell, (2019). Meeting the Future Needs of HPC with MPI Fifth International Workshop on Communication Architectures for HPC, Big Data, Deep Learning and Clouds at Extreme Scale https://www.osti.gov/search/identifier:1642594 Document ID: 1021401

Ronald B. Brightwell, (2019). Thoughts on Autonomous Resource Management for HPC Fifth Workshop on Programming Abstractions for Data Locality https://www.osti.gov/search/identifier:1642595 Document ID: 1021403

Ronald B. Brightwell, (2019). Opportunities and Challenges for Accelerated Network Interfaces in HPC EuroMPI’19 https://www.osti.gov/search/identifier:1642596 Document ID: 1021404

Ronald B. Brightwell, (2019). Memory Technology Impacts on Current, Near-Term, and Future Systems Isc-hpc https://www.osti.gov/search/identifier:1641049 Document ID: 985212

Ryan Eric Grant, Matthew Dosanjh, Michael J. Levenhagen, Ronald B. Brightwell, Anthony Skjellum, (2019). Finepoints: Partitioned Multithreaded MPI Communication Isc High Performance 2019 https://www.osti.gov/search/identifier:1639543 Document ID: 947758

Ronald B. Brightwell, (2019). Meeting the Future Needs of HPC with MPI MPI-Beyond Workshop https://www.osti.gov/search/identifier:1644496 Document ID: 936618

Stephen Lecler Olivier, Ronald B. Brightwell, Kevin Pedretti, Andrew J Younge, Noah Evans, Scott Larson Nicoll Levy, Kurt Brian Ferreira, Ryan Eric Grant, (2019). SNL ATDM Software Ecosystem 2019 Exascale Computing Project Annual Meeting https://www.osti.gov/search/identifier:1583026 Document ID: 902074

Ronald B. Brightwell, (2018). System Software Perspective on Resilience Workshop on Modeling & Simulation of Systems and Applications https://www.osti.gov/search/identifier:1582144 Document ID: 853193

Ronald B. Brightwell, (2018). Vanguard Astra: Maturing the ARM Software Ecosystem for U.S. DOE/ASC Supercomputing International Workshop on High Performance ComputingFrom Clouds and Big Data to Exascale and Beyond https://www.osti.gov/search/identifier:1576170 Document ID: 842285

Ronald B. Brightwell, (2018). Hardware/Software Co-Design for High Performance Interconnects for Extreme-Scale Systems Argonne Training Program on Extreme Scale Computing https://www.osti.gov/search/identifier:1576171 Document ID: 842286

Susan Lacy, Ronald B. Brightwell, (2018). Behind the Scenes of High Performance Computing https://www.osti.gov/search/identifier:1734468 Document ID: 841255

Andrew J Younge, Ryan Eric Grant, Ronald B. Brightwell, (2018). Portals 4: Status of Specification and Implementation CEA Meeting https://www.osti.gov/search/identifier:1526809 Document ID: 807792

Nathan Hjelm, Matthew Dosanjh, Taylor Groves, Ryan Eric Grant, Ronald B. Brightwell, Patrick Bridges, Dorian Arnold, (2018). Improving MPI Multi-threaded RMA Performance International Conference on Parallel Processing (ICPP) https://www.osti.gov/search/identifier:1525924 Document ID: 808201

Ronald B. Brightwell, (2018). Architectural Convergence of Big Data and Extreme-Scale Computing: Marriage of Convenience or Conviction Charm++ Workshop https://www.osti.gov/search/identifier:1510848 Document ID: 795296

Ronald B. Brightwell, (2018). Resource Management Challenges in the Era of Extreme Heterogeneity Charm++ Workshop https://www.osti.gov/search/identifier:1575169 Document ID: 784678

Andrew J Younge, Kevin Pedretti, Ryan Eric Grant, Ronald B. Brightwell, (2018). A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds IEEE CloudCom 2017 https://www.osti.gov/search/identifier:1497564 Document ID: 737730

Ronald B. Brightwell, Stephen Lecler Olivier, (2018). Enhancing Qthreads for ECP Science and Energy Impact 2018 Exascale Computing Project Annual Meeting https://www.osti.gov/search/identifier:1806498 Document ID: 749829

Stephen Lecler Olivier, Kevin Pedretti, Ronald B. Brightwell, (2018). ATDM Operating Systems and On-Node Runtime 2018 ECP Annual Meeting https://www.osti.gov/search/identifier:1524520 Document ID: 749801

Ronald B. Brightwell, (2017). December 2017 ECP ST Project Review: ECP Project WBS 2.3.5.04 (SNL ATDM Software Ecosystem) https://www.osti.gov/search/identifier:1415115 Document ID: 738335

Ronald B. Brightwell, Stephen Lecler Olivier, (2017). December 2017 ECP ST Project Review: ECP Project WBS 2.3.1.15 (Qthreads) ECP ST 2017 Project Review (Online) https://www.osti.gov/search/identifier:1488744 Document ID: 738068

Andrew J Younge, Kevin Pedretti, Ryan Eric Grant, Ronald B. Brightwell, (2017). A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds IEEE International Conference on Cloud Computing Technology and Science https://www.osti.gov/search/identifier:1483132 Document ID: 725458

Ronald B. Brightwell, (2017). What Will Determine the Future Success of MPI? EuroMPI/USA https://www.osti.gov/search/identifier:1509607 Document ID: 703097

Andrew J Younge, Kevin Pedretti, Ryan Eric Grant, Brian Gaines, Ronald B. Brightwell, Andrew J Younge, (2017). Enabling Diverse Software Stacks on Supercomputers using High Performance Virtual Clusters Ieee Cluster 2017 https://www.osti.gov/search/identifier:1476150 Document ID: 671722

James A. Ang, Ronald B. Brightwell, Simon David Hammond, Karl Scott Hemmert, Robert J. Hoekstra, James H. Laros, Kevin Pedretti, Arun F. Rodrigues, (2017). Sandia’s ARM-centric Co-Design Strategy: Introduction to the NNSA/ASC Vanguard Project ARM Research SummitGoing ARM Workshop https://www.osti.gov/search/identifier:1470952 Document ID: 671961

Ronald B. Brightwell, (2017). Challenges and Opportunities for HPC Interconnects and MPI MVAPICH User Group Meeting https://www.osti.gov/search/identifier:1466535 Document ID: 670573

Andrew J Younge, Kevin Pedretti, Ryan Eric Grant, Brian Gaines, Ronald B. Brightwell, (2017). Enabling Diverse Software Stacks on Supercomputers using High Performance Virtual Clusters Ieee Cluster 2017 https://www.osti.gov/search/identifier:1466489 Document ID: 659900

Ronald B. Brightwell, (2017). HPC Co-Design DOD Briefing for Steve Rinaldi https://www.osti.gov/search/identifier:1465100 Document ID: 670136

Ryan Eric Grant, Matthew Dosanjh, Ronald B. Brightwell, (2017). Preparing MPI for Exascale Cis Rf Erb https://www.osti.gov/search/identifier:1464895 Document ID: 659870

Torsten Hoefler, Salvatore Di Girolamo, Konstantin Taranov, Ryan Eric Grant, Ronald B. Brightwell, (2017). sPIN: High-performance streaming Processing in the Network The International Conference for High Performance Computing, Networking, Storage and Analysis (Supercomputing) https://www.osti.gov/search/identifier:1462935 Document ID: 659116

Ronald B. Brightwell, Kevin Pedretti, (2017). Embracing Diversity: OS Support for Integrating High- Performance Computing and Data Analytics Seminar at NSA https://www.osti.gov/search/identifier:1506830 Document ID: 637933

Brian Barrett, Ronald B. Brightwell, Ryan Eric Grant, Karl Scott Hemmert, Kevin Pedretti, Kyle Wheeler, Keith D Underwood, Rolf Riesen, Arthur B. Maccabe, Trammel Hudson, (2017). The Portals 4.1 Network Programming Interface https://www.osti.gov/search/identifier:1365498 Document ID: 612564

Stephen Lecler Olivier, Ronald B. Brightwell, (2017). Qthreads and On-Node Run time Coordination Ecp https://www.osti.gov/search/identifier:1429452 Document ID: 578152

Kevin Pedretti, Ronald B. Brightwell, (2017). ATDM Operating System Project: A Multi-Stack Approach for Application Composition and Performance ECP Annual Meeting https://www.osti.gov/search/identifier:1427102 Document ID: 577865

Ronald B. Brightwell, Stephen Lecler Olivier, (2017). Enhancing Qthreads for ECP Science and Energy Impact And Sandia ATDM On-Node Runtime Coordination 2017 Exascale Computing Project Annual Meeting https://www.osti.gov/search/identifier:1505468 Document ID: 577549

Shane Farmer, Anthony Skjellum, Patrick G Bridges, Matthew Dosanjh, Ryan Eric Grant, Ronald B. Brightwell, (2016). Modeling Concurrent Point-to-Point Communication Cost in MPI Performance Models ExaMPI https://www.osti.gov/search/identifier:1413407 Document ID: 565465

Micheal W. Glass, Harold C. Edwards, Janine Camille Bennett, Stephen Lecler Olivier, Ronald B. Brightwell, Simon David Hammond, Michael A. Heroux, Sivasankaran Rajamanickam, Roger P. Pawlowski, Eric T. Phipps, Craig D. Ulmer, Kenneth D. Moreland, Kevin Pedretti, (2016). Sandia’s ATDM/ASD Components Exascale Computing Project PI & Integration Meeting Document ID: 553961

Ronald B. Brightwell, (2016). Embracing Diversity: OS Support for Integrating High-Performance Computing and Data Analytics Workshop on Interfaces for an Exascale OS https://www.osti.gov/search/identifier:1378843 Document ID: 528802

Stephen Lecler Olivier, Kevin Pedretti, Ronald B. Brightwell, (2016). Software Requirements for ATDM On-Node Resource Management https://www.osti.gov/search/identifier:1561714 Document ID: 475827

Ronald B. Brightwell, Stephen Lecler Olivier, (2016). Qthreads: Run Time Library Support for Task Parallel Programming Annual NNSA/CEA Collaboration Meeting https://www.osti.gov/search/identifier:1367392 Document ID: 475371

Kevin Pedretti, Ronald B. Brightwell, Shyamali Mukherjee, Noah Evans, Brian Kocoloski, Jiannan Ouyang, Dinda Peter, Kyle Hale, Patrick Bridges, Oscar Mondragon, Michael Lang, (2016). Hobbes: A Multi‐Stack Approach for Application Composition and Performance Isolation 2016 Cis Erb https://www.osti.gov/search/identifier:1368803 Document ID: 453749

Ronald B. Brightwell, (2016). XPRESS: eXascale PRogramming Environment and System Software X-Stack PI Meeting https://www.osti.gov/search/identifier:1581533 Document ID: 431312

Ronald B. Brightwell, (2016). OS/Runtime Abstractions and Interfaces for Managing the Memory Hierarchy SOS Workshop https://www.osti.gov/search/identifier:1365001 Document ID: 431115

Matthew Dosanjh, Taylor Groves, Ryan Eric Grant, Ronald B. Brightwell, Patrick G Bridges, (2016). RMA-MT: A Benchmark Suite for Assessing MPI Multi-threaded RMA Performance 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing https://www.osti.gov/search/identifier:1347651 Document ID: 419942

Ronald B. Brightwell, Amin Hassani, Anthony Skjellum, Purushotham Bangalore, (2015). Practical Resilient Cases for FA-MPI, a Transactional Fault-Tolerant MPI Workshop on Exascale MPI https://www.osti.gov/search/identifier:1340216 Document ID: 365987

Ronald B. Brightwell, (2015). XPRESS: eXascale Programming Environment and System Software ASCR X-Stack and OS/R PI Meeting https://www.osti.gov/search/identifier:1335551 Document ID: 365572

Ronald B. Brightwell, (2015). Interconnect So*ware/Hardware Co-Design for Extreme-Scale Systems Workshop on Exascale MPI https://www.osti.gov/search/identifier:1337996 Document ID: 354802

Ronald B. Brightwell, Stephen Lecler Olivier, (2015). Qthreads and Thoughts on ULT Standardization ACM/IEEE Supercomputing 2015 https://www.osti.gov/search/identifier:1331922 Document ID: 354396

Ronald B. Brightwell, (2015). Hardware Support for OS/Runtime and Interconnect Workshop on Software Tools and Techniques for HPC, Clouds, and Server-Class SoCs https://www.osti.gov/search/identifier:1576133 Document ID: 353448

Alice Koniges, Jayashree Ajay Candadai, Hartmut Kaiser, Kevin Huck, Jeremy Kemp, Thomas Heller, Matthew Anderson, Andrew Lumsdaine, Adrian Serio, Michael Wolf, Bryce Lelbach, Ronald B. Brightwell, Thomas Sterling, (2015). HPX Applications and Performance Adaptation The International Conference for High Performance Computing, Networking, Storage and Analysis https://www.osti.gov/search/identifier:1332791 Document ID: 343082

Patrick G Bridges, Matthew Dosanjh, Ryan Eric Grant, Shane Farmer, Anthony Skjellum, Ronald B. Brightwell, (2015). Preparing for Exascale: Modeling MPI for Many-Core Systems using Fine-Grain Queues Workshop on Exascale MPI at Supercomputing Conference 2015 (ExaMPI) https://www.osti.gov/search/identifier:1328652 Document ID: 342626

Matthew Dosanjh, Ryan Eric Grant, Patrick G Bridges, Ronald B. Brightwell, (2015). Re-evaluating Network Onload vs. Offload for the Many-Core Era IEEE Cluster 2015 https://www.osti.gov/search/identifier:1531117 Document ID: 321433

Matthew Dosanjh, Ryan Eric Grant, Patrick Bridges, Ronald B. Brightwell, (2015). Re-evaluating Network Onload vs. Offload for the Many-Core Era IEEE Cluster 2015 https://www.osti.gov/search/identifier:1327501 Document ID: 319236

Rolf (Intel) Riesen, Barney (ORNL) Maccabe, Balazs (RIKEN) Gerofi, David (INTEL) Lombard, John (PITT) Lange, Kevin Pedretti, Kurt Brian Ferreira, Mike (LANL) Lang, Pardo (INTEL) Keppel, Robert (INTEL) Wisniewski, Ronald B. Brightwell, Todd (INTEL) Inglett, Yoonho (IBM) Park, Yutaka (RIKEN) Ishikawa, (2015). Panel: What is a Lightweight Kernel? International Workshop on Runtime and Operating Systems for Supercomputers ROSS 2015 Held in conjunction with HPDC 2015 https://www.osti.gov/search/identifier:1258200 Document ID: 275870

Ronald B. Brightwell, Robert L. Clay, (2015). Asynchronous Many Task Runtime System Working Group Salishan Conference on High Speed Computing https://www.osti.gov/search/identifier:1252434 Document ID: 265014

Matthew Dosanjh, Ryan Eric Grant, Patrick G Bridges, Ronald B. Brightwell, (2015). Re-evaluating Network Onload vs. Offload for the Many-Core Era IEEE Cluster 2015 https://www.osti.gov/search/identifier:1245930 Document ID: 243319

Matthew Dosanjh, Ryan Eric Grant, Patrick G Bridges, Ronald B. Brightwell, (2015). Re-evaluating Network Onload vs. Offload for the Many-Core Era International Conference On Supercomputing https://www.osti.gov/search/identifier:1244900 Document ID: 220207

Ronald B. Brightwell, (2014). Hobbes Extreme Scale OS Workshop on Application Interfaces for an Exascale OS https://www.osti.gov/search/identifier:1504059 Document ID: 219235

Ronald B. Brightwell, Brian Barrett, Ryan Eric Grant, Simon David Hammond, Karl Scott Hemmert, (2014). An Evaluation of MPI Message Rate on Hybrid-Core Processors International Journal of High Performance Computing Applications https://www.osti.gov/search/identifier:1185011 Document ID: 219389

Mark A. Gonzales, Ronald B. Brightwell, (2014). The Impact of MPI Queue Usage on Message Latency 2004 International Conference on Parallel Processing (ICPP-04) https://www.osti.gov/search/identifier:1185011 Document ID: 5220421

Brian W Barrett, Ronald B. Brightwell, Ryan Eric Grant, Karl Scott Hemmert, Kevin Pedretti, Kyle Wheeler, Keith D Underwood, Rolf Riesen, Arthur B. Maccabe, Trammell Hudson, (2014). The Portals 4.0.2 Networking Programming Interface https://www.osti.gov/search/identifier:1561686 Document ID: 208014

Matthew Dosanjh, Ryan Eric Grant, Patrick (UNM) Bridges, Ronald B. Brightwell, (2014). Re-evaluating Network Onload vs. Offload for the Many-Core Era Parallel, Distributed, and Network-Based Processing (PDP) https://www.osti.gov/search/identifier:1315581 Document ID: 155573

Matthew Dosanjh, Ryan Eric Grant, Patrick (UNM) Bridges, Ronald B. Brightwell, (2014). Re-evaluating Network Onload vs. Offload for the Many-Core Era Parallel, Distributed, and Network-Based Processing (PDP) https://www.osti.gov/search/identifier:1315580 Document ID: 154908

Ronald B. Brightwell, Amin Hassani, Anthony Skjellum, (2014). Comparing, Contrasting, Generalizing, and Integrating Two Current Designs for Fault-Tolerant MPI EuroMPI/Asia https://www.osti.gov/search/identifier:1314607 Document ID: 154999

Ronald B. Brightwell, (2014). Hobbes: Using Virtualization to Enable Exascale Applications Ninth Workshop on Virtualization in High-Performance Cloud Computing https://www.osti.gov/search/identifier:1502301 Document ID: 154651

Ryan Eric Grant, Stephen Lecler Olivier, James H. Laros, Ronald B. Brightwell, Allan K. Porterfield, (2014). Metrics for Evalua0ng Energy Saving Techniques for Resilient HPC Systems Workshop on High-Performance Power-Aware Computing https://www.osti.gov/search/identifier:1145896 Document ID: 5336394

Ryan Eric Grant, Stephen Lecler Olivier, James H. Laros, Ronald B. Brightwell, Allan K. Porterfield, (2014). Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems Workshop on High-Performance Power-Aware Computing https://www.osti.gov/search/identifier:1140455 Document ID: 5332263

Ryan Eric Grant, Ronald B. Brightwell, (2013). Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems International Conference on Parallel and Distributed Systems https://www.osti.gov/search/identifier:1143847 Document ID: 5324313

Kirk A. Rackow, Scott Larson Nicoll Levy, Ronald B. Brightwell, Dorian Arnold, Patrick Bridges, (2013). A Holistic Approach to Modeling and Simulation for Resilience and Power Configuration DOE/ASCR A Holistic Approach to Modeling and Simulation for Resilience and Power Configuration https://www.osti.gov/search/identifier:1111081 Document ID: 5324016

Patrick Widener, Kurt Brian Ferreira, Scott Larson Nicoll Levy, Ronald B. Brightwell, Patrick G. Bridges, Dorian Arnold, (2013). Asking the right questions: benchmarking fault-tolerant extreme-scale systems Workshop on Resiliency in High-Performance Computing https://www.osti.gov/search/identifier:1083655 Document ID: 5323574

Brian Barrett, Ronald B. Brightwell, Kevin Pedretti, Karl Scott Hemmert, Kyle Wheeler, Rolf Riesen, Keith Underwood, Arthur Maccabe, Trammell Hudson, (2013). The Portals 4.0.1 Network Programming Interface https://www.osti.gov/search/identifier:1095958 Document ID: 5321486

Brian Barrett, Ronald B. Brightwell, Karl Scott Hemmert, Simon David Hammond, (2013). The Impact of Hybrid-Core Processors on MPI Message Rate EuroMPI 2013 https://www.osti.gov/search/identifier:1078721 Document ID: 5321391

Brian Barrett, Ronald B. Brightwell, Simon David Hammond, Karl Scott Hemmert, (2013). The Impact of Hybrid-Core Processors on MPI Message Rate 20th European MPI Users’ Group Meeting https://www.osti.gov/search/identifier:1078760 Document ID: 5321392

Brian Barrett, Ronald B. Brightwell, Amin Hassani, Anthony Skjellum, (2013). Design, Implementation, and Performance Evaluation of MPI 3.0 on Portals 4.0 20th European MPI Users’ Group Meeting https://www.osti.gov/search/identifier:1078745 Document ID: 5321393

Ryan Eric Grant, Brian Barrett, Ronald B. Brightwell, Torsten Hoefler, Timo Schneider, (2013). Protocols for Fully Offloaded Collective Operations on Accelerated Network Adapters International Conference on Parallel Processing https://www.osti.gov/search/identifier:1078462 Document ID: 5321142

Ryan Eric Grant, Brian Barrett, Ronald B. Brightwell, Timo Schneider, Torsten Hoefler, (2013). Protocols for Fully Offloaded Collective Operations on Accelerated Network Adapters International Conference on Parallel Processing https://www.osti.gov/search/identifier:1072473 Document ID: 5320764

Kirk A. Rackow, Ronald B. Brightwell, Dewan Ibtesham, Dorian Arnold, (2013). A Comparison of Compression and Increment-based Checkpoint Optimizations 19th International European Conference on Parallel and Distributed Computing (EuroPar ’13) https://www.osti.gov/search/identifier:1063390 Document ID: 5318939

Kirk A. Rackow, Ronald B. Brightwell, Rolf Riesen, Patrick Bridges, Dorian Arnold, (2013). Accelerating Incremental Checkpointing for Extreme-Scale Computing Future Generation Computer Systems https://www.osti.gov/search/identifier:1063314 Document ID: 5318940

Kirk A. Rackow, Ronald B. Brightwell, Dewan Ibtesham, Dorian Arnold, (2013). A GPU-based Checkpoint Compression Study Size Does Matter — More Than Speed, AnywaySize Does Matter — More Than Speed, Anyway GPGPU 6Sixth Workshop on General Purpose Processing Using GPUs https://www.osti.gov/search/identifier:1063556 Document ID: 5317652

Brian Barrett, Ronald B. Brightwell, Karl Scott Hemmert, Keith D Underwood, (2012). Portals 4 Network Programming Interface Sc12 https://www.osti.gov/search/identifier:1063493 Document ID: 5315818

Kirk A. Rackow, Kevin Pedretti, Ronald B. Brightwell, Patrick Bridges, David Fiala, Frank Mueller, (2012). An Operating System Resilient to DRAM Failures Workshop on Exascale Operating Systems and Runtime Software https://www.osti.gov/search/identifier:1064154 Document ID: 5310396

Kirk A. Rackow, Ronald B. Brightwell, Rolf Riesen, Dorian Arnold, Dewan Ibtesham, (2012). The Viability of Using Compression to Decrease Message Log Sizes 5th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grid https://www.osti.gov/search/identifier:1064339 Document ID: 5309733

Kirk A. Rackow, Ronald B. Brightwell, Dewan Ibtesham, Dorian Arnold, Patrick Bridges, (2012). On the Viability of Compression for Reducing the Overheads of Checkpoint/Restart-based Fault Tolerance The 41st International Conference on Parallel Processing https://www.osti.gov/search/identifier:1064288 Document ID: 5309062

Kirk A. Rackow, Ronald B. Brightwell, Dewan Ibtesham, Dorian Arnold, (2012). Checkpoint Compression for Improved Checkpoint/Restart The 19th European MPI Users’ Group Meeting https://www.osti.gov/search/identifier:1067599 Document ID: 5308253

Deepesh K. Kholwadwala, Kurt Brian Ferreira, Michael A. Heroux, Ronald B. Brightwell, Patrick Bridges, (2012). Cooperative Application/OS DRAM Fault Recovery https://www.osti.gov/search/identifier:1044954 Document ID: 5308123

Kirk A. Rackow, Kevin Pedretti, Ronald B. Brightwell, Patrick Bridges, David Fiala, Frank Mueller, (2012). Evaluating Operating System Vulnerability to Memory Errors https://www.osti.gov/search/identifier:1044952 Document ID: 5308124

Kirk A. Rackow, Ronald B. Brightwell, David Fiala, Frank Mueller, Christian Engelmann, Rolf Riesen, (2012). Detection and Correction of Silent Data Corruption for Large-Scale High-Performance Computing The International Conference HIgh-Performance Computing, Networking, Storage, and Analysis https://www.osti.gov/search/identifier:1067809 Document ID: 5307805

Kirk A. Rackow, Kevin Pedretti, Ronald B. Brightwell, Patrick Bridges, David Fiala, Frank Mueller, (2012). Evaluating Operating System Vulnerability to Memory Errors nternational Workshop on Runtime and Operating Systems for Supercomputers ROSS 2012 held in conjunc https://www.osti.gov/search/identifier:1067552 Document ID: 5306589

Brian Barrett, Richard Frederick Barrett, James M. Brandt, Ronald B. Brightwell, Matthew Leon Curry, Nathan D. Fabian, Kurt Brian Ferreira, Ann C. Gentile, Karl Scott Hemmert, Suzanne M. Kelly, Ruth Ann Klundt, James H. Laros, Vitus J. Leung, Michael J. Levenhagen, Gerald Fredrick Lofstead, Kenneth D. Moreland, Ron A. Oldfield, Kevin Pedretti, Arun F. Rodrigues, David Thompson, Harry Lee Ward, John P. Vandyke, Courtenay T. Vaughan, Kyle Bruce Wheeler, Tom Tucker, (2012). Report of Experiments and Evidence for ASC L2 Milestone 4467 – Demonstration of a Legacy Applications Path to Exascale https://www.osti.gov/search/identifier:1039013 Document ID: 5305233

Brian Barrett, Richard Frederick Barrett, James M. Brandt, Ronald B. Brightwell, Matthew Leon Curry, Nathan D. Fabian, Kurt Brian Ferreira, Ann C. Gentile, Karl Scott Hemmert, Suzanne M. Kelly, Ruth Ann Klundt, James H. Laros, Vitus J. Leung, Michael J. Levenhagen, Gerald Fredrick Lofstead, Kenneth D. Moreland, Ron A. Oldfield, Kevin Pedretti, Arun F. Rodrigues, David Thompson, Harry Lee Ward, John P. Vandyke, Courtenay T. Vaughan, Kyle Bruce Wheeler, Tom Tucker, (2012). Demonstration of a Legacy Applications Path to Exascale – ASC L2 Milestone 4467 Presentation to L2 Milestone Review Panel https://www.osti.gov/search/identifier:1688616 Document ID: 5305236

James A. Ang, Ronald B. Brightwell, Sudip S. Dosanjh, Karl Scott Hemmert, Arun F. Rodrigues, David Donofrio, John Shalf, (2012). Exascale Computing and the Role of Co-Design Advances in Parallel Computing https://www.osti.gov/search/identifier:1068484 Document ID: 5305105

Kirk A. Rackow, Ronald B. Brightwell, David Fiala, Frank Mueller, Christian Engelmann, (2012). FlipSphere: A Software-based DRAM Error Detection and Correction Library for HPC 18th International European Conference on Parallel and Distributed Computing (Euro-Par 2012) https://www.osti.gov/search/identifier:1145355 Document ID: 5304717

Suzanne M. Kelly, Robert A. Ballance, Ronald B. Brightwell, James H. Laros, Ronald G. Minnich, Chris Dunlap, Jim Garlick, Maya Gokhale, Pam Hamilton, Mike Lang, Scott Pakin, (2011). System Software Working Group Supercomputing 2011 https://www.osti.gov/search/identifier:1111613 Document ID: 5301881

Kirk A. Rackow, Ronald B. Brightwell, Dewan Ibtesham, Dorian Arnold, Patrick G. Bridges, (2011). Checkpoint Compression for Extreme Scale Fault Tolerance IEEE International Parallel and Distributed Processing Symposium https://www.osti.gov/search/identifier:1106330 Document ID: 5300307

Kirk A. Rackow, Ron A. Oldfield, Jon Stearley, James H. Laros, Kevin Pedretti, Ronald B. Brightwell, Rolf Riesen, (2011). Keeping Checkpoint/Restart Viable for Exascale Systems https://www.osti.gov/search/identifier:1029780 Document ID: 5299850

Kevin Pedretti, Michael J. Levenhagen, Ronald B. Brightwell, Trammell Hudson, Peter Dinda, Zheng Cui, Lei Xia, Patrick Bridges, Steven Jaconette, Patrick Widener, (2011). Palacios and Kitten: High Performance Operating Systems For Scalable Virtualized and Native Supercomputing Sc09 https://www.osti.gov/search/identifier:1142272 Document ID: 5271316

Kevin Pedretti, Ronald B. Brightwell, Douglas W. Doerfler, Karl Scott Hemmert, James H. Laros, (2011). The Impact of Injection Bandwidth Performance on Application Scalability 18th EuroMPI conference https://www.osti.gov/search/identifier:1107219 Document ID: 5296543

Deepesh K. Kholwadwala, Kurt Brian Ferreira, Michael A. Heroux, Ronald B. Brightwell, Patrick G. Bridges, Philip Soltero, (2011). Cooperative Application/OS DRAM Fault Recovery 4th Workshop on Resiliency in High Performance Computing @ EuroPar https://www.osti.gov/search/identifier:1107189 Document ID: 5296256

Kirk A. Rackow, Ronald B. Brightwell, Rolf Riesen, Patrick G. Bridges, Dorian Arnold, (2011). Libhashckpt: Hash-based Incremental Checkpointing Using GPU’s The 18th European MPI Users’ Group (EuroMPI 2011) https://www.osti.gov/search/identifier:1109273 Document ID: 5295126

Brian Barrett, Ronald B. Brightwell, Karl Scott Hemmert, Kyle Bruce Wheeler, Keith D. Underwood, (2011). Using Triggered Operations to Offload Rendezvous Messages EuroMPI 2011 https://www.osti.gov/search/identifier:1143913 Document ID: 5294988

Brian Barrett, Ronald B. Brightwell, Karl Scott Hemmert, Kevin Pedretti, Kyle Bruce Wheeler, Keith D. Underwood, (2011). Enhanced Support for PGAS Communication in Portals IEEE Symposium on High-Performance Interconnects https://www.osti.gov/search/identifier:1109338 Document ID: 5294990

Karl Scott Hemmert, Brian Barrett, Ronald B. Brightwell, Michael J. Levenhagen, Keith D. Underwood, Jerrie Coffman, Roy Larsen, (2011). Enabling Flexible Collective Communication Offload with Triggered Operations IEEE Symposium on High-Performance Interconnects https://www.osti.gov/search/identifier:1109261 Document ID: 5294992

Kirk A. Rackow, Ron A. Oldfield, Jon Stearley, James H. Laros, Kevin Pedretti, Ronald B. Brightwell, Rolf Riesen, (2011). rMPI: Increasing Fault Resiliency in a Message-Passing Environment https://www.osti.gov/search/identifier:1012733 Document ID: 5293699

Kirk A. Rackow, Jon Stearley, James H. Laros, Ron A. Oldfield, Kevin Pedretti, Ronald B. Brightwell, Rolf Riesen, Patrick G Bridges, Dorian Arnold, (2011). Evaluating the Viability of Process Replication Reliability for Exascale Systems The International Conference for High Performance Computing, Networking, Storage and Analysis https://www.osti.gov/search/identifier:1108309 Document ID: 5293967

Matthew Leon Curry, Harry Lee Ward, Ronald B. Brightwell, Anthony Skjellum, (2011). Gibraltar RAID – 2011 R&D 100 Awards Entry Form https://www.osti.gov/search/identifier:1671516 Document ID: 5293112

Suzanne M. Kelly, Ronald B. Brightwell, Robert A. Ballance, Jim garlick, Chris Dunlap, Maya B. Gokhale, Pam Hamilton, Scott Pakin, Mike Lang, (2011). Systems Software https://www.osti.gov/search/identifier:1671593 Document ID: 5292384

Ronald G. Minnich, Ronald B. Brightwell, Suzanne M. Kelly, Robert A. Ballance, Chris Dunlap, Jim Garlick, Maya Gokhale, Pam Hamilton, Mike Lang, Scott Pakin, (2011). System Software Report from ASC/CSSE-FOUS Exascale Planning ASC/CSSE Exascale Environment Planning Workshop https://www.osti.gov/search/identifier:1671729 Document ID: 5291387

Kirk A. Rackow, Jon Stearley, Ron A. Oldfield, James H. Laros, Kevin Pedretti, Ronald B. Brightwell, (2010). Redundant Computing for Exascale Systems https://www.osti.gov/search/identifier:1011662 Document ID: 5290105

Kevin Pedretti, Michael J. Levenhagen, Kurt Brian Ferreira, Ronald B. Brightwell, Suzanne M. Kelly, Patrick G Bridges, Trammell Hudson, (2010). LDRD Final Report: A Lightweight Operating System for Multi-core Capability Class Supercomputers https://www.osti.gov/search/identifier:1007323 Document ID: 5286932

Kirk A. Rackow, Ron A. Oldfield, Jon Stearley, James H. Laros, Kevin Pedretti, Ronald B. Brightwell, Todd Kordenbrock, (2010). Increasing Fault Resiliency in a Message-Passing Environment https://www.osti.gov/search/identifier:1001015 Document ID: 5276698

Kevin Pedretti, Michael J. Levenhagen, Ronald B. Brightwell, John Lange, Trammell Hudson, Peter Dinda, Zheng Cui, Lei Xia, Patrick Bridges, Steven Jaconette, Patrick Widener, (2010). Palacios and Kitten: High Performance Operating Systems For Scalable Virtualized and Native Supercomputing https://www.osti.gov/search/identifier:1028948 Document ID: 5276024

Kenneth F. Alvin, Brian Barrett, Ronald B. Brightwell, Sudip S. Dosanjh, Karl Scott Hemmert, Richard C. Murphy, Ron A. Oldfield, Arun F. Rodrigues, Al Geist, Doug Kothe, Jeff Nichols, Jeffrey S. Vetter, (2010). On the Path to Exascale International Journal of Distributed Systems and Technologies (IJDST) https://www.osti.gov/search/identifier:1123730 Document ID: 5282712

Keith D. Underwood, Ronald B. Brightwell, Brian Barrett, Karl Scott Hemmert, (2010). Challenges for High-Performance Networking for Exascale Computing International Conference on Computer Communications and Networks https://www.osti.gov/search/identifier:1012446 Document ID: 5282380

Kirk A. Rackow, Rolf E. Riesen, Ron A. Oldfield, James H. Laros, Kevin Pedretti, Jon Stearley, Ronald B. Brightwell, (2010). rMPI: Increasing Fault Resiliency in a Message-Passing Environment nternational Conference for High Performance Computing, Networking, Storage, and Analysis https://www.osti.gov/search/identifier:1002112 Document ID: 5281778

Ron A. Oldfield, Ronald B. Brightwell, Kevin Pedretti, Rolf E. Riesen, Kurt Brian Ferreira, Suzanne M. Kelly, Todd H. Kordenbrock, James H. Laros, (2010). System Software Research for Extreme-Scale Computing Leadership Computing Facility Seminar https://www.osti.gov/search/identifier:1673292 Document ID: 5280913

Karl Scott Hemmert, Ronald B. Brightwell, Michael J. Levenhagen, Keith D. Underwood, Jerrie Coffman, Roy Larsen, (2010). Enabling Flexible Collective Communication Offload with Triggered Operations IEEE Cluster 2010 https://www.osti.gov/search/identifier:988546 Document ID: 5280420

Kirk A. Rackow, Rolf E. Riesen, Ron A. Oldfield, Ronald B. Brightwell, James H. Laros, Kevin Pedretti, (2009). HPC Application Fault-Tolerance Using Transparent Redundant Computation International Conference for High Performance Computing, Networking, Storage, and Analysis https://www.osti.gov/search/identifier:971418 Document ID: 5274791

Kirk A. Rackow, Kevin Pedretti, Michael J. Levenhagen, Ronald B. Brightwell, (2008). Exploring Memory Management Strategies in Catamount Cray User Group Conference https://www.osti.gov/search/identifier:1143277 Document ID: 5262678

Kirk A. Rackow, Kevin Pedretti, Ronald B. Brightwell, (2008). Exploring Memory Management Strategies in Catamount Cray User Group Conference https://www.osti.gov/search/identifier:1272175 Document ID: 5262233

William Richard Burcham, Karl Scott Hemmert, Ronald B. Brightwell, Keith D. Underwood, (2008). High Message Rate, NIC-Based Atomics: Design and Performance Considerations IEEE International Conference on Cluster Computing https://www.osti.gov/search/identifier:1145790 Document ID: 5261707

Kirk A. Rackow, Ronald B. Brightwell, Patrick G. Bridges, (2008). Characterizing Application Sensitivity to OS Interference Using Kernel-Level Noise Injection International Conference for High-Performance Computing, Networking, Storage, and Analysis (SC’08) https://www.osti.gov/search/identifier:1145507 Document ID: 5261400

Michael A Butler, Ronald B. Brightwell, Kurt Brian Ferreira, Patrick G. Bridges, Trammell Hudson, Arthur B. Maccabe, Patrick M. Widener, (2008). Designing and Implementing Lightweight Kernels for Capability Computing Concurrency and ComputationPractice and Experience https://www.osti.gov/search/identifier:1141189 Document ID: 5260331

Karl Scott Hemmert, Ronald B. Brightwell, Arun F. Rodrigues, Richard C. Murphy, Keith D Underwood, (2007). A Hardware Acceleration Unit for MPI Queue Processing International Parallel and Distributed Processing Symposium https://www.osti.gov/search/identifier:947842 Document ID: 5226257

David R. Bronowski, Ronald B. Brightwell, Richard Murphy, (2007). Enhancing NIC Performance for MPI using Processing-in-Memory Communications Architectures for Clusters Workshop https://www.osti.gov/search/identifier:1144132 Document ID: 5227060

Mark A. Gonzales, Ronald B. Brightwell, Michael J. Levenhagen, (2007). Evaluating NIC Hardware Requirements to Achieve High Message Rate PGAS Support on Multi-Core Processors Sc 2007 https://www.osti.gov/search/identifier:908857 Document ID: 5251942

Steven J. Plimpton, Ronald B. Brightwell, Courtenay T. Vaughan, Keith D Underwood, Mike Davis, (2006). A Simple Synchronous Distributed-Memory Algorithm for the HPCC RandomAccess Benchmark IEEE International Conference on Cluster Computing https://www.osti.gov/search/identifier:1266062 Document ID: 5244288

Gary L. Hennigan, Ronald B. Brightwell, (2006). Measuring MPI Send and Receive Overhead and Application Availability in High Performance Network Interfaces EuroPVM/MPI 2006 https://www.osti.gov/search/identifier:1264538 Document ID: 5242596

Mark A. Gonzales, Steven J. Plimpton, Ronald B. Brightwell, Courtenay T. Vaughan, Mike Davis, (2006). A Simple Synchronous Distributed-Memory Algorithm for the HPCC RandomAccess Benchmark IEEE Conference on Cluster Computing (Cluster, 2006) https://www.osti.gov/search/identifier:1266063 Document ID: 5242698

Michael A Butler, Ronald B. Brightwell, Kevin Pedretti, Arthur B. Maccabe, Trammell Hudson, (2006). The Portals 3.3 Message Passing Interface https://www.osti.gov/search/identifier:882925 Document ID: 5238864

Suzanne M. Kelly, Ronald B. Brightwell, (2005). Software Architecture of the Light Weight Kernel, Catamount Cray User Group https://www.osti.gov/search/identifier:882925 Document ID: 5232223

William Lawry, Christopher W. Wilson, Arthur B. Maccabe, Ronald B. Brightwell, Ronald B. Brightwell, (2002). COMB: A Portable Benchmark Suite for Assessing MPI Overlap 2002 IEEE International Conference on Cluster Computing https://www.osti.gov/search/identifier:882925 Document ID: 4310800

Murray Steven Rodgers, Ronald B. Brightwell, Garth M. Reese, Ronald B. Brightwell, (2002). Scalability and Performance of Salinas on the Computational Plant The Third LCI International Conference on Linux Clusters https://www.osti.gov/search/identifier:882925 Document ID: 4477500

Sue S. Collins, Nathan Weinthaler Dauchy, Ronald B. Brightwell, Ruth Ann Klundt, (2002). An Extensible, Portable, Scalable, Cluster Management Software Architecture IEEE International Conference on Cluster Computing https://www.osti.gov/search/identifier:882925 Document ID: 4296000

Kevin Pedretti, Ronald B. Brightwell, (2002). Cplant Runtime System Support for Multi-Processor and Heterogeneous Compute Nodes Cluster 2002 IEEE International Conference on Cluster Computing https://www.osti.gov/search/identifier:882925 Document ID: 4310900

Ronald B. Brightwell, Ronald B. Brightwell, (2002). Portals and Networking for the Lustre File System 2002 IEEE International Conference on Cluster Computing https://www.osti.gov/search/identifier:882925 Document ID: 4310700

Showing Results. Show More Publications

Awards & Recognition

2011

James H. Laros, Kevin T. Pedretti, Suzanne M. Kelly, James A. Ang, Ron Brightwell, John P. Vandyke, Courtenay T. Vaughan, Robert A. Ballance, , 2011 National Nuclear Security Administration Defense Programs Award of Excellence, National Nuclear Security Administration, September 1, 2011

James H. Laros, Kevin T. Pedretti, Suzanne M. Kelly, John P. Vandyke, Ron Brightwell, Robert B Ballance, , NNSA Environmental Stewardship Award Recipient, National Nuclear Security Administration, March 1, 2011