Center for Computing Research (CCR)

Message passing paradigms provide for many to one messaging patterns that result in receive side resource exhaustion. Traditionally, MPI implementations layered over the Portals network programming interface provided a large default unexpected receive buffer space, the user was expected to configure the buffer size to the application demand, and the application was aborted when the buffer space was overrun. The Portals 4 design provides a set of primitives for implementing scalable resource exhaustion recovery without negatively impacting normal operation. A resource exhaustion recovery protocol for MPI implementations is presented, as well as performance results for an Open MPI implementation of the protocol. © 2012 Springer-Verlag.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

A NIC-offload implementation of portals for quadrics QsNet

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference YEAR 2004

OSTI

A Perspective on Operating and Runtime Systems for Exascale Computing

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2011

OSTI

A prototype implementation of MPI for SMARTMAP

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Brightwell, Ronald B.

Recently the Catamount lightweight kernel was extended to support direct access shared memory between processes running on the same compute node. This extension, called SMARTMAP, allows each process read/write access to another process' memory by extending the virtual address mapping. Simple virtual address bit manipulation can be used to access the same virtual address in a different process' address space. This paper describes a prototype implementation of MPI that uses SMARTMAP for intra-node message passing. SMARTMAP has several advantages over POSIX shared memory techniques for implementing MPI. We present performance results comparing MPI using SMARTMAP to the existing MPI transport layer on a quad-core Cray XT platform. © 2008 Springer-Verlag Berlin Heidelberg.

More Details

TYPE Conference YEAR 2008

Scopus OSTI

A Simple Synchronous Distributed-Memory Algorithm for the HPCC RandomAccess Benchmark

Plimpton, Steven J.; Brightwell, Ronald B.; Vaughan, Courtenay T.; Underwood, Keith

Abstract not provided.

More Details

TYPE Conference YEAR 2006

OSTI

A Simple Synchronous Distributed-Memory Algorithm for the HPCC RandomAccess Benchmark

Underwood, Keith; Plimpton, Steven J.; Brightwell, Ronald B.; Vaughan, Courtenay T.

Abstract not provided.

More Details

TYPE Conference YEAR 2006

OSTI

A System So*ware Approach for Unifying Simula8on and Analysis at Extreme--Scale

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds

Younge, Andrew J.; Pedretti, Kevin P.; Grant, Ryan E.; Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI DOI

A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds

Younge, Andrew J.; Pedretti, Kevin P.; Grant, Ryan E.; Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI DOI

A Transactional Model for Fault-Tolerant MPI for Petascale and Exascale Systems

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Accelerating Incremental Checkpointing for Extreme-Scale Computing

Proposed for publication in Future Generation Computer Systems.

Ferreira, Kurt; Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2013

OSTI

Addressing the System Software Challenges for Converged Simulation and Analysis on Extreme-Scale Systems

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Advanced Architectures for Exascale Computing

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2010

OSTI

Advanced parallel programming models research and development opportunities

Brightwell, Ronald B.; Wen, Zhaofang W.

There is currently a large research and development effort within the high-performance computing community on advanced parallel programming models. This research can potentially have an impact on parallel applications, system software, and computing architectures in the next several years. Given Sandia's expertise and unique perspective in these areas, particularly on very large-scale systems, there are many areas in which Sandia can contribute to this effort. This technical report provides a survey of past and present parallel programming model research projects and provides a detailed description of the Partitioned Global Address Space (PGAS) programming model. The PGAS model may offer several improvements over the traditional distributed memory message passing model, which is the dominant model currently being used at Sandia. This technical report discusses these potential benefits and outlines specific areas where Sandia's expertise could contribute to current research activities. In particular, we describe several projects in the areas of high-performance networking, operating systems and parallel runtime systems, compilers, application development, and performance evaluation.

More Details

TYPE SAND Report YEAR 2004

OSTI DOI

ALAMO: Autonomous Lightweight Allocation Management and Optimization

Brightwell, Ronald B.; Ferreira, Kurt B.; Grant, Ryan E.; Levy, Scott L.; Lofstead, Gerald F.; Olivier, Stephen L.; Pedretti, Kevin P.; Younge, Andrew J.; Gentile, Ann C.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

An evaluation of MPI message rate on hybrid-core processors

International Journal of High Performance Computing Applications

Barrett, Brian W.; Brightwell, Ronald B.; Grant, Ryan E.; Hammond, Simon D.; Hemmert, Karl S.

Power and energy concerns are motivating chip manufacturers to consider future hybrid-core processor designs that may combine a small number of traditional cores optimized for single-thread performance with a large number of simpler cores optimized for throughput performance. This trend is likely to impact the way in which compute resources for network protocol processing functions are allocated and managed. In particular, the performance of MPI match processing is critical to achieving high message throughput. In this paper, we analyze the ability of simple and more complex cores to perform MPI matching operations for various scenarios in order to gain insight into how MPI implementations for future hybrid-core processors should be designed.

More Details

TYPE Journal Article YEAR 2014

Scopus OSTI DOI

An evaluation of open MPI's matching transport layer on the cray XT

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Graham, Richard L.; Brightwell, Ronald B.; Barrett, Brian; Bosilca, George; Pješivac-Grbović, Jelena

Open MPI was initially designed to support a wide variety of high-performance networks and network programming interfaces. Recently, Open MPI was enhanced to support networks that have full support for MPI matching semantics. Previous Open MPI efforts focused on networks that require the MPI library to manage message matching, which is sub-optimal for some networks that inherently support matching. We describes a new matching transport layer in Open MPI, present results of micro-benchmarks and several applications on the Cray XT platform, and compare performance of the new and the existing transport layers, as well as the vendor-supplied implementation of MPI. © Springer-Verlag Berlin Heidelberg 2007.

More Details

TYPE Conference YEAR 2007

Scopus OSTI

An Infrastructure for Characterizing the Sensitivity of Parallel Applications to OS Noise

Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference YEAR 2006

OSTI

An Intra-Node Implementation of OpenSHMEM Using Virtual Address Space Mapping

Brightwell, Ronald B.; Pedretti, Kevin P.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

An MPI tool to measure application sensitivity to variation in communication parameters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

León, Edgar A.; Maccabe, Arthur B.; Brightwell, Ronald B.

This work describes an apparatus which can be used to vary communication performance parameters for MPI applications, and provides a tool to analyze the impact of communication performance on parallel applications. Our tool is based on Myrinet (along with GM). We use an extension of the LogP model to allow greater flexibility in determining the parameter(s) to which parallel applications may be sensitive. We show that individual communication parameters can be independently controlled within a small percentage error. We also present the results of using our tool on a suite of parallel benchmarks. © Springer-Verlag Berlin Heidelberg 2003.

More Details

TYPE Conference YEAR 2003

Scopus OSTI

Publications