Publications Search

We present a new method for mapping applications' MPI tasks to cores of a parallel computer such that applications' communication time is reduced. We address the case of sparse node allocation, where the nodes assigned to a job are not necessarily located in a contiguous block nor within close proximity to each other in the network, although our methods generalize to contiguous allocations as well. The goal is to assign tasks to cores so that interdependent tasks are performed by "nearby' cores, thus lowering the distance messages must travel, the amount of congestion in the network, and the overall cost of communication. Our new method applies a geometric partitioning algorithm to both the tasks and the processors, and assigns task parts to the corresponding processor parts. We also present a number of algorithmic optimizations that exploit specific features of the network or application. We show that, for the structured finite difference mini-application MiniGhost, our mapping methods reduced communication time up to 75% relative to MiniGhost's default mapping on 128K cores of a Cray XK7 with sparse allocation. For the atmospheric modeling code E3SM/HOMME, our methods reduced communication time up to 31% on 32K cores of an IBM BlueGene/Q with contiguous allocation.

More Details

TYPE Other Report YEAR 2018

DOI OSTI

Advanced Power Measurement and Control for the Trinity Supercomputer

Younge, Andrew J.; Grant, Ryan; Bays, Nathan R.; Levenhagen, Michael; Olivier, Stephen L.; Bays, Nathan R.; Ward, Harry L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Subtrajectorization: Efficiently Sub-dividing the Paths of Moving Objects for Better Analysis

Newton, Benjamin D.; Bays, Nathan R.; Valicka, Christopher G.; Wilson, Andrew T.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI OSTI

Maturing the ARM Software Ecosystem for U.S. DOE/ASC Supercomputing

Bays, Nathan R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

CMOS compatible processing for phosphorous delta-layer nanoscale electronics

Campbell, Deanna M.; Marshall, Michael; Maurer, Leon; Bays, Nathan R.; Lu, T.M.; Ward, Daniel R.; Misra, Shashank

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI OSTI

Variability of atomically precise tunnel junctions

Marshall, Michael; Campbell, Deanna M.; Maurer, Leon; Bays, Nathan R.; Lu, T.M.; Ward, Daniel R.; Misra, Shashank

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI OSTI

Vanguard: DOE Technical Advisory Team Information

Bays, Nathan R.; Bays, Nathan R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

Concurrent multiscale modeling of microstructural effects on localization behavior in finite deformation solid mechanics

Computational Mechanics

Alleman, Coleman; Bays, Nathan R.; Mota, Alejandro; Lim, Hojun; Littlewood, David J.

The heterogeneity in mechanical fields introduced by microstructure plays a critical role in the localization of deformation. To resolve this incipient stage of failure, it is therefore necessary to incorporate microstructure with sufficient resolution. On the other hand, computational limitations make it infeasible to represent the microstructure in the entire domain at the component scale. In this study, the authors demonstrate the use of concurrent multiscale modeling to incorporate explicit, finely resolved microstructure in a critical region while resolving the smoother mechanical fields outside this region with a coarser discretization to limit computational cost. The microstructural physics is modeled with a high-fidelity model that incorporates anisotropic crystal elasticity and rate-dependent crystal plasticity to simulate the behavior of a stainless steel alloy. The component-scale material behavior is treated with a lower fidelity model incorporating isotropic linear elasticity and rate-independent J2 plasticity. The microstructural and component scale subdomains are modeled concurrently, with coupling via the Schwarz alternating method, which solves boundary-value problems in each subdomain separately and transfers solution information between subdomains via Dirichlet boundary conditions. In this study, the framework is applied to model incipient localization in tensile specimens during necking.

More Details

TYPE Journal Article YEAR 2018

DOI OSTI Scopus

Trilinos Performance on Knights Landing

Bays, Nathan R.; Siefert, Christopher; Hu, Jonathan J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI

A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds

Younge, Andrew J.; Bays, Nathan R.; Grant, Ryan; Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

DOI OSTI DOI OSTI

Shapes: The Next Frontier of Data Mining

Bays, Nathan R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2018

OSTI OSTI

Vanguard Pre-bid Conference

Bays, Nathan R.; Bays, Nathan R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2017

OSTI

NSRD-15:Computational Capability to Substantiate DOE-HDBK-3010 Data

Bays, Nathan R.; Bignell, John; Dingreville, Remi P.M.; Zepper, Ethan T.; Brien, Timothy J.; Busch, Robert D.; Skinner, Corey

Safety basis analysts throughout the U.S. Department of Energy (DOE) complex rely heavily on the information provided in the DOE Handbook, DOE-HDBK-3010, Airborne Release Fractions/Rates and Respirable Fractions for Nonreactor Nuclear Facilities, to determine radionuclide source terms from postulated accident scenarios. In calculating source terms, analysts tend to use the DOE Handbook’s bounding values on airborne release fractions (ARFs) and respirable fractions (RFs) for various categories of insults (representing potential accident release categories). This is typically due to both time constraints and the avoidance of regulatory critique. Unfortunately, these bounding ARFs/RFs represent extremely conservative values. Moreover, they were derived from very limited small-scale bench/laboratory experiments and/or from engineered judgment. Thus, the basis for the data may not be representative of the actual unique accident conditions and configurations being evaluated. The goal of this research is to develop a more accurate and defensible method to determine bounding values for the DOE Handbook using state-of-art multi-physics-based computer codes.

More Details

TYPE SAND Report YEAR 2017

DOI OSTI

ATDM Operating Systems and On-Node Runtime

Olivier, Stephen L.; Bays, Nathan R.; Brightwell, Ronald B.

Abstract not provided.

More Details

TYPE Presentation YEAR 2017

OSTI

A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds

Proceedings of the International Conference on Cloud Computing Technology and Science, CloudCom

Younge, Andrew J.; Bays, Nathan R.; Grant, Ryan; Brightwell, Ronald B.

Containerization, or OS-level virtualization has taken root within the computing industry. However, container utilization and its impact on performance and functionality within High Performance Computing (HPC) is still relatively undefined. This paper investigates the use of containers with advanced supercomputing and HPC system software. With this, we define a model for parallel MPI application DevOps and deployment using containers to enhance development effort and provide container portability from laptop to clouds or supercomputers. In this endeavor, we extend the use of Sin- gularity containers to a Cray XC-series supercomputer. We use the HPCG and IMB benchmarks to investigate potential points of overhead and scalability with containers on a Cray XC30 testbed system. Furthermore, we also deploy the same containers with Docker on Amazon's Elastic Compute Cloud (EC2), and compare against our Cray supercomputer testbed. Our results indicate that Singularity containers operate at native performance when dynamically linking Cray's MPI libraries on a Cray supercomputer testbed, and that while Amazon EC2 may be useful for initial DevOps and testing, scaling HPC applications better fits supercomputing resources like a Cray.

More Details

TYPE Conference Poster YEAR 2017

DOI OSTI Scopus