Publications Search

Trinity Benchmarks on the Intel Xeon Phi (Knights Corner)

Rajan, Mahesh R.; Doerfler, Douglas W.; Hammond, Simon D.

This report documents the early experiences with porting and performance analysis of the Tri-Lab Trinity benchmark applications on Intel Xeon Phi (Knights Corner) (KNC) processor. KNC, the second generation of the Intel Many Integrated Core (MIC) architectures, uses a large number of small P54C-x86 cores with wide vector units and is deployed as PCI bus attached process accelerators. Sandia has experimental test beds of small InifiniBand clusters and workstations to investigate the performance of the MIC architecture. On these experimental test beds the programming models that may be investigated are "offload", "symmetric" and "native". Among these program usage models our primary interest is in the so called "native" mode, because the planned Trinity system to be deployed in 2016 using the next generation MIC processor architecture called Knights Landing would be self-hosted. Trinity / NERSC-8 benchmark programs cover a variety of scientific disciplines and they were used to guide the procurement of these systems. Architectures such as the Intel MIC are well suited to study evolving processor architectures and a usage model commonly referred to as MPI + X that facilitates migration of our applications to use both coarse grain and fine grain parallelism. Our focus with the applications selected is on the efficacy of algorithms in these applications to take advantage of features like: large number of cores, wide vector units, higher-bandwidth and deeper memory sub-system. This is a first step towards understanding applications, algorithms and programming environments for Trinity and future exascale computing systems.

More Details

TYPE SAND Report YEAR 2015

DOI OSTI

CoE Meeting Tools Discussion

Rajan, Mahesh R.; Dinge, Dennis D.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Preparation of Codes for Trinity

Vaughan, Courtenay T.; Rajan, Mahesh R.; Dinge, Dennis D.; Dohrmann, Clark R.; Franko, Kenneth J.; Glass, Micheal W.; Pierson, Kendall H.; Tupek, Michael R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

SIERRA Solid Mechanics Trinity CoE Meeting: SIERRA/SM Profiling

Tupek, Michael R.; Pierson, Kendall H.; Rajan, Mahesh R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2015

OSTI

Trinity Benchmarks on Xeon Phi (Knights Corner)

Rajan, Mahesh R.; Doerfler, Douglas W.; Hammond, Simon D.; Trott, Christian R.; Barrett, Richard F.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

Experiences with Sandia National Laboratories HPC applications and MPI performance

Rajan, Mahesh R.; Doerfler, Douglas W.; Barrett, Richard F.; Stevenson, Joel O.; Agelastos, Anthony M.; Shaw, Ryan P.; Meyer, Harold E.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications

Agelastos, Anthony M.; Allan, Benjamin A.; Brandt, James M.; Gentile, Ann C.; Monk, Stephen T.; Ogden, Jeffry B.; Rajan, Mahesh R.; Stevenson, Joel O.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI DOI

Toward Rapid Understanding of Production HPC Applications and Systems

Agelastos, Anthony M.; Allan, Benjamin A.; Brandt, James M.; Gentile, Ann C.; Monk, Stephen T.; Ogden, Jeffry B.; Rajan, Mahesh R.; Stevenson, Joel O.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI DOI

The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Agelastos, Anthony M.; Allan, Benjamin A.; Brandt, James M.; Cassella, Paul; Enos, Jeremy; Fullop, Joshi; Gentile, Ann C.; Monk, Stephen T.; Naksinehaboon, Nichamon; Ogden, Jeffry B.; Rajan, Mahesh R.; Showerman, Michael; Stevenson, Joel O.; Taerat, Narate; Tucker, Tom

Understanding how resources of High Performance Compute platforms are utilized by applications both individually and as a composite is key to application and platform performance. Typical system monitoring tools do not provide sufficient fidelity while application profiling tools do not capture the complex interplay between applications competing for shared resources. To gain new insights, monitoring tools must run continuously, system wide, at frequencies appropriate to the metrics of interest while having minimal impact on application performance. We introduce the Lightweight Distributed Metric Service for scalable, lightweight monitoring of large scale computing systems and applications. We describe issues and constraints guiding deployment in Sandia National Laboratories' capacity computing environment and on the National Center for Supercomputing Applications' Blue Waters platform including motivations, metrics of choice, and requirements relating to the scale and specialized nature of Blue Waters. We address monitoring overhead and impact on application performance and provide illustrative profiling results.

More Details

TYPE Conference Poster YEAR 2014

DOI OSTI Scopus

Unprecedented Scalability and Performance of the new NNSA Tri-Lab Capacity Cluster 2 (TLCC2)

Rajan, Mahesh R.; Doerfler, Douglas W.; Lin, Paul L.; Hammond, Simon D.; Barrett, Richard F.; Vaughan, Courtenay T.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Application Performance and Scaling on the new Tri-Lab Capacity Cluster:TLCC2

Rajan, Mahesh R.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Reliable Computation Using Unpredictable Components

Wilke, Jason W.; Ballance, Robert A.; Rajan, Mahesh R.; Kelly, Suzanne M.; Noe, John P.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Application-Driven Analysis of Two Generations of Capability Computing Platforms: The Transition to Multicore Processors

Concurreny and Computation: Practice and Experience

Rajan, Mahesh R.; Vaughan, Courtenay T.; Doerfler, Douglas W.; Barrett, Richard F.; Lin, Paul L.; Pedretti, Kevin; Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Journal Article YEAR 2011

OSTI

Application Driven Analysis of Two Generations of Capability Computing Platforms: Purple and Cielo

Rajan, Mahesh R.; Vaughan, Courtenay T.; Barrett, Richard F.; Doerfler, Douglas W.; Lin, Paul L.; Pedretti, Kevin; Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI OSTI

From Red Storm to Cielo: Performance Analysis of ASC Simulation Programs Across an Evolution of Multicore Architectures

Parallel Processing Letters

Barrett, Richard F.; Vaughan, Courtenay T.; Rajan, Mahesh R.; Doerfler, Douglas W.; Pedretti, Kevin

Abstract not provided.

More Details

TYPE Journal Article YEAR 2011

OSTI

Application-Driven Accpetance of Cielo an XE6 Petascale Capability Platform

Doerfler, Douglas W.; Rajan, Mahesh R.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Investigating the Impact of the Cielo Cray XT6 Architecture on Scientific Application Codes

Vaughan, Courtenay T.; Rajan, Mahesh R.; Barrett, Richard F.; Doerfler, Douglas W.; Pedretti, Kevin

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Application-Driven Acceptance of Cielo an XE6 Petascale Capability Platform

Doerfler, Douglas W.; Rajan, Mahesh R.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Copy of Application-driven Analysis of Two Generations of Capability Computing Platforms: Purple and Cielo

Rajan, Mahesh R.; Vaughan, Courtenay T.; Doerfler, Douglas W.; Lin, Paul L.; Pedretti, Kevin; Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI OSTI

A Comparison of the Performance Characteristics of Capability and Capacity Class HPC Systems

Doerfler, Douglas W.; Rajan, Mahesh R.; Epperson, Marcus E.; Vaughan, Courtenay T.; Pedretti, Kevin; Barrett, Richard F.; Barrett, Brian B.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Copy of Investigating the Impact of the Cielo Cray XE6 Architecture on Scientific Application Codes

Vaughan, Courtenay T.; Rajan, Mahesh R.; Barrett, Richard F.; Doerfler, Douglas W.; Pedretti, Kevin

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI OSTI

Capability vs Capacity; HPC Systems Application Performance Comparisons: Cielo vs. Red Sky

Doerfler, Douglas W.; Rajan, Mahesh R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2011

OSTI OSTI

Application-driven analysis of two generations of capability computing platforms :

Rajan, Mahesh R.; Vaughan, Courtenay T.; Doerfler, Douglas W.; Lin, Paul L.; Pedretti, Kevin P.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Application-driven Analysis of Two Generations of Capability Computing Platforms: Purple and Cielo

Rajan, Mahesh R.; Vaughan, Courtenay T.; Doerfler, Douglas W.; Lin, Paul L.; Pedretti, Kevin; Barrett, Richard F.; Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Investigating the impact of the cielo cray XE6 architecture on scientific application codes

Vaughan, Courtenay T.; Rajan, Mahesh R.; Barrett, Richard F.; Doerfler, Douglas W.; Pedretti, Kevin P.

Cielo, a Cray XE6, is the Department of Energy NNSA Advanced Simulation and Computing (ASC) campaign's newest capability machine. Rated at 1.37 PFLOPS, it consists of 8,944 dual-socket oct-core AMD Magny-Cours compute nodes, linked using Cray's Gemini interconnect. Its primary mission objective is to enable a suite of the ASC applications implemented using MPI to scale to tens of thousands of cores. Cielo is an evolutionary improvement to a successful architecture previously available to many of our codes, thus enabling a basis for understanding the capabilities of this new architecture. Using three codes strategically important to the ASC campaign, and supplemented with some micro-benchmarks that expose the fundamental capabilities of the XE6, we report on the performance characteristics and capabilities of Cielo.

More Details

TYPE Conference YEAR 2010

OSTI

Publications

Search results