Publications Search

As computer systems grow in both size and complexity, the need for applications and run-time systems to adjust to their dynamic environment also grows. The goal of the RAAMP LDRD was to combine static architecture information and real-time system state with algorithms to conserve power, reduce communication costs, and avoid network contention. We devel- oped new data collection and aggregation tools to extract static hardware information (e.g., node/core hierarchy, network routing) as well as real-time performance data (e.g., CPU uti- lization, power consumption, memory bandwidth saturation, percentage of used bandwidth, number of network stalls). We created application interfaces that allowed this data to be used easily by algorithms. Finally, we demonstrated the benefit of integrating system and application information for two use cases. The first used real-time power consumption and memory bandwidth saturation data to throttle concurrency to save power without increasing application execution time. The second used static or real-time network traffic information to reduce or avoid network congestion by remapping MPI tasks to allocated processors. Results from our work are summarized in this report; more details are available in our publications [2, 6, 14, 16, 22, 29, 38, 44, 51, 54].

More Details

TYPE SAND Report YEAR 2014

DOI OSTI

Demonstrating Improved Application Performance Using Dynamic Monitoring and Task Mapping

Brandt, James M.; Devine, Karen; Gentile, Ann C.; Foulk, James W.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

DOI OSTI

Albany on Next-Generation Systems

Devine, Karen; Salinger, Andrew G.; Demeshko, Irina; Hansen, Glen; Edwards, Harold C.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Installing the Anasazi Eigensolver Package with Application to Some Graph Eigenvalue Problems

Lehoucq, Rich; Boman, Erik G.; Devine, Karen; Thornquist, Heidi K.; Slattengren, Nicole L.

The purpose of this report is to document a basic installation of the Anasazi eigensolver package and provide a brief discussion on the numerical solution of some graph eigenvalue problems.

More Details

TYPE SAND Report YEAR 2014

DOI OSTI

FASTMath Partitioning and Task Placement

Devine, Karen; Diamond, Gerrett; Ibanez, Dan; Leung, Vitus J.; Prokopenko, Andrey V.; Rajamanickam, Sivasankaran; Shephard, Mark; Smith, Cameron

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Zoltan Three-Slide Overview for ATPESC 2014

Devine, Karen; Rajamanickam, Sivasankaran; Prokopenko, Andrey V.; Boman, Erik G.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

Demonstrating Improved Application Performance Using Dynamic Monitoring and Task Mapping

Brandt, James M.; Devine, Karen; Gentile, Ann C.; Foulk, James W.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

DOI OSTI

Zoltan2: Exploiting Geometric Partitioning in Task Mapping for Parallel Computers

Leung, Vitus J.; Rajamanickam, Sivasankaran; Pedretti, Kevin; Olivier, Stephen L.; Devine, Karen

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Using 2D Matrix Distributions in Trilinos

Devine, Karen; Boman, Erik G.; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

A computational spectral graph theory tutorial

Boman, Erik G.; Devine, Karen; Lehoucq, Rich

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Exploiting Geometric Partitioning in Task Mapping for Parallel Computers

Rajamanickam, Sivasankaran; Leung, Vitus J.; Pedretti, Kevin P.; Olivier, Stephen L.; Devine, Karen

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

The Zoltan Toolkits: Parallel Partitioning Load Balancing Coloring and Ordering

Devine, Karen; Boman, Erik G.; Rajamanickam, Sivasankaran; Leung, Vitus J.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Multi-jagged: A Scalable Multi-section based Spatial Partitioning Algorithm

Rajamanickam, Sivasankaran; Devine, Karen

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Scalable Matrix Computations on Large Scale-Free Graphs Using 2D Graph Partitioning

Boman, Erik G.; Devine, Karen; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference YEAR 2013

DOI OSTI

Combinatorial Scientific Computing for Exascale Systems and Applications

Devine, Karen; Rajamanickam, Sivasankaran; Boman, Erik G.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Using the Cray Gemini Performance Counters

Pedretti, Kevin; Vaughan, Courtenay T.; Barrett, Richard F.; Devine, Karen; Hemmert, Karl S.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Scalable Matrix Computations on Large Scale-Free Graphs Using 2D Graph Partitioning

Boman, Erik G.; Devine, Karen; Rajamanickam, Sivasankaran

Abstract not provided.

More Details

TYPE Conference YEAR 2013

DOI OSTI

Trilinos-based Software for Eigenanalysis of Graphs

Boman, Erik G.; Devine, Karen; Lehoucq, Rich; Slattengren, Nicole L.

Abstract not provided.

More Details

TYPE Presentation YEAR 2013

OSTI

Efficient Computation of Eigenpairs for Large Scale-free Graphs

Boman, Erik G.; Devine, Karen; Lehoucq, Rich; Slattengren, Nicole L.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Scalable matrix computations on large scale-free graphs using 2D graph partitioning

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Boman, Erik G.; Devine, Karen; Rajamanickam, Sivasankaran

Scalable parallel computing is essential for processing large scale-free (power-law) graphs. The distribution of data across processes becomes important on distributed-memory computers with thousands of cores. It has been shown that two dimensional layouts (edge partitioning) can have significant advantages over traditional one-dimensional layouts. However, simple 2D block distribution does not use the structure of the graph, and more advanced 2D partitioning methods are too expensive for large graphs. We propose a new two-dimensional partitioning algorithm that combines graph partitioning with 2D block distribution. The computational cost of the algorithm is essentially the same as 1D graph partitioning. We study the performance of sparse matrix-vector multiplication (SpMV) for scale-free graphs from the web and social networks using several different partitioners and both 1D and 2D data layouts. We show that SpMV run time is reduced by exploiting the graph's structure. Contrary to popular belief, we observe that current graph and hypergraph partitioners often yield relatively good partitions on scale-free graphs. We demonstrate that our new 2D partitioning method consistently outperforms the other methods considered, for both SpMV and an eigensolver, on matrices with up to 1.6 billion nonzeros using up to 16,384 cores. Copyright 2013 ACM.

More Details

TYPE Conference YEAR 2013

DOI OSTI Scopus

Publications

Search results