Publications

Results 1–25 of 41
Date Inputs. Currently set to enter a start and end date.
Current Filters Clear all
Publication Type Year

ALBADross: Active Learning Based Anomaly Diagnosis for Production HPC Systems

Cluster 2022

Burak Aksar, Efe Sencan, Benjamin Schwaller, Omar Raad Aaziz, Brian Kulis, Ayse K. Coskun, Vitus J. Leung, James M. Brandt

Conference Proceeding – 2022 Conference Proceeding 2022

Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation

Ieee Hpec

Yijia Zhang, Burak Aksar, Omar Raad Aaziz, Benjamin Schwaller, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun

Conference Presentation – 2021 Conference Presentation 2021

L2 Milestone #7842 SAND Report

Omar Raad Aaziz, Benjamin A. Allan, James M. Brandt, Jeanine Cook, Karen D. Devine, James Elliott, Ann C. Gentile, Simon David Hammond, Brian Michael Kelley, Lena Lopatina, Stan Gerald Moore, Stephen Lecler Olivier, Kevin Pedretti, David Zoeller Poliakoff, Roger P. Pawlowski, Phillip A Regier, Mark E Schmitz, Benjamin Schwaller, Vanessa Surjadidjaja, Matthew Scot Swan, Nick Tucker, Tom Tucker, Courtenay T. Vaughan, Sara Petra Walton

https://www.osti.gov/search/identifier:1819812

SAND Report – 2021 SAND Report 2021

Integrated System and Application Continuous Performance Monitoring and Analysis Capability

FY20 ASC FOUS L2 Milestone 7842 Final Review

James M. Brandt, Jeanine Cook, Omar Raad Aaziz, Benjamin A. Allan, Karen D. Devine, James John Elliott, Ann C. Gentile, Simon David Hammond, Brian Michael Kelley, Lena Lopatina, Stan Gerald Moore, Stephen Lecler Olivier, Kevin Pedretti, David Zoeller Poliakoff, Roger P. Pawlowski, Phillip A Regier, Mark E Schmitz, Benjamin Schwaller, Vanessa Surjadidjaja, Matthew Scot Swan, Tom Tucker, Nick Tucker, Courtenay T. Vaughan, Sara Petra Walton

Presentation (non-conference) – 2021 Presentation (non-conference) 2021

Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation

Ieee Hpec

Yijia Zhang, Burak Aksar, Omar Raad Aaziz, Benjamin Schwaller, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun

Conference Proceeding – 2021 Conference Proceeding 2021

E2EWatch: End-to-end Anomaly Diagnosis Framework for Production HPC Systems

Euro-Par 2021

Burak Aksar, Yijia Zhang, Emre Ates, Omar Raad Aaziz, Benjamin Schwaller, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun

Conference Presentation – 2021 Conference Presentation 2021

E2EWatch: End-to-end Anomaly Diagnosis Framework for Production HPC Systems

Euro-Par 2021

Burak Aksar, Yijia Zhang, Emre Ates, Omar Raad Aaziz, Benjamin Schwaller, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun

https://www.osti.gov/search/identifier:1873069

Conference Paper – 2021 Conference Paper 2021

Proctor: A Semi-Supervised Performance Anomaly Diagnosis Framework for Production HPC Systems

ISC High Performance

Burak Aksar, Yijia Zhang, Emre Ates, Benjamin Schwaller, Omar Raad Aaziz, Vitus J. Leung, James M. Brandt, Manuel Egele, Ayse K. Coskun

https://www.osti.gov/search/identifier:1866057

Conference Proceeding – 2021 Conference Proceeding 2021

Enabling Application and System Data Fusion

ECP Annual Meeting

Ann C. Gentile, James M. Brandt, Jeanine Cook, Simon David Hammond, David Zoeller Poliakoff, Benjamin Schwaller, Vanessa Surjadidjaja, Thomas Owen Tucker

https://www.osti.gov/search/identifier:1863505

Conference Presentation – 2021 Conference Presentation 2021

Attributing Performance Variation from Integrated Application and System Data

Applied Computer Science Meeting

Omar Raad Aaziz, Benjamin A. Allan, James M. Brandt, Jeanine Cook, Karen D. Devine, James John Elliott, Ann C. Gentile, Stephen Lecler Olivier, Kevin Pedretti, Tom Tucker

https://www.osti.gov/search/identifier:1765520

Conference Paper – 2020 Conference Paper 2020

A Machine Learning Approach to Understanding HPC Application Performance Variation

Supercomputing Conference

Benjamin Schwaller, Burak Aksar, Omar Raad Aaziz, Emre Ates, James M. Brandt, Ayse Coskun, Manuel Egele, Vitus J. Leung

https://www.osti.gov/search/identifier:1642784

Conference Paper – 2019 Conference Paper 2019

AD for Machine Learning Approach to Understanding HPC Application Performance Variation Poster

Supercomputing Conference

Burak Aksar, Benjamin Schwaller, Omar Raad Aaziz, Emre Ates, James M. Brandt, Ayse Coskun, Manuel Egele, Vitus J. Leung

https://www.osti.gov/search/identifier:1642788

Conference Paper – 2019 Conference Paper 2019

Design, Installation, and Operation of the Vortex ART Platform

Nathan Edward Gauntt, Kevin Duvall Davis, Jason John Repik, James M. Brandt, Ann C. Gentile, Simon David Hammond

https://www.osti.gov/search/identifier:1562796

Report – 2019 Report 2019

Taxonomist: Application Detection through Rich Monitoring Data

Machine Learning Course

Emre Ates, Ozan Tuncer, Ata Turk, Vitus J. Leung, James M. Brandt, Manuel Egele, Ayse K. Coskun

https://www.osti.gov/search/identifier:1645517

Presentation (non-conference) – 2019 Presentation (non-conference) 2019

Large-Scale System Monitoring Experiences and Recommendations

Workshop on Monitoring and Analysis for High Performance Compute Systems Plus Applications

V. Ahlgren, S. Andersson, James M. Brandt, N. Cardo, S. Chunduri, J. Enos, P. Fields, Ann C. Gentile, R. Gerber, M. Gienger, J. Greenseid, A. Greiner, B. Hadri, Y. He, D. Hoppe, U. Kaila, K. Kelly, M. Klein, A. Kristiansen, S. Leak, M. Mason, Kevin Pedretti, J-G. Piccinali, Jason John Repik, J. Rogers, S. Salminen, M. Showerman, C. Whitney, J. Williams

https://www.osti.gov/search/identifier:1592263

Conference Paper – 2018 Conference Paper 2018

Online Diagnosis of Performance Variation in HPC Systems Using Machine Learning

IEEE Transactions on Parallel and Distributed Systems

Ozan Tuncer, Emre Ates, Yijia Zhang, Ata Turk, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun

https://www.osti.gov/search/identifier:1474092

Journal Article – 2018 Journal Article 2018

Large-Scale System Monitoring Experiences and Recommendations

Workshop on Monitoring and Analysis for High Performance Compute Systems Plus Applications

V. Ahlgren, S. Andersson, James M. Brandt, N. Cardo, S. Chunduri, J. Enos, P. Fields, Ann C. Gentile, R. Gerber, M. Gienger, J. Greenseid, A. Greiner, B. Hadri, Y. He, D. Hoppe, U. Kaila, K. Kelly, M. Klein, A. Kristiansen, S. Leak, M. Mason, Kevin Pedretti, J-G. Piccinali, Jason John Repik, J. Rogers, S. Salminen, M. Showerman, C. Whitney, J. Williams

https://www.osti.gov/search/identifier:1576168

Conference Paper – 2018 Conference Paper 2018

Application Performance Insights via System Monitoring

Jowog-34

James M. Brandt, Ann C. Gentile, Simon David Hammond, Jeanine Cook, Benjamin A. Allan, Thomas Tucker, Nichamon Naksinehaboon, Narate Taerat, Jonathan Cook, Omar Raad Aaziz, Emre Ates, Ozan Tuncer, Manuel Egele, Ata Turk, Ayse Coskun, Ramin Izadpanah, Damian Dechev

https://www.osti.gov/search/identifier:1532642

Presentation (non-conference) – 2018 Presentation (non-conference) 2018

Taxonimist: Application Detection through Rich Monitoring Data

Euro-Par

Emre Ates, Ozan Tuncer, Ata Turk, Vitus J. Leung, James M. Brandt, Manuel Egele, Ayse K. Coskun

https://www.osti.gov/search/identifier:1526818

Conference Paper – 2018 Conference Paper 2018

Runtime HPC System and Application Performance Assessment and Diagnostics

Conference on Data Analysis

James M. Brandt, Ann C. Gentile, Jonathan Edwin Cook, Benjamin A. Allan, Jeanine Cook, Omar Raad Aaziz, Thomas Tucker, Naksinehaboon Nichamon, Narate Taerat, Emre Ates, Ozan Tuncer, Manuel Egele, Ata Turk, Ayse Coskun

https://www.osti.gov/search/identifier:1500155

Conference Paper – 2018 Conference Paper 2018

Detection and Diagnosis of Performance Variations

Potential discussion with Susan Seestrom on AI/ML

Ozan Tuncer, Emre Ates, Yijia Zhang, Ata Turk, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse K. Coskun

https://www.osti.gov/search/identifier:1497230

Presentation (non-conference) – 2018 Presentation (non-conference) 2018

Enhanced Profiling for Kokkos Applications

Applied Computer Science Multi-Lab Meeting

Simon David Hammond, Christian Robert Trott, Daniel Alejandro Ibanez-Granados, Harold C. Edwards, Daniel Sunderland, Nathan David Ellingwood, James M. Brandt, Ann C. Gentile, Jeanine Cook, Robert J. Hoekstra

https://www.osti.gov/search/identifier:1495756

Conference Paper – 2018 Conference Paper 2018

Continuous Performance Tracking for Kokkos Applications Using LDMS

Programming Models and Co-Design Meeting

James M. Brandt, Simon David Hammond, Thomas Tucker, Ann C. Gentile, Jeanine Cook

https://www.osti.gov/search/identifier:1495760

Presentation (non-conference) – 2018 Presentation (non-conference) 2018

Diagnosing Performance Variations in HPC Applications Using Machine Learning

Csss

Ozan Tuncer, Emre Ates, Yijia Zhang, Ata Turk, James M. Brandt, Vitus J. Leung, Manuel Egele, Ayse Coskun

https://www.osti.gov/search/identifier:1510194

Presentation (non-conference) – 2017 Presentation (non-conference) 2017

Task Placement to Reduce Application Communication Costs

Sandia CIS External Review Board

Karen D. Devine, James M. Brandt, Mehmet Deveci, Ann C. Gentile, Vitus J. Leung, Stephen Lecler Olivier, Kevin Pedretti, Sivasankaran Rajamanickam, Mark Alan Taylor

https://www.osti.gov/search/identifier:1467790

Presentation (non-conference) – 2017 Presentation (non-conference) 2017
Document Title Type Year
Results 1–25 of 41