Publications

Results 1–50 of 166

Search results

Jump to search filters

Brandt, J.M., Gentile, A.C., & Gentile, A.C. (2022). AppSysFusion: CoMingling of appropriate data to drive Codesign of Applications, HPC Platforms, and Monitoring, Analysis, and Feedback Infrastructure [Conference Presenation]. https://doi.org/10.2172/2006042

Goponenko, A., Lamar, K., Peterson, C., Allan, B., Brandt, J.M., Dechev, D., & Dechev, D. (2022). Metrics for Packing Efficiency and Fairness of HPC Cluster Batch Job Scheduling [Conference Presenation]. https://doi.org/10.2172/2005924

Aaziz, O., Allan, B., Brandt, J.M., Cook, J., Devine, K., Elliott, J., Gentile, A.C., Hammond, S., Kelley, B., Lopatina, L., Moore, S.G., Olivier, S.L., Bachman, W.B., Poliakoff, D., Pawlowski, R., Regier, P., Schmitz, M.E., Schwaller, B., Surjadidjaja, V., … Walton, S.P. (2021). Integrated System and Application Continuous Performance Monitoring and Analysis Capability. https://doi.org/10.2172/1819812

Brandt, J.M., Cook, J., Aaziz, O., Allan, B., Devine, K., Bachman, W.B., Gentile, A.C., Hammond, S., Kelley, B., Lopatina, L., Moore, S.G., Olivier, S.L., Bachman, W.B., Poliakoff, D., Pawlowski, R., Regier, P., Schmitz, M.E., Schwaller, B., Surjadidjaja, V., … Walton, S.P. (2021). Integrated System and Application Continuous Performance Monitoring and Analysis Capability [Presentation]. https://www.osti.gov/biblio/1886175

Aksar, B., Zhang, Y., Ates, E., Aaziz, O., Schwaller, B., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). E2EWatch: End-to-end Anomaly Diagnosis Framework for Production HPC Systems [Conference Presenation]. https://doi.org/10.2172/1891960

Costa, E., Patel, T., Schwaller, B., Brandt, J.M., Tiwari, D., & Tiwari, D. (2021). Lessons From Examining Repetitive Job Behavior and I/O Performance Variability on a Production HPC System Emily Costa Northeastern University, USA Tirthak Patel Northeastern University, USA Benjamin Schwaller [Conference Paper]. https://www.osti.gov/biblio/1884199

Gentile, A.C., Brandt, J.M., Cook, J., Hammond, S., Poliakoff, D., Schwaller, B., Surjadidjaja, V., Tucker, T.O., & Tucker, T.O. (2021). Enabling Application and System Data Fusion [Conference Presenation]. https://doi.org/10.2172/1863505

Aksar, B., Zhang, Y., Ates, E., Schwaller, B., Aaziz, O., Leung, V.J., Brandt, J.M., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). Proctor: A Semi-Supervised Performance Anomaly Diagnosis Framework for Production HPC Systems [Conference Proceeding]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). https://doi.org/10.1007/978-3-030-78713-4_11

Zhang, Y., Aksar, B., Aaziz, O., Schwaller, B., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation [Conference Presenation]. 2021 IEEE High Performance Extreme Computing Conference, HPEC 2021. https://doi.org/10.2172/1888952

Schwaller, B., Allan, B., Brandt, J.M., Tucker, T., Tucker, N., & Tucker, N. (2020). HPC System Data Pipeline to Enable Meaningful Insights through Analytic-Driven Visualizations [Conference Poster]. https://www.osti.gov/biblio/1814415

Aaziz, O., Allan, B., Brandt, J.M., Cook, J., Devine, K., Bachman, W.B., Gentile, A.C., Olivier, S.L., Bachman, W.B., Tucker, T., & Tucker, T. (2020). Attributing Performance Variation from Integrated Application and System Data [Conference Poster]. https://www.osti.gov/biblio/1765520

Aksar, B., Schwaller, B., Aaziz, O., Ates, E., Brandt, J.M., Coskun, A., Egele, M., Leung, V.J., & Leung, V.J. (2019). AD for Machine Learning Approach to Understanding HPC Application Performance Variation Poster [Conference Poster]. https://www.osti.gov/biblio/1642788

Schwaller, B., Aksar, B., Aaziz, O., Ates, E., Brandt, J.M., Coskun, A., Egele, M., Leung, V.J., & Leung, V.J. (2019). A Machine Learning Approach to Understanding HPC Application Performance Variation [Conference Poster]. https://www.osti.gov/biblio/1642784

Brandt, J.M., Brown, C.J., Bachman, W.B., Gentile, A.C., Greenseid, J., Kramer, W., Langer, P., Rashid, A., Rhem, K., Showerman, M., & Showerman, M. (2019). Exploring New Monitoring and Analysis Capabilities on Cray's Software Preview System (Final Version) [Conference Poster]. https://www.osti.gov/biblio/1640116

Brandt, J.M., Brown, C.J., Bachman, W.B., Gentile, A.C., Greenseid, J., Kramer, W., Langer, P., Rashid, A., Rhem, K., Showerman, M., & Showerman, M. (2019). Exploring New Monitoring and Analysis Capabilities on Cray?s Software Preview System [Conference Poster]. https://www.osti.gov/biblio/1639961

Tuncer, O., Ates, E., Zhang, Y., Turk, A., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2019). Online Diagnosis of Performance Variation in HPC Systems Using Machine Learning. IEEE Transactions on Parallel and Distributed Systems, 30(4), pp. 883-896. https://doi.org/10.1109/TPDS.2018.2870403

Izadpanah, R., Allan, B., Dechev, D., Brandt, J.M., & Brandt, J.M. (2019). Production application performance data streaming for system monitoring. ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 4(2). https://doi.org/10.1145/3319498

Kramer, B., Bauer, G., Bode, B., Showerman, M., Enos, J., Saxton, A., Jha, S., Kalbarczyk, Z., Iyer, R., Brandt, J.M., Gentile, A.C., & Gentile, A.C. (2019). Holistic Measurement Driven System Assessment [Presentation]. https://www.osti.gov/biblio/1592279

Ahlgren, V., Andersson, S., Brandt, J.M., Cardo, N., Chunduri, S., Enos, J., Fields, P., Gentile, A.C., Gerber, R., Gienger, M., Greenseid, J., Greiner, A., Hadri, B., He, Y., Hoppe, D., Kaila, U., Kelly, K., Klein, M., Kristiansen, A., … Williams, J. (2018). Large-Scale System Monitoring Experiences and Recommendations [Conference Poster]. https://doi.org/10.1109/CLUSTER.2018.00069

Ahlgren, V., Andersson, S., Brandt, J.M., Cardo, N., Chunduri, S., Enos, J., Fields, P., Gentile, A.C., Gerber, R., Gienger, M., Greenseid, J., Greiner, A., Hadri, B., He, Y., Hoppe, D., Kaila, U., Kelly, K., Klein, M., Kristiansen, A., … Williams, J. (2018). Large-Scale System Monitoring Experiences and Recommendations [Conference Poster]. https://doi.org/10.1109/CLUSTER.2018.00069

Results 1–50 of 166
Results 1–50 of 166