Schwaller, B., Brandt, J.M., Leung, V.J., & Leung, V.J. (2023). Towards Practical Machine Learning Frameworks for Performance Diagnostics in Supercomputers [Conference Paper]. https://www.osti.gov/biblio/2431456
Publications
Search results
Jump to search filtersAksar, B., Sencan, E., Schwaller, B., Aaziz, O.R., Kulis, B., Coskun, A.K., Leung, V.J., Brandt, J.M., & Brandt, J.M. (2022). ALBADross: Active Learning Based Anomaly Diagnosis for Production HPC Systems [Conference Proceeding]. 10.1109/CLUSTER51413.2022.00048
Leung, V.J. (2022). Concise ML Explanations [Presentation]. https://www.osti.gov/biblio/2002248
Aksar, B., Zhang, Y., Ates, E., Aaziz, O.R., Schwaller, B., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). E2EWatch: End-to-end Anomaly Diagnosis Framework for Production HPC Systems [Conference Presentation]. 10.2172/1891960
Hart, W.E., Leung, V.J., & Leung, V.J. (2021). Strategies for Matching Process Models to Observational Data [Conference Presentation]. 10.2172/1876708
Aksar, B., Zhang, Y., Ates, E., Aaziz, O.R., Schwaller, B., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). E2EWatch: End-to-end Anomaly Diagnosis Framework for Production HPC Systems [Conference Paper]. 10.1007/978-3-030-85665-6_5
Ates, E., Aksar, B., Leung, V.J., Coskun, A.K., & Coskun, A.K. (2021). Counterfactual Explanations for Multivariate Time Series [Conference Proceeding]. 2021 International Conference on Applied Artificial Intelligence, ICAPAI 2021. 10.1109/ICAPAI49758.2021.9462056
Ates, E., Aksar, B., Coskun, A.K., Leung, V.J., & Leung, V.J. (2021). CoMTE: Counterfactual Explanations for Multivariate Time Series [Conference Presentation]. 10.2172/1866905
Aksar, B., Zhang, Y., Ates, E., Schwaller, B., Aaziz, O.R., Leung, V.J., Brandt, J.M., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). Proctor: A Semi-Supervised Performance Anomaly Diagnosis Framework for Production HPC Systems [Conference Proceeding]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 10.1007/978-3-030-78713-4_11
Zhang, Y., Aksar, B., Aaziz, O.R., Schwaller, B., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation [Conference Proceeding]. 2021 IEEE High Performance Extreme Computing Conference, HPEC 2021. 10.1109/HPEC49654.2021.9622783
Zhang, Y., Aksar, B., Aaziz, O.R., Schwaller, B., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2021). Using Monitoring Data to Improve HPC Performance via Network-Data-Driven Allocation [Conference Presentation]. 2021 IEEE High Performance Extreme Computing Conference, HPEC 2021. https://doi.org/10.2172/1888952
Ates, E., Aksar, B., Leung, V.J., Coskun, A.K., & Coskun, A.K. (2020). Explainable Machine Learning Frameworks for Managing HPC Systems [Conference Presentation]. https://doi.org/10.2172/1829224
Ates, E., Aksar, B., Leung, V.J., Coskun, A.K., & Coskun, A.K. (2020). Explainable Machine Learning Frameworks for Managing HPC Systems [Conference Paper]. https://www.osti.gov/biblio/1830968
Schwaller, B., Aksar, B., Aaziz, O.R., Ates, E., Brandt, J.M., Coskun, A., Egele, M., Leung, V.J., & Leung, V.J. (2019). A Machine Learning Approach to Understanding HPC Application Performance Variation [Conference Poster]. https://www.osti.gov/biblio/1642784
Aksar, B., Schwaller, B., Aaziz, O.R., Ates, E., Brandt, J.M., Coskun, A., Egele, M., Leung, V.J., & Leung, V.J. (2019). AD for Machine Learning Approach to Understanding HPC Application Performance Variation Poster [Conference Poster]. https://www.osti.gov/biblio/1642788
Ang, J.A., Barrett, R.F., Benner, R.E., Burke, D., Chan, C., Cook, J., Daley, C.S., Donofrio, D., Hammond, S., Hemmert, K.S., Hoekstra, R.J., Ibrahim, K., Kelly, S.M., le, H., Leung, V.J., Michelogiannakis, G., Resnick, D.R., Rodrigues, A., Shalf, J., … Voskuilen, G.R. (2019). Abstract Machine Models and Proxy Architectures for Exascale Computing. 10.2172/1561498
Vineyard, C.M., Green, S., Foulk, J.W., Younge, A.J., Leung, V.J., & Leung, V.J. (2019). Machine Learning for System Software ? Can Computers Manage Computers? [Presentation]. https://www.osti.gov/biblio/1645723
Ates, E., Tuncer, O., Turk, A., Leung, V.J., Brandt, J.M., Egele, M., Coskun, A.K., & Coskun, A.K. (2019). Taxonomist: Application Detection through Rich Monitoring Data [Presentation]. 10.1007/978-3-319-96983-1_7
Tuncer, O., Ates, E., Zhang, Y., Turk, A., Brandt, J.M., Leung, V.J., Egele, M., Coskun, A.K., & Coskun, A.K. (2019). Online Diagnosis of Performance Variation in HPC Systems Using Machine Learning. IEEE Transactions on Parallel and Distributed Systems, 30(4), pp. 883-896. 10.1109/TPDS.2018.2870403
Link, H.E., Richter, S.N., Leung, V.J., Brost, R., Phillips, C.A., Staid, A., & Staid, A. (2019). Statistical models of dengue fever [Conference Poster]. Communications in Computer and Information Science. 10.1007/978-981-13-6661-1_14
Brost, R., Carrier, E.E., Carroll, M.J., Groth, K.M., Kegelmeyer, W.P., Leung, V.J., Link, H.E., Patterson, A.J., Phillips, C.A., Richter, S., Robinson, D.G., Staid, A., Woodbridge, D.M.K., & Woodbridge, D.M.K. (2018). Adverse Event Prediction Using Graph-Augmented Temporal Analysis (Final Report). 10.2172/1530166
Link, H.E., Richter, S.N., Leung, V.J., Brost, R., Phillips, C.A., Staid, A., & Staid, A. (2018). Statistical Models of Dengue Fever [Conference Poster]. 10.1007/978-981-13-6661-1_14
Zhang, Y., Tuncer, O., Kaplan, F., Olcoz, K., Leung, V.J., Coskun, A.K., & Coskun, A.K. (2018). Level-spread: A new job allocation policy for dragonfly networks [Conference Poster]. Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium, IPDPS 2018. 10.1109/IPDPS.2018.00121
Zhang, Y., Tuncer, O., Kaplan, F., Olcoz, K., Leung, V.J., Coskun, A.K., & Coskun, A.K. (2018). Level-Spread: A New Job Allocation Policy for Dragonfly Networks [Conference Poster]. 10.1109/IPDPS.2018.00121
Brost, R., Leung, V.J., Link, H.E., Phillips, C.A., Staid, A., & Staid, A. (2018). Event Prediction Using Graph-Augmented Temporal Analysis [Conference Poster]. https://www.osti.gov/biblio/1498238