Publications Details

Publications / Conference Paper

User-Centric System Fault Identification Using IO500 Benchmark

Liem, Radita; Povaliaiev, Dmytro; Lofstead, Gerald F.; Kunkel, Julian; Terboven, Christian

I/O performance in a multi-user environment is difficult to predict. Users do not know what I/O performance to expect when running and tuning applications. We propose to use the IO500 benchmark as a way to guide user expectations on their application's performance and to aid identifying root causes of their I/O problems that might come from the system. Our experiments describe how we manage user expectation with IO500 and provide a mechanism for system fault identification. This work also provides us with information of the tail latency problem that needs to be addressed and granular information about the impact of I/O technique choices (POSIX and MPI-IO).