Table of ContentsAn Insiders Guide: Janus Optimization and Troubleshooting Overview Sigportals - Message Passing Failures Sigportal Problems by Number More Comments on Sigportals Overview - the Debugger What to do when large production jobs stop making progress and you'd like to figure out why (or gather enough information to beat on your code team, at least.) Other Useful Debug Commands After Attaching Debug’s ‘where’ Command Bug Debug: Using Watchpoints Debug : Using Watchpoints Example Debug: Compiling and Running the Watchpoint Example Overview - NQS What NQS Topics We Can Cover Grabnodes.ksh -- Your Personal Interactive Partition Using NQS Warnlimit for an Orderly Shutdown before Time Expires Example: Using NQS Warnlimit,Sample Program Example: Using NQS Warnlimit,Compile Line and qsub Script Example: Using NQS Warnlimit,Output and Results Overview - OpenMP OMP Introduction OMP Usage/Compiling/Linking OMP Refs. & Directiveshttp://www.openmp.org PGI Users Guide: http://www.pgroup.com/ppro_docs/pgiws_ug/pgi30u.htm OMP C Pragmas OMP Clauses OMP Sample Program,Compile and Run OMP Need-to-do’s! Virtual Nodes Overview - the Profiler Profiler Introduction Profiler Overview Profiler FeaturesCpu time/HW counters Profiler FeaturesMemory usage/Env. variables Profiler FeaturesMore Env. Variables Profiler FeaturesCall Tree/Ancestor Tree/Load Bal. Profiler:Compiling/Running/Getting Results Profiler exampleexam1.c - the basics on 1 node Profiler exampleexam1.c - Profile output Profiler Exampleexam1.c - Call Tree Profiler Exampleexam1.c - Running on 2 nodes Profiler Exampleexam1.c - Call Tree for 2 nodes Profiler Exampleexam2.c - Using the HW Counters to Gets Flops by Routine Profiler Exampleexam2.c - Display the flops Profiler & Load BalanceSample Code First Look with Profiler,Basic and PROFILE_GSYNC_ROUTINE Looking at wait1() & wait2() Interpreting the Output Profiler Notes - 2nd CPU and Useful HW Counters Manual Profiling - genperf and perfrep Overview - CTH Profiler Example Another Profile Example -- CTH PPT Slide Taking the next step… More OMP CTH w/ “full” OMP utilization Overview - IO Cliff Notes Optimizing IO Rules for best IO performance IO Bottleneck - fyods Rules for Stream IO and Shared File IO File systems - NFS (slowest) & UFS (still slow) File systemsPFS file system C and Fortran IO APIs IO Results Impacted by Many Factors IO Results - C Streams IO Results - C write/read/cwrite/cread IO Results - C Shared File - Without ‘yod -masync’ IO Results: C Shared File with ‘yod -masync’ and Best API IO Results: F77 read/write IO Results: F90 using Async IO Sample programs using sync IO and async IO Overview - Math Libs ASCI Red Math Libraries What Math Libraries Exist LIBCSMATH Libcsmath R2.8 enhancements LIBWC LAPACK, PBLAS, ScaLAPACK A new Optimization Tool Example |
Authors: Ben Cole, Greg Henry and Pat Fay
Email: pfay@lanl.gov |