While many application tuning guides focus on understanding the performance of an application on a single node, a very scalable application will use multiple node to achieve very high performance.
OpenMP.org has posted the slides and audio from a day-long tutorial on Hybrid MPI and OpenMP Parallel Programming from SC13. OpenMP is the de facto standard for writing parallel applications for ...