How do I achieve high performance using ScaLAPACK?
ScaLAPACK performance relies on an efficient low-level message-passing layer and high speed interconnection network for communication, and an optimized BLAS library for local computation. For a detailed description of performance-related issues, please refer to Chapter 5 of the ScaLAPACK Users’ Guide.