Can PETSc use GPUs to speedup computations?
PETSc-dev has some support for running portions of the computation on Nvidia GPUs. See installation.html for how to install PETSc to use GPUs. PETSc has a Vec class VECCUDA that performs almost all the vector operations on the GPU. The Mat class MATCUDA performs matrix-vector products on the GPU but does not have matrix assembly on the GPU yet. Both of these classes run in parallel with MPI. All KPS methods, except KSPIBCGS, run all their vector operations on the GPU thus, for example Jacobi preconditioned Krylov methods run completely on the GPU. Preconditioners are a problem, we could do with some help for these. The example src/snes/examples/tutorials/ex47cu.cu demonstates how the nonlinear function evaluation can be done on the GPU.
PETSc-dev has some support for running portions of the computation on Nvidia GPUs. See installation.html for how to install PETSc to use GPUs. PETSc has a Vec class VECCUSP that performs almost all the vector operations on the GPU. The Mat class MATCUSP performs matrix-vector products on the GPU but does not have matrix assembly on the GPU yet. Both of these classes run in parallel with MPI. All KPS methods, except KSPIBCGS, run all their vector operations on the GPU thus, for example Jacobi preconditioned Krylov methods run completely on the GPU. Preconditioners are a problem, we could do with some help for these. The example src/snes/examples/tutorials/ex47cu.cu demonstates how the nonlinear function evaluation can be done on the GPU.
PETSc-dev has some support for running portions of the computation on Nvidia GPUs. See installation.html for how to install PETSc to use GPUs. PETSc has a Vec class VECCUSP that performs almost all the vector operations on the GPU. The Mat class MATCUSP performs matrix-vector products on the GPU but does not have matrix assembly on the GPU yet. Both of these classes run in parallel with MPI. All KPS methods, except KSPIBCGS, run all their vector operations on the GPU thus, for example Jacobi preconditioned Krylov methods run completely on the GPU. Preconditioners are a problem, we could do with some help for these. The example src/snes/examples/tutorials/ex47cu.cu demonstates how the nonlinear function evaluation can be done on the GPU. Can I run PETSc with exended precision Yes, with gcc 4.6 and later (and gfortran 4.6 and later) ./configure PETSc using the options –with-precision=__float128 –download-qblaslapack. External packages cannot be used in this mode and some print statemen