A Massively Parallel Adaptive Fast Multipole Method on Heterogeneous Architectures
We describe a parallel fast multipole method for highly nonuniform distributions of particles. We employ both distributed memory parallelism and shared memory parallelism to rapidly evaluate two-body nonoscillatory potentials in three dimensions on heterogeneous high performance computing architectures.