pith. sign in

arxiv: 1705.02843 · v1 · pith:SIZDM65Gnew · submitted 2017-05-08 · 💻 cs.AI · cs.DC

Block-Parallel IDA* for GPUs (Extended Manuscript)

classification 💻 cs.AI cs.DC
keywords block-parallelbpidaparallelizationaccessachievesassignsblockcompared
0
0 comments X
read the original abstract

We investigate GPU-based parallelization of Iterative-Deepening A* (IDA*). We show that straightforward thread-based parallelization techniques which were previously proposed for massively parallel SIMD processors perform poorly due to warp divergence and load imbalance. We propose Block-Parallel IDA* (BPIDA*), which assigns the search of a subtree to a block (a group of threads with access to fast shared memory) rather than a thread. On the 15-puzzle, BPIDA* on a NVIDIA GRID K520 with 1536 CUDA cores achieves a speedup of 4.98 compared to a highly optimized sequential IDA* implementation on a Xeon E5-2670 core.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.