Jradi, W., do Nascimento, H., & Martins, W. 2018 Oct 1. A Fast and Generic GPU-Based Parallel Reduction Implementation. Proceedings of the Symposium on High Performance Computing Systems (SSCAD). [Online] :