Cited By
View all- De Gonzalo SHuang SGómez-Luna JHammond SMutlu OHwu WKandemir MJimborean AMoseley T(2019)Automatic generation of warp-level primitives and atomic instructions for fast and portable parallel reduction on GPUsProceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization10.5555/3314872.3314884(73-84)Online publication date: 16-Feb-2019
- Gonzalo SHuang SGomez-Luna JHammond SMutlu OHwu W(2019)Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs2019 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)10.1109/CGO.2019.8661187(73-84)Online publication date: Feb-2019
- Chang LGómez-Luna JEl Hajj IHuang SChen DHwu WBinder WCortellessa VKoziolek ASmirni EPoess M(2017)Collaborative Computing for Heterogeneous Integrated SystemsProceedings of the 8th ACM/SPEC on International Conference on Performance Engineering10.1145/3030207.3030244(385-388)Online publication date: 17-Apr-2017