Cited By
View all- Hanindhito BJohn LBalsamo SKnottenbelt WAbad CShang W(2024)Accelerating ML Workloads using GPU Tensor Cores: The Good, the Bad, and the UglyProceedings of the 15th ACM/SPEC International Conference on Performance Engineering10.1145/3629526.3653835(178-189)Online publication date: 7-May-2024
- Lu YLiu WMohror KArnold DBadia R(2023)DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector MultiplicationProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607051(1-14)Online publication date: 12-Nov-2023
- Zouzias AMcColl W(2023)A Parallel Scan Algorithm in the Tensor Core Unit ModelEuro-Par 2023: Parallel Processing10.1007/978-3-031-39698-4_33(489-502)Online publication date: 28-Aug-2023
- Show More Cited By