Related papers: |
| "Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs", Simon Garcia de Gonzalo, Sitao Huang, Juan Gómez-Luna, Simon D. Hammond, Onur Mutlu, Wen-mei Hwu, In the proceedings of the 2019 IEEE International Symposium on Code Generation and Optimization (CGO19). [more...] |
  |
| "Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures", Simon Garcia de Gonzalo, Simon D. Hammond, Christian R. Trott, Wen-mei Hwu, In the 19th IEEE International Conference on High Performance Computing and Communications (HPCC17). [more...] |
  |
| "Efficient Kernel Synthesis for Performance Portable Programming", Li-Wen Chang, Izzat El Hajj, Christopher I. Rodrigues, Juan Gómez-Luna, Wen-mei Hwu, Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016. [more...] |
  |
| "DySel: Lightweight Dynamic Selection for Kernel-based Data-parallel Programming Model", Li-Wen Chang, Hee-Seok Kim, Wen-mei Hwu, Proceedings of the 21th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '16) . [more...] |
  |
| "A Programming System for Future Proofing Performance Critical Libraries", Li-Wen Chang, Izzat El Hajj, Hee-Seok Kim, Juan Gómez-Luna, Abdul Dakkak, Wen-mei Hwu, Proceedings of the 21th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2016) [poster]. [more...] |
  |
| "Transitioning HPC software to exascale heterogeneous computing", Wen-mei Hwu, Li-Wen Chang, Hee-Seok Kim, Abdul Dakkak, Izzat El Hajj, Computational Electromagnetics International Workshop (CEM), 2015. [more...] |
  |
| "Tangram: a High-level Language for Performance Portable Code Synthesis", Li-Wen Chang, Abdul Dakkak, Christopher I. Rodrigues, Wen-mei Hwu, the Eighth Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2015). [more...] |
  |