Offloading support for OpenMP in Clang and LLVM SF Antao, A Bataev, AC Jacob, GT Bercea, AE Eichenberger, G Rokos, ... 2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 1-11, 2016 | 91 | 2016 |
Integrating GPU support for OpenMP offloading directives into Clang C Bertolli, SF Antao, GT Bercea, AC Jacob, AE Eichenberger, T Chen, ... Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in …, 2015 | 73 | 2015 |
A fast and scalable graph coloring algorithm for multi-core and many-core architectures G Rokos, G Gorman, PHJ Kelly Euro-Par 2015: Parallel Processing: 21st International Conference on …, 2015 | 41 | 2015 |
Performance analysis of OpenMP on a GPU using a CORAL proxy application GT Bercea, C Bertolli, SF Antao, AC Jacob, AE Eichenberger, T Chen, ... Proceedings of the 6th International Workshop on Performance Modeling …, 2015 | 38 | 2015 |
Hybrid OpenMP/MPI anisotropic mesh smoothing GJ Gorman, J Southern, PE Farrell, MD Piggott, G Rokos, PHJ Kelly Procedia Computer Science 9, 1513-1522, 2012 | 34 | 2012 |
Performance analysis and optimization of Clang's OpenMP 4.5 GPU support M Martineau, S McIntosh-Smith, C Bertolli, AC Jacob, SF Antao, ... 2016 7th International Workshop on Performance Modeling, Benchmarking and …, 2016 | 28 | 2016 |
Efficient fork-join on GPUs through warp specialization AC Jacob, AE Eichenberger, H Sung, SF Antao, GT Bercea, C Bertolli, ... 2017 IEEE 24th International Conference on High Performance Computing (HiPC …, 2017 | 21 | 2017 |
Thread-parallel anisotropic mesh adaptation GJ Gorman, G Rokos, J Southern, PHJ Kelly New challenges in grid generation and adaptivity for scientific computing …, 2015 | 16 | 2015 |
Thread parallelism for highly irregular computation in anisotropic mesh adaptation G Rokos, GJ Gorman, KE Jensen, PHJ Kelly arXiv preprint arXiv:1505.04694, 2015 | 13 | 2015 |
A thread-parallel algorithm for anisotropic mesh adaptation G Rokos, GJ Gorman, J Southern, PHJ Kelly arXiv preprint arXiv:1308.2480, 2013 | 10 | 2013 |
Pragmatic–parallel anisotropic adaptive mesh toolkit G Rokos, G Gorman Facing the Multicore-Challenge III: Aspects of New Paradigms and …, 2013 | 9 | 2013 |
Implementing implicit OpenMP data sharing on GPUs GT Bercea, C Bertolli, AC Jacob, A Eichenberger, A Bataev, G Rokos, ... Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in …, 2017 | 6 | 2017 |
Accelerating Optimisation-Based Anisotropic Mesh Adaptation using nVIDIA’s CUDA Architecture G Rokos Msc thesis, Imperial College London, 2010 | 4 | 2010 |
Towards performance portable gpu programming with raja A Jacob, SF Antao, H Sung, AE Eichenberger, C Bertolli, GT Bercea, ... Workshop on Portability Among HPC Architectures for Scientific Applications, 2015 | 3 | 2015 |
Accelerating anisotropic mesh adaptivity on nVIDIA’s CUDA using texture interpolation G Rokos, G Gorman, PHJ Kelly Euro-Par 2011 Parallel Processing: 17th International Conference, Euro-Par …, 2011 | 3 | 2011 |
Solving the advection PDE on the Cell Broadband Engine G Rokos, G Peteinatos, G Kouveli, G Goumas, K Kourtis, N Koziris 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 2 | 2010 |
Scalable multithreaded algorithms for mutable irregular data with application to anisotropic mesh adaptivity G Rokos Imperial College London, 2014 | 1 | 2014 |
An Interrupt-Driven Work-Sharing For-Loop Scheduler G Rokos, GJ Gorman, PHJ Kelly arXiv preprint arXiv:1505.04134, 2015 | | 2015 |
Palchaudhuri, Ayan 104 Panda, Dhabaleswar K.(DK) 84, 213, 62 Panyala, Ajay 23 Park, Yoonho 94 V Pascucci, K Komatsu, K Kothapalli, S Krishnamoorthy, S Kumar, SE Kurt, ... | | |
Offloading Support for OpenMP in Clang and LLVM C Bertolli, AEE Bercea, G Rokos, M Martineau, T Jin, G Ozen, Z Sura, ... | | |