Adaptive precision in block‐Jacobi preconditioning for iterative sparse linear system solvers H Anzt, J Dongarra, G Flegar, NJ Higham, ES Quintana‐Ortí Concurrency and Computation: Practice and Experience 31 (6), e4460, 2019 | 91 | 2019 |
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing H Anzt, T Cojean, G Flegar, F Göbel, T Grützmacher, P Nayak, T Ribizel, ... ACM Transactions on Mathematical Software (TOMS) 48 (1), 1-33, 2022 | 85 | 2022 |
Load-balancing sparse matrix vector product kernels on gpus H Anzt, T Cojean, C Yen-Chen, J Dongarra, G Flegar, P Nayak, S Tomov, ... ACM Transactions on Parallel Computing (TOPC) 7 (1), 1-26, 2020 | 55 | 2020 |
Adaptive precision block-Jacobi for high performance preconditioning in the Ginkgo linear algebra software G Flegar, H Anzt, T Cojean, ES Quintana-Orti ACM Transactions on Mathematical Software (TOMS) 47 (2), 1-28, 2021 | 36 | 2021 |
Ginkgo: A high performance numerical linear algebra library H Anzt, T Cojean, YC Chen, G Flegar, F Göbel, T Grützmacher, P Nayak, ... Journal of Open Source Software 5 (52), 2260, 2020 | 31 | 2020 |
FloatX: A C++ Library for Customized Floating-Point Arithmetic G Flegar, F Scheidegger, V Novaković, G Mariani, AE Tomás, ACI Malossi, ... ACM Transactions on Mathematical Software (TOMS) 45 (4), 1-23, 2019 | 31 | 2019 |
Batched Gauss-Jordan elimination for block-Jacobi preconditioner generation on GPUs H Anzt, J Dongarra, G Flegar, ES Quintana-Ortí Proceedings of the 8th International Workshop on Programming Models and …, 2017 | 30 | 2017 |
Toward a modular precision ecosystem for high-performance computing H Anzt, G Flegar, T Grützmacher, ES Quintana-Ortí The International Journal of High Performance Computing Applications 33 (6 …, 2019 | 28 | 2019 |
Variable-size batched Gauss–Jordan elimination for block-Jacobi preconditioning on graphics processors H Anzt, J Dongarra, G Flegar, ES Quintana-Ortí Parallel Computing 81, 131-146, 2019 | 25 | 2019 |
Balanced CSR sparse matrix-vector product on graphics processors G Flegar, ES Quintana-Ortí Euro-Par 2017: Parallel Processing: 23rd International Conference on …, 2017 | 24 | 2017 |
Overcoming load imbalance for irregular sparse matrices G Flegar, H Anzt Proceedings of the Seventh Workshop on Irregular Applications: Architectures …, 2017 | 23 | 2017 |
ParILUT-a parallel threshold ILU for GPUs H Anzt, T Ribizel, G Flegar, E Chow, J Dongarra 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 20 | 2019 |
Towards continuous benchmarking: An automated performance evaluation framework for high performance software H Anzt, YC Chen, T Cojean, J Dongarra, G Flegar, P Nayak, ... Proceedings of the platform for advanced scientific computing conference, 1-11, 2019 | 18 | 2019 |
A customized precision format based on mantissa segmentation for accelerating sparse linear algebra T Grützmacher, T Cojean, G Flegar, F Göbel, H Anzt Concurrency and Computation: Practice and Experience 32 (15), e5418, 2020 | 17 | 2020 |
Acceleration of PageRank with customized precision based on mantissa segmentation T Grützmacher, T Cojean, G Flegar, H Anzt, ES Quintana-Ortí ACM Transactions on Parallel Computing (TOPC) 7 (1), 1-19, 2020 | 16 | 2020 |
Variable-size batched LU for small matrices and its integration into block-Jacobi preconditioning H Anzt, J Dongarra, G Flegar, ES Quintana-Ortí 2017 46th International Conference on Parallel Processing (ICPP), 91-100, 2017 | 14 | 2017 |
Crediting pull requests to open source research software as an academic contribution H Anzt, E Kuehn, G Flegar Journal of Computational Science 49, 101278, 2021 | 9 | 2021 |
Customized-precision Block-Jacobi preconditioning for Krylov iterative solvers on data-parallel manycore processors G Flegar, H Anzt, T Cojean, ES Quintana-Ortı ACM Trans. Math. Softw, 2020 | 8 | 2020 |
Flexible batched sparse matrix-vector product on gpus H Anzt, G Collins, J Dongarra, G Flegar, ES Quintana-Ortí Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms …, 2017 | 8 | 2017 |
Communication in task‐parallel ILU‐preconditioned CG solvers using MPI+ OmpSs JI Aliaga, M Barreda, G Flegar, M Bollhöfer, ES Quintana‐Ortí Concurrency and Computation: Practice and Experience 29 (21), e4280, 2017 | 8 | 2017 |