Accelerating linear system solutions using randomization techniques M Baboulin, J Dongarra, J Herrmann, S Tomov ACM Transactions on Mathematical Software (TOMS) 39 (2), 1-13, 2013 | 58 | 2013 |
Multilevel algorithms for acyclic partitioning of directed acyclic graphs J Herrmann, MY Ozkaya, B Uçar, K Kaya, UV Çatalyürek SIAM Journal on Scientific Computing 41 (4), A2117-A2145, 2019 | 53 | 2019 |
A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning MY Özkaya, A Benoit, B Uçar, J Herrmann, ÜV Çatalyürek 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 45 | 2019 |
Acyclic partitioning of large directed acyclic graphs J Herrmann, J Kho, B Uçar, K Kaya, ÜV Çatalyürek 2017 17th IEEE/ACM international symposium on cluster, cloud and grid …, 2017 | 38 | 2017 |
Optimal multistage algorithm for adjoint computation G Aupy, J Herrmann, P Hovland, Y Robert SIAM Journal on Scientific Computing 38 (3), C232-C255, 2016 | 37 | 2016 |
Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory J Herrmann, O Beaumont, L Eyraud-Dubois, J Hermann, A Joly, A Shilova arXiv preprint arXiv:1911.13214, 2019 | 30 | 2019 |
Bridging the gap between performance and bounds of cholesky factorization on heterogeneous platforms E Agullo, O Beaumont, L Eyraud-Dubois, J Herrmann, S Kumar, ... 2015 IEEE International Parallel and Distributed Processing Symposium …, 2015 | 25 | 2015 |
Optimal memory-aware backpropagation of deep join networks O Beaumont, J Herrmann, G Pallez, A Shilova Philosophical Transactions of the Royal Society A 378 (2166), 20190049, 2020 | 22 | 2020 |
Periodicity in optimal hierarchical checkpointing schemes for adjoint computations G Aupy, J Herrmann Optimization Methods and Software 32 (3), 594-624, 2017 | 12 | 2017 |
Acyclic partitioning of large directed acyclic graphs. In 2017 17th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGRID) J Herrmann, J Kho, B Uçar, K Kaya, ÜV Çatalyürek IEEE, 371ś380, 2017 | 12 | 2017 |
Memory-aware list scheduling for hybrid platforms J Herrmann, L Marchal, Y Robert 2014 IEEE international parallel & distributed processing symposium …, 2014 | 12 | 2014 |
Mixing LU and QR factorization algorithms to design high-performance dense linear algebra solvers M Faverge, J Herrmann, J Langou, B Lowery, Y Robert, J Dongarra Journal of Parallel and Distributed Computing 85, 32-46, 2015 | 10 | 2015 |
Designing LU-QR hybrid solvers for performance and stability M Faverge, J Herrmann, J Langou, BR Lowery, Y Robert, J Dongarra 2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014 | 10 | 2014 |
Assessing the cost of redistribution followed by a computational kernel: Complexity and performance results J Herrmann, G Bosilca, T Hérault, L Marchal, Y Robert, J Dongarra Parallel Computing 52, 22-41, 2016 | 9 | 2016 |
Task-based parallel programming for scalable algorithms: Application to matrix multiplication E Agullo, A Buttari, A Guermouche, J Herrmann, A Jego Inria Bordeaux-Sud-Ouest, 2022 | 7 | 2022 |
H-Revolve A Framework for Adjoint Computation on Synchronous Hierarchical Platforms J Herrmann ACM Transactions on Mathematical Software (TOMS) 46 (2), 1-25, 2020 | 7* | 2020 |
Model and complexity results for tree traversals on hybrid platforms J Herrmann, L Marchal, Y Robert Euro-Par 2013 Parallel Processing: 19th International Conference, Aachen …, 2013 | 6 | 2013 |
Determining the optimal redistribution for a given data partition T Herault, J Herrmann, L Marchal, Y Robert 2014 IEEE 13th International Symposium on Parallel and Distributed Computing …, 2014 | 5* | 2014 |
Computing the expected makespan of task graphs in the presence of silent errors H Casanova, J Herrmann, Y Robert Parallel Computing 75, 41-60, 2018 | 4 | 2018 |
Memory-aware tree traversals with pre-assigned tasks J Herrmann, L Marchal, Y Robert Journal of Parallel and Distributed Computing 75, 53-66, 2015 | 4 | 2015 |