Publications

Export 12 results:
Filters: Author is Thomas Hérault  [Clear All Filters]
2015
Bouteiller, A., T. Hérault, G. Bosilca, P. Du, and J. Dongarra, "Algorithm-based Fault Tolerance for Dense Matrix Factorizations, Multiple Failures, and Accuracy", ACM Transactions on Parallel Computing, vol. 1, pp. 10:1-10:28, 02/2015.
Bosilca, G., A. Bouteiller, T. Hérault, Y. Robert, and J. Dongarra, "Composing Resilience Techniques: ABFT, Periodic, and Incremental Checkpointing", International Journal of Networking and Computing, vol. 5, pp. 2-15, 01/2015.
Cao, C., G. Bosilca, T. Hérault, and J. Dongarra, "Design for a Soft Error Resilient Dynamic Task-based Runtime", 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Hyderabad, India, IEEE, 05/2015.
2013
Bouteiller, A., T. Hérault, G. Bosilca, and J. J. Dongarra, "Correlated set coordination in fault tolerant message logging protocols for many-core clusters", Concurrency and Computation: Practice and Experience, vol. 25, issue 4, pp. 572-585, March, 2013.
2012
Bland, W., P. Du, A. Bouteiller, T. Hérault, G. Bosilca, and J. Dongarra, "A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI", Euro -Par International Conference on parallel processing ,27-31 August 2012, Rhodes Island, Greece, Springer Berlin Heidelberg, 2012.
Bosilca, G., A. Bouteiller, A. Danalis, T. Hérault, and J. Dongarra, "From Serial Loops to Parallel Execution on Distributed Systems", Euro -Par International Conference on parallel processing ,27-31 August 2012, Rhodes Island, Greece, Springer Berlin Heidelberg, 2012.