"Enabling Fair Pricing on HPC Systems with Node haring", SC'13, Denver, Colorado, November, 2013.
"Soft Error Vulnerability of Iterative Linear Algebra Methods", Proceedings of the 22nd annual International Conference on Supercomputing, New York, NY, pp. 155-164, 2008.
"Automatic fault characterization via abnormality-enhanced classification", Dependable Systems and Networks (DSN- W ) , 25-28 June 2012, Boston , Massachusetts - USA, IEEE, 2012.
"AutomaDeD: Automata-Based Debugging for Dissimilar Parallel Tasks", 2010 IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Chicago, IL, pp. 231 -240, 2010.
"Design for a Soft Error Resilient Dynamic Task-based Runtime", 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Hyderabad, India, IEEE, 05/2015.
"A Performance Prediction Framework for Scientific Applications", Future Generation Computer Systems, vol. 22, no. 3, pp. 336-346, february" note =, 2006.
"Toward Multi-target Autotuning for Accelerators", The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014), Hsinchu, Taiwan, IEEE Computer Society Press, 12/2014.
"Integration and Synthess for Automated Performance Tuning: the SYNAPT Project", The International Workshop on Automatic Performance Tuning, Eugene, OR, USA, 07/2014.
"Tools for machine-learning-based empirical autotuning and specialization", International Journal of High Performance Computing Applications, vol. 27, issue 4, pp. 403-411, November, 2013.
"Towards fully automatic auto-tuning: Leveraging language features of Chapel", International Journal of High Performance Computing Applications, vol. 27, issue 4, pp. 394-402, November, 2013.
"Algorithm-Based Checkpoint-Free Fault Tolerance for Parallel Matrix Computations on Volatile Resources", 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), Rhodes Island, Greece, 2006.
"Polyhedra scanning revisited", ACM SIGPLAN conference on Programming Language Design and Implementation (PLDI), pp. 499-508, 06/2012.
"Algorithm-Based Fault Tolerance for Fail-Stop Failures", IEEE Transactions on Parallel and Distributed Systems, vol. 19, no. 12, 2008.
"ANGEL: A Hierarchical Approach to Multi-Objective Online Auto-Tuning", Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, New York, NY, USA, ACM, 2015.
"Efficient Search for Inputs Causing High Floating-point Errors", 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'14), Orlando, Florida, February, 2014.