"Multi-Threaded Library for Many-Core Systems", Workshop on Multithreaded Architectures and Applications, Rome, Italy, IEEE, 2009.
"OpenMP and MPI Application Energy Measurement Variation", 1st International Workshop on Energy Efficient SuperComputing (E2SC), Denver, CO, 11/2013.
"Optimizing Sparse Matrix-Multiple Vector Multiplication for Nuclear Configuration Interaction Calculations", International Parallel and Distributed Processing Symposium (IPDPS 2014), 05/2014.
"Parallel implementation and performance optimization of the configuration-interaction method", SC'15, Austin, TX, 11/2015.
"Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs", International Conference on Parallel Processing (ICPP 2011), Taipei, Taiwan, sep, 2011.
"Parallel Reduction to Hessenberg Form with Algorithm-Based Fault Tolerance", SC'13, Denver, Colorado, November, 2013.
"Particle advection performance over varied architectures and workloads", High Performance Computing (HiPC), 2014 21st International Conference on: IEEE, 2014.
"Performance Analysis of Applications in the Context of Architectural Rooflines", Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering, New York, NY, USA, ACM, 2017.
"A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor Cores", Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International: IEEE, 2015.
"Performance counter monitoring for the Blue Gene/Q architecture", ScicomP 2012, Toronto, Ontario, Canada, May, 2012.
"Performance portability of a GPU enabled factorization with the DAGuE framework", IEEE Cluster Workshop on Parallel Programming on Accelerator Clusters (PPAC), Austin, Texas, September, 2011.
"Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms", 4th International Workshop on Performance Modeling, Benchmarking, and Simulation of HPC Systems (PMBS13), Denver, Colorado, November, 2013.
"Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms", Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS13) held as part of SC13, 11/2013.
"Pinpointing and Exploiting Opportunities for Enhancing Data Reuse", Proceedings of the 2008 IEEE International Symposium on Performance Analysis of Systems and Software, Austin, TX, USA, apr, 2008.
"The PMaC Binary Instrumentation Library for PowerPC", Workshop on Binary Instrumentation and Applications, San Jose, CA, 2006.