Publications
"Autotuned Parallel I/O for Highly Scalable Biosequence Analysis",
TeraGrid'11, Salt Lake City, Utah, 07/2011.
"Auto-tuning Full Applications: A Case Study",
International Journal of High Performance Computing Applications, vol. 25, no. 3, 2011.
"Can search algorithms save large-scale automatic performance tuning?",
Sixth international Workshop on Automatic Performance Tuning (iWAPT2011), Singapore, pp. 2136 - 2145, 06/2011.
"The Design of an Auto-tuning I/O Framework on Cray XT5 System",
Cray Users Group Conference (CUG'11) (Best Paper Finalist), Fairbanks, Alaska, may, 2011.
"Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems",
International Journal of High Performance Computing Applications, vol. 25, no. 3, pp. 342-350, 2011.
"Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning",
International Conference for High Performance Computing, Networking, Storage and Analysis (SC11), Seattle, Washington, ACM/IEEE, 11/2011.
"Fault Resilience of the Algebraic Multi-Grid Solver",
In Submission, 2011.
"Formal analysis of MPI-based parallel programs",
Communications of the ACM, vol. 54, issue 12, pp. 82-91, December, 2011.
"Fully empirical autotuned dense QR factorization for multicore architectures",
EuroPar-2011, Bordeaux, France, August, 2011.
"Gyrokinetic Particle-in-Cell Optimization on Emerging Multi- and Manycore Platforms",
Parallel Computing, vol. 37, no. 9, pp. 501-520, sept, 2011.
"Gyrokinetic Toroidal Simulations on Leading Mult- and Manycore HPC Systems",
(submitted to) Supercomputing, April, 2011.
"High performance dense linear system solver with soft error resilience",
IEEE Cluster 2011, Austin, Texas, 09/2011.
"Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs",
International Conference on Parallel Processing (ICPP 2011), Taipei, Taiwan, sep, 2011.
"Performance portability of a GPU enabled factorization with the DAGuE framework",
IEEE Cluster Workshop on Parallel Programming on Accelerator Clusters (PPAC), Austin, Texas, September, 2011.
