Publications

Export 12 results:
Conference Paper
Agullo, E., J. Dongarra, R. Nath, and S. Tomov, "Fully empirical autotuned dense QR factorization for multicore architectures", EuroPar-2011, Bordeaux, France, August, 2011.
Malony, A. D., and K. A. Huck, "General Hybrid Parallel Profiling", Parallel, Distributed, and Network-Based Processing (PDP 2014), Turin, Italy, 02/2014.
Kamil, S., C. Chan, S. Williams, L. Oliker, J. Shalf, M. Howison, W. E. Bethel, and Prabhat, "A Generalized Framework for Auto-tuning Stencil Computations", In Proceedings of the Cray User Group Conference, 2009, Atlanta, GA, May, 2009.
Narayanan, S H K., B. Norris, and P. D. Hovland, "Generating Performance Bounds from Source Code", Proceedings of the First International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2010), sep, 2010.
Tikir, M., L. Carrington, A. Snavely, and E. Strohmaier, "Genetic Algorithm Approach to Modeling the Performance of Memory-bound Codes", Supercomputing, 2007. The Proceeding of the ACM/IEEE Conference on High Performance Networking and Computing, Reno, NV, november, 2007.
Tiwari, A., M. Laurenzano, J. Peraza, L. Carrington, and A. Snavely, "Green Queue: Customized Large-Scale Clock Frequency Scaling", Cloud and Green Computing (CGC) , 1-3 November , 2012, Xiangtan, Hunan, China, IEEE, 2012.
Madduri, K., K. Z. Ibrahim, S. Williams, E-J. Im, S. Ethier, J. Shalf, and L. Oliker, "Gyrokinetic Toroidal Simulations on Leading Mult- and Manycore HPC Systems", (submitted to) Supercomputing, April, 2011.
Ozog, D., A. Kamil, Y. Zheng, P. Hargrove, J. R. Hammond, A. Malony, W. de Jong, and K. Yelick, "A Hartree-Fock Application Using UPC++ and the New DArray Library", Parallel and Distributed Processing Symposium, 2016 IEEE International: IEEE, 2016.
Du, P., P. Luszczek, and J. Dongarra, "High performance dense linear system solver with soft error resilience", IEEE Cluster 2011, Austin, Texas, 09/2011.
Li, D., B. R. de Supinski, M. Schulz, K. Cameron, and D. S. Nikolopoulos, "Hybrid MPI/OpenMP Power-Aware Computing", Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on, Atlanta, GA, pp. 1 -12, april, 2010.
Nataraj, A., A. Malony, A. Morris, D. Arnold, and B. Miller, "In Search of Sweet-Spots in Parallel Performance Monitoring", IEEE International Conference on Cluster Computing (Cluster 2008): IEEE, pp. 69-78, 2008.
Ramalingam, S., M. W. Hall, and C. Chen, "Improving High-Performance Sparse Libraries Using Compiler-Assisted Specialization: A PETSc Case Study", Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW) , 21-25 May 2012, Shanghai, China, IEEE Computing Society, Washington D.C , USA , 2012.
Behzad, B., S. Byna, S. M. Wild, Prabhat, and M. Snir, "Improving parallel I/O autotuning with performance modeling", The 23rd International Symposium on High-Performance Parallel and Distributed Computing, HPDC'14, Vancouver, BC, Canada - June 23 - 27, 2014, 2014.
Huck, K. A., A. D. Malony, S. Shende, and D. W. Jacobsen, "Integrated Measurement for Cross-Platform OpenMP Performance Analysis", International Workshop on OpenMP, Salvador, Bahia, Brazil, Springer, 09/2014.
Long, R., S. Moore, and B. Rountree, "Iso-power-efficiency: an approach to scaling application codes with a power budget", Eleventh Workshop on High-Performance Power-Aware Computing (HPPAC), Hyderabad, India, 2015.

Pages