Publications
"Modeling the Impact of Reduced Memory Bandwidth on HPC Applications",
Euro-Par 2014 Parallel Processing - 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings, 2014.
"Multi-Objective Optimization of HPC Kernels for Performance, Power, and Energy",
4th International Workshop on Performance Modeling, Benchmarking, and Simulation of HPC Systems (PMBS12), Denver, Colorado, November, 2013.
"Measuring energy and power with PAPI",
International Workshop on Power-Aware Systems and Architectures (PASA 2012), Pittsburgh, PA, September 10, 2012.
"Mechanisms and evaluation of cross-layer fault-tolerance for supercomputing",
In the 41st International Conference on Parallel Processing (ICPP), Pittsburgh, PA, Sep, 2012.
"Model-based, memory-centric performance and power optimization on NUMA multiprocessors",
IEEE International Symposium on Workload Characterization (IISWC) , 4-6 Novemeber ,2012 , La Jolla . California, 2012.
"Modeling and predicting performance of high performance computing applications on hardware accelerators",
International Journal of High Performance Computing Applications (IJHPCA) , vol. 27, pp. 89-108, 2012.
"Modeling Power and Energy Usage of HPC Kernels",
International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW) ,21-25 May 2012, Shanghai, China, IEEE Computing Society, Washington D.C , USA , 2012.
"Memory-efficient Optimization of Gyrokinetic Particle-to-Grid Interpolation for Multicore Processors",
Proc.\ ACM/IEEE Conf.\ on Supercomputing (SC 2009): The Parallel Computing Laboratory, pp. 48:1–48:12, 2009.
"Multi-Threaded Library for Many-Core Systems",
Workshop on Multithreaded Architectures and Applications, Rome, Italy, IEEE, 2009.
"Managing Locality in Grand Challenge Applications: A Case Study of the Gyrokinetic Toroidal Code",
Proc. of {SciDAC} 2008, J. of Physics: Conference Series: Institute of Physics Publishing, June, 2008.