Publications
Export 12 results:
Filters: Author is S. Williams [Clear All Filters]
Auto-tuning Performance on Multicore Computers,
, Berkeley, CA, EECS Department, University of California, Berkeley, December, 2008.
"A Generalized Framework for Auto-tuning Stencil Computations",
In Proceedings of the Cray User Group Conference, 2009, Atlanta, GA, May, 2009.
"Gyrokinetic Particle-in-Cell Optimization on Emerging Multi- and Manycore Platforms",
Parallel Computing, vol. 37, no. 9, pp. 501-520, sept, 2011.
"Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms}",
Interational Conference on Parallel and Distributed Computing Systems (IPDPS), Miami, Florida, pp. 1-14, 2008.
"Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms",
Journal of Parallel and Distributed Computing, vol. 69, no. 9, pp. 762-777, 2009.
"Memory-efficient Optimization of Gyrokinetic Particle-to-Grid Interpolation for Multicore Processors",
Proc.\ ACM/IEEE Conf.\ on Supercomputing (SC 2009): The Parallel Computing Laboratory, pp. 48:1–48:12, 2009.
"Optimizing Sparse Matrix-Multiple Vector Multiplication for Nuclear Configuration Interaction Calculations",
International Parallel and Distributed Processing Symposium (IPDPS 2014), 05/2014.
"Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4",
Proc. CUG09: Cray User Group meeting: The Parallel Computing Laboratory, 2009.