Export 12 results:
Filters: Author is S. Williams [Clear All Filters]
Auto-tuning Performance on Multicore Computers, , Berkeley, CA, EECS Department, University of California, Berkeley, December, 2008.
"A Generalized Framework for Auto-tuning Stencil Computations", In Proceedings of the Cray User Group Conference, 2009, Atlanta, GA, May, 2009.
"Gyrokinetic Particle-in-Cell Optimization on Emerging Multi- and Manycore Platforms", Parallel Computing, vol. 37, no. 9, pp. 501-520, sept, 2011.
"Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms}", Interational Conference on Parallel and Distributed Computing Systems (IPDPS), Miami, Florida, pp. 1-14, 2008.
"Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms", Journal of Parallel and Distributed Computing, vol. 69, no. 9, pp. 762-777, 2009.
"Memory-efficient Optimization of Gyrokinetic Particle-to-Grid Interpolation for Multicore Processors", Proc.\ ACM/IEEE Conf.\ on Supercomputing (SC 2009): The Parallel Computing Laboratory, pp. 48:1–48:12, 2009.
"Optimizing Sparse Matrix-Multiple Vector Multiplication for Nuclear Configuration Interaction Calculations", International Parallel and Distributed Processing Symposium (IPDPS 2014), 05/2014.
"Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4", Proc. CUG09: Cray User Group meeting: The Parallel Computing Laboratory, 2009.