Publications

Export 12 results:
Filters: Author is Samuel Williams  [Clear All Filters]
2015
Shan, H., S. Williams, W. De Jong, and L. Oliker, "Thread-level parallelization and optimization of NWChem for the Intel MIC architecture", Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM@PPoPP 2015, San Francisco, CA, USA, February 7-8, 2015, 2015.
2012
Madduri, K., J. Su, S. Williams, L. Oliker, S. Ethier, and K. A. Yelick, "Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms", IEEE Transactions on Parallel Distributed Systems, vol. 23, issue 10, pp. 1915 - 1922, October 2012.
2011
Williams, S., L. Oliker, J. Carter, and J. Shalf, "Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning", International Conference for High Performance Computing, Networking, Storage and Analysis (SC11), Seattle, Washington, ACM/IEEE, 11/2011.
Madduri, K., K. Z. Ibrahim, S. Williams, E-J. Im, S. Ethier, J. Shalf, and L. Oliker, "Gyrokinetic Toroidal Simulations on Leading Mult- and Manycore HPC Systems", (submitted to) Supercomputing, April, 2011.
2009
Datta, K., S. Williams, V. Volkov, J. Carter, L. Oliker, J. Shalf, and K. Yelick, "Auto-tuning the 27-point stencil for multicore", In Proc. iWAPT2009: The Fourth International Workshop on Automatic Performance Tuning, Tokyo, Japan, The Parallel Computing Laboratory, 2009.