Publications

Export 12 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
Y
You, H., Q. Liu, Z. Li, and S. Moore, "The Design of an Auto-tuning I/O Framework on Cray XT5 System", Cray Users Group Conference (CUG'11) (Best Paper Finalist), Fairbanks, Alaska, may, 2011.
You, H., B. Rekapalli, Q. Liu, and S. Moore, "Autotuned Parallel I/O for Highly Scalable Biosequence Analysis", TeraGrid'11, Salt Lake City, Utah, 07/2011.
W
Williams, S., J. Carter, L. Oliker, J. Shalf, and K. Yelick, "Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms}", Interational Conference on Parallel and Distributed Computing Systems (IPDPS), Miami, Florida, pp. 1-14, 2008.
Williams, S W., A. Waterman, and D. A. Patterson, Roofline: An Insightful Visual Performance Model for Floating-Point Programs and Multicore Architectures, , no. UCB/EECS-2008-134: EECS Department, University of California, Berkeley, Oct, 2008.
Williams, S., Auto-tuning Performance on Multicore Computers, , Berkeley, CA, EECS Department, University of California, Berkeley, December, 2008.
Williams, S., J. Carter, L. Oliker, J. Shalf, and K. Yelick, "Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4", Proc. CUG09: Cray User Group meeting: The Parallel Computing Laboratory, 2009.
Williams, S., L. Oliker, J. Carter, and J. Shalf, "Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning", International Conference for High Performance Computing, Networking, Storage and Analysis (SC11), Seattle, Washington, ACM/IEEE, 11/2011.
Williams, S., J. Carter, L. Oliker, J. Shalf, and K. Yelick, "Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms", Journal of Parallel and Distributed Computing, vol. 69, no. 9, pp. 762-777, 2009.
Weaver, V., M. Johnson, K. Kasichayanula, J. Ralph, P. Luszczek, D. Terpstra, and S. Moore, "Measuring energy and power with PAPI", International Workshop on Power-Aware Systems and Architectures (PASA 2012), Pittsburgh, PA, September 10, 2012.
Wang, B., S. Ethier, W. Thang, T. Williams, K. Ibrahim, K. Madduri, S. Wililiams, and L. Oliker, "Kinetic Turbulence Simulations at Extreme Scale on Leadership-Class Systems", SC'13, Denver, Colorado, November, 2013.
V
Vo, A., S. Aananthakrishnan, G. Gopalakrishnan, B. R. de Supinski, M. Schulz, and G. Bronevetsky, "A Scalable and Distributed Dynamic Formal Verifier for MPI Programs", Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis SC '10: IEEE Computer Society Washington, DC, pp. 1-10, nov, 2010.
Vetter, J., A. Malony, W. - M. Hwui, and R. Vuduc, Vancouver: Designing a Next-Generation Software Infrastructure for Productive Heterogeneous Exascale Computing, , 2011.
Vetter, J.., S.. Lee, D.. Li, G.. Marin, C.. McCurdy, J.. Meredith, P.. Roth, and K.. Spafford, "Quantifying Architectural Requirements of Contemporary Extreme-Scale Scientific Applications", 4th International Workshop on Performance Modeling, Benchmarking and Simulation of HPC Systems (PMBS13), Denver, Colorado, November, 2013.
Venkat, A., M. W. Hall, and M. Strout, "Loop and data transformations for sparse matrix code", Proceedings of the 36th {ACM} {SIGPLAN} Conference on Programming Language Design and Implementation, Portland, OR, USA, June 15-17, 2015, 2015.
Venkat, A., M. Shantharam, M. Hall, and M. Strout, "Non-affine Extensions to Polyhedral Code Generation", IEEE/ACM International Symposium on Code Generation and Optimization (CGO): ACM, pp. 185, 02/2014.

Pages