Publications

Export 12 results:
Filters: First Letter Of Last Name is S  [Clear All Filters]
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
S
Su, CY., D. Li, D. S. Nikolopoulos, K. W. Cameron, B. R. de Supinski, and E. A. Leon, "Model-based, memory-centric performance and power optimization on NUMA multiprocessors", IEEE International Symposium on Workload Characterization (IISWC) , 4-6 Novemeber ,2012 , La Jolla . California, 2012.
Sreepathi, S., V. Sripathi, R. Mills, G. Hammond, and G.. Mahinthakumar, "SCORPIO: A Scalable Two-Phase Parallel I/O Library With Application to a Large Scale Subsurface Simulator", IEEE Conference on High Performance Computing (HiPC), Bengaluru, India, 12/2013.
Sottile, M. J., C. Edward Rasmussen, W. N. Weseloh, R. W. Robey, D. J. Quinlan, and J. Overbey, "Transformations exploiting array syntax in Fortran for accelerator programming", International Journal of Computational Science and Engineering (IJCSE), vol. 8, pp. 47-57, February,2013.
Song, S., and J. K. Hollingsworth, "Designing and Auto-Tuning Parallel 3-D FFT for Computation-Communication Overlap", 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Orlando, Florida, February, 2014.
Shende, S., and A. Malony, "The TAU Parallel Performance System", International Journal of High Performance Computing Applications, vol. 20, no. 2, Summer, pp. 287–311, 2006.
Shan, H., S. Williams, W. De Jong, and L. Oliker, "Thread-level parallelization and optimization of NWChem for the Intel MIC architecture", Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM@PPoPP 2015, San Francisco, CA, USA, February 7-8, 2015, 2015.
Shan, H.., W.. Jong, L.. Oliker, N.. Wright, and B.. Austin, "Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms", 4th International Workshop on Performance Modeling, Benchmarking, and Simulation of HPC Systems (PMBS13), Denver, Colorado, November, 2013.
Shan, H., K. McElvain, C. Johnson, S. Williams, and E. W. Ormand, "Parallel implementation and performance optimization of the configuration-interaction method", SC'15, Austin, TX, 11/2015.
Shan, H., B. Austin, W. De Jong, L. Oliker, N. Wright, and E. Apra, "Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms", Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS13) held as part of SC13, 11/2013.
Seymour, K., H. You, and J. Dongarra, "A Comparison of Search Heuristics for Empirical Code Optimization", The 3rd international Workshop on Automatic Performance Tuning, Tsukuba, Japan, IEEE, pp. 421-429, 2008.
Sato, K., A. Moody, K. Mohror, T. Gamblin, B. R. de Supinski, N. Maruyama, and S. Matsuoka, "FMI: Fault Tolerant Messaging Interface for Fast and Transparent Recovery", 28th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2014), 05/2014.
Sato, K., A. Moody, K. Mohror, T. Gamblin, B. R. de Supinski, N. Maruyama, and S. Matsuoka, "A User-level Infiniband-based File System and Checkpoint Strategy for Burst Buffers", 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2014), 05/2014.
Sarje, A., S. Song, D. Jacobsen, K. Huck, J. Hollingsworth, A. Malony, S. Williams, and L. Oliker, "Parallel performance optimizations on unstructured mesh-based simulations", Procedia Computer Science, vol. 51, pp. 2016–2025, 2015.