default search action
SC 2009: Portland, Oregon, USA
- Proceedings of the ACM/IEEE Conference on High Performance Computing, SC 2009, November 14-20, 2009, Portland, Oregon, USA. ACM 2009, ISBN 978-1-60558-744-8
Plenary Speakers
- Justin R. Rattner:
Opening address: The rise of the 3D internet - advancements in collaborative and immersive sciences. - Lee Hood:
Systems medicine, transformational technologies and the emergence of predictive, personalized, preventive and participatory (P4) medicine. - Francine Berman:
Kennedy award: Laying the groundwork for success in the information age. - Kenichi Miura:
Cray award: My adventures in parallel computing. - Roberto Car, Michele Parrinello:
Fernbach award.
Technical papers
- Anshul Gupta, Seid Koric, Thomas George:
Sparse matrix factorization on massively parallel computers. - George Michelogiannakis, William J. Dally:
Router designs for elastic buffer on-chip networks. - Edgar A. León, Rolf Riesen, Arthur B. Maccabe, Patrick G. Bridges:
Instruction-level simulation of a cluster at scale. - Tom Peterka, David Goodell, Robert B. Ross, Han-Wei Shen, Rajeev Thakur:
A configurable algorithm for parallel image-compositing applications. - Cy P. Chan, Jason Ansel, Yee Lok Wong, Saman P. Amarasinghe, Alan Edelman:
Autotuning multigrid with PetaBricks. - Shekhar Srikantaiah, Reetuparna Das, Asit K. Mishra, Chita R. Das, Mahmut T. Kandemir:
A case for integrated processor-cache partitioning in chip multiprocessors. - John C. Linford, John Michalakes, Manish Vachharajani, Adrian Sandu:
Multi-core acceleration of chemical kinetics for simulation and prediction. - Ramya Prabhakar, Shekhar Srikantaiah, Christina M. Patrick, Mahmut T. Kandemir:
Dynamic storage cache allocation in multi-server architectures. - Florian Ries, Tommaso DeMarco, Matteo Zivieri, Roberto Guerrieri:
Triangular matrix inversion on Graphics Processing Unit. - Yu Hua, Hong Jiang, Yifeng Zhu, Dan Feng, Lei Tian:
SmartStore: a new metadata organization paradigm with semantic-awareness for next-generation file systems. - Mark Silberstein, Artyom Sharov, Dan Geiger, Assaf Schuster:
GridBot: execution of bags of tasks in multiple grids. - Marghoob Mohiyuddin, Mark Murphy, Leonid Oliker, John Shalf, John Wawrzynek, Samuel Williams:
A design methodology for domain-optimized power-efficient supercomputing. - David Isaac Wolinsky, Yonggang Liu, Pierre St. Juste, Girish Venkatasubramanian, Renato J. O. Figueiredo:
On the design of scalable, self-configuring virtual networks. - Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan:
Enabling software management for multicore caches with a lightweight hardware support. - Wesley Kendall, Markus Glatter, Jian Huang, Tom Peterka, Robert Latham, Robert B. Ross:
Terascale data organization for discovering multivariate climatic trends. - David Pugmire, Hank Childs, Christoph Garth, Sean Ahern, Gunther H. Weber:
Scalable computation of streamlines on very large datasets. - Wolfgang Frings, Felix Wolf, Ventsislav Petkov:
Scalable massively parallel I/O to task-local files. - Nathan Bell, Michael Garland:
Implementing sparse matrix-vector multiplication on throughput-oriented processors. - Fengguang Song, Asim YarKhan, Jack J. Dongarra:
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems. - Emmanuel Agullo, Bilel Hadri, Hatem Ltaief, Jack J. Dongarra:
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware. - John Bent, Garth A. Gibson, Gary Grider, Ben McClelland, Paul Nowoczynski, James Nunez, Milo Polte, Meghan Wingate:
PLFS: a checkpoint filesystem for parallel applications. - David Tarjan, Jiayuan Meng, Kevin Skadron:
Increasing memory miss tolerance for SIMD cores. - Cliff Young, Joseph A. Bank, Ron O. Dror, J. P. Grossman, John K. Salmon, David E. Shaw:
A 32x32x32, spatially distributed 3D FFT in four microseconds on Anton. - Subhash Saini, Andrey Naraikin, Rupak Biswas, David Barkai, Timothy Sandstrom:
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications. - Nagesh B. Lakshminarayana, Jaekyu Lee, Hyesoon Kim:
Age based scheduling for asymmetric multiprocessors. - Jing Xing, Jin Xiong, Ninghui Sun, Jie Ma:
Adaptive and scalable metadata management to support a trillion files. - Jidong Zhai, Tianwei Sheng, Jiangzhou He, Wenguang Chen, Weimin Zheng:
FACT: fast communication trace collection for parallel applications through program slicing. - Takashi Soga, Akihiro Musa, Youichi Shimomura, Ryusuke Egawa, Ken'ichi Itakura, Hiroyuki Takizawa, Koki Okabe, Hiroaki Kobayashi:
Performance evaluation of NEC SX-9 using real science and engineering applications. - Zizhong Chen:
Optimal real number codes for fault tolerant matrix operations. - Akira Nukada, Satoshi Matsuoka:
Auto-tuning 3-D FFT library for CUDA GPUs. - Mohamed E. Hussein, Wael Abd-Almageed:
Efficient band approximation of Gram matrices for large scale kernel methods on GPUs. - Qian Zhu, Gagan Agrawal:
Supporting fault-tolerance for time-critical events in distributed environments. - Farrukh Nadeem, Thomas Fahringer:
Predicting the execution time of grid workflow applications through local learning. - Ezra Kissel, D. Martin Swany, Aaron Brown:
Improving GridFTP performance using the Phoebus session layer. - André Brinkmann, Dominic Eschweiler:
A microdriver architecture for error correcting codes inside the Linux kernel. - Marghoob Mohiyuddin, Mark Hoemmen, James Demmel, Katherine A. Yelick:
Minimizing communication in sparse matrix solvers. - Zoltán Szebenyi, Felix Wolf, Brian J. N. Wylie:
Space-efficient time-series call-path profiling of parallel applications. - Alvin AuYoung, Amin Vahdat, Alex C. Snoeren:
Evaluating the impact of inaccurate information in utility-based scheduling. - David E. Shaw, Ron O. Dror, John K. Salmon, J. P. Grossman, Kenneth M. Mackenzie, Joseph A. Bank, Cliff Young, Martin M. Deneroff, Brannon Batson, Kevin J. Bowers, Edmond Chow, Michael P. Eastwood, Doug Ierardi, John L. Klepeis, Jeffrey Kuskin, Richard H. Larson, Kresten Lindorff-Larsen, Paul Maragakis, Mark A. Moraes, Stefano Piana, Yibing Shan, Brian Towles:
Millisecond-scale molecular dynamics simulations on Anton. - Samuel Lang, Philip H. Carns, Robert Latham, Robert B. Ross, Kevin Harms, William E. Allcock:
I/O performance challenges at leadership scale. - Jung Ho Ahn, Nathan L. Binkert, Al Davis, Moray McLaren, Robert S. Schreiber:
HyperX: topology, routing, and packaging of efficient large-scale networks. - Jung Ho Ahn, Norman P. Jouppi, Christos Kozyrakis, Jacob Leverich, Robert S. Schreiber:
Future scaling of processor-memory interfaces. - Prabhanjan Kambadur, Anshul Gupta, Amol Ghoting, Haim Avron, Andrew Lumsdaine:
PFunc: modern task parallelism for modern high performance computing. - Dong H. Ahn, Bronis R. de Supinski, Ignacio Laguna, Gregory L. Lee, Ben Liblit, Barton P. Miller, Martin Schulz:
Scalable temporal order analysis for large scale debugging. - Lakshminarayanan Renganarayanan, Uday Bondhugula, Salem Derisavi, Alexandre E. Eichenberger, Kevin O'Brien:
Compact multi-dimensional kernel extraction for register tiling. - Fabrizio Petrini, Virat Agarwal, Davide Pasetto:
SCAMPI: a scalable CAM-based algorithm for multiple pattern inspection. - Lavanya Ramakrishnan, Charles Koelbel, Yang-Suk Kee, Richard Wolski, Daniel Nurmi, Dennis Gannon, Graziano Obertelli, Asim YarKhan, Anirban Mandal, T. Mark Huang, Kiran Thyagaraja, Dmitrii Zagorodnov:
VGrADS: enabling e-Science workflows on grids and clouds with fault tolerance. - Kamesh Madduri, Samuel Williams, Stéphane Ethier, Leonid Oliker, John Shalf, Erich Strohmaier, Katherine A. Yelick:
Memory-efficient optimization of Gyrokinetic particle-to-grid interpolation for multicore processors. - Doe Hyun Yoon, Mattan Erez:
Flexible cache error protection using an ECC FIFO. - Tanzima Zerin Islam, Saurabh Bagchi, Rudolf Eigenmann:
FALCON: a system for reliable checkpoint recovery in shared grid environments. - Nathan R. Tallent, John M. Mellor-Crummey, Laksono Adhianto, Michael W. Fagan, Mark Krentel:
Diagnosing performance bottlenecks in emerging petascale applications. - Daniel U. Becker, William J. Dally:
Allocator implementations for network-on-chip routers. - James Dinan, D. Brian Larkins, P. Sadayappan, Sriram Krishnamoorthy, Jarek Nieplocha:
Scalable work stealing. - David M. Kunzman, Laxmikant V. Kalé:
Towards a framework for abstracting accelerators in parallel applications: experience with cell. - Kathryn M. Mohror, Karen L. Karavanic:
Evaluating similarity-based trace reduction techniques for scalable performance analysis. - Shih-wei Liao, Tzu-Han Hung, Donald Nguyen, Chinyen Chou, Chia-Heng Tu, Hucheng Zhou:
Machine learning-based prefetch optimization for data center applications. - Xiangyu Dong, Naveen Muralimanohar, Norman P. Jouppi, Richard Kaufmann, Yuan Xie:
Leveraging 3D PCRAM technologies to reduce checkpoint overhead for future exascale systems. - Ilya Lashuk, Aparna Chandramowlishwaran, Harper Langston, Tuan-Anh Nguyen, Rahul S. Sampath, Aashay Shringarpure, Richard W. Vuduc, Lexing Ying, Denis Zorin, George Biros:
A massively parallel adaptive fast-multipole method on heterogeneous architectures. - Geoffrey Belter, Elizabeth R. Jessup, Ian Karlin, Jeremy G. Siek:
Automating the generation of composed linear algebra kernels.
Gordon Bell finalists
- David F. Richards, James N. Glosli, Bor Chan, Milo R. Dorr, Erik W. Draeger, Jean-Luc Fattebert, William D. Krauss, Thomas E. Spelce, Frederick H. Streitz, Michael P. Surh, John A. Gunnels:
Beyond homogeneous decomposition: scaling long-range forces on Massively Parallel Systems. - Amol Ghoting, Konstantin Makarychev:
Indexing genomic sequences on the IBM Blue Gene. - Tsuyoshi Hamada, Tetsu Narumi, Rio Yokota, Kenji Yasuoka, Keigo Nitadori, Makoto Taiji:
42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence. - Rajagopal Ananthanarayanan, Steven K. Esser, Horst D. Simon, Dharmendra S. Modha:
The cat is out of the bag: cortical simulations with 109 neurons, 1013 synapses. - Markus Eisenbach, C.-G. Zhou, Donald M. C. Nicholson, G. Brown, Jeffrey M. Larkin, Thomas C. Schulthess:
A scalable method for ab initio computation of free energies in nanoscale systems. - David E. Shaw, Ron O. Dror, John K. Salmon, J. P. Grossman, Kenneth M. Mackenzie, Joseph A. Bank, Cliff Young, Martin M. Deneroff, Brannon Batson, Kevin J. Bowers, Edmond Chow, Michael P. Eastwood, Doug Ierardi, John L. Klepeis, Jeffrey Kuskin, Richard H. Larson, Kresten Lindorff-Larsen, Paul Maragakis, Mark A. Moraes, Stefano Piana, Yibing Shan, Brian Towles:
Millisecond-scale molecular dynamics simulations on Anton. - Edoardo Aprà, Alistair P. Rendell, Robert J. Harrison, Vinod Tipparaju, Wibe A. de Jong, Sotiris S. Xantheas:
Liquid water: obtaining the right answer for the right reasons. - Dinesh K. Kaushik, Micheal Smith, Allan B. Wollaber, Barry F. Smith, Andrew R. Siegel, Won Sik Yang:
Enabling high-fidelity neutron transport simulations on petascale architectures. - Onkar Sahni, Min Zhou, Mark S. Shephard, Kenneth E. Jansen:
Scalable implicit finite element solver for massively parallel processing with demonstration to 160K cores.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.