default search action
IPDPS 2017: Orlando / Buena Vista, FL, USA - Workshops
- 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2017, Orlando / Buena Vista, FL, USA, May 29 - June 2, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-3408-0
HCW: Heterogeneity in Computing Workshop
- Erik Saule, Emmanuel Jeannot:
Introduction to HCW Workshop. 1 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 2 - Erik Saule:
Message from the HCW General Chair. 3 - Emmanuel Jeannot:
Message from the HCW Program Committee Chair. 4 - Ricky Yu-Kwong Kwok:
HCW Keynote Talk. 5
Session 1: Managing the Different Components of Heterogeneous Systems
- Oliver Jakob Arndt, Fabian David Trager, Tobias Moß, Holger Blume:
Portable Implementation of Advanced Driver-Assistance Algorithms on Heterogeneous Architectures. 6-17 - Siddharth Rai, Mainak Chaudhuri:
Improving CPU Performance Through Dynamic GPU Access Throttling in CPU-GPU Heterogeneous Processors. 18-29 - Benjamin Marks, Tia Newhall:
Transparent Heterogeneous Backing Store for File Systems. 30-41
Session 2: Scheduling and Resource Allocation
- Sonia López, Stavan Satish Karia:
Alternative Processor Within Threshold: Flexible Scheduling on Heterogeneous Systems. 42-53 - Dylan Machovec, Sudeep Pasricha, Anthony A. Maciejewski, Howard Jay Siegel, Gregory A. Koenig, Michael Wright, Marcia Hilton, Rajendra Rambharos, Thomas J. Naughton, Neena Imam:
Preemptive Resource Management for Dynamically Arriving Tasks in an Oversubscribed Heterogeneous Computing System. 54-64 - Lilia Zaourar, Massinissa Ait Aba, David Briand, Jean-Marc Philippe:
Modeling of Applications and Hardware to Explore Task Mapping and Scheduling Strategies on a Heterogeneous Micro-Server System. 65-76 - Thibaud Ecarot, Djamal Zeghlache, Cedric Brandily:
Consumer-and-Provider-Oriented Efficient IaaS Resource Allocation. 77-85
RAW: Reconfigurable Architectures Workshop
- Marco D. Santambrogio, Ramachandran Vaidyanathan:
Introduction to RAW Workshop. 86-87 - Ronald F. DeMara, Georgi Gaydadjiev:
RAW Keynote Speakers. 88-89
Session 1: Architectures for Convolutional Neural Networks and Sliding Window
- Marco Bacis, Giuseppe Natale, Emanuele Del Sozzo, Marco Domenico Santambrogio:
A Pipelined and Scalable Dataflow Implementation of Convolutional Neural Networks on FPGA. 90-97 - Haruyoshi Yonekawa, Hiroki Nakahara:
On-Chip Memory Based Binarized Convolutional Deep Neural Network Applying Batch Normalization Free Technique on an FPGA. 98-105 - Murad Qasaimeh, Joseph Zambreno, Phillip H. Jones:
A Modified Sliding Window Architecture for Efficient BRAM Resource Utilization. 106-114
Session 2: Design and Programming Methods
- Gary Gréwal, Shawki Areibi, Matthew Westrik, Ziad Abuowaimer, Betty Zhao:
Automatic Flow Selection and Quality-of-Result Estimation for FPGA Placement. 115-123 - Javier Alejandro Varela, Norbert Wehn, Qian Liang, Songyin Tang:
Exploiting Decoupled OpenCL Work-Items with Data Dependencies on FPGAs: A Case Study. 124-131 - Luca Stornaiuolo, Alberto Parravicini, Gianluca Durelli, Marco D. Santambrogio:
Exploiting FPGAs from Higher Level Languages A Signal Analysis Case Study. 132-140 - Philip Gottschling, Christian Hochberger:
ReEP: A Toolset for Generation and Programming of Reconfigurable Datapaths for Event Processing. 141-149
Session 3: Acceleration of Curran's Approximation and Elliptic Curve Crypto
- Anna Maria Nestorov, Enrico Reggiani, Hristina Palikareva, Pavel Burovskiy, Tobias Becker, Marco D. Santambrogio:
A Scalable Dataflow Implementation of Curran's Approximation Algorithm. 150-157 - Rabia Shahid, Ted Winograd, Kris Gaj:
A Generic Approach to the Development of Coprocessors for Elliptic Curve Cryptosystems. 158-167
Session 4: Acceleration of Biological Signal Processing
- Luca Cerina, Pierandrea Cancian, Giuseppe Franco, Marco Domenico Santambrogio:
A Hardware Acceleration for Surface EMG Non-Negative Matrix Factorization. 168-174 - Giovanni Pietro Seu, Gian Nicola Angotzi, Giuseppe Tuveri, Luigi Raffo, Luca Berdondini, Alessandro Maccione, Paolo Meloni:
On-FPGA Real-Time Processing of Biological Signals From High-Density MEAs: a Design Space Exploration. 175-183
Session 5: Design Methods
- Yosi Ben-Asher, Esti Stein, Ramachandran Vaidyanathan:
Combining Boolean Gates and Branching Programs in One Model can Lead to Faster Circuits. 184-191 - Utsav Agarwal, Ramachandran Vaidyanathan:
Efficient Totally-Ordered Subset Generation, with Application in Partial Reconfiguration. 192-201
Short Papers
- Godwin Enemali, Adewale Adetomi, Tughrul Arslan:
FAReP: Fragmentation-Aware Replacement Policy for Task Reuse on Reconfigurable FPGAs. 202-206 - Tejaswini Ananthanarayana, Sonia López, Marcin Lukowiak:
Power Analysis of HLS-Designed Customized Instruction Set Architectures. 207-212 - Tajas Ruschke, Lukas Johannes Jung, Christian Hochberger:
A Near Optimal Integrated Solution for Resource Constrained Scheduling, Binding and Routing on CGRAs. 213-218 - Adewale Adetomi, Godwin Enemali, Tughrul Arslan:
Clock Buffers, Nets, and Trees for On-Chip Communication: A Novel Network Access Technique in FPGAs. 219-222 - Enrico Reggiani, Eleonora D'Arnese, Andrea Purgato, Marco D. Santambrogio:
Pearson Correlation Coefficient Acceleration for Modeling and Mapping of Neural Interconnections. 223-228 - Tripti Jain, Klaus Schneider, Frederik Walk:
Out-of-Order Execution of Buffered Function Units in Exposed Data Path Architectures. 229-234 - Andres Jacoby, Daniel Llamocca:
Dynamic Dual Fixed-Point CORDIC Implementation. 235-240 - Emanuele Del Sozzo, Lorenzo Di Tucci, Marco D. Santambrogio:
A Highly Scalable and Efficient Parallel Design of N-Body Simulation on FPGA. 241-246 - Francesca Palumbo, Carlo Sau, Danilo Pani, Paolo Meloni, Luigi Raffo:
Feasibility Study of Real-Time Spiking Neural Network Simulations on a Swarm Intelligence Based Digital Architecture. 247-250
HiCOMB: 16th IEEE International Workshop on High Performance Computational Biology
- Alex Pothen, Ananth Grama:
Introduction to HiCOMB Workshop. 251 - Radu Marculescu:
HiCOMB Keynote. 252
Session 1
- Cyrus Cousins, Christopher M. Pietras, Donna K. Slonim:
Scalable FRaC Variants: Anomaly Detection for Precision Medicine. 253-262 - Jae-Seung Yeom, Tanya Kostova-Vassilevska, Peter D. Barnes Jr., David R. Jefferson, Tomas Oppelstrup:
Exploratory Modeling and Simulation of the Evolutionary Dynamics of Single-Stranded RNA Virus Populations. 263-272
Session 2
- Julia D. Warnke-Sommer, Hesham H. Ali:
Parallel NGS Assembly Using Distributed Assembly Graphs Enriched with Biological Knowledge. 273-282 - Vasudevan Rengasamy, Paul Medvedev, Kamesh Madduri:
Parallel and Memory-Efficient Preprocessing for Metagenome Assembly. 283-292
Session 3
- Philip E. Davis, Adam M. Terwilliger, David Zeitler, Gregory Wolffe:
Scalable Parallelization of a Markov Coalescent Genealogy Sampler. 293-302 - Mücahid Kutlu, Gagan Agrawal, James S. Blachly:
Par-eXpress: A Tool for Analysis of Sequencing Experiments With Ambiguous Assignment of Fragments in Parallel. 303-310
EduPar: NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Sheikh K. Ghafoor, Sushil K. Prasad, Satish Puri:
Introduction to EduPar Workshop. 311-313 - Jack J. Dongarra:
EduPar Keynote. 314
Session 1: Tools and Programming Environment
- Abdul Dakkak, Carl Pearson, Cheng Li, Wen-mei W. Hwu:
RAI: A Scalable Project Submission System for Parallel Programming Courses. 315-322 - Brian Broll, Ákos Lédeczi, Péter Völgyesi, János Sallai, Miklós Maróti, Chris Vanags:
Introducing Parallel and Distributed Computing to K12. 323-330 - Tianyi Bao, William B. Gardner:
Log Visualization Tool for Message-Passing Programming in Pilot. 331-338 - David A. Richie, James A. Ross:
I Can Has Supercomputer? A Novel Approach to Teaching Parallel and Distributed Computing Concepts Using a Meme-Based Programming Language. 339-345
Session 2: Pedagogy and Experience
- Joshua Eckroth:
Teaching Future Big Data Analysts: Curriculum and Experience Report. 346-351 - Jane Wyngaard, Heather J. Lynch, Jaroslaw Nabrzyski, Allen Pope, Shantenu Jha:
Hacking at the Divide Between Polar Science and HPC: Using Hackathons as Training Tools. 352-359 - Vivek Sarkar, Max Grossman, Zoran Budimlic, Shams Imam:
Preparing an Online Java Parallel Computing Course. 360-366 - Jawwad Ahmed Shamsi:
A Laboratory Based Course on GPU Programming: Methods, Practices, and Lessons. 367-374
ParLearning: The 6th International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Anand Panangadan:
Introduction to ParLearning Workshop. 375-376 - John Feo, Wei Tan:
ParLearning Keynotes. 377-378
Session 1
- Azalia Mirhoseini, Bita Darvish Rouhani, Ebrahim M. Songhori, Farinaz Koushanfar:
ExtDict: Extensible Dictionaries for Data- and Platform-Aware Large-Scale Learning. 379-388 - Songze Li, Sucha Supittayapornpong, Mohammad Ali Maddah-Ali, Salman Avestimehr:
Coded TeraSort. 389-398 - Nitin A. Gawande, Joshua B. Landwehr, Jeff A. Daily, Nathan R. Tallent, Abhinav Vishnu, Darren J. Kerbyson:
Scaling Deep Learning Workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing. 399-408 - Jing Chen, Jianbin Fang, Weifeng Liu, Tao Tang, Xuhao Chen, Canqun Yang:
Efficient and Portable ALS Matrix Factorization for Recommender Systems. 409-418
Session 2
- Thomas P. Parnell, Celestine Dünner, Kubilay Atasu, Manolis Sifalakis, Haris Pozidis:
Large-Scale Stochastic Learning Using GPUs. 419-428 - Amaury Durand, Yanik Ngoko, Christophe Cérin:
Distributed and in-Situ Machine Learning for Smart-Homes and Buildings: Application to Alarm Sounds Detection. 429-432 - DeJiao Niu, Rui Xue, Tao Cai, Hai Li, Kingsley Effah, Hang Zhang:
The New Large-Scale RNNLM System Based on Distributed Neuron. 433-436 - Yuchen Qiao, Kazuma Hashimoto, Akiko Eriguchi, Haixia Wang, Dongsheng Wang, Yoshimasa Tsuruoka, Kenjiro Taura:
Cache Friendly Parallelization of Neural Encoder-Decoder Models Without Padding on Multi-core Architecture. 437-440
PDCO: 7th IEEE Workshop Parallel / Distributed Computing and Optimization
- Grégoire Danoy, Didier El Baz:
Introduction to PDCO Workshop. 441
Session 1: Scheduling I
- Laleh Ghalami, Daniel Grosu:
A Parallel Approximation Algorithm for Scheduling Parallel Identical Machines. 442-451 - Hadrien Croubois, Eddy Caron:
Communication Aware task Placement for Workflow Scheduling on DaaS-Based Cloud. 452-461 - Muhammad Qasim, Touseef Iqbal, Ehsan Ullah Munir, Nikos Tziritas, Samee U. Khan, Laurence T. Yang:
Dynamic Mapping of Application Workflows in Heterogeneous Computing Environments. 462-471
Session 2: Scheduling II
- Jorge M. Cortés-Mendoza, Andrei Tchernykh, Igor V. Bychkov, Alexander G. Feoktistov, Pascal Bouvry, Loic Didelot:
Load-Aware Strategies for Cloud-Based VoIP Optimization with VM Startup Prediction. 472-481 - David Pena, Andrei Tchernykh, Sergio Nesmachnow, Renzo Massobrio, Alexander G. Feoktistov, Igor V. Bychkov:
Multiobjective Vehicle-type Scheduling in Urban Public Transport. 482-491
Session 3: Parallel Metaheuristics and Machine Learning
- Emmanuel Kieffer, Grégoire Danoy, Pascal Bouvry, Anass Nagih:
A new Co-evolutionary Algorithm Based on Constraint Decomposition. 492-500 - Javier A. Cruz-Lopez, Vincent Boyer, Didier El Baz:
Training Many Neural Networks in Parallel via Back-Propagation. 501-509 - Amir Nakib, Mohamed Hilia, Frederic Heliodore, El-Ghazali Talbi:
Design of Metaheuristic Based on Machine Learning: A Unified Approach. 510-518
Session 4: Graphs, Networks and Algorithms
- Raphael Kimmig, Henning Meyerhenke, Darren Strash:
Shared Memory Parallel Subgraph Enumeration. 519-529 - Julien Collet, Tanguy Sassolas, Yves Lhuillier, Renaud Sirdey, Jacques Carlier:
Exploration of de Bruijn Graph Filtering for de novo Assembly Using GraphLab. 530-539 - He Li, Robson Eduardo De Grande, Azzedine Boukerche:
An Efficient CPP Solution for Resilience-Oriented SDN Controller Deployment. 540-549
Session 5: Parallel Algorithms
- Chris Rohlfs, Mohamed Zahran:
Optimal Bandwidth Selection for Kernel Regression Using a Fast Grid Search and a GPU. 550-556 - Numair Khan, Mohamed Zahran:
Space-Efficient Pointwise Computation of the Distance Transform on GPUs. 557-566 - Christian Herold, Olaf Krzikalla, Andreas Knüpfer:
Optimizing One-Sided Communication of Parallel Applications Using Critical Path Methods. 567-576
GABB: Graph Algorithms Building Blocks
- Aydin Buluç, Tim Mattson:
Introduction to GABB Workshop. 577 - Ümit V. Çatalyürek:
GABB Keynote. 578
Session 1
- Maryia Belova, Ming Ouyang:
Breadth-First Search with A Multi-Core Computer. 579-587 - George M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri:
Order or Shuffle: Empirically Evaluating Vertex Order Impact on Parallel Graph Computations. 588-597 - Sayyad Nayyaroddeen, Mahak Gambhir, Kishore Kothapalli:
A Study of Graph Decomposition Algorithms for Parallel Symmetry Breaking. 598-607
Session 2
- Hayden Jananthan, Karia Dibert, Jeremy Kepner:
Constructing Adjacency Arrays from Incidence Arrays. 608-615 - Yangzihao Wang, Sean Baxter, John D. Owens:
Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU. 616-626 - Charles Colley, Junyuan Lin, Xiaozhe Hu, Shuchin Aeron:
Algebraic Multigrid for Least Squares Problems on Graphs with Applications to HodgeRank. 627-636
Session 3
- David Ediger, James P. Fairbanks:
Deriving Streaming Graph Algorithms from Static Definitions. 637-642
Session 4
- Aydin Buluç, Tim Mattson, Scott McMillan, José E. Moreira, Carl Yang:
Design of the GraphBLAS API for C. 643-652 - William Horn, Gabriel Tanase, Hao Yu, Pratap Pattnaik:
A Linear Algebra-Based Programming Interface for Graph Computations in Scala and Spark. 653-659
AsHES: The Seventh International Workshop on Accelerators and Hybrid Exascale Systems
- Sunita Chandrasekaran:
Introduction to AsHES Workshop. 660 - Tim Mattson:
AsHES Keynote. 661
Session 1: Programming Models and Runtime Systems
- Michael Wolfe, Seyong Lee, Jungwon Kim, Xiaonan Tian, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman:
Implementing the OpenACC Data Model. 662-672 - Sergio Pino, Lori L. Pollock, Sunita Chandrasekaran:
Exploring Translation of OpenMP to OpenACC 2.5: Lessons Learned. 673-682 - Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Pietro Cicotti, Erwin Laure, Stefano Markidis:
Exploring the Performance Benefit of Hybrid Memory System on HPC Environments. 683-692
Session 2: Algorithms
- Mehmet Deveci, Christian Trott, Sivasankaran Rajamanickam:
Performance-Portable Sparse Matrix-Matrix Multiplication for Many-Core Architectures. 693-702 - Antonio Gómez-Iglesias, Miguel Cárdenas-Montes:
Time and Energy to Solution Evaluation for the Three-Point Angular Correlation Function. 703-712 - Kaixi Hou, Wu-chun Feng, Shuai Che:
Auto-Tuning Strategies for Parallelizing Sparse Matrix-Vector (SpMV) Multiplication on Multi- and Many-Core Processors. 713-722
Session 3: Scheduling and Architectures
- Max Grossman, Vivek Kumar, Nick Vrvilo, Zoran Budimlic, Vivek Sarkar:
A Pluggable Framework for Composable HPC Scheduling Libraries. 723-732 - Sandra Catalán, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí, José R. Herrero:
Static Versus Dynamic Task Scheduling of the Lu Factorization on ARM big. LITTLE Architectures. 733-742 - Zhigeng Xu, James Lin, Satoshi Matsuoka:
Benchmarking SW26010 Many-Core Processor. 743-752
HIPS: 22nd International Workshop on High Level Programming Models and Supportive Environments
- Bo Wu, Andreas Knüpfer:
Introduction to HIPS Workshop. 753-754 - Zizhong Chen:
HIPS Keynote. 755
Session 1
- Dana Akhmetova, Roman Iakymchuk, Örjan Ekeberg, Erwin Laure:
Performance Study of Multithreaded MPI and OpenMP Tasking in a Large Scientific Code. 756-765 - Solmaz Salehian, Jiawen Liu, Yonghong Yan:
Comparison of Threading Programming Models. 766-774 - Mostafa Mehrabi, Nasser Giacaman, Oliver Sinnen:
Annotation-Based Parallelization of Java Code. 775-784
Session 2
- Alexis Engelke, Josef Weidendorfer:
Using LLVM for Optimized Lightweight Binary Re-Writing at Runtime. 785-794 - Nathan Zhang, Michael B. Driscoll, Charles Markley, Samuel Williams, Protonu Basu, Armando Fox:
Snowflake: A Lightweight Portable Stencil DSL. 795-804 - Pavel Shamis, M. Graham Lopez, Gilad Shainer:
Enabling One-Sided Communication Semantics on ARM. 805-813
Session 3
- Jari-Matti Mäkelä, Martti Forsell, Ville Leppänen:
Towards a Language Framework for Thick Control Flows. 814-823 - Benjamin J. L. Wang, Uwe R. Zimmer:
Pure Concurrent Programming. 824-831
APDCM: 19th Workshop on Advances in Parallel and Distributed Computational Models
- Oscar H. Ibarra, Koji Nakano:
Introduction to APDCM Workshop. 832 - Hong Shen:
APDCM Keynote. 833
Session 1: Distributed Computing
- Aisha Aljohani, Gokarna Sharma:
Complete Visibility for Mobile Agents with Lights Tolerating a Faulty Agent. 834-843 - Yonghwan Kim, Haruka Ohno, Yoshiaki Katayama, Toshimitsu Masuzawa:
A Self-Stabilizing Algorithm for Constructing (1, 1)-Maximal Directed Acyclic Graph. 844-853 - Jonas Posner, Claudia Fohry:
Fault Tolerance for Cooperative Lifeline-Based Global Load Balancing in Java with APGAS and Hazelcast. 854-863 - Debarshi Dutta, Meher Chaitanya, Kishore Kothapalli, Debajyoti Bera:
Applications of Ear Decomposition to Efficient Heterogeneous Algorithms for Shortest Path/Cycle Problems. 864-873
Session 2: Scheduling and Hardware Models
- Guillaume Aupy, Anne Benoit, Loïc Pottier, Padma Raghavan, Yves Robert, Manu Shantharam:
Co-Scheduling Algorithms for Cache-Partitioned Systems. 874-883 - Loris Marchal, Samuel McCauley, Bertrand Simon, Frédéric Vivien:
Minimizing I/Os in Out-of-Core Task Tree Scheduling. 884-893 - Basem Assiri, Costas Busch:
Approximate Count and Queue Objects in Transactional Memory. 894-903 - Max Plauth, Christoph Sterz, Felix Eberhardt, Frank Feinbube, Andreas Polze:
Assessing NUMA Performance Based on Hardware Event Counters. 904-913
Session 3: Parallel Computing
- Daniel Dauwe, Sudeep Pasricha, Anthony A. Maciejewski, Howard Jay Siegel:
An Analysis of Resilience Techniques for Exascale Computing Platforms. 914-923 - Tomoki Kawamura, Yoneda Kazunori, Takashi Yamazaki, Takashi Iwamura, Masahiro Watanabe, Yasushi Inoguchi:
A Compression Method for Storage Formats of a Sparse Matrix in Solving the Large-Scale Linear Systems. 924-931 - Takahiro Nishimura, Jacir Luiz Bordim, Yasuaki Ito, Koji Nakano:
Accelerating the Smith-Waterman Algorithm Using Bitwise Parallel Bulk Computation Technique on GPU. 932-941 - Yi Yang, Yasuaki Ito, Koji Nakano:
Photomosaic Generation by Rearranging Subimages, with GPU Acceleration. 942-951
HPPAC: 13th Workshop on High-Performance, Power-Aware Computing
- Shuaiwen Leon Song, Richard W. Vuduc:
HPPAC Workshop Introduction. 952 - Kirk W. Cameron:
HPPAC Keynote Talk. 953
Session 1
- Hayk Shoukourian, Torsten Wilde, Detlef Labrenz, Arndt Bode:
Using Machine Learning for Data Center Cooling Infrastructure Efficiency Prediction. 954-963 - Wissam Abu Ahmad, Andrea Bartolini, Francesco Beneventi, Luca Benini, Andrea Borghesi, Marco Cicala, Privato Forestieri, Cosimo Gianfreda, Daniele Gregori, Antonio Libri, Filippo Spiga, Simone Tinti:
Design of an Energy Aware Petaflops Class High Performance Cluster Based on Power Architecture. 964-973 - Aniruddha Marathe, Ghaleb Abdulla, Barry L. Rountree, Kathleen Shoga:
Towards a Unified Monitoring Framework for Power, Performance and Thermal Metrics: A Case Study on the Evaluation of HPC Cooling Systems. 974-983
Session 2
- Xinning Hui, Zhihui Du, Jason Liu, Hongyang Sun, Yuxiong He, David A. Bader:
When Good Enough Is Better: Energy-Aware Scheduling for Multicore Servers. 984-993 - Shouq Alsubaihi, Jean-Luc Gaudiot:
A Runtime Workload Distribution with Resource Allocation for CPU-GPU Heterogeneous Systems. 994-1003
Session 3
- Vladimir A. Mironov, Alexander A. Moskovsky, Yuri Alexeev:
Power Measurements of Hartree-Fock Algorithms Using Different Storage Devices. 1004-1011 - Mohak Chadha, Thomas Ilsche, Mario Bielert, Wolfgang E. Nagel:
A Statistical Approach to Power Estimation for x86 Processors. 1012-1019
HPBDC: 3rd IEEE International Workshop on High-Performance Big Data Computing
- Xiaoyi Lu, Jianfeng Zhan, Dhabaleswar K. Panda:
Introduction to HPBDC Workshop. 1020
Session 1: High-Performance Graph Processing
- Manu Shantharam, Keita Iwabuchi, Pietro Cicotti, Laura Carrington, Maya B. Gokhale, Roger A. Pearce:
Performance Evaluation of Scale-Free Graph Algorithms in Low Latency Non-volatile Memory. 1021-1028 - Vito Giovanni Castellana, Marco Minutoli, Shreyansh Bhatt, Khushbu Agarwal, Arthur Bleeker, John Feo, Daniel G. Chavarría-Miranda, David J. Haglin:
High-Performance Data Analytics Beyond the Relational and Graph Data Models with GEMS. 1029-1038 - Peter M. Kogge:
Graph Analytics: Complexity, Scalability, and Architectures. 1039-1047
Session 2: Benchmarking and Performance Analysis
- Saba Sehrish, Jim Kowalkowski, Marc F. Paterno:
Spark and HPC for High Energy Physics Data Analyses. 1048-1057 - Houliang Qi, Xu Chang, Xingwu Liu, Li Zha:
The Consistency Analysis of Secondary Index on Distributed Ordered Tables. 1058-1067 - Xinhui Tian, Shaopeng Dai, Zhihui Du, Wanling Gao, Rui Ren, Yaodong Cheng, Zhifei Zhang, Zhen Jia, Peijian Wang, Jianfeng Zhan:
BigDataBench-S: An Open-Source Scientific Big Data Benchmark Suite. 1068-1077 - Paras Jain, Chirag Tailor, Sam Ford, Liexiao Ding, Michael Phillips, Fang (Cherry) Liu, Nagi Gebraeel, Duen Horng Chau:
Scalable Architecture for Anomaly Detection and Visualization in Power Generating Assets. 1078-1082
CHIUW: The Fourth Annual Chapel Implementers and Users Workshop
- Tom MacDonald, Michael Ferguson:
Introduction to CHIUW Workshop. 1083-1084 - Jonathan Dursi:
CHIUW Keynote. 1085 - Jyothi Krishna V. S, Vassily Litvinov:
Identifying Use-After-Free Variables in Fire-and-Forget Tasks. 1086-1094 - Ariful Azad, Aydin Buluç:
Towards a GraphBLAS Library in Chapel. 1095-1104 - Engin Kayraklioglu, Wo Chang, Tarek A. El-Ghazawi:
Comparative Performance and Optimization of Chapel in Modern Manycore Architectures. 1105-1114
PDSEC: 18th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing
- Peter E. Strazdins, Keita Teranishi, Raphaël Couturier, Joseph Antony, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
Introduction to PDSEC Workshop. 1115-1116 - Pavan Balaji:
PDSEC Keynote. 1117
Session 1: Best Paper
- Ichitaro Yamazaki, Mark Hoemmen, Piotr Luszczek, Jack J. Dongarra:
Improving Performance of GMRES by Reducing Communication and Pipelining Global Collectives. 1118-1127
Session 2: Linear Algebra
- Bryce Adelstein-Lelbach, Hans Johansen, Samuel Williams:
Simultaneously Solving Swarms of Small Sparse Systems on SIMD Silicon. 1128-1137 - Gregoire Pichon, Eric Darve, Mathieu Faverge, Pierre Ramet, Jean Roman:
Sparse Supernodal Solver Using Block Low-Rank Compression. 1138-1147 - José Ignacio Aliaga, Rocío Carratalá-Sáez, Ronald Kriemann, Enrique S. Quintana-Ortí:
Task-Parallel LU Factorization of Hierarchical Matrices Using OmpSs. 1148-1157
Session 3: Applications
- Ramachandran Kodanganallur Narayanan, Kamesh Madduri:
Parallel Particle-in-Cell Performance Optimization: A Case Study of Electrospray Simulation. 1158-1167 - Yann Barsamian, Sever A. Hirstoaga, Eric Violard:
Efficient Data Structures for a Hybrid Parallel and Vectorized Particle-in-Cell Code. 1168-1177 - Hongzhang Shan, Samuel Williams, Calvin W. Johnson, Kenneth S. McElvain:
A Locality-Based Threading Algorithm for the Configuration-Interaction Method. 1178-1187 - Yunfan Xiao, Min Huang, Qinghai Miao, Jun Xiao, Ying Wang:
Architecting the Discontinuous Deformation Analysis Method Pipeline on the GPU. 1188-1197
Session 4: Parallel Techniques
- Zahra Khatami, Hartmut Kaiser, J. Ramanujam:
Redesigning OP2 Compiler to Use HPX Runtime Asynchronous Techniques. 1198-1207 - Thomas Marrinan, Joseph A. Insley, Silvio Rizzi, Francois Tessier, Michael E. Papka:
Automated Dynamic Data Redistribution. 1208-1215 - Lina Yu, Hongfeng Yu, Hong Jiang, Jun Wang:
An Application-Aware Data Replacement Policy for Interactive Large-Scale Scientific Visualization. 1216-1225 - Jackson DeBuhr, Bo Zhang, Luke Dalessandro:
Scalable Hierarchical Multipole Methods Using an Asynchronous Many-Tasking Runtime System. 1226-1234
JSSPP: 21st Workshop on Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai, Dalibor Klusácek:
Introduction to JSSPP Workshop. 1235-1236
DPDNS: 22nd IEEE Workshop on Dependable Parallel, Distributed and Network-Centric Systems
- Dimiter R. Avresky, Erik Maehle:
Introduction to DPDNS Workshop. 1237
Session 1
- Satoshi Fujita:
Reliability Calculation of P2P Streaming Systems with Bottleneck Links. 1238-1244 - Chaoyang Li, Anu G. Bourgeois:
Lifetime and Full-View Coverage Guarantees Through Distributed Algorithms in Camera Sensor Networks. 1245-1250
Session 2
- Jason St. John, Thomas J. Hacker:
A Small-Scale Testbed for Large-Scale Reliable Computing. 1251-1258 - Santosh Aditham, Nagarajan Ranganathan, Srinivas Katkoori:
LSTM-Based Memory Profiling for Predicting Data Attacks in Distributed Big Data Systems. 1259-1267 - Salvatore Distefano, Samuele Rodi:
An Outlook on Volunteer and Croudsourcing Based Computing. 1268-1273 - Rizwan A. Ashraf, Roberto Gioiosa, Gokcen Kestor, Ronald F. DeMara:
Exploring the Effect of Compiler Optimizations on the Reliability of HPC Applications. 1274-1283
IPDRM: Second Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware
- Shuaiwen Leon Song, Torsten Hoefler:
IPDRM Workshop Introduction. 1284
Session 1
- Jaume Bosch, Xubin Tan, Carlos Álvarez, Daniel Jiménez-González, Xavier Martorell, Eduard Ayguadé:
Characterizing and Improving the Performance of Many-Core Task-Based Parallel Programming Runtimes. 1285-1292 - Kavitha Chandrasekar, Xiang Ni, Laxmikant V. Kalé:
A Memory Heterogeneity-Aware Runtime System for Bandwidth-Sensitive HPC Applications. 1293-1300 - Alexis Champsaur, Jay F. Lofstead, Jai Dayal, Matthew Wolf, Greg Eisenhauer, Patrick M. Widener, Ada Gavrilovska:
SmartBlock: An Approach to Standardizing In Situ Workflow Components. 1301-1308
Session 2
- John Jenkins, Galen M. Shipman, Jamaludin Mohd-Yusof, Kipton Barros, Philip H. Carns, Robert B. Ross:
A Case Study in Computational Caching Microservices for HPC. 1309-1316 - Zahra Khatami, Sungpack Hong, Jinsoo Lee, Siegfried Depner, Hassan Chafi, J. Ramanujam, Hartmut Kaiser:
A Load-Balanced Parallel and Distributed Sorting Algorithm Implemented with PGX.D. 1317-1324
Session 3
- Carlos Rosales, Antonio Gómez-Iglesias, Si Liu, Feng Chen, Lei Huang, Hang Liu, Antia Lamas-Linares, John Cazes:
Performance Prediction of HPC Applications on Intel Processors. 1325-1332 - Stefanos Gerangelos, Nectarios Koziris:
vPHI: Enabling Xeon Phi Capabilities in Virtual Machines. 1333-1340
iWAPT: 12th International Workshop on Automatic Performance Tuning
- Osni Marques, Reiji Suda:
Introduction to iWAPT Workshop. 1341
Session 1: New Methodology of Auto-Tuning
- Wilson Feng, Tarek S. Abdelrahman:
A Sampling Based Strategy to Automatic Performance Tuning of GPU Programs. 1342-1349 - Tianyi David Han, Tarek S. Abdelrahman:
Use of Synthetic Benchmarks for Machine-Learning-Based Performance Auto-Tuning. 1350-1361
Session 2: Auto-Tuning Software and Environment
- Tharindu Rusira, Mary W. Hall, Protonu Basu:
Automating Compiler-Directed Autotuning for Phased Performance Behavior. 1362-1371 - Hiroyuki Takizawa, Daichi Sato, Shoichi Hirasawa, Daisuke Takahashi:
A Customizable Auto-Tuning Scenario with User-Defined Code Transformations. 1372-1378 - Philip Pfaffe, Martin Peter Tillmann, Sigmar Walter, Walter F. Tichy:
Online-Autotuning in the Presence of Algorithmic Choice. 1379-1388
Session 3: Case-Study of Auto-Tuning and Optimization
- Athena Elafrou, Georgios I. Goumas, Nectarios Koziris:
Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Intel Xeon Phi. 1389-1398 - Takahiro Katagiri, Satoshi Ohshima, Masaharu Matsumoto:
Auto-Tuning on NUMA and Many-Core Environments with an FDM Code. 1399-1407 - Mark Gates, Jakub Kurzak, Piotr Luszczek, Yu Pei, Jack J. Dongarra:
Autotuning Batch Cholesky Factorization in CUDA with Interleaved Layout of Matrices. 1408-1417
Session 4: Scientific Applications by Auto-Tuning
- Susumu Yamada, Toshiyuki Imamura, Takuya Ina, Narimasa Sasa, Yasuhiro Idomura, Masahiko Machida:
Quadruple-Precision BLAS Using Bailey's Arithmetic with FMA Instruction: Its Performance and Applications. 1418-1425 - Masayoshi Mochizuki, Akihiro Fujii, Teruo Tanaka:
Fast Multidimensional Performance Parameter Estimation with Multiple One-Dimensional d-Spline Parameter Search. 1426-1433 - Luigi Nardi, Bruno Bodin, Sajad Saeedi, Emanuele Vespa, Andrew J. Davison, Paul H. J. Kelly:
Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications Using HyperMapper. 1434-1443
ParSocial: 2nd IEEE Workshop on Parallel and Distributed Processing for Computational Social System
- Eunice E. Santos, John Korah:
Introduction to ParSocial Workshop. 1444-1445 - Boleslaw K. Szymanski:
ParSocial Keynote. 1446
Session 1
- Xiaoyan Lu, Boleslaw K. Szymanski:
Predicting Viral News Events in Online Media. 1447-1456 - Julia Buwaya, José D. P. Rolim:
Mobile Crowdsensing from a Selfish Routing Perspective. 1457-1463 - George Cybenko:
Parallel Computing for Machine Learning in Social Network Analysis. 1464-1471
Session 2
- Gennaro Cordasco, Carmine Spagnuolo, Vittorio Scarano:
Work Partitioning on Parallel and Distributed Agent-Based Simulation. 1472-1481 - Humayun Kabir, Kamesh Madduri:
Parallel k-Core Decomposition on Multicore Platforms. 1482-1491 - Eric Tatara, Nicholson T. Collier, Jonathan Ozik, Charles M. Macal:
Endogenous Social Networks from Large-Scale Agent-Based Models. 1492-1499
Session 3
- Sindhuja Parimalarangan, George M. Slota, Kamesh Madduri:
Fast Parallel Graph Triad Census and Triangle Counting on Shared-Memory Platforms. 1500-1509 - Eunice E. Santos, John Korah, Vairavan Murugappan, Suresh Subramanian:
Efficient Anytime Anywhere Algorithms for Vertex Additions in Large and Dynamic Graphs. 1510-1519 - Wen-Jing Hsu, You Lu, Zhuo Qi Lee:
Accelerating Topic Exploration of Multi-Dimensional Documents. 1520-1527
BigDataEco: Big Data Regional Innovation Hubs and Spokes Workshop
- Chaitan Baru, Fen Zhao, Joanna Chan:
Introduction to BigDataEco Workshop. 1528
GraML: First Workshop on the Intersection of Graph Algorithms and Machine Learning
- Antonino Tumeo, Mahantesh Halappanavar, John Feo:
Introduction to GraML Workshop. 1529-1530 - Sujith Ravi:
GraML Keynote. 1531 - Hristo N. Djidjev, Daniel O'Malley, Hari S. Viswanathan, Jeffrey D. Hyman, Satish Karra, Gowri Srinivasan:
Learning on Graphs for Predictions of Fracture Propagation, Flow and Transport. 1532-1539 - Hongyuan Zhan, Kamesh Madduri:
Analyzing Community Structure in Networks. 1540-1549 - Ronald D. Hagan, Charles A. Phillips, Bradley J. Rhodes, Michael A. Langston:
Compound Analytics: Templates for Integrating Graph Algorithms and Machine Learning. 1550-1556
EMBRACE: Evolvable Methods for Benchmarking Realism and Community Engagement
- David A. Bader:
Introduction to EMBRACE Workshop. 1557 - Torsten Hoefler:
EMBRACE Keynote. 1558
REPPAR: Workshop on Reproducibility in Parallel Computing
- Sascha Hunold, Arnaud Legrand, Lucas Nussbaum:
Introduction to REPPAR Workshop. 1559 - Todd Gamblin:
REPPAR Keynote. 1560
Session 1
- Ivo Jimenez, Michael Sevilla, Noah Watkins, Carlos Maltzahn, Jay F. Lofstead, Kathryn M. Mohror, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau:
The Popper Convention: Making Reproducible Systems Evaluation Practical. 1561-1570 - Lucas Nussbaum:
Towards Trustworthy Testbeds Thanks to Throughout Testing. 1571-1578 - Franziska Hoffeins, Florina M. Ciorba, Ioana Banicescu:
Examining the Reproducibility of Using Dynamic Loop Scheduling Techniques in Scientific Applications. 1579-1587
Session 2
- Luka Stanisic, Lucas Mello Schnorr, Augustin Degomme, Franz C. Heinrich, Arnaud Legrand, Brice Videau:
Characterizing the Performance of Modern Architectures Through Opaque Benchmarks: Pitfalls Learned the Hard Way. 1588-1597 - Roman Iakymchuk, Enrique S. Quintana-Ortí, Erwin Laure, Stef Graillat:
Towards Reproducible Blocked LU Factorization. 1598-1607
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.