default search action

combined dblp search
author search
venue search
publication search

ask others

Gennady Pekhimenko

> Home > Persons

Person information

affiliation: University of Toronto
affiliation: Microsoft Research
affiliation (former): Carnegie Mellon University

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/SuYP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/SuYP24
Qidong Su, Jiacheng Yang, Gennady Pekhimenko:
BOOM: Use your Desktop to Accurately Predict the Performance of Large Deep Neural Networks. PACT 2024: 284-296
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/eurosys/YangGWE0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eurosys/YangGWE0P24
Jiacheng Yang, Christina Giannoula, Jun Wu, Mostafa Elhoushi, James Gleeson, Gennady Pekhimenko:
Minuet: Accelerating 3D Sparse Convolutions on GPUs. EuroSys 2024: 786-802
[c59]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TuWKBPAA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TuWKBPAA24
Renbo Tu, Colin White, Jean Kossaifi, Boris Bonev, Gennady Pekhimenko, Kamyar Azizzadenesheli, Anima Anandkumar:
Guaranteed Approximation Bounds for Mixed-Precision Neural Operators. ICLR 2024
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/ics/MuG0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ics/MuG0P24
Baorun Mu, Christina Giannoula, Shang Wang, Gennady Pekhimenko:
Sylva: Sparse Embedded Adapters via Hierarchical Approximate Second-Order Information. ICS 2024: 485-497
[c57]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/GaoHGTPV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/GaoHGTPV24
Yubo Gao, Maryam Haghifam, Christina Giannoula, Renbo Tu, Gennady Pekhimenko, Nandita Vijaykumar:
Proteus: Preserving Model Confidentiality during Graph Optimizations. MLSys 2024
[e1]
- view
- export record
  dblp key:
  - conf/mlsys/2024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/2024
Phillip B. Gibbons, Gennady Pekhimenko, Christopher De Sa:
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, MLSys 2024, Santa Clara, CA, USA, May 13-16, 2024. mlsys.org 2024 [contents]
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06145
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06145
Jiacheng Yang, Christina Giannoula, Jun Wu, Mostafa Elhoushi, James Gleeson, Gennady Pekhimenko:
Minuet: Accelerating 3D Sparse Convolutions on GPUs. CoRR abs/2401.06145 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16731
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16731
Christina Giannoula, Peiming Yang, Ivan Fernandez Vega, Jiacheng Yang, Yu Xin Li, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu, Gennady Pekhimenko:
Accelerating Graph Neural Networks on Real Processing-In-Memory Systems. CoRR abs/2402.16731 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-12512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-12512
Yubo Gao, Maryam Haghifam, Christina Giannoula, Renbo Tu, Gennady Pekhimenko, Nandita Vijaykumar:
Proteus: Preserving Model Confidentiality during Graph Optimizations. CoRR abs/2404.12512 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13161
Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si:
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts. CoRR abs/2406.13161 (2024)
2023
[j9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/natmi/KarargyrisUSAGWPKZBMCGNRKXBCBEATS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/natmi/KarargyrisUSAGWPKZBMCGNRKXBCBEATS23
Alexandros Karargyris, Renato Umeton, Micah J. Sheller, Alejandro Aristizabal, Johnu George, Anna Wuest, Sarthak Pati, Hasan Kassem, Maximilian Zenk, Ujjwal Baid, Prakash Narayana Moorthy, Alexander Chowdhury, Junyi Guo, Sahil S. Nalawade, Jacob Rosenthal, David Kanter, Maria Xenochristou, Daniel J. Beutel, Verena Chung, Timothy Bergquist, James A. Eddy, Abubakar Abid, Lewis Tunstall, Omar Sanseviero, Dimitrios Dimitriadis, Yiming Qian, Xinxing Xu, Yong Liu, Rick Siow Mong Goh, Srini Bala, Victor Bittorf, Sreekar Reddy Puchala, Biagio Ricciuti, Soujanya Samineni, Eshna Sengupta, Akshay Chaudhari, Cody Coleman, Bala Desinghu, Gregory F. Diamos, Debo Dutta, Diane Feddema, Grigori Fursin, Xinyuan Huang, Satyananda Kashyap, Nicholas D. Lane, Indranil Mallick, Pietro Mascagni, Virendra Mehta, Cassiano Ferro Moraes, Vivek Natarajan, Nikola Nikolov, Nicolas Padoy, Gennady Pekhimenko, Vijay Janapa Reddi, G. Anthony Reina, Pablo Ribalta, Abhishek Singh, Jayaraman J. Thiagarajan, Jacob Albrecht, Thomas Wolf, Geralyn Miller, Huazhu Fu, Prashant Shah, Daguang Xu, Poonam Yadav, David Talby, Mark M. Awad, Jeremy P. Howard, Michael Rosenthal, Luigi Marchionni, Massimo Loda, Jason M. Johnson, Spyridon Bakas, Peter Mattson:
Federated benchmarking of medical artificial intelligence with MedPerf. Nat. Mac. Intell. 5(7): 799-810 (2023)
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/aplas/SuGPS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aplas/SuGPS23
Qidong Su, Chuqin Geng, Gennady Pekhimenko, Xujie Si:
TorchProbe: Fuzzing Dynamic Deep Learning Compilers. APLAS 2023: 310-331
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/DingYZLWP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/DingYZLWP23
Yaoyao Ding, Cody Hao Yu, Bojian Zheng, Yizhi Liu, Yida Wang, Gennady Pekhimenko:
Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor Programs. ASPLOS (2) 2023: 370-384
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/JayarajanZSP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/JayarajanZSP23
Anand Jayarajan, Wei Zhao, Yudi Sun, Gennady Pekhimenko:
TiLT: A Time-Centric Approach for Stream Query Optimization and Parallelization. ASPLOS (2) 2023: 818-832
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/micro/ZhengYWDLWP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micro/ZhengYWDLWP23
Bojian Zheng, Cody Hao Yu, Jie Wang, Yaoyao Ding, Yizhi Liu, Yida Wang, Gennady Pekhimenko:
Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs. MICRO 2023: 1364-1380
[c52]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/SniderCP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/SniderCP23
Daniel Snider, Fanny Chevalier, Gennady Pekhimenko:
Hotline Profiler: Automatic Annotation and A Multi-Scale Timeline for Visualizing Time-Use in DNN Training. MLSys 2023
[c51]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/JiangJLP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/JiangJLP23
Chenhao Jiang, Anand Jayarajan, Hao Lu, Gennady Pekhimenko:
Arbitor: A Numerically Accurate Hardware Emulation Tool for DNN Accelerators. USENIX ATC 2023: 519-536
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12030
Anand Jayarajan, Wei Zhao, Yudi Sun, Gennady Pekhimenko:
TiLT: A Time-Centric Approach for Stream Query Optimization and Parallelization. CoRR abs/2301.12030 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-15034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-15034
Colin White, Renbo Tu, Jean Kossaifi, Gennady Pekhimenko, Kamyar Azizzadenesheli, Anima Anandkumar:
Speeding up Fourier Neural Operators via Mixed Precision. CoRR abs/2307.15034 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18813
Qidong Su, Christina Giannoula, Gennady Pekhimenko:
The Synergy of Speculative Decoding and Batching in Serving Large Language Models. CoRR abs/2310.18813 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-20078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-20078
Qidong Su, Chuqin Geng, Gennady Pekhimenko, Xujie Si:
TorchProbe: Fuzzing Dynamic Deep Learning Compilers. CoRR abs/2310.20078 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04789
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04789
Kevin Song, Jiacheng Yang, Sihang Liu, Gennady Pekhimenko:
Lightweight Frequency-Based Tiering for CXL Memory Systems. CoRR abs/2312.04789 (2023)
2022
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/Qiu0SKP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/Qiu0SKP22
Han Jie Qiu, Sihang Liu, Xinyang Song, Samira Manabi Khan, Gennady Pekhimenko:
Pavise: Integrating Fault Tolerance Support for Persistent Memory Applications. PACT 2022: 109-123
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/TanGVP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/TanGVP22
Xiaodan Serina Tan, Pavel Golikov, Nandita Vijaykumar, Gennady Pekhimenko:
GPUPool: A Holistic Approach to Fine-Grained GPU Sharing in the Cloud. PACT 2022: 317-332
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/cgo/LiZPL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cgo/LiZPL22
Ao Li, Bojian Zheng, Gennady Pekhimenko, Fan Long:
Automatic Horizontal Fusion for GPU Kernels. CGO 2022: 14-27
[c47]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/chil/TonekaboniMAPHJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chil/TonekaboniMAPHJ22
Sana Tonekaboni, Gabriela Morgenshtern, Azadeh Assadi, Aslesha Pokhrel, Xi Huang, Anand Jayarajan, Robert Greer, Gennady Pekhimenko, Melissa D. McCradden, Mjaye Mazwi, Anna Goldenberg:
How to validate Machine Learning Models Prior to Deployment: Silent trial protocol for evaluation of real-time models at ICU. CHIL 2022: 169-182
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/ipps/Pekhimenko22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ipps/Pekhimenko22
Gennady Pekhimenko:
Keynote Talk 1: Efficient DNN Training at Scale: from Algorithms to Hardware. IPDPS Workshops 2022: 1244
[c45]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/ZhengJYSFLWCCP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/ZhengJYSFLWCCP22
Bojian Zheng, Ziheng Jiang, Cody Hao Yu, Haichen Shen, Joshua Fromm, Yizhi Liu, Yida Wang, Luis Ceze, Tianqi Chen, Gennady Pekhimenko:
DietCode: Automatic Optimization for Dynamic Tensor Programs. MLSys 2022
[c44]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AndoorveeduZZP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AndoorveeduZZP22
Muralidhar Andoorveedu, Zhanda Zhu, Bojian Zheng, Gennady Pekhimenko:
Tempo: Accelerating Transformer-Based Model Training through Memory Footprint Reduction. NeurIPS 2022
[c43]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/osdi/ZhuWDKLZXMXC0YZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/ZhuWDKLZXMXC0YZ22
Hongyu Zhu, Ruofan Wu, Yijia Diao, Shanbin Ke, Haoyu Li, Chen Zhang, Jilong Xue, Lingxiao Ma, Yuqing Xia, Wei Cui, Fan Yang, Mao Yang, Lidong Zhou, Asaf Cidon, Gennady Pekhimenko:
ROLLER: Fast and Efficient Tensor Compilation for Deep Learning. OSDI 2022: 233-248
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07736
James Gleeson, Daniel Snider, Yvonne Yang, Moshe Gabel, Eyal de Lara, Gennady Pekhimenko:
Optimizing Data Collection in Deep Reinforcement Learning. CoRR abs/2207.07736 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-09603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-09603
Yaoyao Ding, Cody Hao Yu, Bojian Zheng, Yizhi Liu, Yida Wang, Gennady Pekhimenko:
Hidet: Task Mapping Programming Paradigm for Deep Learning Tensor Programs. CoRR abs/2210.09603 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10246
Muralidhar Andoorveedu, Zhanda Zhu, Bojian Zheng, Gennady Pekhimenko:
Tempo: Accelerating Transformer-Based Model Training through Memory Footprint Reduction. CoRR abs/2210.10246 (2022)
2021
[j8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taco/KaushikPP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taco/KaushikPP21
Anirudh Mohan Kaushik, Gennady Pekhimenko, Hiren D. Patel:
Gretch: A Hardware Prefetcher for Graph Analytics. ACM Trans. Archit. Code Optim. 18(2): 18:1-18:25 (2021)
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/JayarajanHGP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/JayarajanHGP21
Anand Jayarajan, Kimberly Hau, Andrew Goodwin, Gennady Pekhimenko:
LifeStream: a high-performance stream processing engine for periodic streams. ASPLOS 2021: 107-122
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/WangCKMPS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/WangCKMPS021
Ziqi Wang, Chul-Hwan Choo, Michael A. Kozuch, Todd C. Mowry, Gennady Pekhimenko, Vivek Seshadri, Dimitrios Skarlatos:
NVOverlay: Enabling Efficient and Scalable High-Frequency Snapshotting to NVM. ISCA 2021: 498-511
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/micro/AwadMEZBJPM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micro/AwadMEZBJPM21
Omar Mohamed Awad, Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Ciaran Bannon, Anand Jayarajan, Gennady Pekhimenko, Andreas Moshovos:
FPRaker: A Processing Element For Accelerating Neural Network Training. MICRO 2021: 857-869
[c39]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/0001GPLKR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/0001GPLKR21
James Gleeson, Moshe Gabel, Gennady Pekhimenko, Eyal de Lara, Srivatsan Krishnan, Vijay Janapa Reddi:
RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads. MLSys 2021
[c38]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/0002YZLP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/0002YZLP21
Shang Wang, Peiming Yang, Yuxuan Zheng, Xin Li, Gennady Pekhimenko:
Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models. MLSys 2021
[c37]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/DingZJP021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/DingZJP021
Yaoyao Ding, Ligeng Zhu, Zhihao Jia, Gennady Pekhimenko, Song Han:
IOS: Inter-Operator Scheduler for CNN Acceleration. MLSys 2021
[c36]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/VivancosSLABNML21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/VivancosSLABNML21
Isak Edo Vivancos, Sayeh Sharify, Daniel Ly-Ma, Ameer Abdelhadi, Ciaran Bannon, Milos Nikolic, Mostafa Mahmoud, Alberto Delmas Lascorz, Gennady Pekhimenko, Andreas Moshovos:
Boveda: Building an On-Chip Deep Learning Memory Hierarchy Brick by Brick. MLSys 2021
[c35]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DiskinBRSLSPPKB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DiskinBRSLSPPKB21
Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, Lucile Saulnier, Quentin Lhoest, Anton Sinitsin, Dmitry Popov, Dmitry V. Pyrkin, Maxim Kashirin, Alexander Borzunov, Albert Villanova del Moral, Denis Mazur, Ilia Kobelev, Yacine Jernite, Thomas Wolf, Gennady Pekhimenko:
Distributed Deep Learning In Open Collaborations. NeurIPS 2021: 7879-7897
[c34]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/RyabininGPP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RyabininGPP21
Max Ryabinin, Eduard Gorbunov, Vsevolod Plokhotnyuk, Gennady Pekhimenko:
Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices. NeurIPS 2021: 18195-18211
[c33]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/YuGGP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/YuGGP21
Geoffrey X. Yu, Yubo Gao, Pavel Golikov, Gennady Pekhimenko:
Habitat: A Runtime-Based Computational Performance Predictor for Deep Neural Network Training. USENIX ATC 2021: 503-521
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-00527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-00527
Geoffrey X. Yu, Yubo Gao, Pavel Golikov, Gennady Pekhimenko:
Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach. CoRR abs/2102.00527 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-02344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-02344
Shang Wang, Peiming Yang, Yuxuan Zheng, Xin Li, Gennady Pekhimenko:
Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models. CoRR abs/2102.02344 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04285
James Gleeson, Srivatsan Krishnan, Moshe Gabel, Vijay Janapa Reddi, Eyal de Lara, Gennady Pekhimenko:
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads. CoRR abs/2102.04285 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-03239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-03239
Max Ryabinin, Eduard Gorbunov, Vsevolod Plokhotnyuk, Gennady Pekhimenko:
Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices. CoRR abs/2103.03239 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10207
Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, Lucile Saulnier, Quentin Lhoest, Anton Sinitsin, Dmitry Popov, Dmitry V. Pyrkin, Maxim Kashirin, Alexander Borzunov, Albert Villanova del Moral, Denis Mazur, Ilia Kobelev, Yacine Jernite, Thomas Wolf, Gennady Pekhimenko:
Distributed Deep Learning in Open Collaborations. CoRR abs/2106.10207 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01406
Alexandros Karargyris, Renato Umeton, Micah J. Sheller, Alejandro Aristizabal, Johnu George, Srini Bala, Daniel J. Beutel, Victor Bittorf, Akshay Chaudhari, Alexander Chowdhury, Cody Coleman, Bala Desinghu, Gregory F. Diamos, Debo Dutta, Diane Feddema, Grigori Fursin, Junyi Guo, Xinyuan Huang, David Kanter, Satyananda Kashyap, Nicholas D. Lane, Indranil Mallick, Pietro Mascagni, Virendra Mehta, Vivek Natarajan, Nikola Nikolov, Nicolas Padoy, Gennady Pekhimenko, Vijay Janapa Reddi, G. Anthony Reina, Pablo Ribalta, Jacob Rosenthal, Abhishek Singh, Jayaraman J. Thiagarajan, Anna Wuest, Maria Xenochristou, Daguang Xu, Poonam Yadav, Michael Rosenthal, Massimo Loda, Jason M. Johnson, Peter Mattson:
MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation. CoRR abs/2110.01406 (2021)
2020
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/ReddiCKMSWABCCC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/ReddiCKMSWABCCC20
Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee, Jeffery Liao, Anton Lokhmotov, Francisco Massa, Peng Meng, Paulius Micikevicius, Colin Osborne, Gennady Pekhimenko, Arun Tejusve Raghunath Rajan, Dilip Sequeira, Ashish Sirasao, Fei Sun, Hanlin Tang, Michael Thomson, Frank Wei, Ephrem Wu, Lingjie Xu, Koichi Yamada, Bing Yu, George Yuan, Aaron Zhong, Peizhao Zhang, Yuchen Zhou:
MLPerf Inference Benchmark. ISCA 2020: 446-459
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/ZhengVP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/ZhengVP20
Bojian Zheng, Nandita Vijaykumar, Gennady Pekhimenko:
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training. ISCA 2020: 1089-1102
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/micro/MahmoudEZAPAM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micro/MahmoudEZAPAM20
Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos:
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training. MICRO 2020: 781-795
[c29]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/0002BP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/0002BP20
Shang Wang, Yifan Bai, Gennady Pekhimenko:
BPPSA: Scaling Back-propagation by Parallel Scan Algorithm. MLSys 2020
[c28]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/MattsonCDCMPTWB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/MattsonCDCMPTWB20
Peter Mattson, Christine Cheng, Gregory F. Diamos, Cody Coleman, Paulius Micikevicius, David A. Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor Bittorf, David Brooks, Dehao Chen, Debo Dutta, Udit Gupta, Kim M. Hazelwood, Andy Hock, Xinyuan Huang, Daniel Kang, David Kanter, Naveen Kumar, Jeffery Liao, Deepak Narayanan, Tayo Oguntebi, Gennady Pekhimenko, Lillian Pentecost, Vijay Janapa Reddi, Taylor Robie, Tom St. John, Carole-Jean Wu, Lingjie Xu, Cliff Young, Matei Zaharia:
MLPerf Training Benchmark. MLSys 2020
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/uist/YuGP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uist/YuGP20
Geoffrey X. Yu, Tovi Grossman, Gennady Pekhimenko:
Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training. UIST 2020: 126-139
[c26]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/ZhuPP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/ZhuPP20
Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko:
Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training. USENIX ATC 2020: 337-352
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-03318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-03318
Hongyu Zhu, Amar Phanishayee, Gennady Pekhimenko:
Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training. CoRR abs/2006.03318 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-01277
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-01277
Ao Li, Bojian Zheng, Gennady Pekhimenko, Fan Long:
Automatic Horizontal Fusion for GPU Kernels. CoRR abs/2007.01277 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-00177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-00177
Jiahuang Lin, Xin Li, Gennady Pekhimenko:
Multi-node Bert-pretraining: Cost-efficient Approach. CoRR abs/2008.00177 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06798
Geoffrey X. Yu, Tovi Grossman, Gennady Pekhimenko:
Skyline: Interactive In-Editor Computational Performance Profiling for Deep Neural Network Training. CoRR abs/2008.06798 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-00748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-00748
Mostafa Mahmoud, Isak Edo Vivancos, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos:
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference. CoRR abs/2009.00748 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08065
Omar Mohamed Awad, Mostafa Mahmoud, Isak Edo Vivancos, Ali Hadi Zadeh, Ciaran Bannon, Anand Jayarajan, Gennady Pekhimenko, Andreas Moshovos:
FPRaker: A Processing Element For Accelerating Neural Network Training. CoRR abs/2010.08065 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01302
Yaoyao Ding, Ligeng Zhu, Zhihao Jia, Gennady Pekhimenko, Song Han:
IOS: Inter-Operator Scheduler for CNN Acceleration. CoRR abs/2011.01302 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-00192
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-00192
Anand Jayarajan, Kimberly Hau, Andrew Goodwin, Gennady Pekhimenko:
LifeStream: A High-performance Stream Processing Engine for Waveform Data. CoRR abs/2012.00192 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/MiaoJPML19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/MiaoJPML19
Hongyu Miao, Myeongjae Jeon, Gennady Pekhimenko, Kathryn S. McKinley, Felix Xiaozhu Lin:
StreamBox-HBM: Stream Analytics on High Bandwidth Hybrid Memory. ASPLOS 2019: 167-181
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/0001SPKK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/0001SPKK19
Sihang Liu, Korakit Seemakhupt, Gennady Pekhimenko, Aasheesh Kolli, Samira Manabi Khan:
Janus: optimizing memory and storage support for non-volatile memory systems. ISCA 2019: 143-156
[c23]
- view
  - electronic edition @ mlsys.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mlsys/JayarajanWGFP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/JayarajanWGFP19
Anand Jayarajan, Jinliang Wei, Garth Gibson, Alexandra Fedorova, Gennady Pekhimenko:
Priority-based Parameter Propagation for Distributed DNN Training. SysML 2019
[p2]
- view
  authority control:
- export record
  dblp key:
  - books/sp/19/YazdanbakhshPEMM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/19/YazdanbakhshPEMM19
Amir Yazdanbakhsh, Gennady Pekhimenko, Hadi Esmaeilzadeh, Onur Mutlu, Todd C. Mowry:
Towards Breaking the Memory Bandwidth Wall Using Approximate Value Prediction. Approximate Circuits 2019: 417-441
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-01328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-01328
Hongyu Miao, Myeongjae Jeon, Gennady Pekhimenko, Kathryn S. McKinley, Felix Xiaozhu Lin:
StreamBox-HBM: Stream Analytics on High Bandwidth Hybrid Memory. CoRR abs/1901.01328 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03257
Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Eric S. Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros G. Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim M. Hazelwood, Furong Huang, Martin Jaggi, Kevin G. Jamieson, Michael I. Jordan, Gauri Joshi, Rania Khalaf, Jason Knight, Jakub Konecný, Tim Kraska, Arun Kumar, Anastasios Kyrillidis, Jing Li, Samuel Madden, H. Brendan McMahan, Erik Meijer, Ioannis Mitliagkas, Rajat Monga, Derek Gordon Murray, Dimitris S. Papailiopoulos, Gennady Pekhimenko, Theodoros Rekatsinas, Afshin Rostamizadeh, Christopher Ré, Christopher De Sa, Hanie Sedghi, Siddhartha Sen, Virginia Smith, Alex Smola, Dawn Song, Evan Randall Sparks, Ion Stoica, Vivienne Sze, Madeleine Udell, Joaquin Vanschoren, Shivaram Venkataraman, Rashmi Vinayak, Markus Weimer, Andrew Gordon Wilson, Eric P. Xing, Matei Zaharia, Ce Zhang, Ameet Talwalkar:
SysML: The New Frontier of Machine Learning Systems. CoRR abs/1904.03257 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03960
Anand Jayarajan, Jinliang Wei, Garth Gibson, Alexandra Fedorova, Gennady Pekhimenko:
Priority-based Parameter Propagation for Distributed DNN Training. CoRR abs/1905.03960 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-10134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-10134
Shang Wang, Yifan Bai, Gennady Pekhimenko:
Scaling Back-propagation by Parallel Scan Algorithm. CoRR abs/1907.10134 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01500
Peter Mattson, Christine Cheng, Cody Coleman, Greg Diamos, Paulius Micikevicius, David A. Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor Bittorf, David Brooks, Dehao Chen, Debojyoti Dutta, Udit Gupta, Kim M. Hazelwood, Andrew Hock, Xinyuan Huang, Bill Jia, Daniel Kang, David Kanter, Naveen Kumar, Jeffery Liao, Guokai Ma, Deepak Narayanan, Tayo Oguntebi, Gennady Pekhimenko, Lillian Pentecost, Vijay Janapa Reddi, Taylor Robie, Tom St. John, Carole-Jean Wu, Lingjie Xu, Cliff Young, Matei Zaharia:
MLPerf Training Benchmark. CoRR abs/1910.01500 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-02549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-02549
Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee, Jeffery Liao, Anton Lokhmotov, Francisco Massa, Peng Meng, Paulius Micikevicius, Colin Osborne, Gennady Pekhimenko, Arun Tejusve Raghunath Rajan, Dilip Sequeira, Ashish Sirasao, Fei Sun, Hanlin Tang, Michael Thomson, Frank Wei, Ephrem Wu, Lingjie Xu, Koichi Yamada, Bing Yu, George Yuan, Aaron Zhong, Peizhao Zhang, Yuchen Zhou:
MLPerf Inference Benchmark. CoRR abs/1911.02549 (2019)
2018
[c22]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/cascon/PekhimenkoT08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cascon/PekhimenkoT08
Gennady Pekhimenko, Ettore Tiotto:
Compiler-driven performance workshop. CASCON 2018: 374-376
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/iiswc/ZhuAZPJPSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iiswc/ZhuAZPJPSP18
Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Anand Jayarajan, Amar Phanishayee, Bianca Schroeder, Gennady Pekhimenko:
Benchmarking and Analyzing Deep Neural Network Training. IISWC 2018: 88-100
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/VijaykumarJMHPE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/VijaykumarJMHPE18
Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazar, Phillip B. Gibbons, Onur Mutlu:
A Case for Richer Cross-Layer Abstractions: Bridging the Semantic Gap with Expressive Memory. ISCA 2018: 207-220
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/JainPMTP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/JainPMTP18
Animesh Jain, Amar Phanishayee, Jason Mars, Lingjia Tang, Gennady Pekhimenko:
Gist: Efficient Data Encoding for Deep Neural Network Training. ISCA 2018: 776-789
[c18]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/PekhimenkoGJHZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/PekhimenkoGJHZ18
Gennady Pekhimenko, Chuanxiong Guo, Myeongjae Jeon, Peng Huang, Lidong Zhou:
TerseCades: Efficient Data Compression in Stream Processing. USENIX ATC 2018: 307-320
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1802-02573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-02573
Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Manabi Khan, Ashish Shrestha, Saugata Ghose, Phillip B. Gibbons, Onur Mutlu:
Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management. CoRR abs/1802.02573 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-06905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-06905
Hongyu Zhu, Mohamed Akrout, Bojian Zheng, Andrew Pelegris, Amar Phanishayee, Bianca Schroeder, Gennady Pekhimenko:
TBD: Benchmarking and Analyzing Deep Neural Network Training. CoRR abs/1803.06905 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-02498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-02498
Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Manabi Khan, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, Onur Mutlu:
Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance. CoRR abs/1805.02498 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03047
Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Manabi Khan, Vivek Seshadri, Kevin K. Chang, Onur Mutlu:
Adaptive-Latency DRAM: Reducing DRAM Latency by Exploiting Timing Margins. CoRR abs/1805.03047 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03154
Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Manabi Khan, Onur Mutlu:
Flexible-Latency DRAM: Understanding and Exploiting Latency Variation in Modern DRAM Chips. CoRR abs/1805.03154 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03195
Hasan Hassan, Nandita Vijaykumar, Samira Manabi Khan, Saugata Ghose, Kevin K. Chang, Gennady Pekhimenko, Donghyuk Lee, Oguz Ergin, Onur Mutlu:
SoftMC: Practical DRAM Characterization Using an FPGA-Based Infrastructure. CoRR abs/1805.03195 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03502
Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
RowClone: Accelerating Data Movement and Initialization Using DRAM. CoRR abs/1805.03502 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03969
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03969
Hasan Hassan, Gennady Pekhimenko, Nandita Vijaykumar, Vivek Seshadri, Donghyuk Lee, Oguz Ergin, Onur Mutlu:
Exploiting Row-Level Temporal Locality in DRAM to Reduce the Memory Access Latency. CoRR abs/1805.03969 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-08899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-08899
Bojian Zheng, Akshay Nair, Qiongsi Wu, Nandita Vijaykumar, Gennady Pekhimenko:
EcoRNN: Fused LSTM RNN Implementation with Data Layout Optimization. CoRR abs/1805.08899 (2018)
2017
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/pomacs/LeeKSGAPSM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pomacs/LeeKSGAPSM17
Donghyuk Lee, Samira Manabi Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri, Onur Mutlu:
Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms. Proc. ACM Meas. Anal. Comput. Syst. 1(1): 26:1-26:36 (2017)
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/HassanVKGCPLEM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/HassanVKGCPLEM17
Hasan Hassan, Nandita Vijaykumar, Samira Manabi Khan, Saugata Ghose, Kevin K. Chang, Gennady Pekhimenko, Donghyuk Lee, Oguz Ergin, Onur Mutlu:
SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies. HPCA 2017: 241-252
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/sigmetrics/LeeKSGAPSM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigmetrics/LeeKSGAPSM17
Donghyuk Lee, Samira Manabi Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri, Onur Mutlu:
Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms. SIGMETRICS (Abstracts) 2017: 54
[c15]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/MiaoPJPML17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/MiaoPJPML17
Hongyu Miao, Heejin Park, Myeongjae Jeon, Gennady Pekhimenko, Kathryn S. McKinley, Felix Xiaozhu Lin:
StreamBox: Modern Stream Processing on a Multicore Machine. USENIX ATC 2017: 617-629
2016
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/bioinformatics/XinNZEPKAM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bioinformatics/XinNZEPKAM16
Hongyi Xin, Sunny Nahar, Richard L. Zhu, John Emmons, Gennady Pekhimenko, Carl Kingsford, Can Alkan, Onur Mutlu:
Optimal seed solver: optimizing seed selection in read mapping. Bioinform. 32(11): 1632-1642 (2016)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/dt/YazdanbakhshTEP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dt/YazdanbakhshTEP16
Amir Yazdanbakhsh, Bradley Thwaites, Hadi Esmaeilzadeh, Gennady Pekhimenko, Onur Mutlu, Todd C. Mowry:
Mitigating the Memory Bottleneck With Approximate Load Value Prediction. IEEE Des. Test 33(1): 32-42 (2016)
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taco/YazdanbakhshPTE16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taco/YazdanbakhshPTE16
Amir Yazdanbakhsh, Gennady Pekhimenko, Bradley Thwaites, Hadi Esmaeilzadeh, Onur Mutlu, Todd C. Mowry:
RFVP: Rollback-Free Value Prediction with Safe-to-Approximate Loads. ACM Trans. Archit. Code Optim. 12(4): 62:1-62:26 (2016)
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taco/LeeGPKM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taco/LeeGPKM16
Donghyuk Lee, Saugata Ghose, Gennady Pekhimenko, Samira Manabi Khan, Onur Mutlu:
Simultaneous Multi-Layer Access: Improving 3D-Stacked Memory Bandwidth at Low Cost. ACM Trans. Archit. Code Optim. 12(4): 63:1-63:29 (2016)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/PekhimenkoBVMMK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/PekhimenkoBVMMK16
Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler:
A case for toggle-aware compression for GPU systems. HPCA 2016: 188-200
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/HassanPVSLEM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/HassanPVSLEM16
Hasan Hassan, Gennady Pekhimenko, Nandita Vijaykumar, Vivek Seshadri, Donghyuk Lee, Oguz Ergin, Onur Mutlu:
ChargeCache: Reducing DRAM latency by exploiting row access locality. HPCA 2016: 581-593
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/micro/VijaykumarHPKSG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micro/VijaykumarHPKSG16
Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Manabi Khan, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, Onur Mutlu:
Zorua: A holistic approach to resource virtualization in GPUs. MICRO 2016: 15:1-15:14
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/sigmetrics/ChangKHGHLLPKM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigmetrics/ChangKHGHLLPKM16
Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Manabi Khan, Onur Mutlu:
Understanding Latency Variation in Modern DRAM Chips: Experimental Characterization, Analysis, and Optimization. SIGMETRICS 2016: 323-336
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/VijaykumarPJG0A16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/VijaykumarPJG0A16
Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Saugata Ghose, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu:
A Framework for Accelerating Bottlenecks in GPU Execution with Assist Warps. CoRR abs/1602.01348 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LeeKPKSCM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeeKPKSCM16
Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Manabi Khan, Vivek Seshadri, Kevin Kai-Wei Chang, Onur Mutlu:
Adaptive-Latency DRAM (AL-DRAM). CoRR abs/1603.08454 (2016)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/Pekhimenko16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Pekhimenko16
Gennady Pekhimenko:
Practical Data Compression for Modern Memory Hierarchies. CoRR abs/1609.02067 (2016)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LeeKSAPSGM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeeKSAPSGM16
Donghyuk Lee, Samira Manabi Khan, Lavanya Subramanian, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri, Saugata Ghose, Onur Mutlu:
Reducing DRAM Latency by Exploiting Design-Induced Latency Variation in Modern DRAM Chips. CoRR abs/1610.09604 (2016)
2015
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/bioinformatics/XinGEPKAM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bioinformatics/XinGEPKAM15
Hongyi Xin, John Greth, John Emmons, Gennady Pekhimenko, Carl Kingsford, Can Alkan, Onur Mutlu:
Shifted Hamming distance: a fast and accurate SIMD-friendly filter to accelerate alignment verification in read mapping. Bioinform. 31(10): 1553-1560 (2015)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cal/PekhimenkoBOMMK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cal/PekhimenkoBOMMK15
Gennady Pekhimenko, Evgeny Bolotin, Mike O'Connor, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler:
Toggle-Aware Compression for GPUs. IEEE Comput. Archit. Lett. 14(2): 164-168 (2015)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/PekhimenkoHCMGK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/PekhimenkoHCMGK15
Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
Exploiting compressed block size as an indicator of future reuse. HPCA 2015: 51-63
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/LeeKPKSCM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/LeeKPKSCM15
Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Manabi Khan, Vivek Seshadri, Kevin Kai-Wei Chang, Onur Mutlu:
Adaptive-latency DRAM: Optimizing DRAM timing for the common-case. HPCA 2015: 489-501
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/VijaykumarPJ0AD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/VijaykumarPJ0AD15
Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu:
A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps. ISCA 2015: 41-53
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/SeshadriPRMGKMC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/SeshadriPRMGKMC15
Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul M. Chilimbi:
Page overlays: an enhanced virtual memory framework to enable fine-grained memory management. ISCA 2015: 79-91
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/www/PekhimenkoLRSB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/PekhimenkoLRSB15
Gennady Pekhimenko, Dimitrios Lymberopoulos, Oriana Riva, Karin Strauss, Doug Burger:
PocketTrend: Timely Identification and Delivery of Trending Search Content to Mobile Users. WWW 2015: 842-852
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LeePKGM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeePKGM15
Donghyuk Lee, Gennady Pekhimenko, Samira Manabi Khan, Saugata Ghose, Onur Mutlu:
Simultaneous Multi Layer Access: A High Bandwidth and Low Cost 3D-Stacked Memory Interface. CoRR abs/1506.03160 (2015)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/XinZNEPKAM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XinZNEPKAM15
Hongyi Xin, Richard L. Zhu, Sunny Nahar, John Emmons, Gennady Pekhimenko, Carl Kingsford, Can Alkan, Onur Mutlu:
Optimal Seed Solver: Optimizing Seed Selection in Read Mapping. CoRR abs/1506.08235 (2015)
2014
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/ThwaitesPEYMPMM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/ThwaitesPEYMPMM14
Bradley Thwaites, Gennady Pekhimenko, Hadi Esmaeilzadeh, Amir Yazdanbakhsh, Onur Mutlu, Jongse Park, Girish Mururu, Todd C. Mowry:
Rollback-free value prediction with approximate loads. PACT 2014: 493-494
2013
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/micro/PekhimenkoSKXMGKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micro/PekhimenkoSKXMGKM13
Gennady Pekhimenko, Vivek Seshadri, Yoongu Kim, Hongyi Xin, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
Linearly compressed pages: a low-complexity, low-latency main memory compression framework. MICRO 2013: 172-184
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/micro/SeshadriKFLAPLMGKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/micro/SeshadriKFLAPLMGKM13
Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
RowClone: fast and energy-efficient in-DRAM bulk data copy and initialization. MICRO 2013: 185-197
2012
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/PekhimenkoSMGKM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/PekhimenkoSMGKM12
Gennady Pekhimenko, Vivek Seshadri, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
Base-delta-immediate compression: practical data compression for on-chip caches. PACT 2012: 377-388
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/IEEEpact/PekhimenkoMM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/IEEEpact/PekhimenkoMM12
Gennady Pekhimenko, Todd C. Mowry, Onur Mutlu:
Linearly compressed pages: a main memory compression framework with low complexity and low latency. PACT 2012: 489-490
2010
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/daglib/p/PekhimenkoB10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/daglib/p/PekhimenkoB10
Gennady Pekhimenko, Angela Demke Brown:
Efficient Program Compilation Through Machine Learning Techniques. Software Automatic Tuning, From Concepts to State-of-the-Art Results 2010: 335-351

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.