Publications
Journal Publications
TCAD 2020 “DLUX: a LUT-based Near-Bank Accelerator for Data Center Deep Learning Training Workloads”.
Peng Gu, Xinfeng Xie, Shuangchen Li, Krishna T. Malladi, Dimin Niu, Hongzhong Zheng, Yuan Xie. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2020 PDFTACO 2020 “NNBench-X: A Benchmarking Methodology for Neural Network Accelerator Designs”.
Xinfeng Xie, Xing Hu, Peng Gu, Shuangchen Li, Yu Ji, and Yuan Xie. ACM Transactions on Architecture and Code Optimization (TACO), 2020CAL 2020 “NMTSim: Transaction-Command based Simulator for New Memory Technology Devices”.
Peng Gu, Benjamin Lim, Wenqin Huangfu, Krishna T. Malladi, Andrew Chang, Yuan Xie. IEEE Computer Architecture Letters (CAL), 2020 PDF Talk SlideCAL 2019 “NNBench-X: Benchmarking and Understanding Neural Network Workloads for Accelerator Designs”.
Xinfeng Xie, Xing Hu, Peng Gu, Shuangchen Li, Yu Ji, and Yuan Xie. IEEE Computer Architecture Letters (CAL), 2019 PDFJCST 2016 “Technological Exploration of RRAM Crossbar Array for Matrix-vector Multiplication”.
Lixue Xia, Peng Gu, Boxun Li, Tianqi Tang, Xiling Yin, Wenqin Huangfu, Shimeng Yu, Yu Cao, Yu Wang, Huazhong Yang. Journal of Computer Science and Technology (JCST), 2016 PDFIEEE Design & Test 2016 “Exploring the Precision Limitation for RRAM-Based Analog Approximate Computing”.
Boxun Li, Peng Gu, Yu Wang, Huazhong Yang. IEEE Design & Test of Computers, 2016 PDFTCAD 2015 “RRAM-based Analog Approximate Computing”.
Boxun Li, Peng Gu, Yi Shan, Yu Wang, Yiran Chen, Huazhong Yan. IEEE Transaction on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2015 PDF
Refereed Conference Publications
HPCA 2021 “SpaceA: Sparse Matrix Vector Multiplication on Processing-in-Memory Accelerator”.
Xinfeng Xie, Zheng Liang, Peng Gu, Abanti Basak, Lei Deng, Ling Liang, Xing Hu, Yuan Xie. Proceedings of the 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2021 PDFISCA 2020 “iPIM: Programmable In-Memory Image Processing Accelerator Using Near-Bank Architecture”.
Peng Gu+, Xinfeng Xie+, Yufei Ding, Guoyang Chen, Weifeng Zhang, Dimin Niu, Yuan Xie. (+ equal contribution) International Symposium on Computer Architecture (ISCA), 2020 PDF Talk SlideICCAD 2020 “NEST: DIMM based Near-Data-Processing Accelerator for K-mer Counting”.
Wenqin Huangfu, Krishna T. Malladi, Shuangchen Li, Peng Gu, Yuan Xie. Proceedings of the 39th International Conference On Computer Aided Design (ICCAD), 2020MICRO 2019 “MEDAL: Scalable DIMM based Near Data Processing Accelerator for DNA Seeding Algorithm”.
Wenqin Huangfu, Xueqi Li, Shuangchen Li, Xing Hu, Peng Gu, Yuan Xie. International Symposium on Microarchitecture (MICRO), 2019 PDFMICRO 2019 “Alleviating Irregularity in Graph Analytics Acceleration: a Hardware/Software Co-Design Approach”.
Mingyu Yan, Xing Hu, Shuangchen Li, Abanti Basak, Han Li, Xin Ma, Itir Akgun, Yujing Feng, Peng Gu, Lei Deng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie. International Symposium on Microarchitecture (MICRO), 2019 PDFMICRO 2018 “SCOPE: A Stochastic Computing Engine for DRAM-based In-situ Accelerator”.
Shuangchen Li, Alvin Oliver Glova, Xing Hu, Peng Gu, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Bren-nan, Yuan Xie. International Symposium on Microarchitecture (MICRO), 2018 PDFAsianHOST 2018 “Cost-efficient 3D Integration to Hinder Reverse Engineering During and After Manufacturing”.
Peng Gu, Dylan Stow, Prashansa Mukim, Shuangchen Li, Yuan Xie. Asian Hardware Oriented Security and Trust Symposium (Asian HOST), 2018 PDF SlideGLVLSI 2017 “Security Threats and Countermeasures in Three-Dimensional Integrated Circuits”.
Jaya Dofe, Peng Gu, Dylan Stow, Qiaoyan Yu, Eren Kursun, Yuan Xie. Proceedings of the 27th Great Lakes Symposium on VLSI (GLSVLSI), 2017 PDFICCD 2016 “Thermal-aware 3D Design for Side-channel Information Leakage”.
Peng Gu, Dylan Stow, Russell Barnes, Eren Kursun, Yuan Xie. Proceedings of the 34th IEEE International Conference on Computer Design (ICCD), 2016 PDF SlideGLVLSI 2016 “Leveraging 3D Integration Technologies to Improve Hardware Security: Opportunities and Challenge”.
Peng Gu, Shuangchen Li, Dylan Stow, Russell Barnes, Liu Liu, Eren Kursun, Yuan Xie. Proceedings of the 26th GreatLakes Symposium on VLSI (GLSVLSI), 2016 PDFICCAD 2016 “NVSim-CAM: A Circuit-Level Simulator for Emerging Nonvolatile Memory based Content-Addressable Memory”.
Shuangchen Li, Liu Liu, Peng Gu, Cong Xu, Yuan Xie. Proceedings of the 35th International Conference On Computer Aided Design (ICCAD), 2016 PDFICCAD 2016 “Cost Analysis and Cost-Driven IP Reuse Methodology for SoC design Based on 2.5D/3D Integration”.
Dylan Stow, Itir Akgun, Russell Barnes, Peng Gu, Yuan Xie. Proceedings of the 35th International Conference On Computer Aided Design (ICCAD), 2016 PDFDATE 2016 “MNSIM: Simulation Platform for Memristor-based Neuromorphic Computing System”.
Lixue Xia, Boxun Li, Tianqi Tang, Peng Gu, Xiling Yin, Wenqin Huangfu, Pai-yu Chen, Shimeng Yu, Yu Cao, YuWang, Yuan Xie, Huangzhong Yang. Proceedings of IEEE/ACM Design Automation and Test in Europe (DATE), 2016 PDFASPDAC 2015 “Technological Exploration of RRAM Crossbar Array for Matrix-vector Multiplication”.
Peng Gu, Boxun Li, Tianqi Tang, Shimeng Yu, Yu Cao, Yu Wang, Huazhong Yang. Proceedings of the 20th Asia and South Pacific Design Automation Conference (ASP-DAC), 2015 PDFDAC 2015 “Merging the Interface: Power, Area and Accuracy Co-optimization for RRAM Crossbar-based Mixed-signal Computing System”.
Boxun Li, Lixue Xia, Peng Gu, Yu Wang, Huazhong Yang. Proceedings of the 52nd Design Automation Conference (DAC), 2015 PDFGLVLSI 2015 “Energy Efficient RRAM Spiking Neural Network for Real Time Classification”.
Yu Wang, Tianqi Tang, Lixue Xia, Boxun Li, Peng Gu, Huazhong Yang, Hai Li, Yuan Xie. Proceedings of the 25th Great Lakes Symposium on VLSI (GLSVLSI), 2015 PDF
Patents
“Memory Lookup Computing Mechanisms”. Krishna Malladi, Peng Gu, Hongzhong Zheng, Robert Brennan. US Patent App. 15/913,758.
“Computing Accelerator Using a Lookup Table”. Peng Gu, Krishna Malladi, Hongzhong Zheng. US Patent App. 15/916,196
““HBM Silicon Photonic TSV Architecture for Lookup Computing AI Accelerator”. Peng Gu, Krishna Malladi, Hongzhong Zheng. US Patent App. 15/911,063
“Scale-out High Bandwidth Memory System”. Krishna Malladi, Hongzhong Zheng, Dimin Niu, Peng Gu. US Patent App. 16/194,219
“The method for parameter configuration of memristor crossed array”. Yu Wang, Boxun Li, Peng Gu, Tianqi Tang, Lixue Xia, Huazhong Yang. CN Patent CN105,390,520B
“Digital-to-analogue mixed signal processing system for Imprecise computation”. Yu Wang, Boxun Li, Lixue Xia, Peng Gu, Tianqi Tang, Huazhong Yang. CN Patent CN105,184,365B