Most of the papers are copyrighted by ACM or IEEE. They are posted here for your personal use, to ensure timely dissemination of research work with no commercial purpose.

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

For other papers listed below but not included in ACM digital library, I will add links to PDF as soon as possible. Or you may email me for a copy of PDF file.

2020

[CAL 2020] Peng Gu, Benjamin Lim, Wenqin Huangfu, Krishna Malladi, Andrew Chang, Yuan Xie, "NMTSim: Transaction-Command based Simulator for New Memory Technology Devices", IEEE Computer Architecture Letters (CAL), 2020
[ISCA 2020] Peng Gu, Xinfeng Xie, Yufei Ding, Guoyang Chen, Weifeng Zhang, Dimin Niu, Yuan Xie, "iPIM: Programmable In-Memory Image Processing Accelerator Using Near-Bank Architecture", in Proceedings of 47th International Symposium on Computer Architecture (ISCA), 2020
[ISCA 2020] Weitao Li, Pengfei Xu, Yang Zhao, Haitong Li, Yuan Xie, Yingyan Lin, "TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators towards Local and in Time Domain", in Proceedings of 47th International Symposium on Computer Architecture (ISCA), 2020
[ISCA 2020] Yang Zhao, Xiaohan Chen, Yue Wang, Chaojian Li, Yuan Xie, Zhangyang Wang, Yingyan Lin, "SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation", in Proceedings of 47th International Symposium on Computer Architecture (ISCA), 2020
[ISPASS 2020] Abanti Basak, Jilan Lin, Ryan Lorica, Xinfeng Xie, Zeshan Chishti, Alaa Alameldeen, Yuan Xie, "SAGA-Bench: Software and Hardware Characterization of Streaming Graph Analytics Workloads", (https://github.com/abasak24/SAGA-Bench) to appear in International Symposium on Performance Analysis of Systems and Software, 2020
[DAC 2020] Gushu Li, Yufei Ding, Yuan Xie, "Eliminating Redundant Computation in Noisy Quantum Computing Simulation", to appear in Design Automation Conference, 2020
[DAC 2020] Maohua Zhu, Yuan Xie, "Taming Unstructured Sparsity on GPUs via Latency-Aware Optimization", to appear in Design Automation Conference, 2020
[HPCA 2020] Mingyu Yan, Lei Deng, Xing Hu, Ling Liang, Yujing Feng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie, "HyGCN: A GCN Accelerator with Hybrid Architecture", to appear in the 26th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2020
[HPCA 2020] Jianbo Dong, Zheng Cao, Tao Zhang, Jianxi Ye, Shaochuang Wang, Fei Feng, Li Zhao, Xiaoyong Liu, Liuyihan Song, Liwei Peng, Yiqun Guo, Xiaowei Jiang, Lingbo Tang, Yin Du, Yingya Zhang, Pan Pan , Yuan Xie, "EFLOPS: Algorithm and System Co-design for a High Performance Distributed Training Platform", to appear in the 26th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2020
[HPCA 2020] Marzieh Lenjani, Patricia Gonzalez, Elaheh Sadredini, Shuangchen Li, Yuan Xie, Ameen Akel, Sean Eilert, Mircea R.Stan, Kevin Skadron, "Fulcrum: a Simplified Control and Access Mechanism toward Flexible and Practical in-situ Accelerators", to appear in the 26th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2020
[ASPLOS 2020] Xing Hu, Ling Liang, Shuangchen Li, Lei Deng, Pengfei Zuo, Yu Ji, Xinfeng Xie, Yufei Ding, Chang Liu, Timothy Sherwood, Yuan Xie, "DeepSniffer: A DNN Model Extraction Framework Based on Learning Architectural Hints", to appear in the 25th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020
[ASPLOS 2020] Gushu Li, Yufei Ding, Yuan Xie, "Towards Efficient Superconducting Quantum Processor Architecture Design", to appear in the 25th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020
 
[TC 2020] Yijin Guan, Guangyu Sun, Zhihang Yuan, Xingchen Li, Ningyi Xu, Shu Chen, Jason Cong, and Yuan Xie, "Crane: Mitigating Accelerator Under-utilization Caused by Sparsity Irregularities in CNNs", IEEE TRANSACTIONS ON COMPUTERS, 2020
[TNNLS 2020] Zhaodong Chen, Lei Deng, Guoqi Li, Jiawei Sun, Ling Liang, Xing Hu, Yufei Ding, Yuan Xie., "Effective and Efficient Batch Normalization Using Few Uncorrelated Data for Statistics Estimation", IEEE Transactions on Neural Networks and Learning Systems
[Engineering 2020] Yiran Chen, Yuan Xie, Linghao Song, Fan Chen, and Tianqi Tang, "A Survey of Accelerator Architectures for Deep Neural Networks", Engineering, 2020. 
[NN 2020] Lei Deng, Yujie Wu, Xing Hu, Ling Liang, Yufei Ding, Guoqi Li, Guangshe Zhao, Peng Li, Yuan Xie, "Rethinking the performance comparison between SNNS and ANNS", Neural Networks, Vol. 21, Page:294-307, 2020.
[JSSC 2020] Lei Deng, Guanrui Wang, Guoqi Li, Shuangchen Li, Ling Liang, Maohua Zhu, Yujie Wu, Zheyu Yang, Zhe Zou, Jing Pei, Zhenzhi Wu, Xing Hu, Yufei Ding, Wei He, Yuan Xie, Luping Shi., "Tianjic: A Unified and Scalable Chip Bridging Spike-Based and Continuous Neural Computation", IEEE Journal of Solid-State Circuits, 2020.
[Proceedings of the IEEE 2020] Lei Deng, Guoqi Li, Song Han, Luping Shi, Yuan Xie., "Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey.", Proceedings of the IEEE, 2020.
[NN 2020] Yukuan Yang, Lei Deng, Shuang Wu, Tianyi Yan, Yuan Xie, Guoqi Li, "Training high-performance and large-scale deep neural networks with full 8-bit integers", Neural Networks, 2020.

2019

[HCS2019] Jiansong Zhang, Lixue Xia, Zhao Jiang, Hao Liang, Jiaoyan Chen, Shouda Liu, Wei Lin, Yuan Xie, "Ouroboros: An Inference Engine for Deep Learning Based TTS on Embedded Devices", IEEE Hot Chips Symposium (HCS), 2019
[IEEE Micro2019] Itir Akgun, Dylan Stow, Yuan Xie., "Network-on-Chip Design Guidelines for Monolithic 3-D Integration", IEEE Micro, vol. 39, no. 6, pages:46-53, 2019
[TVLSI2019] Liang Chang, Xin Ma, Zhaohao Wang, Youguang Zhang, Yufei Ding, Weisheng Zhao, Yuan Xie, "DASM: Data-Streaming-Based Computing in Nonvolatile Memory Architecture for Embedded System", IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2019
[TVLSI2019] Liang Chang, Xin Ma, Zhaohao Wang, Youguang Zhang, Yuan Xie, Weisheng Zhao, "PXNOR-BNN: In/With Spin-Orbit Torque MRAM Preset-xnor Operation-Based Binary Neural Networks", IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2019
[TOCS2019] Yunji Chen, Huiying Lan, Zidong Du, Shaoli Liu, Jinhua Tao, Dong Han, Tao Luo, Qi Guo, Ling Li, Yuan Xie, and Tianshi Chen., "An Instruction Set Architecture for Machine Learning", ACM Transactions on Computer Systems (TOCS), 36, 3, article 9, 2019.
[Nature2019] Jing Pei, Lei Deng, Sen Song, Mingguo Zhao, Youhui Zhang, Shuang Wu, Guanrui Wang, Zhe Zou, Zhenzhi Wu, Wei He, Feng Chen, Ning Deng, Si Wu, Yu Wang, Yujie Wu, Zheyu Yang, Cheng Ma, Guoqi Li, Wentao Han, Huanglong Li, Huaqiang Wu, Rong Zhao, Yuan Xie, and Luping Shi, "Towards artificial general intelligence with hybrid Tianjic chip architecture", Nature, vol. 572, pages:106-111, 2019
[CAL2019] D. Stow, A. Farmahini-Farahani, S. Gurumurthi, M. Ignatowski and Y. Xie., "Power Profiling of Modern Die-Stacked Memory", IEEE Computer Architecture Letter (CAL), vol. 18, no. 2, pages:132-135, 2019.
[Physical Review2019] Xu Jianyu, Ling Liang, Lei Deng, Changyun Wen, Yuan Xie, and Guoqi L, "Towards a polynomial algorithm for optimal contraction sequence of tensor networks from trees", Physical Review E 100, no. 4, 2019
[SSI2019] Jiale YAN, Ying ZHANG, Fengbin TU, Jianxun YANG, Shixuan ZHENG, Peng OUYANG, Leibo LIU, Yuan XIE, Shaojun WEI, Shouyi YIN., "Research on low-power neural network computing accelerator", SCIENTIA SINICA Informationis, 2019.
[CAL 2019] Mingyu Yan, Zhaodong Chen, Lei Deng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, and Yuan Xie, "Characterizing and Understanding GCNs on GPU", IEEE Computer Architecture Letters, 2019
[SLIP 2019] Dylan Stow, Itir Akgun, Yuan Xie, "Investigation of Cost-Optimal Network-on-Chip for Passive and Active Interposer Systems", System Level Interconnect Prediction Workshop (SLIP), June 2019
[DAC 2019] Dylan Stow, Itir Akgun, Wenqin Huangfu, Xueqi Li, Yuan Xie, Gabriel H. Loh , "Efficient System Architecture in the Era of Monolithic 3D: Dynamic Inter-tier Interconnect and Processing-in-Memory", Design Automation Conference (DAC), June 2019, Invited Paper
[ISLPED 2019] Mingyu Yan, Xing Hu, Shuangchen Li, Itir Akgun, Han Li, Xin Ma, Lei Deng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie, "Balancing Memory Accesses for Energy-Efficient Graph Analytics Accelerators", International Symposium on Low Power Electronics and Design (ISLPED), July 2019
[TCAD 2019] Yi Cai, Tianqi Tang, Lixue Xia, Boxun Li, Yu Wang, Huazhong Yang, "Low Bit-width Convolutional Neural Network on RRAM", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
[ICLR 2019] Liu Liu, Lei Deng, Xing Hu, Maohua Zhu, Guoqi Li, Yufei Ding, Yuan Xie , "Dynamic Sparse Graph for Efficient Deep Learning", in the Seventh International Conference on Learning Representations (ICLR), May 2019
[MICRO 2019] Wenqin Huangfu, Xueqi Li, Shuangchen Li, Xing Hu, Peng Gu, Yuan Xie, "MEDAL: Scalable DIMM based Near Data Processing Accelerator for DNA Seeding Algorithm", In the Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2019.
[MICRO 2019] Mingyu Yan, Xing Hu, Shuangchen Li, Abanti Basak, Han Li, Xin Ma, Itir Akgun, Yujing Feng, Peng Gu, Lei Deng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie, "Alleviating Irregularity in Graph Analytics Acceleration: a Hardware/Software Co-Design Approach", In the Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2019.
[MICRO 2019] Maohua Zhu, Tao Zhang, Zhenyu Gu, Yuan Xie, "Sparse Tensor Core: Algorithm and Hardware Co-Design for Vector-wise Sparse Neural Networks on Modern GPUs", In the Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2019. 
[CAL2019] Xinfeng Xie, Xing Hu, Peng Gu, Shuangchen Li, Yu Ji, Yuan Xie, "NNBench-X: Benchmarking and Understanding Neural Network Workloads for Accelerator Designs", IEEE Computer Architecture Letter (CAL)
[DATE 2019] Liang Chang, Xin Ma, Zhaohao Wang, Youguang Zhang, Weisheng Zhao, and Yuan Xie, "CORN: In-Buffer Computing for Binary Neural Network", Design Automation and Test In Europe (DATE), 2019
[TCAD2019] Deng, Lei, Ling Liang, Guanrui Wang, Liang Chang, Xing Hu, Xin Ma, Liu Liu, Jing Pei, Guoqi Li, and Yuan Xie, "SemiMap: A Semi-folded Convolution Mapping for Speed-Overhead Balance on Crossbars", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
[HPCA2019] Abanti Basak, Shuangchen Li, Xing Hu, Sang Min Oh, Xinfeng Xie, Li Zhao, Xiaowei Jiang, Yuan Xie, "Analysis and Optimization of the Memory Hierarchy for Graph Processing Workloads", The 25th International Symposium on High-Performance Computer Architecture (HPCA), 2019
[DATE2019] Alvin Oliver Glova, Itir Akgun, Shuangchen Li, Xing Hu, Yuan Xie, "Near-Data Acceleration of Privacy-Preserving Biomarker Search with 3D-Stacked Memory", Design Automation and Test In Europe (DATE), 2019
[ASPLOS2019] Yu Ji, Youyang Zhang, Xinfeng Xie, Shuangchen Li, Peiqi Wang, Xing Hu, Youhui Zhang, and Yuan Xie, "FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture", The 24th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2019
[ASPLOS2019] Gushu Li, Yufei Ding, and Yuan Xie, "Tackling the Qubit Mapping Problem for NISQ-Era Quantum Devices", The 24th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2019
[ASPDAC2019] Jilan Lin, Zhenhua Zhu, Yu Wang, and Yuan Xie, "Learning the sparsity for ReRAM: mapping and pruning sparse neural network for ReRAM based accelerator", The 24th Asia and South Pacific Design Automation Conference (ASPDAC), 2019
[AAAI2019] Yujie Wu, Lei Deng, Guoqi Li, Jun Zhu, Yuan Xie, and Luping Shi, "Direct Training for Spiking Neural Networks: Faster, Larger, Better", The 33rd AAAI Conference on Artificial Intelligence (AAAI-19), 2019
[DATE2019] Yang Zhao, Xing Hu, Shuangchen Li, Jing Ye, Lei Deng, Yu Ji, Jianyu Xu, Dong Wu, and Yuan Xie, "Memory Trojan Attack on Neural Network Accelerators", Design Automation and Test In Europe (DATE), 2019

2018

[NeurIPS2018] Ji, Yu, Ling Liang, Lei Deng, Youyang Zhang, Youhui Zhang, and Yuan Xie, "TETRIS: TilE-matching the TRemendous Irregular Sparsity", Advances in Neural Information Processing Systems 31 (NIPS 2018)
[NeurIPS2018] Peiqi Wang, Xinfeng Xie, Lei Deng, Guoqi Li, Dongsheng Wang, and Yuan Xie, "HitNet: Hybrid Ternary Recurrent Neural Network", Advances in Neural Information Processing Systems 31 (NIPS 2018)
[TNNLS2018] Shuang Wu, Guoqi Li, Lei Deng, Liu Liu, Dong Wu, Yuan Xie, and Luping Shi, "L1-norm batch normalization for efficient training of deep neural networks", IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

Pages