Most of the papers are copyrighted by ACM or IEEE. They are posted here for your personal use, to ensure timely dissemination of research work with no commercial purpose.

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

For other papers listed below but not included in ACM digital library, I will add links to PDF as soon as possible. Or you may email me for a copy of PDF file.

2022

[CSUR 2022] Nan Wu, Yuan Xie, "A Survey of Machine Learning for Computer Architecture and Systems", ACM Computing Surveys (CSUR), 2022
[TC] Xing Hu, Ling Liang, Xiaobing Chen, Lei Deng, Yu Ji, Yufei Ding, Zidong Du, Qi Guo, Timothy Sherwood, and Yuan Xie, "A Systematic View of Model Leakage Risks in Deep Neural Network Systems", IEEE Transactions on Computers (TC), 2022
[ASPLOS 2022] Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, and Yuan Xie, "Paulihedral: A Generalized Block-Wise Compiler Optimization Framework For Quantum Simulation Kernels", in Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
[ISSCC 2022] D. Niu, S. Li, Y. Wang, W. Han, Z. Zhang, Y. Guan, T. Guan, F. Sun, F. Xue, L. Duan, Y. Fang, H. Zheng, X. Jiang, S. Wang, F. Zuo, Y. Wang, B. Yu, Q. Ren, and Y. Xie, "184QPS/W 64Mb/mm^2 3D Logic-to-DRAM Hybrid Bonding with Process-Near Memory Engine for Recommendation System", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022
[ASPLOS 2022] Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, Yuan Xie, "DOTA: Detect and Omit Weak Attentions for Scalable Transformer Acceleration", in Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
[ISSCC 2022] Fengbin Tu,  Yiqi Wang, Zihan Wu, Ling Liang, Yufei Ding, Bongjin Kim, Leibo Liu, Shaojun Wei, Yuan Xie, and Shouyi Yin, "A 28nm 29.2TFLOPS/W BF16 and 36.5TOPS/W INT8 Reconfigurable Digital CIM Processor with Unified FP/INT Pipeline and Bitwise in-Memory Booth Multiplication for Cloud Deep Learning Acceleration", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022
[ISSCC 2022] Fengbin Tu, Zihan Wu,  Yiqi Wang, Ling Liang, Liu Liu, Yufei Ding, Leibo Liu, Shaojun Wei, Yuan Xie, and Shouyi Yin, "A 28nm 15.59uJ/Token Full-Digital Bitline-Transpose CIM-based Sparse Transformer Accelerator with Pipeline/Parallel Reconfigurable Modes", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022
[ASPLOS 2022] Bangyan Wang, Lei Deng, Fei Sun, Guohao Dai, Liu Liu, Yu Wang, and Yuan Xie, "A One-for-All and O(Vlog(V))-cost Solution for Parallel Merge Style Operations on Sorted Key-Value Arrays", in Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
[ISSCC22] H. Zhu, B. Jiao, J. Zhang, X. Jia, Y. Wang, T. Guan, S. Wang, D. Niu, H. Zheng, C. Chen, M. Wang, L. Zhang, X. Zeng, Q. Liu, Y. Xie, and M. Liu, "COMB-MCM: Computing-on-Memory-Boundary NN Processor with Bipolar Bitwise Sparsity Optimization for Scalable Multi-Chiplet-Module Edge Machine Learning", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022

2021

[Neurocomputing] Hengnu Chen, Lei Deng, Zheng Qu, Ling Liang, Tianyi Yan, Yuan Xie, and Guoqi Li, "Tensor Train Decomposition for Solving Large-scale Linear Equations", Neurocomputing, 2021
[TNNLS] Lei Deng, Yujie Wu, Yifan Hu, Ling Liang, Guoqi Li, Xing Hu, Yufei Ding, Peng Li, and Yuan Xie, "Comprehensive snn compression using admm optimization and activity regularization", IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
[TCAD] Ling Liang, Zheng Qu, Zhaodong Chen, Fengbin Tu, Yujie Wu, Lei Deng, Guoqi Li, Peng Li, and Yuan Xie, "H2Learn: High-Efficiency Learning Accelerator for High-Accuracy Spiking Neural Networks", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2021
[CAL] Xinfeng Xie, Peng Gu, Jiayi Huang, Yufei Ding, and Yuan Xie, "MPU-Sim: A Simulator for In-DRAM Near-Bank Processing Architectures", IEEE Computer Architecture Letters (CAL), 2021
[Neurocomputing] Jiayi Yang, Lei Deng, Yukuan Yang, Yuan Xie, and Guoqi Li, "Training and Inference for Integer-based Semantic Segmentation Network", Neurocomputing, 2021
[ICCAD 2021] Hussam Amrouch, Jian-Jia Chen, Kaushik Roy, Yuan Xie, Indranil Chakraborty, Wenqin Huangfu, Ling Liang, Fengbin Tu, Cheng Wang, and Mikail Yayla, "Brain-Inspired Computing: Adventure from Beyond CMOS Technologies to Beyond von Neumann Architectures ICCAD Special Session Paper", in Proceedings of International Conference on Computer Aided Design (ICCAD), 2021
[ICCAD 2021] Jilan Lin, Shuangchen Li, Yufei Ding, Yuan Xie, "Overcoming the Memory Hierarchy Inefficiencies in Graph Processing Applications", in Proceedings of International Conference on Computer Aided Design (ICCAD), 2021
[CAL] Han Li, Mingyu Yan, Xiaocheng Yang, Lei Deng, Wenming Li, Xiaochun Ye, Dongrui Fan, and Yuan Xie, "Hardware Acceleration for GCNs via Bidirectional Fusion", IEEE Computer Architecture Letters (CAL), 2021
[JSTSP 2021] Ling Liang, Jianyu Xu, Lei Deng, Mingyu Yan, Xing Hu, Zheng Zhang, Guoqi Li, and Yuan Xie, "Fast Search of the Optimal Contraction Sequence in Tensor Networks", IEEE Journal of Selected Topics in Signal Processing (JSTSP), 2021
[TNNLS] Ling Liang, Xing Hu, Lei Deng, Yujie Wu, Guoqi Li, Yufei Ding, Peng Li, Yuan Xie., "Exploring Adversarial Attack in Spiking Neural Networks with Spike-Compatible Gradient", IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
[TCAD 2021] Xiaobing Chen, Yuke Wang, Xinfeng Xie, Xing Hu, Abanti Basak, Ling Liang, Mingyu Yan, Lei Deng, Yufei Ding, Zidong Du, Yunji Chen, Yuan Xie, "Rubik: A Hierarchical Architecture for Efficient Graph Learning", IEEE Transactions on Computer Aided Design of Integrated Circuits & Systems
[TCAD 2021] Yuke Wang, Boyuan Feng, Gushu Li, Lei Deng, Yuan Xie, Yufei Ding, "STPAcc: Structural TI-based Pruning for Accelerating Distance-related Algorithms on CPU-FPGA Platforms", IEEE Transactions on Computer Aided Design of Integrated Circuits & Systems
[MICRO 2021] Abanti Basak, Zheng Qu, Jilan Lin, Alaa R. Alameldeen, Zeshan Chishti, Yufei Ding, Yuan Xie, "Improving Streaming Graph Processing Performance Using Input Knowledge", 54th IEEE/ACM International Symposium on Microarchitecture
[SC 2021] Zhaodong Chen*, Zheng Qu*, Liu Liu, Yufei Ding, Yuan Xie, "Efficient Tensor Core-based GPU Kernels for Structured Sparsity under Reduced Precision", 2021 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
[SC 2021] Boyuan Feng*, Yuke Wang*, Tong Geng, Ang Li, Yufei Ding, "APNN-TC: Accelerating Arbitrary-Precision Neural Networks on Tensor Cores", 2021 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
[USENIX ATC 2021] Boyuan Feng, Yuke Wang, Gushu Li, Yuan Xie, Yufei Ding, "Palleon: A Runtime System for Efficient Video Processing toward Dynamic Class Skew", 2021 USENIX Annual Technical Conference
[ACM NanoCom 2021] Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, Yuan Xie, "On the Co-Design of Quantum Software and Hardware", 8th ACM International Conference on Nanoscale Computing and Communication
[ICCAD 2021] Jilan Lin, Shuangchen Li, Yufei Ding and Yuan Xie, "Overcoming the Memory Hierarchy Inefficiencies in Graph Processing Applications", 2021 International Conference On Computer Aided Design
[MICRO 2021] Liu Liu*, Jilan Lin*, Zheng Qu, Yufei Ding, Yuan Xie, "ENMC: Extreme Near-Memory Classification via Approximate Screening", 54th IEEE/ACM International Symposium on Microarchitecture
[CIKM 2021] Yuke Wang, Boyuan Feng, Xueqiao Peng, Yufei Ding, "An Efficient Quantitative Approach for Optimizing Convolutional Neural Networks", 30th ACM International Conference on Information and Knowledge Management
[GLSVLSI 2021] Nan Wu, Yuan Xie, Cong Hao, "IRONMAN: GNN-assisted Design Space Exploration in High-Level Synthesis via Reinforcement Learning", Proceedings of the 2021 on Great Lakes Symposium on VLSI
[JSTSP] Ling Liang, Jianyu Xu, Lei Deng, Mingyu Yan, Xing Hu, Zheng Zhang, Guoqi Li, Yuan Xie, "Fast Search of the Optimal Contraction Sequence in Tensor Networks", IEEE Journal of Selected Topics in Signal Processing
[TCAD] Zheng Qu, Lei Deng, Bangyan Wang, Hengnu Chen, Jilan Lin, Ling Liang, Guoqi Li, Zheng Zhang, Yuan Xie, "Hardware-Enabled Efficient Data Processing with Tensor-Train Decomposition", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
[IEEE Trans. Conput.] Bangyan Wang, Lei Deng, Zheng Qu, Shuangchen Li, Zheng Zhang, Yuan Xie, "Efficient Processing of Sparse Tensor Decomposition via Unified Abstraction and PE-interactive Architecture", IEEE Transactions on Computers
[DAC 2021] Pengfei Zuo, Yu Hua, Ling Liang, Xinfeng Xie, Xing Hu, Yuan Xie, "SEALing Neural Network Models in Encrypted Deep Learning Accelerators", IEEE/ACM Design Automation Conference (DAC), 2021
[OSDI 21] Yuke Wang, Boyuan Feng, Gushu Li, Shuangchen Li, Lei Deng, Yuan Xie, and Yufei Ding, "GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.", 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2021
[PPoPP 2021] Boyuan Feng, Yuke Wang, Guoyang Chen, Weifeng Zhang, Yuan Xie, Yufei Ding, "EGEMM-TC: Accelerating Scientific Computing on Tensor Cores with Extended Precision", Principles and Practice of Parallel Programming 2021
[HPCA 2021] Tianqi Tang, Sheng Li, Lifeng Nai, Norman P. Jouppi, Yuan Xie, "NeuroMeter: An Integrated Power, Area, and Timing Modeling Framework for Machine Learning Accelerators", 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2021)
[HPCA 2021] Xinfeng Xie, Zheng Liang, Peng Gu, Abanti Basak, Lei Deng, Ling Liang, Xing Hu, and Yuan Xie, "SpaceA: Sparse Matrix Vector Multiplication on Processing-in-Memory Accelerator", 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2021)

2020

[OOPSLA 2020] Gushu Li, Li Zhou, Nengkun Yu, Yufei Ding, Mingsheng Ying, and Yuan Xie, "Projection-Based Runtime Assertions for Testing and Debugging Quantum Programs", the ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA), 2020 (Distinguished Paper Award)
[JSSC] Fengbin Tu, Weiwei Wu, Yang Wang, Hongjiang Chen, Feng Xiong, Man Shi, Ning Li, Jinyi Deng, Tianbao Chen, Leibo Liu, Shaojun Wei, Yuan Xie, Shouyi Yin, "Evolver: A Deep Learning Processor with On-Device Quantization-Voltage-Frequency Tuning", IEEE Journal of Solid-State Circuits (JSSC)
[Neural Networks] Weihua He, Yujie Wu, Lei Deng, Guoqi Li, Haoyu Wang, Yang Tian, Wei Ding, Wenhui Wang, Yuan Xie, "Comparing SNNs and RNNs on neuromorphic vision datasets: Similarities and differences", Neural Networks
[ICCAD 2020] Wenqin Huangfu, Krishna T.Malladi, Shuangchen Li, Peng Gu, Yuan Xie, "NEST: DIMM based Near-Data-Processing Accelerator for K-mer Counting", 2020 International Conference on Computer Aided Design
[ICML 2020] Liu Liu, Lei Deng, Zhaodong Chen, Yuke Wang, Shuangchen Li, Jingwei Zhang, Yihua Yang, Zhenyu Gu, Yufei Ding, Yuan Xie, "Boosting Deep Neural Network Efficiency with Dual-Module Inference", The 37th International Conference on Machine Learning
[TCAD] Peng Gu, Xinfeng Xie, Shuangchen Li, Dimin Niu, Hongzhong Zheng, Krishna T. Malladi, and Yuan Xie, "DLUX: a LUT-based Near-Bank Accelerator for Data Center Deep Learning Training Workloads", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
[TCAD] Jilan Lin, Cheng-Da Wen, Xing Hu, Tianqi Tang, Ing-Chao Lin, Yu Wang, and Yuan Xie, "Rescuing RRAM-based Computing from Static and Dynamic Faults", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2020
[TODAES] Nan Wu, Lei Deng, Guoqi Li, and Yuan Xie, "Core Placement Optimization for Multi-Chip Many-Core Neural Network Systems with Reinforcement Learning", ACM Transactions on Design Automation of Electronic Systems
[TACO] Xinfeng Xie, Xing Hu, Peng Gu, Shuangchen Li, Yu Ji, Yuan Xie, "NNBench-X: A Benchmarking Methodology for Neural Network Accelerator Designs", ACM Transactions on Architecture and Code Optimization
[IEEE Computer] Liu Liu, Jie Tang, Shaoshan Liu, Bo Yu, Jean-Luc Gaudiot, Yuan Xie, "Π-RT: A Runtime Framework to Enable Energy-Efficient Real-Time Robotic Vision Applications on Heterogeneous Architectures", IEEE Computer
[ICCAD 2020] Zhaodong Chen, Mingyu Yan, Maohua Zhu, Guoqi Li, Shuangchen Li, Yuan Xie, "fuseGNN: Accelerating Graph Convolutional Neural Network Training on GPGPU", 2020 International Conference on Computer Aided Design
[MICRO 2020] Liu Liu, Zheng Qu, Lei Deng, Fengbin Tu, Shuangchen Li, Xing Hu, Zhenyu Gu, Yufei Ding, and Yuan Xie, "DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture", The 53rd IEEE/ACM International Symposium on Microarchitecture

Pages