Most of the papers are copyrighted by ACM or IEEE. They are posted here for your personal use, to ensure timely dissemination of research work with no commercial purpose.

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

For other papers listed below but not included in ACM digital library, I will add links to PDF as soon as possible. Or you may email me for a copy of PDF file.

2022

[TC] Liu Liu, Zheng Qu, Zhaodong Chen, Fengbin Tu, Yufei Ding, and Yuan Xie, "Dynamic Sparse Attention for Scalable Transformer Acceleration", IEEE Transactions on Computers
[MICRO'22] Anbang Wu, Hezi Zhang, Gushu Li , Alireza Shabani, Yuan Xie, and Yufei Ding, "AutoComm: A Framework for Enabling Efficient Communication in Distributed Quantum Program", 55th IEEE/ACM International Symposium on Microarchitecture
[USENIX ATC'22] Boyuan Feng, Tianqi Tang, Yuke Wang, Zhaodong Chen, Zheng Wang, Shu Yang, Yuan Xie, and Yufei Ding, "Faith: An Efficient Framework for Transformer Verification on GPUs", 2022 USENIX Annual Technical Conference
[SC'22] Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, and Yufei Ding, "EL-Rec: Efficient Large-scale Recommendation Model Training via Tensor-train Embedding", The International Conference for High Performance Computing, Networking, Storage, and Analysis, 2022
[SC'22] Xiaohui Wang, Yang Wei, Guyue Huang, Ying Xiong, Xian Qian, Yufei Ding, Lei Li, Mingxuan Wang, "ATL: Accelerated Training for Transformer-based Models on GPUs", The International Conference for High Performance Computing, Networking, Storage, and Analysis, 2022
[MICRO'22] Anbang Wu, Hezi Zhang, Gushu Li, Alireza Shabani, Yuan Xie, and Yufei Ding, "AutoComm: A Framework for Enabling Efficient Communication in Distributed Quantum Programs", Proceeding of 55th IEEE/ACM International Symposium on Microarchitecture, 2022
[ISCA 22] Shuangchen Li, Dimin Niu, Yuhao Wang, Wei Han, Zhe Zhang, Tianchan Guan, yijin Guan, Heng Liu, Linyong Huang, Zhaoyang Du, Fei Xue, Yuanwei Fang, Hongzhong Zheng, and Yuan Xie, "Hyperscale FPGA-as-a-Service Architecture for Large-Scale Distributed Graph Neural Network", Proceedings of 49th International Symposium on Computer Architecture, 2022
[DAC 22] Guohao Dai, Guyue Huang, Shang Yang, Zhongming Yu, Hengrui Zhang, Yufei Ding, Yuan Xie, Huazhong Yang, and Yu Wang, "Heuristic Adaptability to Input Dynamics for SpMM on GPUs", 59th Design Automation Conference (DAC), 2022
[ISCA 22] Guohao Dai, Zhenhua Zhu, Tianyu Fu, Chiyue Wei, Bangyan Wang, Xiangyu Li, Yuan Xie, Huazhong Yang, and Yu Wang, "DIMMining: Pruning-Efficient and Parallel Graph Mining on Near-Memory Computing", Proceedings of 49th International Symposium on Computer Architecture, 2022
[DAC 22] Haiyang Lin, Mingyu Yan, Duo Wang, Mo Zuo, Fengbin Tu, Xiaochun Ye, Dongrui Fang, and Yuan Xie, "Alleviating Datapath Conflicts and Design Centralization in Graph Analytics Acceleration", 59th Design Automation Conference, 2022
[DAC 22] Guyue Huang, Haoran Li, Minghai Qin, Fei Sun, Yufei Ding, and Yuan Xie, "Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning", 59th Design Automation Conference, 2022
[ISCA 22] Jilan Lin, Ling Liang, Zheng Qu, Ishtiyaque Ahmad, Liu Liu, Fengbin Tu, Trinabh Gupta, Yufei Ding, and Yuan Xie, "INSPIRE: In-Storage Private Information Retrieval via Protocol and Architecture Co-design", Proceedings of 49th International Symposium on Computer Architecture
[DAC 22] Nan Wu, Hang Yang, Yuan Xie, Pan Li, and Hao Cong, "High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing", 59th Design Automation Conference, 2022
[ISCA 22] Anbang Wu, Gushu Li, Hezi Zhang, Gian Giacomo Guerreschi, Yufei Ding, and Yuan Xie, "A Synthesis Framework for Stitching Source Code with Superconducting Quantum Devices", Preceedings of 49th International Symposium on Computer Architecture, 2022
[CSUR 2022] Nan Wu, Yuan Xie, "A Survey of Machine Learning for Computer Architecture and Systems", ACM Computing Surveys (CSUR), 2022
[TC] Xing Hu, Ling Liang, Xiaobing Chen, Lei Deng, Yu Ji, Yufei Ding, Zidong Du, Qi Guo, Timothy Sherwood, and Yuan Xie, "A Systematic View of Model Leakage Risks in Deep Neural Network Systems", IEEE Transactions on Computers (TC), 2022
[ASPLOS 2022] Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, and Yuan Xie, "Paulihedral: A Generalized Block-Wise Compiler Optimization Framework For Quantum Simulation Kernels", in Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
[ISSCC 2022] D. Niu, S. Li, Y. Wang, W. Han, Z. Zhang, Y. Guan, T. Guan, F. Sun, F. Xue, L. Duan, Y. Fang, H. Zheng, X. Jiang, S. Wang, F. Zuo, Y. Wang, B. Yu, Q. Ren, and Y. Xie, "184QPS/W 64Mb/mm^2 3D Logic-to-DRAM Hybrid Bonding with Process-Near Memory Engine for Recommendation System", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022
[ASPLOS 2022] Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, Yuan Xie, "DOTA: Detect and Omit Weak Attentions for Scalable Transformer Acceleration", in Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
[ISSCC 2022] Fengbin Tu,  Yiqi Wang, Zihan Wu, Ling Liang, Yufei Ding, Bongjin Kim, Leibo Liu, Shaojun Wei, Yuan Xie, and Shouyi Yin, "A 28nm 29.2TFLOPS/W BF16 and 36.5TOPS/W INT8 Reconfigurable Digital CIM Processor with Unified FP/INT Pipeline and Bitwise in-Memory Booth Multiplication for Cloud Deep Learning Acceleration", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022
[ISSCC 2022] Fengbin Tu, Zihan Wu,  Yiqi Wang, Ling Liang, Liu Liu, Yufei Ding, Leibo Liu, Shaojun Wei, Yuan Xie, and Shouyi Yin, "A 28nm 15.59uJ/Token Full-Digital Bitline-Transpose CIM-based Sparse Transformer Accelerator with Pipeline/Parallel Reconfigurable Modes", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022
[ASPLOS 2022] Bangyan Wang, Lei Deng, Fei Sun, Guohao Dai, Liu Liu, Yu Wang, and Yuan Xie, "A One-for-All and O(Vlog(V))-cost Solution for Parallel Merge Style Operations on Sorted Key-Value Arrays", in Proceedings of ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
[ISSCC22] H. Zhu, B. Jiao, J. Zhang, X. Jia, Y. Wang, T. Guan, S. Wang, D. Niu, H. Zheng, C. Chen, M. Wang, L. Zhang, X. Zeng, Q. Liu, Y. Xie, and M. Liu, "COMB-MCM: Computing-on-Memory-Boundary NN Processor with Bipolar Bitwise Sparsity Optimization for Scalable Multi-Chiplet-Module Edge Machine Learning", in Proceedings of International Solid-State Circuits Conference (ISSCC), 2022

2021

[Neurocomputing] Hengnu Chen, Lei Deng, Zheng Qu, Ling Liang, Tianyi Yan, Yuan Xie, and Guoqi Li, "Tensor Train Decomposition for Solving Large-scale Linear Equations", Neurocomputing, 2021
[TNNLS] Lei Deng, Yujie Wu, Yifan Hu, Ling Liang, Guoqi Li, Xing Hu, Yufei Ding, Peng Li, and Yuan Xie, "Comprehensive snn compression using admm optimization and activity regularization", IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
[TCAD] Ling Liang, Zheng Qu, Zhaodong Chen, Fengbin Tu, Yujie Wu, Lei Deng, Guoqi Li, Peng Li, and Yuan Xie, "H2Learn: High-Efficiency Learning Accelerator for High-Accuracy Spiking Neural Networks", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2021
[CAL] Xinfeng Xie, Peng Gu, Jiayi Huang, Yufei Ding, and Yuan Xie, "MPU-Sim: A Simulator for In-DRAM Near-Bank Processing Architectures", IEEE Computer Architecture Letters (CAL), 2021
[Neurocomputing] Jiayi Yang, Lei Deng, Yukuan Yang, Yuan Xie, and Guoqi Li, "Training and Inference for Integer-based Semantic Segmentation Network", Neurocomputing, 2021
[ICCAD 2021] Hussam Amrouch, Jian-Jia Chen, Kaushik Roy, Yuan Xie, Indranil Chakraborty, Wenqin Huangfu, Ling Liang, Fengbin Tu, Cheng Wang, and Mikail Yayla, "Brain-Inspired Computing: Adventure from Beyond CMOS Technologies to Beyond von Neumann Architectures ICCAD Special Session Paper", in Proceedings of International Conference on Computer Aided Design (ICCAD), 2021
[ICCAD 2021] Jilan Lin, Shuangchen Li, Yufei Ding, Yuan Xie, "Overcoming the Memory Hierarchy Inefficiencies in Graph Processing Applications", in Proceedings of International Conference on Computer Aided Design (ICCAD), 2021
[CAL] Han Li, Mingyu Yan, Xiaocheng Yang, Lei Deng, Wenming Li, Xiaochun Ye, Dongrui Fan, and Yuan Xie, "Hardware Acceleration for GCNs via Bidirectional Fusion", IEEE Computer Architecture Letters (CAL), 2021
[JSTSP 2021] Ling Liang, Jianyu Xu, Lei Deng, Mingyu Yan, Xing Hu, Zheng Zhang, Guoqi Li, and Yuan Xie, "Fast Search of the Optimal Contraction Sequence in Tensor Networks", IEEE Journal of Selected Topics in Signal Processing (JSTSP), 2021
[TNNLS] Ling Liang, Xing Hu, Lei Deng, Yujie Wu, Guoqi Li, Yufei Ding, Peng Li, Yuan Xie., "Exploring Adversarial Attack in Spiking Neural Networks with Spike-Compatible Gradient", IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
[TCAD 2021] Xiaobing Chen, Yuke Wang, Xinfeng Xie, Xing Hu, Abanti Basak, Ling Liang, Mingyu Yan, Lei Deng, Yufei Ding, Zidong Du, Yunji Chen, Yuan Xie, "Rubik: A Hierarchical Architecture for Efficient Graph Learning", IEEE Transactions on Computer Aided Design of Integrated Circuits & Systems
[TCAD 2021] Yuke Wang, Boyuan Feng, Gushu Li, Lei Deng, Yuan Xie, Yufei Ding, "STPAcc: Structural TI-based Pruning for Accelerating Distance-related Algorithms on CPU-FPGA Platforms", IEEE Transactions on Computer Aided Design of Integrated Circuits & Systems
[MICRO 2021] Abanti Basak, Zheng Qu, Jilan Lin, Alaa R. Alameldeen, Zeshan Chishti, Yufei Ding, Yuan Xie, "Improving Streaming Graph Processing Performance Using Input Knowledge", 54th IEEE/ACM International Symposium on Microarchitecture
[SC 2021] Zhaodong Chen*, Zheng Qu*, Liu Liu, Yufei Ding, Yuan Xie, "Efficient Tensor Core-based GPU Kernels for Structured Sparsity under Reduced Precision", 2021 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
[SC 2021] Boyuan Feng*, Yuke Wang*, Tong Geng, Ang Li, Yufei Ding, "APNN-TC: Accelerating Arbitrary-Precision Neural Networks on Tensor Cores", 2021 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
[USENIX ATC 2021] Boyuan Feng, Yuke Wang, Gushu Li, Yuan Xie, Yufei Ding, "Palleon: A Runtime System for Efficient Video Processing toward Dynamic Class Skew", 2021 USENIX Annual Technical Conference
[ACM NanoCom 2021] Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, Yuan Xie, "On the Co-Design of Quantum Software and Hardware", 8th ACM International Conference on Nanoscale Computing and Communication
[ICCAD 2021] Jilan Lin, Shuangchen Li, Yufei Ding and Yuan Xie, "Overcoming the Memory Hierarchy Inefficiencies in Graph Processing Applications", 2021 International Conference On Computer Aided Design
[MICRO 2021] Liu Liu*, Jilan Lin*, Zheng Qu, Yufei Ding, Yuan Xie, "ENMC: Extreme Near-Memory Classification via Approximate Screening", 54th IEEE/ACM International Symposium on Microarchitecture
[CIKM 2021] Yuke Wang, Boyuan Feng, Xueqiao Peng, Yufei Ding, "An Efficient Quantitative Approach for Optimizing Convolutional Neural Networks", 30th ACM International Conference on Information and Knowledge Management
[GLSVLSI 2021] Nan Wu, Yuan Xie, Cong Hao, "IRONMAN: GNN-assisted Design Space Exploration in High-Level Synthesis via Reinforcement Learning", Proceedings of the 2021 on Great Lakes Symposium on VLSI
[JSTSP] Ling Liang, Jianyu Xu, Lei Deng, Mingyu Yan, Xing Hu, Zheng Zhang, Guoqi Li, Yuan Xie, "Fast Search of the Optimal Contraction Sequence in Tensor Networks", IEEE Journal of Selected Topics in Signal Processing
[TCAD] Zheng Qu, Lei Deng, Bangyan Wang, Hengnu Chen, Jilan Lin, Ling Liang, Guoqi Li, Zheng Zhang, Yuan Xie, "Hardware-Enabled Efficient Data Processing with Tensor-Train Decomposition", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
[IEEE Trans. Conput.] Bangyan Wang, Lei Deng, Zheng Qu, Shuangchen Li, Zheng Zhang, Yuan Xie, "Efficient Processing of Sparse Tensor Decomposition via Unified Abstraction and PE-interactive Architecture", IEEE Transactions on Computers
[DAC 2021] Pengfei Zuo, Yu Hua, Ling Liang, Xinfeng Xie, Xing Hu, Yuan Xie, "SEALing Neural Network Models in Encrypted Deep Learning Accelerators", IEEE/ACM Design Automation Conference (DAC), 2021
[OSDI 21] Yuke Wang, Boyuan Feng, Gushu Li, Shuangchen Li, Lei Deng, Yuan Xie, and Yufei Ding, "GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.", 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2021
[PPoPP 2021] Boyuan Feng, Yuke Wang, Guoyang Chen, Weifeng Zhang, Yuan Xie, Yufei Ding, "EGEMM-TC: Accelerating Scientific Computing on Tensor Cores with Extended Precision", Principles and Practice of Parallel Programming 2021

Pages