Publications

2025

  1. ICLR
    topk_kernel_0325.png
    RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs
    Xi Xie, Yuebo Luo, Hongwu Peng, and 1 more author
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. ASPLOS
    maxk_forward_kernel.png
    MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training
    Xi Xie*, Hongwu Peng*, Kaustubh Shivdikar, and 6 more authors
    In Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, La Jolla, CA, USA, 2024
  2. ICCAD
    adapi_overview_2.png
    AdaPI: Facilitating DNN Model Adaptivity for Efficient Private Inference in Edge Computing
    Tong Zhou, Jiahui Zhao, Yukui Luo, and 4 more authors
    In 2024 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 2024
  3. Preprint
    allm_code_generation.png
    Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis
    Kiran Thorat, Jiahui Zhao, Yaotian Liu, and 5 more authors
    arXiv preprint arXiv:2312.01022, 2024

2023

  1. ICCAD
    block_level_partition0517_new.png
    Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks
    Xi Xie, Hongwu Peng, Amit Hasan, and 7 more authors
    In 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 2023
  2. ICCV
    autorep_framework.png
    AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
    Hongwu Peng, Shaoyi Huang, Tong Zhou, and 11 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023
  3. AAAI Workshop
    rrnet_overview.png
    RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
    Hongwu Peng, Shanglin Zhou, Yukui Luo, and 11 more authors
    AAAI 2023 Workshop on DL-Hardware Co-Design for AI Acceleration, 2023