top of page

Publication

Xiaowei Xu, Zhiding Liang, Zichang He, Yue Sun, Dylan Herman, Qingyue Jiao, Yanzhang Zhu, Weiwen Jiang, Di Wu, Marco Pistoia, Yiyu Shi. Synergizing Quantum Techniques with Machine Learning for Advancing Drug Discovery Challenge. Scientific Reports, 2025.

Dewan Saiham, Di Wu, Sazadur Rahman. Leveraging Photonic Interconnects for Scalable and Efficient Fully Homomorphic Encryption. GOMACTech, 2025.

Ruokai Yin, Youngeun Kim, Di Wu, Priyadarshini Panda. LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks. MICRO, 2024.

Yanzhang Zhu, Siyuan Niu, Di Wu. Synergizing Error Suppression, Mitigation and Correction for Fault-Tolerant Quantum Computing. QUILLS, 2024.

Prabhu Vellaisamy, Harideep Nair, Di Wu, Shawn Blanton, John Paul Shen. Evaluating Unary GEMM for Low-Precision AI: Toward Scalable Energy-Efficient DL Accelerators. ISVLSI, 2024.

Youpeng Zhao, Di Wu, Jun Wang. ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching. ISCA, 2024.

Queenly Xie, Prabhu Vellaisamy, Di Wu. xBrain: Brain-Like Computing for Explainable Brain-Computer Interfaces. YArch, 2024.

Prabhu Vellaisamy, Harideep Nair, Di Wu, Shawn Blanton, John Paul Shen. Exploration of Unary Arithmetic-Based Matrix Multiply Units for Low Precision DL Accelerators. WUC, 2024.

Zhewen Pan, Joshua San Miguel, Di Wu. Carat: Unlocking Value-Level Parallelism for Multiplier-Free GEMMs. ASPLOS, 2024.
🏅 Distinguished Artifact Evaluation Award
[paper]

Di Wu, Jingjie Li, Zhewen Pan, Younghyun Kim, Joshua San Miguel. uBrain: A Unary Brain Computer Interface. ISCA, 2022.
[paper]

Zhewen Pan, Di Wu, Joshua San Miguel. T-MAC: Temporal Multiplication with Accumulation. YArch, 2022.
[paper]

Di Wu, Joshua San Miguel. uSystolic: Byte-Crawling Unary Systolic Array. HPCA, 2022.
[paper]

Di Wu, Joshua San Miguel. Special Session: When Dataflows Converge: Reconfigurable and Approximate Computing for Emerging Neural Networks. ICCD, 2021.
[paper] [slide]

Di Wu, Jingjie Li, Setareh Behroozi, Younghyun Kim, Joshua San Miguel. UNO: Virtualizing and Unifying Nonlinear Operations for Emerging Neural Networks. ISLPED, 2021.
[paper] [slide]

Di Wu, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim, Joshua San Miguel. uGEMM: Unary Computing for GEMM Applications. IEEE Micro, 2021.
[paper]

Di Wu, Ruokai Yin, Joshua San Miguel. Normalized Stability: A Cross-Level Design Metric for Early Termination in Stochastic Computing. ASP-DAC, 2021.
[paper] [slide]

Di Wu, Ruokai Yin, Joshua San Miguel. In-Stream Correlation-Based Division and Bit-Inserting Square Root in Stochastic Computing. IEEE D&T, 2021.
[paper]

Di Wu, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim, Joshua San Miguel. uGEMM: Unary Computing Architecture for GEMM Applications. ISCA, 2020.
🏅 IEEE Micro Top Pick
[paper] [slide]

Younghyun Kim, Joshua San Miguel, Setareh Behroozi, Tianen Chen, Kyuin Lee, Yongwoo Lee, Jingjie Li, Di Wu. Approximate Hardware Techniques for Energy-Quality Scaling Across the System. ICEIC, 2020.
[paper]

Di Wu, Tianen Chen, Chienfu Chen, Oghenefego Ahia, Joshua San Miguel, Mikko Lipasti, Younghyun Kim. SECO: A Scalable Accuracy Approximate Exponential Function Via Cross-Layer Optimization. ISLPED, 2019.
[paper] [slide] [poster]

Di Wu, Joshua San Miguel. In-Stream Stochastic Division and Square Root via Correlation. DAC, 2019.
[paper] [slide] [poster]

Qichen Zhang, Yun Chen, Di Wu, Xiaoyang Zeng, Yeong-luh Ueng. Convergence-Optimized Variable Node Structure for Stochastic LDPC Decoder. ICASSP, 2016.
[paper]

Di Wu, Yun Chen, Qichen Zhang, Yeong-luh Ueng, Xiaoyang Zeng. Strategies for Reducing Decoding Cycles in Stochastic LDPC Decoders. IEEE TCAS-II, 2016.
[paper]

Qichen Zhang, Yun Chen, Di Wu, Xiaoyang Zeng, Yeong-luh Ueng. An Area-Efficient Architecture for Stochastic LDPC Decoder. DSP, 2015.
[paper]

Di Wu, Yun Chen, Qichen Zhang, Lirong Zheng, Xiaoyang Zeng, Yeong-luh Ueng. Latency-Optimized Stochastic LDPC Decoder for High-Throughput Applications. ISCAS, 2015.
[paper]

Yun Chen, Qichen Zhang, Di Wu, Changsheng Zhou, Xiaoyang Zeng. An Efficient Multirate LDPC-CC Decoder with A Layered Decoding Algorithm for The IEEE 1901 Standard. IEEE TCAS-II, 2014.
[paper]

Di Wu, Yun Chen, Yuebin Huang, Yeongluh Ueng, Lirong Zheng, Xiaoyang Zeng. A High-Throughput LDPC Decoder for Optical Communication. ASICON, 2013.
[paper]

bottom of page