top of page

Publication

Ruokai Yin, Youngeun Kim, Di Wu, Priyadarshini Panda. LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks. MICRO, 2024.

Yanzhang Zhu, Siyuan Niu, Di Wu. Synergizing Error Suppression, Mitigation and Correction for Fault-Tolerant Quantum Computing. QUILLS, 2024.

Prabhu Vellaisamy, Harideep Nair, Di Wu, Shawn Blanton, John Paul Shen. Evaluating Unary GEMM for Low-Precision AI: Toward Scalable Energy-Efficient DL Accelerators. ISVLSI, 2024.

Youpeng Zhao, Di Wu, Jun Wang. ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching. ISCA, 2024.

Queenly Xie, Prabhu Vellaisamy, Di Wu. xBrain: Brain-Like Computing for Explainable Brain-Computer Interfaces. YArch, 2024.

Prabhu Vellaisamy, Harideep Nair, Di Wu, Shawn Blanton, John Paul Shen. Exploration of Unary Arithmetic-Based Matrix Multiply Units for Low Precision DL Accelerators. WUC, 2024.

Zhewen Pan, Joshua San Miguel, Di Wu. Carat: Unlocking Value-Level Parallelism for Multiplier-Free GEMMs. ASPLOS, 2024.
🏅 Distinguished Artifact Evaluation Award
[paper]

Di Wu, Jingjie Li, Zhewen Pan, Younghyun Kim, Joshua San Miguel. uBrain: A Unary Brain Computer Interface. ISCA, 2022.
[paper]

Zhewen Pan, Di Wu, Joshua San Miguel. T-MAC: Temporal Multiplication with Accumulation. YArch, 2022.
[paper]

Di Wu, Joshua San Miguel. uSystolic: Byte-Crawling Unary Systolic Array. HPCA, 2022.
[paper]

Di Wu, Joshua San Miguel. Special Session: When Dataflows Converge: Reconfigurable and Approximate Computing for Emerging Neural Networks. ICCD, 2021.
[paper] [slide]

Di Wu, Jingjie Li, Setareh Behroozi, Younghyun Kim, Joshua San Miguel. UNO: Virtualizing and Unifying Nonlinear Operations for Emerging Neural Networks. ISLPED, 2021.
[paper] [slide]

Di Wu, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim, Joshua San Miguel. uGEMM: Unary Computing for GEMM Applications. IEEE Micro, 2021.
[paper]

Di Wu, Ruokai Yin, Joshua San Miguel. Normalized Stability: A Cross-Level Design Metric for Early Termination in Stochastic Computing. ASP-DAC, 2021.
[paper] [slide]

Di Wu, Ruokai Yin, Joshua San Miguel. In-Stream Correlation-Based Division and Bit-Inserting Square Root in Stochastic Computing. IEEE D&T, 2021.
[paper]

Di Wu, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim, Joshua San Miguel. uGEMM: Unary Computing Architecture for GEMM Applications. ISCA, 2020.
🏅 IEEE Micro Top Pick
[paper] [slide]

Younghyun Kim, Joshua San Miguel, Setareh Behroozi, Tianen Chen, Kyuin Lee, Yongwoo Lee, Jingjie Li, Di Wu. Approximate Hardware Techniques for Energy-Quality Scaling Across the System. ICEIC, 2020.
[paper]

Di Wu, Tianen Chen, Chienfu Chen, Oghenefego Ahia, Joshua San Miguel, Mikko Lipasti, Younghyun Kim. SECO: A Scalable Accuracy Approximate Exponential Function Via Cross-Layer Optimization. ISLPED, 2019.
[paper] [slide] [poster]

Di Wu, Joshua San Miguel. In-Stream Stochastic Division and Square Root via Correlation. DAC, 2019.
[paper] [slide] [poster]

Qichen Zhang, Yun Chen, Di Wu, Xiaoyang Zeng, Yeong-luh Ueng. Convergence-Optimized Variable Node Structure for Stochastic LDPC Decoder. ICASSP, 2016.
[paper]

Di Wu, Yun Chen, Qichen Zhang, Yeong-luh Ueng, Xiaoyang Zeng. Strategies for Reducing Decoding Cycles in Stochastic LDPC Decoders. IEEE TCAS-II, 2016.
[paper]

Qichen Zhang, Yun Chen, Di Wu, Xiaoyang Zeng, Yeong-luh Ueng. An Area-Efficient Architecture for Stochastic LDPC Decoder. DSP, 2015.
[paper]

Di Wu, Yun Chen, Qichen Zhang, Lirong Zheng, Xiaoyang Zeng, Yeong-luh Ueng. Latency-Optimized Stochastic LDPC Decoder for High-Throughput Applications. ISCAS, 2015.
[paper]

Yun Chen, Qichen Zhang, Di Wu, Changsheng Zhou, Xiaoyang Zeng. An Efficient Multirate LDPC-CC Decoder with A Layered Decoding Algorithm for The IEEE 1901 Standard. IEEE TCAS-II, 2014.
[paper]

Di Wu, Yun Chen, Yuebin Huang, Yeongluh Ueng, Lirong Zheng, Xiaoyang Zeng. A High-Throughput LDPC Decoder for Optical Communication. ASICON, 2013.
[paper]

bottom of page