Skip to content

About me

Cheng Zhang

I am Cheng Zhang, a first-year PhD student in Circuits and System Group, the Department of Electrical and Electronic Engineering, Imperial College London, supervised by Dr Yiren (Aaron) Zhao and Prof George A. Constantinides. My research interests mainly include efficient machine learning and AI acceleration.

Education

  • PhD in Electrical and Electronic Engineering, Imperial College London, Jan 2023 - Current
  • MSc in Electronics, The University of Edinburgh, Sep 2021 - Aug 2022
  • BEng in Automation, Beihang University, Sep 2017 - Jun 2021

Publications

  • [ICML2024 Workshop]. Yuang Chen, Cheng Zhang, Xitong Gao, Robert D. Mullins, George A. Constantinides, Yiren Zhao. Optimised Grouped-Query Attention Mechanism for Transformers.

  • [ICML2024 Workshop] Zixi Zhang, Cheng Zhang, Xitong Gao, Robert D. Mullins, George A. Constantinides, Yiren Zhao. Unlocking the Global Synergies in Low-Rank Adapters.

  • [ICML2024] Cheng Zhang, Jianyi Cheng, George A. Constantinides, Yiren Zhao. LQER: Low-Rank Quantization Error Reconstruction for LLMs. The Forty-first International Conference on Machine Learning.

  • [FPL2024]. Zhewen Yu, Sudarshan Sreeram, Krish Agrawal, Junyi Wu, Alexander Montgomerie-Corcoran, Cheng Zhang, Jianyi Cheng, Christos-Savvas Bouganis, Yiren Zhao. HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator

  • [NeurIPS2023 Workshop] Cheng Zhang, Jianyi Cheng, Zhewen Yu, Yiren Zhao. MASE: An Efficient Representation for Software-Defined ML Hardware System Exploration. Workshop on ML for Systems at NeurIPS 2023.

  • [EMNLP2023] Cheng Zhang, Jianyi Cheng, Ilia Shumailov, George Anthony Constantinides, Yiren Zhao. Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference? Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.